1. Prepare the migration ideally
Preparation is the decisive step for the success of a DWH migration. An important tool here is a modern data catalog. It supports three central aspects of preparation and planning.
Overview of your data landscape
A data catalog enables you to get a holistic overview of your own data assets such as tables, views, ETLs, reports and more. A correct overview is the basis for estimating how extensive the migration to a cloud DWH will be.
Understand relations between data assets
Additionally, we recommend identifying existing dependencies and documenting them where not already done. With the knowledge thus gained and the transparency about the data landscape, you can then draw up a prioritized migration plan.
Sustainably improve data quality
A relevant issue for many customers in the context of a migration is the topic of data quality. The preparation phase can be optimally used to
check already existing data and tables
archive unused data
improve the data quality.
In addition to a data catalog, we recommend the use of open source tools such as Great Expectations.
2. Actively shape the migration
Even during the actual migration, certain activities have emerged as success factors in numerous projects.
Monitoring the migration progress
A data catalog is also a great supporter of the process during migration! It allows the teams involved to get a complete overview of already migrated assets at any time.
Establish control mechanisms
Migration is a good time to establish automated checks and redefine data lifecycles - because this ensures high data quality and reduces necessary repairs and subsequent maintenance. Modern tools offer very convenient, powerful functions for this purpose.
Modern tools for bridging
Gaps between different DWH environments often occur. With suitable tools, you can bridge these efficiently. Another helpful technology during the transition period is data virtualization. This allows you to easily perform cross-application queries and combine data from multiple DWHs and other sources for ad-hoc queries.
Improve the documentation
Preserve existing documentation - a data catalog is also suitable for this purpose. It is best to enrich new components with meaningful context directly during migration.
3. Optimize your cloud DWH usage
Cloud DWHs support the long-term and smooth scalability of the data infrastructure. Collaborative work in different teams can also be simplified enormously. Cloud DWHs unfold their advantages best in combination with other tools in a modern data landscape.
Bringing together different tools allows business teams, for example, to independently create ad hoc reports on prepared data sets at any time. It also allows available data structures and other assets to be permanently reused - across different teams. This saves data engineers a lot of work. Our advice is: bring your data catalog, your BI tools and your ETL tools together for maximum added value.
Another benefit of linking the DWH with many other tools is that collaborative data engineering is greatly simplified. Different teams of experts can share information about dependencies, SQL statements, schedules, and related assets from different applications.