To rely on data and truly become a Data-Driven organization, you must first of all trust it; one of the fundamental elements in this sense is to use Data Lineage techniques and solutions.
The Data Lineage is the technique that allows you to identify and represent the life cycle of the data from when it enters our organization until it leaves it, keeping track of the transformations it undergoes over time, which are the main processes that involve it and the interdependencies of the latter with other company systems.
It is therefore evident, since its definition, that the Data Lineage is a fundamental element, in general for Data Governance and even more so for Data Quality, because the more significant the knowledge of systems, processes, and transformations that characterize the data, so much the implementation of controls on them is more effective.
Table of Contents
The risk of not adopting effective Data Lineage solutions is that which occurs when playing on the wireless telephone: the words entered into the system by the first player can be wholly distorted during a word of mouth that characterizes the game, with the risk that the last player pronounces completely different words from the original one by changing the meaning of the initial sentence, without being able to reconstruct where the words have been changed.
Replacing words with data, this is precisely what happens in a company in the absence of a Data Lineage system: data, during its life cycle, flow within a flow-through complex architecture, solutions, and processes and, if not adequately managed, they can suffer the application of wrong rules, or they may lack information, and reach the end of their cycle (for example the production of a report) without anyone being able to explain where and why the error or change has been generated.
Customers, legislators, and business users need secure and compliant data. This data must be available when and where it is needed and, at all times, it is necessary to be able to reconstruct its history; this need becomes even greater within complex organizations where we typically find large platforms used by many users, data in various formats and hybrid architectures (both cloud and traditional).
The Data Lineage offers numerous advantages to all those who have to do with data and Data Governance systems in general:
Often, data errors are found only at the end of the process that involves them, typically when a report is read; often, however, the error originated much further upstream. The Data lineage allows you to easily and quickly reconstruct the chain that produced that data and, therefore, quickly identify the moment the error was introduced.
It is often necessary to change existing systems for regulatory or business reasons or to change some software systems. When the change involves data, the Data Lineage allows you to evaluate a priori which applications, which data, and which downstream processes will be affected by the change and helps plan the updates of the various applications.
The Data Lineage makes it possible to identify any “bottlenecks” in data management and, therefore, allows intervention with alternative solutions that can be assessed through impact analysis.
As previously anticipated, there is a strong interdependence bond between Data Lineage, Data Governance, and Data Quality. The Data Lineage indicates where the data comes from, where they are going, and which transformations are applied to them through multiple processes: one of the essential information processes for Metadata Management.
From the point of view of Data Quality and Data Governance, it is essential to use the Data Lineage to ensure that business rules exist or not, that they are applied where required, that the calculation rules and other transformations are correct and that the inputs and outputs of systems are compatible. The traceability of data, guaranteed by the Date Line, is also the best tool for data validation and represents a control tool relating to their use for audit and regulatory compliance purposes.
Mapping processes from a business point of view and doing it from a technical/operational point of view is quite different. Still, there is a connection: data is behind the business and all operations. By creating the business metadata and their technical counterpart, following the life cycle of the data, or using the Data Lineage, it is possible to build this connection. The result is surprising, as it is possible to locate specific business metadata and track it across multiple applications, data sources, interfaces, models, analytics, reports, and more. This guarantees the entire company organization maximum data reliability, transparency in its management, and possible collaboration at all levels.
When your two year mobile phone contract comes to an end, you might find yourself… Read More
In an era where business dynamics shift with dizzying speed, the difference between success and… Read More
Introduction Generative AI and Machine Learning models have exploded in recent times, and organizations and… Read More
Quick advances in information science are opening up additional opportunities for organizations. They can extend… Read More
When thinking about the future, financial stability is an important factor that provides us with… Read More
It may have been a long time since you had to pull a handle on… Read More