Jim Harris shares examples of how and why AI applications are dependent on high-quality data.
Jim Harris says learn the lineage of the data that fed the analysis before you get dazzled by visualizations or algorithms.
The post Growing importance of data lineage when managing data for analytics appeared first on The Data Roundtable.
Get faster value out of your data by empowering business users to work with data on their own.
The post Discoverability enables self-service data preparation appeared first on The Data Roundtable.
As the application stack supporting big data has matured, it has demonstrated the feasibility of ingesting, persisting and analyzing potentially massive data sets that originate both within and outside of conventional enterprise boundaries. But what does this mean from a data governance perspective? Most aspects of data governance for internally […]
In the previous three blogs in this series, we talked about what metadata can be available from source systems, transformation and movement, and operational usage. For this final blog in the series, I want to discuss the analytical usage of metadata. Let’s set up the scenario. Let's imagine I'm a […]
The post Importance of metadata – Bridging the gap (Part 4: analytical metadata usage) appeared first on The Data Roundtable.
As I discussed in the first two blogs of this series, metadata is useful in a variety of ways. Its importance starts at the source system, and continues through the data movement and transformation processes and into operations. Operational metadata, in particular, gives us information about the execution and completion […]
The post Importance of metadata – Bridging the gap (Part 3: operational metadata usage) appeared first on The Data Roundtable.
In the first blog of this four-part series, we discussed traditional data management and how it can apply these principles to our big data platforms. We also discussed how metadata can help bridge the gap of understanding the data as we move to newer technologies. Part 2 will focus on […]
The post Importance of metadata – Bridging the gap (Part 2: transformation and movement) appeared first on The Data Roundtable.
Traditional data management includes all the disciplines required to manage data resources. More specifically, data management usually includes: Architectures that encompass data, process and infrastructure. Policies and governance surrounding data privacy, data quality and data usage. Procedures that manage a data life cycle from creation of the data to sunset […]
The post Importance of metadata – Bridging the gap (Part 1: source system) appeared first on The Data Roundtable.