data management for analytics

11月 282016
 

One aspect of high-quality information is consistency. We often think about consistency in terms of consistent values. A large portion of the effort expended on “data quality dimensions” essentially focuses on data value consistency. For example, when we describe accuracy, what we often mean is consistency with a defined source […]

The post Harmonizing semantics for consistency in interpreting analytical results appeared first on The Data Roundtable.

9月 142016
 

In Part 1 of this two-part series, I defined data preparation and data wrangling, then raised some questions about requirements gathering in a governed environment (i.e., ODS and/or data warehouse). Now – all of us very-managed people are looking at the horizon, and we see the data lake. How do […]

The post Data preparation and data wrangling, Part 2 (yippee, bring your lasso) appeared first on The Data Roundtable.

9月 062016
 

I'm a very fortunate woman. I have the privilege of working with some of the brightest people in the industry. But when it comes to data, everyone takes sides. Do you “govern” the use of all data, or do you let the analysts do what they want with the data to […]

The post Data preparation and data wrangling, Part 1 (yippee, bring your lasso) appeared first on The Data Roundtable.

9月 022016
 

Critical business applications depend on the enterprise creating and maintaining high-quality data. So, whenever new data is received – especially from a new source – it’s great when that source can provide data without defects or other data quality issues. The recent rise in self-service data preparation options has definitely improved the quality of […]

The post Is data quality a component of data preparation? Or vice versa? appeared first on The Data Roundtable.

8月 292016
 

Hadoop has driven an enormous amount of data analytics activity lately. And this poses a problem for many practitioners coming from the traditional relational database management system (RDBMS) world. Hadoop is well known for having lots of variety in the structure of data it stores and processes. But it's fair to […]

The post Data preparation strengthens Hadoop information chain appeared first on The Data Roundtable.