enterprise data hub

An enterprise data hub is a big data management model that uses a Hadoop platform as the central data repository.  The goal of an enterprise data hub is to provide an organization with a centralized, unified data source that can quickly provide diverse business users with the information they need to do their jobs.  

Enterprise data hubs differ from traditional data management models because the data remains in place. In the traditional extract, transform and load (ETL) model, data is extracted from one system, transformed into the required format and then loaded another system for analysis or other business purposes. In an enterprise data hub model, however, data is first loaded into the Hadoop platform, and then analytics and data mining tools are applied to the data where it resides in the hub. Offloading ELT processing to Hadoop can provide provide better integration for disparate data sources and reduce data movement and data latency, which can cause bottlenecks.

Enterprise data hubs are designed to address the challenge of data that is ballooning in volume, variety and velocity (3Vs). This term is largely associated with Cloudera and MapR

See also: extract, load and transform (ELT)



This was last updated in April 2014

Continue Reading About enterprise data hub

Dig Deeper on IT applications, infrastructure and operations