TechTarget.com/searchdatamanagement

https://www.techtarget.com/searchdatamanagement/definition/data-silo

What are data silos and what problems do they cause?

By Scott Robinson

A data silo is a repository of data that's controlled by one department or business unit and isolated from the rest of an organization, much like grass and grain in a farm silo are closed off from outside elements. Siloed data typically is stored in a standalone system and often is incompatible with other data sets. That makes it hard for users in other parts of the organization to access and use company data. And when an organization aspires to be data-driven, siloed data can be a huge obstacle.

Data silos can have technical, organizational or cultural roots. They tend to arise naturally in large companies because separate business units often operate independently and have their own goals, priorities and IT budgets. But any organization can end up with data silos if it doesn't have a well-planned data management strategy.

Why are data silos a problem?

Data silos hinder business operations and the data analytics initiatives that support them. Silos limit executives' ability to use data to manage business processes and make informed business decisions. They also prevent call center agents, sales reps and other operational workers from accessing relevant data about customers, products and supply chains. This is a problem for organizations implementing customer relationship management. CRM is increasingly essential to enhancing customer experience.

The specific ways that data silos can harm an organization include the following:

How data silos occur

A department or end user might go rogue and create a data silo even in an organization that has solid data management processes. More often, though, data silos are a consequence of how organizations are structured and managed as a whole, including their IT operations. The following factors commonly cause silos to occur:

How do you identify data silos?

Because of their disconnected nature, data silos can be hard to detect. Ideally, IT and data management teams will create an inventory of the systems in their organizations and regularly update it to add new ones. Doing so should help identify and document data silos. But finding them all can be a challenge, especially in large organizations with business units that operate autonomously.

Evidence of data silos might come to light, though. Signs that point to them include the following:

How do you break down data silos?

Breaking down data silos lets an organization manage and use data more effectively. It often also helps lower technology and data management costs. The following approaches can be used separately or in tandem to remove silos and connect data assets to better support business operations:

  1. Data integration. Integrating data with other systems is the most straightforward method for breaking down silos. The most popular form of data integration is extract, transform and load (ETL), which extracts data from source systems, consolidates it and loads it into a target system or application. Other data integration techniques that can be used against silos include real-time integration, data virtualization and extract, load and transform, a variation of ETL.
  2. Data warehouses and data lakes. The most common target system in data integration jobs is a data warehouse, which stores structured transaction data for BI, analytics and reporting applications. Increasingly, organizations also build data lakes to hold sets of big data, which can include large volumes of structured, unstructured and semi-structured data used in data science applications. Those two types of platforms provide centralized repositories for data from different systems, making them a natural way to address silos.
  3. Enterprise data management and governance. Ultimately, it's best to not only eliminate existing data silos but also prevent new ones from being created. A more comprehensive data management strategy helps achieve both of those goals. For example, data architecture design documents data assets, maps data flows and creates a blueprint for data platform deployments. An enterprise data strategy better aligns the data management process with business operations. And a strong data governance program can directly reduce the number of data silos in an organization and promote common data standards and policies.
  4. Culture change. To really put a stop to data silos, it might be necessary to change an organization's culture. Efforts to do so can be part of the data strategy development process or a data governance initiative. In some cases, a change management program might be needed to implement the cultural changes and ensure that departments and business units adopt them.

What are the business costs of data silos?

According to IDC Market Research, incorrect or siloed data can cost a company up to 30% of its annual revenue. An organization can measure these inefficiencies by how many silos it has, how successful efforts to eliminate them are, and whether they continue to proliferate. In general, increased IT and data management expenses are the most tangible cost. But data silos also have the following intangible costs:

The terms data silo and information silo are sometimes used as synonyms. More often, though, information silos are considered a cultural problem caused by departments or individual workers who don't want to share information. In addition to cultural change, one way to address the latter problem is to create an information architecture along with a data architecture.

On-premises and cloud-based ETL tools

Increasingly, organizations are moving their digital assets into cloud-based data storage. However, moving data around in any domain -- on premises or in the cloud -- requires tools for ETL to do the actual moving and to modify the data as needed in transit.

On-premises ETL tools can be automated, simplifying the process of consolidating and cleaning up disparate siloed data sources for centralized access. These tools are generally platform-specific and must be purchased, although most organizations with significant database or data warehouse assets already possess them.

Cloud-based platforms generally include built-in ETL tools to facilitate migration, and these can be used in much the same way -- to integrate data as it's migrated out of siloed data sources, transforming it as needed along the way.

Managing unstructured data can be expensive and time-consuming. Learn what strategies and tools organizations can use to manage this data cost-effectively.

25 Jul 2024

All Rights Reserved, Copyright 2005 - 2025, TechTarget | Read our Privacy Statement