Data warehousing
Data warehousing captures data from a variety of sources so it can be accessed and analyzed by business analysts, data scientists and other end users. One goal is to enhance data quality and consistency for analytics uses while improving business intelligence. Read how data warehousing provides these and other unique benefits to overall data management strategy.
Top Stories
-
News
11 Dec 2024
Aerospike adds new vector search capabilities to database
With vector search a critical component of AI development, the vendor's latest vector search and storage capabilities target simplifying data discovery for training AI tools. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
03 Dec 2024
AWS reimagines SageMaker as suite for data, analytics, AI
The service is now a unified platform featuring a data catalog, lakehouse and integrations with other AWS platforms. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
19 Nov 2024
With new Fabric features, Microsoft aims at AI development
New data management and analytics suite features include databases and a data catalog to enable enterprises to develop and operationalize advanced applications. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
10 Sep 2024
Oracle keeps AI focus with database updates, new data lake
Updates to HeatWave and Database 23ai, along with the introduction of Intelligent Data Lake, are all aimed at better enabling customers to develop artificial intelligence tools. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
01 Aug 2024
Google Cloud's BigQuery gets AI injection, Looker to follow
The tech giant's latest analytics and data management moves include the general availability of Gemini in BigQuery and the imminent availability of Gemini in Looker. Continue Reading
By- Eric Avidon, Senior News Writer
-
Tip
24 Jul 2024
Cloud database comparison: AWS, Microsoft, Google and Oracle
Here's a look at the rival cloud database offerings from AWS, Google, Microsoft and Oracle based on their product breadth, migration capabilities and pricing models. Continue Reading
By -
Opinion
27 Jun 2024
Databricks bids to marry AI and BI
At its recent Data+AI Summit, data platform provider Databricks introduced a product that might be a harbinger of a new wave of AI-backed Business Intelligence Continue Reading
By- Brian McKenna, Senior Analyst, Business Applications
-
Feature
14 Jun 2024
Graph database vs. relational database: Key differences
Graph databases offer plenty of advantages for enterprises, but relational databases still top the market. Both emphasize relationships between data, but how do they compare? Continue Reading
-
Definition
12 Jun 2024
data mesh
Data mesh is a decentralized data management architecture for analytics and data science. Continue Reading
By- Kinza Yasar, Technical Writer
- George Lawton
-
Definition
03 May 2024
data center
A data center is a facility composed of networked computers, storage systems and computing infrastructure that organizations use to assemble, process, store and disseminate large amounts of data. Continue Reading
By- Kinza Yasar, Technical Writer
- Peter Loshin, Former Senior Technology Editor
- Ben Lutkevich, Site Editor
-
News
30 Apr 2024
Tableau adds generative AI tools, tightens Databricks bond
The analytics vendor's new features include a tool that enables customers to explore metrics using natural language as well as connectors that improve its link to Databricks. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
09 Apr 2024
Google Cloud to inject Gemini into data, analytics tools
The tech giant unveiled integrations between its LLM and BigQuery, Looker and its databases to provide customers with a foundation for developing GenAI models and applications. Continue Reading
By- Eric Avidon, Senior News Writer
-
Definition
21 Mar 2024
big data
Big data is a combination of structured, semi-structured and unstructured data that organizations collect, analyze and mine for information and insights. Continue Reading
By- Cameron Hashemi-Pour, Site Editor
- Bridget Botelho, Editorial Director, News
- Stephen J. Bigelow, Senior Technology Editor
-
Definition
19 Mar 2024
off-site backup
Off-site backup is a method of backing up data to a remote server or to media that's transported off-site. Continue Reading
By- Paul Kirvan
- Kinza Yasar, Technical Writer
- Brien Posey
-
Tip
18 Mar 2024
On-premises vs. cloud data warehouses: Pros and cons
Data warehouses increasingly are being deployed in the cloud. But both on-premises and cloud data warehouses have pluses and minuses to consider, as explained here. Continue Reading
By -
News
14 Mar 2024
Databricks partners with Mistral AI to aid GenAI development
The data cloud vendor joins Microsoft and Snowflake in partnering with -- and investing in -- the startup to provide customers with access to Mistral's open source language models. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
29 Feb 2024
Snowflake CEO Slootman steps down, Ramaswamy takes over
Slootman resigns after five years at the helm of the data cloud vendor. Revenues grew fivefold under him and the company went public in a record-setting initial public offering. Continue Reading
By- Eric Avidon, Senior News Writer
-
Definition
26 Feb 2024
data warehouse as a service (DWaaS)
Data warehouse as a service (DWaaS) is an outsourcing model in which a cloud service provider configures and manages the hardware and software resources a data warehouse requires, and the customer provides the data and pays for the managed service. Continue Reading
By- Cameron Hashemi-Pour, Site Editor
- Craig S. Mullins, Mullins Consulting
-
News
14 Feb 2024
Alteryx, Databricks expand complementary partnership
The expanded partnership features new integrations designed to better enable joint customers to combine self-service data preparation and analysis with data science and AI. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
26 Sep 2023
MongoDB reveals new generative AI, vector search tools
After unveiling an integration with Google's LLM suite in June, the vendor moved a set of NLP tools into preview and introduced new data migration and vector search capabilities. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
14 Sep 2023
Dremio launches updated SQL query acceleration capabilities
The data lakehouse specialist's new version of its SQL query acceleration tool, Reflections, includes automated recommendations and automatic data refresh capabilities. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
06 Sep 2023
InfluxData launches new database for self-managed users
The time series database vendor completed the InfluxDB 3.0 product line with the release of InfluxDB Clustered, a version tailored for private cloud and on-premises deployments. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
30 Aug 2023
Couchbase intros generative AI feature for its Capella DBaaS
The database vendor's new tool -- now in private preview -- uses LLM technology to make application developers more efficient by helping them more easily generate code. Continue Reading
By- Eric Avidon, Senior News Writer
-
Definition
08 Aug 2023
dimension
In data warehousing, a dimension is a collection of reference information that supports a measurable event, such as a customer transaction. Continue Reading
-
News
24 Jul 2023
Oracle targets speed with launch of MySQL HeatWave Lakehouse
The tech giant's new lakehouse enables users of its database management suite to combine structured and unstructured data to develop a more complete view of their operations. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
18 Jul 2023
Confluent partner plan aids streaming data platform delivery
The vendor's Connect With Confluent program enables technology partners to deliver event data to end users in real time through integrations with Confluent Cloud. Continue Reading
By- Eric Avidon, Senior News Writer
-
Feature
06 Jul 2023
Generative AI hype evolving into reality in data, analytics
Organizations are already beginning to apply the technology to their data operations, helping expand analytics use to more employees and boosting the efficiency of data experts. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
22 Jun 2023
MongoDB unveils new AI, migration tools for database
The vendor, with its latest slate of new and updated capabilities, is adding generative AI with its partnership with Google Cloud and launching a new data migration tool. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
20 Jun 2023
Starburst Galaxy update targets governance, data access
The vendor's latest update includes the public preview of Gravity, a centralized access and governance layer that enables users to better control and connect data across clouds. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
16 Jun 2023
Dremio adds first generative AI-infused tool, intros others
The vendor's initial generative AI-infused tool is Text-to-SQL, which enables customers to work with data using natural language that automatically gets translated to code. Continue Reading
By- Eric Avidon, Senior News Writer
-
News
06 Jun 2023
Collibra update targets data quality, lineage and discovery
The data management vendor's Data Intelligence Cloud now includes pushdowns that enable work within Snowflake and Databricks and prebuilt workflows focused on data visibility. Continue Reading
By- Eric Avidon, Senior News Writer
-
Feature
02 Jun 2023
Data mesh helping fuel Sloan Kettering's cancer research
The cancer hospital and research center began using tools from data management vendor Dremio two years ago to decentralize its data operations and improve speed-to-insight. Continue Reading
By- Eric Avidon, Senior News Writer
-
Definition
31 May 2023
data lakehouse
A data lakehouse is a data management architecture that combines the key features and the benefits of a data lake and a data warehouse. Continue Reading
By- Craig Stedman, Industry Editor
- George Lawton
-
Feature
17 May 2023
Peloton rides, runs, rows with AWS for data management
The connected fitness company has long used AWS tools. When its data volume surged during COVID-19, Redshift was critical -- and still is as the company attempts a fiscal comeback. Continue Reading
By- Eric Avidon, Senior News Writer
-
Tip
15 May 2023
Mainframe databases teach an old dog new survival tricks
Long predicted to fade away in favor of more modern architectures, mainframes still play an integral role in corporate IT strategies, thanks to advances in database software. Continue Reading
By- Ed Scannell, Freelancer
-
News
24 Apr 2023
IBM acquires Ahana, steward of open source PrestoDB
The purchase not only gives IBM a managed SaaS and AWS marketplace version of the popular open-source Presto database, but membership in the Presto Foundation as well. Continue Reading
By- Tim McCarthy, News Writer
-
News
04 Apr 2023
Alation unveils enhanced partnerships with Databricks, DBT
The data catalog vendor launched new connectors with its partners designed to help joint customers better understand data in their lakehouses and more easily transform the data. Continue Reading
By- Eric Avidon, Senior News Writer
-
Opinion
16 Mar 2023
Data lakes: Key to the modern data management platform
Data lakes influence the modern data management platform at all levels. Organizations can gain faster insights, save costs, improve governance and boost self-service data access. Continue Reading
By- Stephen Catanzano, Senior Analyst
-
Enterprise Strategy Group
We provide market insights, research and advisory, and technical validations for tech buyers.
-
Definition
28 Feb 2023
data warehouse
A data warehouse is a repository of data from an organization's operational systems and other sources that supports analytics applications to help drive business decision-making. Continue Reading
By- Mary K. Pratt
- Jacqueline Biscobing, Senior Managing Editor, News
-
News
30 Jan 2023
Expanded AtScale, Databricks integration adds functionality
The semantic layer platform vendor's tools are now listed on Databricks' Partner Connect, and existing customers can now connect to Databricks SQL and Unity Catalog. Continue Reading
By- Eric Avidon, Senior News Writer
-
Feature
25 Jan 2023
Data lake vs. data warehouse: Key differences explained
Data lakes and data warehouses are both commonly used in enterprises. Here are the main differences between them to help you decide which is best for your data needs. Continue Reading
By- Bridget Botelho, Editorial Director, News
-
Definition
29 Dec 2022
DataOps
DataOps is an Agile approach to designing, implementing and maintaining a distributed data architecture that will support a wide range of open source tools and frameworks in production. Continue Reading
-
Feature
05 Dec 2022
What is a data warehouse analyst?
Data warehouse analysts help organizations manage the repositories of analytics data and use them effectively. Here's a look at the role and its responsibilities. Continue Reading
-
News
30 Nov 2022
AWS adds data quality, scalability services for cloud data
The cloud giant expanded its data portfolio with a series of features designed to help organizations more easily scale database and data warehouse deployments. Continue Reading
-
News
29 Nov 2022
AWS expands cloud data options with Amazon DataZone
The cloud computing giant at its AWS re:Invent 2022 conference introduced a series of new capabilities to help organizations better integrate and manage data across services. Continue Reading
-
News
28 Nov 2022
Alation Connected Sheets extends data intelligence platform
The data intelligence vendor's Connected Sheets lets spreadsheet users directly pull in data sets from a data catalog to improve data governance and visibility. Continue Reading
-
News
07 Nov 2022
Snowflake data cloud adds Python, multi-cloud collaboration
The cloud data vendor released preview updates to its platform to accelerate data queries, better support multi-cloud operations and boost developer productivity. Continue Reading
-
Feature
04 Oct 2022
How to design a data architecture for business success
To gain business value from data, enterprises need to get their data architecture right – and the right business leadership and culture is critical to that Continue Reading
-
Feature
23 Sep 2022
How Lufthansa is flying its data warehouse to the cloud
Moving from an on-premises data system to the cloud can be a complex operation. Lufthansa is looking to remove some of the complexity with virtualization. Continue Reading
-
News
30 Aug 2022
Alation adds Snowflake service, updates data catalog
The vendor launched the Alation Cloud Service for Snowflake designed to enable Snowflake users to more easily use Alation's data intelligence capabilities. Continue Reading
-
News
17 Aug 2022
Cloudera users get fully managed data lakehouse platform
The vendor is expanding its set of offerings with the launch of CDP One, a service initially available only on AWS that enables serverless deployment in the cloud. Continue Reading
-
News
11 Aug 2022
GridGain, Apache Ignite founder talks in-memory databases
Nikita Ivanov details the origin of his company and discusses the growing need organizations have for real-time database processing capabilities to complete modern transactions. Continue Reading
-
Feature
09 Aug 2022
A look at Presto, Trino SQL query engines
The co-creator of the open source project at Facebook reflects on 10 years of growth as he helps lead one of its resulting tools into the future. Continue Reading
-
News
23 Jun 2022
Starburst acquires Varada to accelerate data lake queries
After a year of partnering, the data lake query vendor decided to acquire fellow Trino SQL query engine supporter Varada to help boost query performance. Continue Reading
-
News
14 Jun 2022
Yellowbrick 6 advances cloud data warehouse deployments
The data warehouse vendor is growing its hybrid data warehouse capabilities with version 6.0 of its namesake platform that is now enabled to run on AWS. Continue Reading
-
News
14 Jun 2022
Snowflake Data Cloud expands with Unistore Hybrid Tables
Snowflake adds more capabilities, including support for Apache Iceberg data lake tables and both transactional and analytics workloads, with Hybrid Tables. Continue Reading
-
Tip
24 May 2022
How to evaluate and optimize data warehouse performance
Organizations build data warehouses to satisfy their information management needs. Data warehouse optimization can help ensure that these warehouses achieve their full potential. Continue Reading
By -
Tip
17 May 2022
6 key steps to develop a data governance strategy
Data governance shouldn't be built around technology, but the other way around. Existing infrastructure, executive support, data literacy, metrics and proper tools are essential. Continue Reading
By- Donald Farmer, TreeHive Strategy
-
Tip
13 May 2022
7 best practices for successful data governance programs
A comprehensive, companywide data governance program strengthens data infrastructure, improves compliance initiatives, supports strategic intelligence and boosts customer loyalty. Continue Reading
By- Donald Farmer, TreeHive Strategy
-
News
28 Apr 2022
Ocient Hyperscale Data Warehouse scales data operations
The data warehouse vendor is targeting enterprises that need to use a trillion rows of data or more for analysis, with hyperscale technology that is now ready for broader adoption. Continue Reading
-
News
21 Apr 2022
Databricks aims data lakehouse at media and entertainment
Gaming vendor Sega is using the data vendor's technology to unify its data for sales as well as game balancing to enable players to get the best experience. Continue Reading
-
News
12 Apr 2022
Arcion brings change data capture platform to the cloud
Getting data out of one system and into another in the right format as quickly as possible is a challenge the Arcion Cloud service is aiming to address for organizations. Continue Reading
-
News
06 Apr 2022
Google grows cloud capabilities with BigLake data lakehouse
Google is continuing to build out its cloud data services with a new approach that will serve as a central cloud service for unifying data across different clouds and formats. Continue Reading
-
News
05 Apr 2022
Databricks enables Delta Live Tables in data lakehouse
Databricks is looking to make it easier for users to build data pipelines to extract, transform and load data from any source using a Spark SQL data query. Continue Reading
-
News
24 Mar 2022
DataStax Astra DB integrates real-time data streaming
The database-as-a-service vendor advanced the change data capture capabilities of its cloud database with technology from its Apache Pulsar-based streaming service. Continue Reading
-
Feature
10 Mar 2022
Cloud data platforms spark partners' consulting business
Service providers can find a range of consulting opportunities as they help clients navigate a plethora of emerging data management platforms and associated tools. Continue Reading
By- John Moore, Industry Editor
-
News
09 Mar 2022
Databricks extends data lakehouse platform to healthcare
Healthcare data exists in widely varying formats. Getting it into a data lake where it can be used for analytics and machine learning is a challenge Databricks is looking to meet. Continue Reading
-
News
08 Mar 2022
DataStax scales K8ssandra for cloud-native Cassandra
The real-time data vendor is out with a new release of its K8ssandra Kubernetes operator that enables organizations to deploy, configure and run the Apache Cassandra database. Continue Reading
-
News
02 Mar 2022
Dremio opens up data lakehouse with new engine
The data lakehouse vendor is expanding its cloud platform with a new SQL query engine and data metastore for data lakes that builds on top of the Apache Iceberg table format. Continue Reading
-
Definition
23 Feb 2022
operational data store (ODS)
An operational data store (ODS) is a type of database that's often used as an interim logical area for a data warehouse. Continue Reading
By- Ben Lutkevich, Site Editor
-
Feature
26 Jan 2022
NLP and AI boost the automated data warehouse
Businesses are working to automate as many elements of their data warehouses as they can through nascent tools like augmented analytics and natural language processing. Continue Reading
By -
Tip
30 Dec 2021
Top 5 elements needed for a successful data warehouse
While conventional data warehouses may struggle to keep up with growing volumes of data, these five elements best give the ability to tap into valuable BI. Continue Reading
-
Definition
24 Nov 2021
tree structure
A tree data structure is an algorithm for placing and locating files (called records or keys) in a database. Continue Reading
-
News
23 Nov 2021
American Airlines flies its data warehouse to the cloud
The pandemic hit American Airlines hard, but its management took an optimistic view, seeing at as an opportunity to use data more effectively to improve operations. Continue Reading
-
News
16 Nov 2021
Snowflake grows cloud data platform with unstructured data
The cloud data vendor's winter release updates its data platform with new capabilities to enable organizations to query and manage more data types across different environments. Continue Reading
-
Definition
02 Nov 2021
What is a data mart (datamart)?
A data mart is a repository of data that is designed to serve a particular community of knowledge workers. Continue Reading
-
News
27 Oct 2021
Informatica goes public again as data management grows
Informatica has transformed in recent years from an on-premises software vendor to a SaaS-based subscription model in the cloud as new services for data have emerged. Continue Reading
-
Tip
19 Oct 2021
The challenges of cloud data management
Cloud platforms are expanding rapidly, causing organizations to face new cloud management challenges keeping pace with cloud data management advancements. Continue Reading
By -
News
14 Sep 2021
Snowflake aims at financial services with data cloud
Snowflake launches its new Data Cloud offering as it continues to expand the range of applications and capabilities for its customers, including Western Union and Capital One. Continue Reading
-
Tip
24 Aug 2021
6 strategies to tap into data warehouse BI
Data warehouse BI benefits include data storage, summarization and transformation and can be unlocked with these six strategies leveraging cloud architectures. Continue Reading
-
Feature
17 Aug 2021
Bill Inmon's data warehouse approach tackles text analysis
Learn the fine points of a concept at the heart of 'The Textual Warehouse' a new book that aims to help organizations profit through textual analysis. Continue Reading
By- Technics Publications, Technics Publications
-
News
10 Aug 2021
Kyligence 4.5 adds Clickhouse to Intelligent Data Cloud
The open source Clickhouse OLAP database joins the Apache Kylin analytical data warehouse in a new update that provides a platform for different types of data queries. Continue Reading
-
News
24 Jun 2021
Firebolt raises $127M to fuel cloud data warehouse efforts
Firebolt looks to grow in the cloud data warehouse market with a proprietary file format and a focus on enabling developers to build data-driven apps on top of cloud data lakes. Continue Reading
-
News
27 May 2021
Google Datastream advances change data capture in the cloud
At Google's Data Cloud Summit, a new service released in preview to enable users to capture data from other sources to bring into Google's cloud data and analytics services. Continue Reading
-
News
07 Apr 2021
Yellowbrick Manager embraces Kubernetes for data warehouse
Yellowbrick is building out a new unified control plane to help users manage distributed cloud data warehouse deployments. The vendor also advanced its data lake integration. Continue Reading
-
Tip
05 Apr 2021
Data warehouse environment modernization tools and tips
A data warehouse environment is made up of many tools and systems. Read on to learn the history of the modern data warehouse and how they're currently evolving. Continue Reading
By- Andy Hayler, Information Difference
-
News
26 Mar 2021
Presto users detail what's next for open source SQL engine
The open source Presto project is gaining adoption beyond just Uber and Facebook as the need to connect and query disparate sources of data continues to be in demand. Continue Reading
-
News
17 Mar 2021
Oracle Autonomous Data Warehouse updated with new data tools
Oracle is updating its cloud data warehouse platform with new tools that aim to enable users to more easily benefit from data analytics and machine learning predictions. Continue Reading
-
News
16 Feb 2021
Matillion raises $100M for ETL to enable data middleware
The co-founder and CEO of Matillion, talks about his company’'s latest round of Series D funding and how data will be the key to success for many organizations in 2021. Continue Reading
-
News
01 Feb 2021
Databricks fueling data lakehouse goals with $1B funding round
Databricks gets a major vote of confidence from big name investors, including Franklin Templeton, as it raises new funding to help it advance its cloud data management technology efforts. Continue Reading
-
News
21 Jan 2021
Kyligence builds out data cloud for OLAP and big data
Kyligence is advancing the Apache Kylin project with a cloud-native offering that can help organizations more efficiently execute and manage data queries against large data sets. Continue Reading
-
News
08 Jan 2021
Starburst raises $100M as PrestoSQL rebrands as Trino
Starburst is advancing its enterprise data platform with new funding that will help advance its cloud efforts around the open source Trino query engine formerly known as PrestoSQL. Continue Reading
-
Guest Post
17 Dec 2020
Apache Pulsar vs. Kafka and other data processing technologies
David Kjerrumgaard looks at how the distributed messaging platform Apache Pulsar handles storage compared to Apache Kafka and other data processing technologies. Continue Reading
By- David Kjerrumgaard
-
News
15 Dec 2020
AWS expands cloud databases with data virtualization
At AWS re:invent 2020 the public cloud giant unveiled enhancements to its database and analytics portfolio, including the Babelfish project for migrating from SQL Server to PostgreSQL. Continue Reading
-
News
20 Nov 2020
Yellowbrick data warehouse update boosts workload management
Hybrid cloud data warehouse vendor updates platform with self-healing cluster capabilities and a "penalty box" feature to improve workload management. Continue Reading
-
News
17 Nov 2020
Snowflake builds out data cloud with Snowpark
Snowflake is continuing to grow the capabilities of its cloud architecture with new developer and security capabilities that aim to help users optimize information. Continue Reading
-
News
14 Oct 2020
Idera targets cloud data lake market with Qubole acquisition
Qubole joins the Idera Inc. group of software companies, bringing data pipeline capabilities into the group to target customers that need more cloud data storage. Continue Reading
-
News
07 Oct 2020
SODA Foundation improves Open Data Framework
An open source data management group enhanced its Open Data Framework with the Greenland 1.1 update, improving multi-cloud file support and expanding connectivity to data storage options. Continue Reading
-
News
16 Sep 2020
Explosive Snowflake IPO advances cloud data warehouse fortunes
Snowflake goes public in a big way on the New York Stock Exchange, raising more than $3 billion as enterprises increasingly move data and analytics workloads to the cloud. Continue Reading
-
News
26 Aug 2020
Snowflake IPO shows strength of cloud data warehouse
The cloud data warehouse services vendor files for an IPO as it looks to build out and expand its data portfolio to help organizations manage and analyze data assets. Continue Reading
-
News
23 Jul 2020
Starburst advances Presto to handle Hadoop data better
Enterprise Presto SQL vendor Starburst updated its data query platform with expanded support for legacy Hadoop workloads as well as modern cloud data lake deployments. Continue Reading
-
News
14 Jul 2020
BigQuery Omni enables Google multi-cloud data analytics
Google BigQuery data analytics users can now query data across clouds from multiple providers, including AWS and Microsoft Azure, without the need to first move the data. Continue Reading