Data lake	Data warehouse
Supported data types	Data lakes can handle a combination of structured, semistructured and unstructured data, which is commonly stored in its native format to make the full sets of raw data available for analysis.	Data warehouses typically store structured data from transaction processing systems and other business applications. In most cases, organizations cleanse and curate the data before loading it in a data warehouse.
Analytics uses	Organizations primarily use data lakes for data science applications that involve machine learning, predictive modeling and other advanced analytics techniques. Analytics goals aren't always predefined.	Data warehouses support less-complex BI, ad hoc analysis, reporting and data visualization applications, usually with a predefined purpose for analyzing business operations and tracking KPIs.
Users	Data scientists and lower-level data analysts are the primary users of data lakes. Data engineers often support them by building data pipelines and helping to prepare data for analysis as needed.	Business analysts, executives and operational workers use data warehouses through self-service BI tools. Alternatively, BI analysts and developers run queries in data warehouses for business users.
Data processing methods	Data lakes support traditional extract, transform and load (ETL) processes, but organizations are more likely to use extract, load and transform (ELT), where they load raw data first and transform it later for specific needs.	Data teams commonly use ETL processes for data integration and preparation in data warehouses. They finalize the data structure before loading data sets to support planned BI and analytics applications.
Schema approach	Data teams can define the schema for data sets after they're stored in a data lake, using a schema-on-read approach.	Data teams define schemas in data warehouses before loading data sets, following schema-on-write practices.
Data storage	Data is typically stored in platforms other than relational databases, such as the Hadoop Distributed File System, cloud object storage services or NoSQL databases.	Organizations commonly store data in relational databases using conventional disk storage. They can also build data warehouses on columnar databases, similar to disk storage.
Costs	Hardware costs can be less expensive because data lakes use lower-cost servers and storage. Data management might cost less, too. But the large size of some data lakes can erase the cost advantages.	In general, the large servers and disk storage systems required for data warehouses make them more expensive to deploy than data lakes. Managing a data warehouse can also be more costly.
Business benefits	Data lakes enable data science teams to analyze diverse sets of structured and unstructured data and create analytical models that provide insights for strategic planning and business decision-making.	Organizations use data warehouses as a centralized repository of consolidated and curated data sets to analyze business performance and support operational decisions.

What is data management and why is it important? Full guide

What is a data lake?

What is a data warehouse?

Data lake vs. data warehouse: 8 important differences

Selecting the right platform based on organizational goals

Next Steps

Related Resources

Dig Deeper on Data management strategies

Data warehouse vs. data mart: Key differences and use cases

What is a data lake?

What is AWS Glue?

Dremio: Understanding Apache Iceberg (the data lakehouse backbone)