What are the differences between data mining, data warehousing and data querying?The definitions of data warehousing,...
data mining and data querying can be confusing because they are related. Learn the differences between the terms below.
A data warehouse is a repository of data designed to facilitate information retrieval and analysis. The data contained within a data warehouse is often consolidated from multiple systems making analysis across those systems quicker and easier.
Data mining and data querying represent two methods of retrieval and analysis.
Data querying is the process of asking questions of data in search of a specific answer. Unlike many forms of search (i.e. Google), queries are normally structured and require specific parameters or code, known as SQL (Structured Query Language). A query could be written to answer questions like, "How many items were sold in Region 2 last month?"
Data mining is the process of sorting through large amounts of data to identify patterns and relationships using statistical algorithms. These relationships may help us to understand which factors affected the outcome of something, or they may be used to predict future outcomes. Data mining might be used to answer questions like, "What factors affected sales in Region 2 last month?" Knowing which factors drive sales in the past could help to predict or make estimates about sales in the future.
More about data warehousing
More about data mining
Dig Deeper on Data science and analytics
Related Q&A from William McKnight
Learn the advantages and disadvantages of data mining tools and of implementing data mining technology. Continue Reading
Learn about the debate between the Ralph Kimball and Bill Inmon approaches to data warehouse design. Continue Reading
Find out the major difference between business intelligence (BI) in the public sector and BI in the private sector and get an expert's take on trends... Continue Reading