Get started
Bring yourself up to speed with our introductory content.
Get started
Bring yourself up to speed with our introductory content.
data fabric
A data fabric is an architecture and software offering a unified collection of data assets, databases and database architectures within an enterprise. Continue Reading
data architect
A data architect is an IT professional responsible for defining the policies, procedures, models and technologies to be used in collecting, organizing, storing and accessing company information. Continue Reading
NoSQL database types explained: Column-oriented databases
Learn about the uses of column-oriented databases and the large data model, data warehouses and high-performance querying benefits the NoSQL database brings to organizations. Continue Reading
-
How to choose exactly the right data story for your audience
A data practitioner has two jobs: tell the right data story and in the right way to win over project stakeholders, data expert Larry Burns says in his latest book. Continue Reading
Quiz: Test your understanding of the Hadoop ecosystem
This quiz will test your knowledge of Hadoops basics including framework, capabilities and related technologies. Continue Reading
stream processing
Stream processing is a data management technique that involves ingesting a continuous data stream to quickly analyze, filter, transform or enhance the data in real time.Continue Reading
7 data modeling techniques and concepts for business
Three types of data models and various data modeling techniques are available to data management teams to help convert data into valuable business information.Continue Reading
9 steps to a dynamic data architecture plan
Learn the nine steps to a comprehensive data architecture plan, including C-suite support, data personas, user needs, governance, catalogs, SWOT, lifecycles, blueprints and maps.Continue Reading
How to build a successful cloud data architecture
As enterprises vacate the premises and migrate their operations skyward, a cloud data architecture can provide the long-term flexibility to improve workflows, costs and security.Continue Reading
columnar database
A columnar database is a database management system (DBMS) that stores data in columns instead of rows.Continue Reading
-
relational database
A relational database is a collection of information that organizes data points with defined relationships for easy access.Continue Reading
Db2
Db2 is a family of database management system (DBMS) products from IBM that serve a number of different operating system (OS) platforms.Continue Reading
hashing
Hashing is the process of transforming any given key or a string of characters into another value.Continue Reading
spatial data
Spatial data is any type of data that directly or indirectly references a specific geographical area or location.Continue Reading
query
A query is a question or a request for information expressed in a formal manner. In computer science, a query is essentially the same thing, the only difference is the answer or retrieved information comes from a database.Continue Reading
schema
In computer programming, a schema (pronounced SKEE-mah) is the organization or structure for a database, while in artificial intelligence (AI) a schema is a formal expression of an inference rule.Continue Reading
information
Information is stimuli that has meaning in some context for its receiver. When information is entered into and stored in a computer, it is generally referred to as data.Continue Reading
RFM analysis (recency, frequency, monetary)
RFM analysis is a marketing technique used to quantitatively rank and group customers based on the recency, frequency and monetary total of their recent transactions to identify the best customers and perform targeted marketing campaigns.Continue Reading
denormalization
Denormalization is the process of adding precomputed redundant data to an otherwise normalized relational database to improve read performance of the database.Continue Reading
Building a big data architecture: Core components, best practices
To process the infinite volume and variety of data collected from multiple sources, most enterprises need to get with the program and build a multilayered big data architecture.Continue Reading
Establish big data integration techniques and best practices
A big data integration strategy departs from traditional techniques, embraces several data processes working together and accounts for the volume, variety and velocity of data.Continue Reading
Who belongs on a high-performance data governance team?
Putting together a high-quality data governance team can be a challenge. Explore the necessary team members and best practices for a high-performing team.Continue Reading
NoSQL (Not Only SQL database)
NoSQL is an approach to database management that can accommodate a wide variety of data models, including key-value, document, columnar and graph formats.Continue Reading
How parallelization works in streaming systems
Dive into this book excerpt from 'Grokking Streaming Systems' and learn the crucial role the parallelization process plays in the design of a streaming system.Continue Reading
How a DataOps pipeline can support your data
DataOps has created a lot of hype as a data management pipeline because of its focus on collaboration and flexibility. Read on to find out how these priorities support your data.Continue Reading
data structures
A data structure is a specialized format for organizing, processing, retrieving and storing data.Continue Reading
Hadoop Distributed File System (HDFS)
The Hadoop Distributed File System (HDFS) is the primary data storage system used by Hadoop applications.Continue Reading
Open source database migration guide: How to transition
Open source database transitions have been on the rise as they prove to be worthy competitors to commercial database options, but that transition requires strategy and user buy-in.Continue Reading
Why your data story matters and how to tell it
Data storytelling isn't just for business analysts. Find out how to build a data management story and why you need to have one in the first place.Continue Reading
Creating a data advantage by building a data ecosystem
Developing a data ecosystem will improve personalization and customer retention. Find out how data mining across channels can build a data advantage for your organization.Continue Reading
Enterprise data lakes hold the key to actionable insights
Technological pillars of sound business decisions, AI, machine learning and advanced analytics depend on the quantity, quality and integrity of information in data lakes.Continue Reading
Graph database vs. relational database: Key differences
Relational databases and graph databases both focus on the relationships between data but not in the same ways. Here are some key differences between the two.Continue Reading
feature engineering
Feature engineering is the process that takes raw data and transforms it into features that can be used to create a predictive model using machine learning or statistical modeling, such as deep learning.Continue Reading
Top 5 U.S. open data use cases from federal data sets
The U.S. government has made data sets from many federal agencies available for public access to use and analyze. Check out some of the ways that data is being used.Continue Reading
Quiz on MongoDB 4 new features and database updates
Check out this excerpt from the new book Learn MongoDB 4.x from Packt Publishing, then quiz yourself on new updates and features to the database.Continue Reading
Google BigQuery
Google BigQuery is a cloud-based big data analytics web service for processing very large read-only data sets.Continue Reading
Why understanding data structures is so important to coders
Jay Wengrow talks about how his new book on data structures and algorithms and considerations for making your choices as efficient as possible.Continue Reading
Key steps in the feature engineering process
Feature engineering is key to machine learning algorithms. Read on to learn how those features are created and chosen to increase the accuracy of those models.Continue Reading
Apache Hadoop YARN
Apache Hadoop YARN is the resource management and job scheduling technology in the open source Hadoop distributed processing framework.Continue Reading
data aggregation
Data aggregation is any process whereby data is gathered and expressed in a summary form.Continue Reading
How to ensure your data lake security
Your data lake is full of sensitive information and securing that data is a top priority. These are the best practices to keep that information safe from hackers.Continue Reading
When a DIY database management system design is the best fit
Learn how a combination of homegrown, off-the-shelf and open source tools, plus proper motivation, can yield a DIY DBMS that meets corporate expectations, needs and ROI.Continue Reading
Building a database application the DIY way
Business users experience the trials, tribulations and exultations of building a DIY DBMS, especially when IT expertise is not readily available or costs are too high.Continue Reading
Developing an enterprise data strategy: 10 steps to take
Consultants detail 10 to-do items for data management teams looking to create a data strategy to help their organization use data more effectively in business operations.Continue Reading
corporate performance management (CPM)
Corporate performance management (CPM) is a term used to describe the various processes and methodologies involved in aligning an organization's strategies and goals to its plans and executions in order to control the success of the company.Continue Reading
Extract, Load, Transform (ELT)
Extract, Load, Transform (ELT) is a data integration process for transferring raw data from a source server to a data system (such as a data warehouse or data lake) on a target server and then preparing the information for downstream uses.Continue Reading
Data warehousing design and value change with the times
Big data, the cloud and analytics profoundly shape data warehouse purpose and design. Learn how companies derive value from a repository that at times needs definition.Continue Reading
RDBMS (relational database management system)
A relational database management system (RDBMS) is a collection of programs and capabilities that enable IT teams and others to create, update, administer and otherwise interact with a relational database.Continue Reading
Third-party database tools boast attractive alternatives
For companies considering third-party database tools, this handbook provides expert advice on evaluating and deploying on-premises and cloud options from third parties.Continue Reading
T-SQL (Transact-SQL)
T-SQL (Transact-SQL) is a set of programming extensions from Sybase and Microsoft that add several features to the Structured Query Language (SQL), including transaction control, exception and error handling, row processing and declared variables.Continue Reading
database normalization
Database normalization is intrinsic to most relational database schemes. It is a process that organizes data into tables so that results are always unambiguous.Continue Reading
pivot table
A pivot table is a statistics tool that summarizes and reorganizes selected columns and rows of data in a spreadsheet or database table to obtain a desired report.Continue Reading
data
In computing, data is information that has been translated into a form that is efficient for movement or processing.Continue Reading
SQL Server database design best practices and tips for DBAs
Good database design is a must to meet processing needs in SQL Server systems. In a webinar, consultant Koen Verbeeck offered advice on how to make that happen.Continue Reading
Big data containers gain wider appeal in system deployments
This handbook examines the use of Docker containers in Kubernetes clusters to run big data systems and offers insight on container deployment and management issues.Continue Reading
Data virtualization tools promote anywhere, anytime data access
This online handbook examines data virtualization software and how organizations are deploying and using the technology as part of their data integration processes.Continue Reading
Check SQL Server Query Store performance impact before using
Many IT teams hesitate to use SQL Server Query Store due to performance concerns. Consultant Andy Warren offers tips on how to test and get started with Query Store.Continue Reading
Data as a Service (DaaS)
Data as a Service (DaaS) is an information provision and distribution model in which data files (including text, images, sounds, and videos) are made available to customers over a network, typically the Internet.Continue Reading
Advice on enterprise data cleansing from an SAP VP
SAP's Kristin McMahon details data cleansing best practices and explains why a good data cleanse needs continual communication, collaboration and oversight.Continue Reading
Data model design tips to help standardize business data
Data models should be understandable to business users and kept to a reasonable scope, say the leaders of a data modeling initiative at England's Environment Agency.Continue Reading
USAA adds data engineering skills to speed data science work
When the United Services Automobile Association's data science team wasn't getting data in the right format, the team lead realized the USAA needed more data engineers.Continue Reading
5 things to know about deploying big data systems in data containers
Planning for security and container APIs, and watching out for infrastructure sprawls are some issues to be aware of before deploying big data in containers.Continue Reading
DataOps is more than DevOps for data, Delphix CTO says
Data operations is young compared to DevOps, but it is increasingly used as part of projects that put data at the center of development. Here, Delphix CTO Eric Schrock makes observations about the trend.Continue Reading
HR makes major strides toward improving employee engagement
5 FAQs on SQL Server containers and how to manage them
Running SQL Server in containers creates new challenges for database administrators. The answers to these questions can guide you through some of them.Continue Reading
The Power BI-PowerShell cmdlet cheat sheet
DBAs can manage Power BI data sets, workspaces and reports with PowerShell. Using the two tools together makes for a more efficient and effective workflow.Continue Reading
Azure Data Studio (formerly SQL Operations Studio)
Azure Data Studio is a Microsoft tool, originally named SQL Operations Studio, for managing SQL Server databases and cloud-based Azure SQL Database and Azure SQL Data Warehouse systems.Continue Reading
SQL vs. NoSQL: What do you know about the database designs?
The decision to use a SQL database or a NoSQL database can be made wisely only if the ins and outs of both are understood. See how well you know the database architectures.Continue Reading
11 features to look for in data quality management tools
As the need for quality data has increased, so have the capabilities of data quality tools. Learn how collaboration, data lineage and other features enable data quality.Continue Reading
AI for analytics augments and bolsters business intelligence
What is an enterprise data strategy?
Defining a data strategy can help focus an organization's data management initiatives -- but it isn't the same as data governance. Expert Anne Marie Smith explains why.Continue Reading
customer data integration (CDI)
Customer data integration (CDI) is the process of defining, consolidating and managing customer information across an organization's business units and systems to achieve a "single version of the truth" for customer data.Continue Reading
5 to-dos for your GDPR compliance checklist
It's never too late to fine-tune your GDPR strategy. Expert Anne Marie Smith suggests a current state analysis of your PII protections, drafting a data privacy policy and more.Continue Reading
2 ways to attach SQL Server database files to Linux containers
SQL Server files can be stored outside of Docker containers in host directories or volumes. Here's how to set up SQL Server on Linux databases and attach them to containers.Continue Reading
Cloud vs. legacy ERP systems: Tug of war intensifies for SMBs
Aging legacy ERP systems at SMBs seem to be getting plenty of scrutiny these days. Heightened consumer demands, shifting technology landscapes and relentless market disruptions, not to mention maintenance costs, technical support and obsolescence, ...Continue Reading
How to attach databases to custom SQL Server containers
Deploying SQL Server in Docker containers for production applications typically requires custom containers. Here are guidelines on how to attach databases to them.Continue Reading
Good data quality for machine learning is an analytics must
As companies add machine learning applications, they need to really understand -- and be able to improve -- their data. That's where data quality initiatives come in.Continue Reading
Six sample databases for SQL Server and how to find them
SQL Server sample databases are useful for test and dev, but they can be difficult to parse. Use this SQL database sample overview to decide which to use and how to access them.Continue Reading
The benefits of columnar storage and the Parquet file format
What's behind Apache Parquet's growing popularity? It may be the file format's columnar storage orientation, which leads to benefits including improved query performance.Continue Reading
Four first steps for customer data management
Forrester's Mike Gualtieri details how to develop a unified plan to manage customer data that gives business users what they need to manage CRM programs.Continue Reading
Three factors for protecting sensitive data in the GDPR era
Data privacy is a hot topic nowadays thanks to GDPR and the Facebook data scandal. But how do data security, access control and data protection differ?Continue Reading
What's the difference between DDL and DML?
What's the difference between DDL and DML? Get the answer and see examples of data manipulation language and data definition language commands for SQL databases.Continue Reading
What goes into a customer analytics data integration framework
Customer data integration is a minefield for IT teams to navigate. But incorporating a set of core technical functions into an integration architecture can ease the process.Continue Reading
Google Cloud data lake fuels cloud payment processing flow
To create a cloud payment processing system, Global Payments first had to deploy a data lake in the Google Cloud. Getting quick user feedback was another early step.Continue Reading
Develop smart AI in CRM strategies to win and keep customers
Of the three words that comprise customer relationship management, one word binds the other two. As necessity and competition dictate that CRM upgrade itself with artificial intelligence and flights to the cloud, what counts most in ...Continue Reading
GDPR compliance requirements drive new winds of data privacy
Hello, GDPR. May 25 is the witching hour for enforcement of the EU's much-discussed GDPR compliance requirements -- and it's a harbinger of more changes to come.Continue Reading
Why running SQL Server on Docker is no longer frowned upon
Microsoft now lets SQL Server databases run in Docker containers, a capability that depends on using volumes to store data in a persistent way outside the containers.Continue Reading
What does the GDPR definition of personal data include?
The definition of personal data in the EU's GDPR data protection rules is broad enough to include any type of data that can be used to directly or indirectly identify a person.Continue Reading
U-SQL
U-SQL is a Microsoft query language that combines a declarative SQL-like syntax with C# programming, enabling it to be used to process both structured and unstructured data in big data environments.Continue Reading
Data expert: GDPR deadline is an opportunity, not a burden
There is stress as the EU's General Data Protection Regulation compliance deadline nears, but the GDPR privacy movement is a good thing for data policies, advises consultant Daragh O Brien.Continue Reading
TensorFlow
TensorFlow is an open source framework developed by Google researchers to run machine learning, deep learning and other statistical and predictive analytics workloads.Continue Reading
Hyperledger Fabric offers path to enterprise blockchain future
Blockchain arose from bitcoin, but it's looking to find a place in the enterprise. Frameworks like Hyperledger Fabric could smooth out the technology's rough side for business uses.Continue Reading
Data lake concept needs firm hand to pay big data dividends
Data lakes pose technology deployment and data management challenges that can leave analytics users high and dry if the implementation process isn't handled properly.Continue Reading
Slow to gain traction, AI apps on the verge of explosion
From chatbots ("Can I help you?") to killer bots ("I'll be back."), artificial intelligence runs the gamut of applications and emotions like no other technology. It's been nearly 70 years since AI first came into consciousness with humankind, yet ...Continue Reading
Three ways to turn old files into Hadoop data sets in a data lake
Hadoop data lakes offer a new home for legacy data that still has analytical value. But there are different ways to convert the data for use in Hadoop depending on your analytics needs.Continue Reading
Hadoop data lake
A Hadoop data lake is a data management platform comprising one or more Hadoop clusters.Continue Reading
How AI and IoT will influence data management in 2018
AI and IoT will alter the data management landscape in 2018, according to analyst James Kobielus. AI will need regular updates, and DevOps will become more prevalent as a result.Continue Reading
MariaDB
MariaDB is an open source relational database management system (DBMS) that is a compatible drop-in replacement for the widely used MySQL database technology.Continue Reading
Microsoft SSIS (SQL Server Integration Services)
Microsoft SSIS (SQL Server Integration Services) is an enterprise data integration, data transformation and data migration tool built into Microsoft's SQL Server database.Continue Reading
Microsoft Visual FoxPro (Microsoft VFP)
Microsoft Visual FoxPro (VFP) is an object-oriented programming environment with a built-in relational database engine.Continue Reading