Apache Hadoop News
February 22, 2018
In big data news, we find Google TPUs, or Tensor Processing Units, offered as a cloud service, while LinkedIn is open sourcing a Hadoop test simulator called Dynamometer.
January 03, 2018
Is this the post-Hadoop era? Not in the eyes of Hadoop 3.0 backers, who see the latest update to the big data framework succeeding in machine learning applications and cloud systems.
December 01, 2017
Born at Cloudera, the MPP query engine known as Apache Impala has become a top-level open source project. It's one of various tools bringing SQL-style interactivity to big data analytics.
November 29, 2017
The Apache Software Foundation (ASF) has graduated Apache Impala to become a Top-Level Project (TLP). Apache Impala itself is an analytic database for Apache Hadoop, the open source software ...
Apache Hadoop Get Started
Bring yourself up to speed with our introductory content
How can CIOs support an enterprise machine learning initiative? They can start by building a data lake. Continue Reading
Data lakes pose technology deployment and data management challenges that can leave analytics users high and dry if the implementation process isn't handled properly. Continue Reading
Hadoop data lakes offer a new home for legacy data that still has analytical value. But there are different ways to convert the data for use in Hadoop depending on your analytics needs. Continue Reading
Evaluate Apache Hadoop Vendors & Products
Weigh the pros and cons of technologies, products and projects you are considering.
Before purchasing big data analytics software, companies must first identify their specific needs and then evaluate how the product features address those needs. Continue Reading
As part of its Big Data Cloud Service, Oracle provides a set of internal and external tools designed to help users efficiently deploy and manage Hadoop-based big data systems. Continue Reading
The NewSQL database was almost hidden when Hadoop and NoSQL arose. Now, as more big data teams move toward production uses, MemSQL, Cloud Spanner and similar products may get a second look. Continue Reading
Manage Apache Hadoop
Learn to apply best practices and optimize your operations.
Data security needs to be addressed upfront in deployments of big data systems -- and users are likely to find they have to build some security capabilities themselves. Continue Reading
Today, analytics work is about speed. That means rapidly building clusters and transforming and querying data. Learn how users are streamlining digital business. Continue Reading
Organizations hungry for more revenue are using Hadoop and other big data technologies to break their existing business molds and pursue new strategies and product offerings. Continue Reading
Problem Solve Apache Hadoop Issues
We’ve gathered up expert advice and tips from professionals like you so that the answers you need are always available.
The new thing in big data is Kubernetes container orchestration. While it's still early, there are signs of activity, which are cited in this edition of the Talking Data podcast. Continue Reading
Flooding a Hadoop cluster with data that isn't organized and managed properly can stymie analytics efforts. Take these steps to help make your data lake accessible and usable. Continue Reading
In the era of more and more digital orders, Panera Bread encountered big data challenges that led the restaurant chain to deploy a new cluster architecture with Hadoop, Spark and other technologies. Continue Reading