TechTarget.com/searchaws

https://www.techtarget.com/searchaws/definition/Amazon-Elastic-MapReduce-Amazon-EMR

Amazon EMR (Elastic MapReduce)

By TechTarget Contributor

What is Amazon EMR?

Amazon EMR (previously known as Amazon Elastic MapReduce) is an Amazon Web Services (AWS) tool for big data processing and analysis. Amazon markets EMR as an expandable, low-configuration service that provides an alternative to running on-premises cluster computing.

Amazon EMR is based on Apache Hadoop, a Java-based programming framework that supports the processing of large data sets in a distributed computing environment. Using MapReduce, a core component of the Hadoop software framework, developers can write programs that process massive amounts of unstructured data across a distributed cluster of processors or standalone computers. It was developed by Google for indexing webpages and replaced its original indexing algorithms and heuristics in 2004.

Amazon EMR processes big data across a Hadoop cluster of virtual servers on Amazon Elastic Compute Cloud (EC2) and Amazon Simple Storage Service (S3). The Elastic in EMR's name refers to its dynamic resizing ability, which enables administrators to increase or reduce resources, depending on their current needs.

Amazon EMR is used for data analysis in log analysis, web indexing, data warehousing, machine learning (ML), financial analysis, scientific simulation and bioinformatics. It also supports workloads based on Apache Spark, Apache Hive, Presto and Apache HBase -- the latter of which integrates with Hive and Pig, which are open source data warehouse tools for Hadoop. Hive uses queries and analyzes data, and Pig offers a high-level mechanism for programming MapReduce jobs to be executed in Hadoop.

Amazon EMR use cases

There are several ways enterprises can use Amazon EMR, including:

Amazon EMR deployment options

As a cloud service, Amazon EMR can be deployed in a variety of settings, such as:

Amazon EMR features

Amazon EMR's features are designed to make the following tasks easier and more convenient for administrators and developers:

Learn how to use AWS Outposts to bring Amazon cloud services to your data center.

26 Aug 2021

All Rights Reserved, Copyright 2014 - 2025, TechTarget | Read our Privacy Statement