Join the 1500 companies using G2 Track to manage SaaS spend, usage, contracts & compliance.

Amazon EMR

4.0
(45)

Amazon EMR is a web-based service that simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective to distribute and process vast amounts of data across dynamically scalable Amazon EC2 instances.

Work for Amazon EMR?

Learning about Amazon EMR?

We can help you find the solution that fits you best.

Amazon EMR Reviews

Chat with a G2 Advisor
Write a Review
Filter Reviews
Filter Reviews
  • Ratings
  • Company Size
  • User Role
  • Industry
Ratings
Company Size
User Role
Industry
Showing 45 Amazon EMR reviews
LinkedIn Connections
Amazon EMR review by User in Luxury Goods & Jewelry
User in Luxury Goods & Jewelry
Validated Reviewer
Review Source
content

"AWS EMR at a glance"

What do you like best?

EMR does well in managing the cost as it uses the task node cores to process the data and these instances are cheaper when the data is stored on s3. It is really cost-efficient. No need to maintain any libraries to connect to AWS resources.

What do you dislike?

No UI client for saving the workbooks or code snippets. Everything has to go through submitting process. Not really convenient for tracking the job as well.

What business problems are you solving with the product? What benefits have you realized?

EMR is suited if the jobs are long running and doesn't really need much monitoring. EMR is really flexible in processing the data on s3 as a developer doesn't need to spend time on debugging the connections to s3 from a big data framework as most of the configuration is taken care of by Amazon. Very cheap when compared to most of the solutions on the market and the ready to go configuration at the launch time reduces the amount of time required for admin tasks. So, considering the cheap cost, processing options on s3 and scalability via adding task nodes, EMR serves a better purpose for startups considering open source and cost-efficient options.

Sign in to G2 to see what your connections have to say about Amazon EMR
Amazon EMR review by Chris H.
Chris H.
Validated Reviewer
Verified Current User
Review Source
content

"We've moved our hadoop processing here"

What do you like best?

Ease of spinning up clusters on-demand to save $$$

What do you dislike?

Not much! We had a fairly easy tiime getting Spark on EMR running. Was a little troublesome at first, but once we learned the platform, we love it!

What business problems are you solving with the product? What benefits have you realized?

We have moved our hadoop/sqoop data processing here. This allows us to offload the MV refresh process we used to have in place inside Oracle, thus saving us valuable throughput and DB load.

What Big Data Processing and Distribution solution do you use?

Thanks for letting us know!
Amazon EMR review by User in Computer Software
User in Computer Software
Validated Reviewer
Review Source
content

"Easy way to run Big data applications in the cloud"

What do you like best?

Amazon EMR is easy to setup and use. There are plenty of ways to get started, such as the AWS EMR console, or you can automate the whole thing using AWS command line tool awscli, or the boto3 api for Python. We use a combination of awscli and boto3 for automation. It provides best in class tools built in, and integration with other amazon services such as s3 for data as well as log aggregation, etc. It also provides a way to use AWS EC2 Spot instances, which reduce the cost to run Map Reduce jobs by 50-80% on average.

What do you dislike?

There are no major dislikes about EMR, but in general, they could provide more options to monitor the cluster, and also provide ways to troubleshoot the failed jobs and provide ways to recover failed jobs. But Amazon is going in the right direction and hope they address those things as well.

Recommendations to others considering the product

If you are on AWS and looking to run Hadoop/Spark, look no further than Amazon EMR, and it might save a lot of time in setting up your cluster, and helps you focus more on the application business logic than worrying about the infrastructure configuration and setup.

What business problems are you solving with the product? What benefits have you realized?

We use EMR primarily for data analytics and big data processing using Spark, Hadoop and we also use S3 for storing the output.

Amazon EMR review by Administrator
Administrator
Validated Reviewer
Review Source
content

"Mapreduce for the Cloud"

What do you like best?

EMR is simple to use, configure, and maintain. The number of components available in EMR is very good and serves almost all the usecases and rarely you will have to install a software on your own. The ability to mix instance lifecycles (spot, on-demand, reserved) is very helpful to optimize the costs

What do you dislike?

There isn't an easy way for setting up authorization and authentication in EMR. Cloudera does a better job in maintaining that.

Recommendations to others considering the product

EMR is the way to go if you're going to run big data workloads in AWS.

What business problems are you solving with the product? What benefits have you realized?

We run analytics workload for processing large volumes of data. Primarily the data is stored in S3. We have both transient and permanent clusters, and it is really helpful to setup and teardown the clusters using

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Useful cloud-based platform"

What do you like best?

The best part is its ease of use for the user. We use it for big data storage. The platform is very useful in regards to its processing and storage of big data.

What do you dislike?

It is really only suited for big data, so if you the user is looking for something else than storage of big data, then I don't really recommend something like this for the user. Also not for users that are not advanced with knowledge of data storage

What business problems are you solving with the product? What benefits have you realized?

It provides the ability to handle big data storage. It also is a great choice if you are going to set up a distributed compute platform. EMR's benefit is a great usefulness to startups that a thinking of using open source and cost efficient options

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Ideal tool for Analytical Based Project that used Hadoop and Apache Spark."

What do you like best?

Amazon EMR is best option for running resource heavy task. It is secure and easy to setup with its intuitive setup wizard in both basic and advanced mode.

We need to to worry about the cluster formation and handling, Leaving it to Amazon EMR we can concentrate on designing model to run on it.

What do you dislike?

Not much variety to chose from. EMR Clusters are not highly versatility. It comes with basic application setup like hadoop, spark, Hive etc. But we need to relive on Bootstrap action to download and install require applications.

What business problems are you solving with the product? What benefits have you realized?

Amazon EMR is used for information extracting from the history and logs of our other applications.

It is very cost effective when compare to other methods.

It made data processing and analysis easy.

Amazon EMR review by sujeet k.
sujeet k.
Validated Reviewer
Review Source
content

"IT made easy to process big data"

What do you like best?

.It allows to work on large amount of data an on different framework. EMR offers the expandable low-configuration service as an easier alternative to running in-house cluster computing

What do you dislike?

Its complex to use in initially and i think there should be proper Userinterface for real time.

Recommendations to others considering the product

Good option for startup.

open source and cost efficient

saves lots of time for setup

What business problems are you solving with the product? What benefits have you realized?

Using Hadoop and Spark frameworks. For processing and storage of large data

Amazon EMR review by Kapies V.
Kapies V.
Validated Reviewer
Review Source
content

"EMR Review"

What do you like best?

We were able to easily crunch over 400million events every few hours. We can use HIVE which makes the learning curve much easier.

What do you dislike?

IF you forget to turn off your instance.. you can expect a nice heavy bill. The sizing of the cluster is not easy to use, we tend to just guess.

What business problems are you solving with the product? What benefits have you realized?

We were crunching a lot of data that were coming through our pipeline.

Amazon EMR review by john d.
john d.
Validated Reviewer
Review Source
content

"EMR"

What do you like best?

the ability to spin up cluster size based upon the needs of each job

What do you dislike?

error logging is sometimes hard to track down root cause

Recommendations to others considering the product

start with small jobs, take adavantage of new freatures in newer releases

What business problems are you solving with the product? What benefits have you realized?

we are building data assets for multiple business units to share via S3 and Redshift

Amazon EMR review by Monish K.
Monish K.
Validated Reviewer
Review Source
content

"Easy way to setup and use hadoop with flexible configurations"

What do you like best?

the configuration and node managements are flexible and the performance is good

What do you dislike?

the app logs are difficult to view , we may need a way to terminate unused clusters

What business problems are you solving with the product? What benefits have you realized?

using map reduce we do parallel processing and the performance of our computation has incresed

Amazon EMR review by Sivakumar D.
Sivakumar D.
Validated Reviewer
Review Source
content
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"EMR is great"

What do you like best?

I was able to quickly setup and run the workloads easily. Support team gave enough guidence to resolve issues

What do you dislike?

Most of the featurestes are good,. one thing i felt is, it is less custmizable.

What business problems are you solving with the product? What benefits have you realized?

We we able to run some adhoc queries using Hive quickly with EMR

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"EMR: Easy Setup but Dispatching Work has Friction"

What do you like best?

Its very easy to set up and upgrade Spark clusters. Auto scaling with spot is also great, as it can save you lots of money on the correct workloads.

What do you dislike?

The step API does not allow you to run concurrent workloads and its unecessaraly difficult to set up a local client that is not located on the actual EMR cluster (i.e. dispatching work from a scheduler to a Spark cluster on EMR).

What business problems are you solving with the product? What benefits have you realized?

ETL workloads and building a data warehouse.

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Decent program"

What do you like best?

The program had a lot of capabilities, and options for customization. Those who are familiar with these times of

What do you dislike?

Big learning curve for someone who hasn't used a program like this before, I found it quite frustrating and bit of a headache until I got really comfortable with it.

What business problems are you solving with the product? What benefits have you realized?

From a fairly basic understanding, Amazon EMR allowed us to streamline our processes and see an in-depth scope of our business, both under a microscope and birds eye.

Amazon EMR review by Michael A.
Michael A.
Validated Reviewer
Review Source
content

"Amazon AWS EMR"

What do you like best?

The simplicity of creating an EMR Cluster

What do you dislike?

Sometimes the time it takes to create a cluster in a specific region.

What business problems are you solving with the product? What benefits have you realized?

It allows my analytics team to troubleshoot customer issues as they arise and query a customer's current database.

Amazon EMR review by Musharaf S.
Musharaf S.
Validated Reviewer
Review Source
content
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"EMR Presto"

What do you like best?

Its better than cloudera Impala.but the servcie goes down due to many errors

What do you dislike?

Service goes down frequntly .due to maximum thread count and out of contact with wokrers

What business problems are you solving with the product? What benefits have you realized?

replacing Impala with EMR Presto

Amazon EMR review by Sushanth R.
Sushanth R.
Validated Reviewer
Review Source
content

"Quick and easy setup"

What do you like best?

SImplicity of emr in usability and scalability. Seamless access to s3

What do you dislike?

Can be expensive to start with. Need to have some understanding of other aws features to start using it.

What business problems are you solving with the product? What benefits have you realized?

Data analytics

Amazon EMR review by aditya b.
aditya b.
Validated Reviewer
Review Source
content

"Amazon EMR"

What do you like best?

make the organization of medical records very easy to process.

What do you dislike?

the user interface can be kind of tricky to use.

Recommendations to others considering the product

use it with other software as a complement

What business problems are you solving with the product? What benefits have you realized?

makes the organization easier to process

Amazon EMR review by User
User
Validated Reviewer
Review Source
content
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"EMR"

What do you like best?

The ease of starting up a new cluster and picking up where the old one left off.

What do you dislike?

No security on some of the web interfaces. Enabling developers to troubleshoot in production is very difficult.

What business problems are you solving with the product? What benefits have you realized?

We are running hbase and flink to extract data for legacy sources, convert to avro, and push on. We are solving one source of data for the company

Amazon EMR review by Administrator
Administrator
Validated Reviewer
Review Source
content

"Amazon EMR"

What do you like best?

It is easy to use with lots of documentation available online.

Also, with Amazon EMR we can provision one, hundreds, or thousands of compute instances to process data at any scale. It is extremely low cost as well.

What do you dislike?

Not much to dislike. We faced issues initially going through the documentation to set up EM clusters.

What business problems are you solving with the product? What benefits have you realized?

We are using Amazon EMR t process huge dataset withing no time

Amazon EMR review by Venkatasai K.
Venkatasai K.
Validated Reviewer
Review Source
content
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"Very good experience "

What do you like best?

Performance and auto scaling compared to other vendors in the market

What do you dislike?

Cost is more compared to other vendors in the market

What business problems are you solving with the product? What benefits have you realized?

Insurance analytics

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Best last resort."

What do you like best?

Allows us to handle massive data sets when our local hardware won't allow it.

What do you dislike?

Takes a very long time to spin up a cluster, and is relatively expensive.

Recommendations to others considering the product

Requires a decent amount of technical knowledge to get started.

What business problems are you solving with the product? What benefits have you realized?

We are solving the problem of handling large amounts of biological data. We are able to more quickly analyze our data using EMR.

Amazon EMR review by David L.
David L.
Validated Reviewer
Review Source
content

"Great Hadoop Environment in the cloud"

What do you like best?

Flexibility and quick setup. Ease of installing custom tools

What do you dislike?

Security features are not simple to configure

What business problems are you solving with the product? What benefits have you realized?

Replacing Hadoop on premise

Amazon EMR review by Consultant in Non-Profit Organization Management
Consultant in Non-Profit Organization Management
Validated Reviewer
Review Source
content

"Principle Engineer"

What do you like best?

Its a fully managed service in Cloud to handle and Support Big data platform. Which is amazing that there is no need to handle complex configuration of Big data platform.

What do you dislike?

There is none so far, and i am expecting to have more featuers added on this EMR Platform in terms of supporting machine learning.

What business problems are you solving with the product? What benefits have you realized?

Big Data analytics

Amazon EMR review by Sridhar R.
Sridhar R.
Validated Reviewer
Review Source
content

"Best with Autoscaling"

What do you like best?

Best for autoscaling, helps to spin clusers very fast

What do you dislike?

Unable to use custom AMI(Enterpise Images)

What business problems are you solving with the product? What benefits have you realized?

Analytics procurment of Hardware

Amazon EMR review by User in Computer Software
User in Computer Software
Validated Reviewer
Review Source
content

"Simple way to get into Hadoop"

What do you like best?

I like that bootstrapping a cluster is so painless and quick

What do you dislike?

Auto-scaling can be painful, especially when running Spark jobs that run the application master on core nodes and get killed.

Recommendations to others considering the product

Good for getting acquainted with Hadoop but not the most optimal or cost-effective environment to run Hadoop workloads on

What business problems are you solving with the product? What benefits have you realized?

Batch processing large volumes of data

Amazon EMR review by Internal Consultant in Hospital & Health Care
Internal Consultant in Hospital & Health Care
Validated Reviewer
Review Source
content

"Great for ETL"

What do you like best?

Best way of transforming large volumes of non structured data, especially from S3

What do you dislike?

Still miss something like Impala to do ad-hoc queries (although I know that I can use Athena for it)

What business problems are you solving with the product? What benefits have you realized?

Moving data from landing area to EDL (S3 to S3) with low level of data transformation (e.g. merge multiple individual JSON files)

Amazon EMR review by Industry Analyst / Tech Writer
Industry Analyst / Tech Writer
Validated Reviewer
Review Source
content

"Dipping toes in the water -- and drinking the water"

What do you like best?

quick performance with the right configuration

What do you dislike?

time to spin up and cost to keep up, even while not in use

Recommendations to others considering the product

Do reccomend but have had only limited use but can't wait until additional services and applications are added

What business problems are you solving with the product? What benefits have you realized?

extri performance and scoringetscting data from s3 storages and pairing with other data to generate

Amazon EMR review by Administrator in Financial Services
Administrator in Financial Services
Validated Reviewer
Review Source
content

"Great product for batch processing usecases"

What do you like best?

Great for batch usecases, quick and easy to learn, deploy. Reporting, monitoring, alerting ... everything is out of box.

What do you dislike?

Not cost effiecient, monolithic, requires thorough testing, bake in fault tolerance, reseliency ...

What business problems are you solving with the product? What benefits have you realized?

Mostly for processing huge loads of batch files and cleansing and streaming the data, data enrichment,

Amazon EMR review by User in Financial Services
User in Financial Services
Validated Reviewer
Review Source
content

"EMR a great service for analytics"

What do you like best?

EMR is easy to use and helps you get started quickly with your ETL jobs

What do you dislike?

nothing. I actually enjoy what EMR offers

What business problems are you solving with the product? What benefits have you realized?

We are taking unstructured data or data that can't be used to make business decisons and transforming it into something in which our business can use.

Amazon EMR review by Administrator
Administrator
Validated Reviewer
Review Source
content

"performance"

What do you like best?

good performance, ease of use and easy to spin cluster in EMR. Easy to integrate and build with terraform.

What do you dislike?

version constraints on components. It is really confusing to add components to the cluster coz of version constraints and its really hard to find good documentation.

What business problems are you solving with the product? What benefits have you realized?

analyics

Amazon EMR review by Consultant
Consultant
Validated Reviewer
Review Source
content

"EMR Review"

What do you like best?

The best part of EMR is the auto scaling option. This is highly cost effective.

What do you dislike?

Sometimes the scaling is delyed. Whether its upscaling and douwnscaling. The jobs wait for resources and they dont get it

Recommendations to others considering the product

Please enhance the autoscaling timing

What business problems are you solving with the product? What benefits have you realized?

Big Data Processing

Amazon EMR review by Consultant
Consultant
Validated Reviewer
Review Source
content
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"Easy Morning Relaxation"

What do you like best?

I love the transient nature of the solution. Easy to spin up, bootstrap with additional services and configurations, and get going.

What do you dislike?

No big complaints. Failover of the master could be better.

What business problems are you solving with the product? What benefits have you realized?

Batch analysis and transformation for ingest into data lakes and Redshift.

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Good as long as you aren't in a hurry"

What do you like best?

Great for not in memory querying and data access. Powerful for very big data.

What do you dislike?

In-memory solutions like Presto are very limited in data size.

Recommendations to others considering the product

Make sure you know your requirements for speed and size.

What business problems are you solving with the product? What benefits have you realized?

Joining and filtering internal and external data sets.

Amazon EMR review by User
User
Validated Reviewer
Review Source
content
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"Emergency review"

What do you like best?

I like the way it is organized and easy to use

What do you dislike?

Very complex. Felt like the instruction could’ve been explained better

Recommendations to others considering the product

Learn carefully. Group learning

What business problems are you solving with the product? What benefits have you realized?

That platform is very organized and easy to use. Learning the gist of this program was easy to follow.

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Pretty good for cloud based Hadoop"

What do you like best?

It's really good for handling fast generating, unstructured data.

What do you dislike?

It is not very user friendly and requires massive amounts of memory.

What business problems are you solving with the product? What benefits have you realized?

We run Spark jobs on EMR, which are used our machine learning applications for better user recommendations.

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Hadoop cluster deployment made easy"

What do you like best?

auto scaling feature, integration of jupyter notebooks, Easy deployment of haoop cluster

What do you dislike?

One management node, if the management node goes down, then cluster goes down and the work is lost

What business problems are you solving with the product? What benefits have you realized?

Data processing of unstructured data

Amazon EMR review by Administrator in Computer Software
Administrator in Computer Software
Validated Reviewer
Review Source
content

"Using Amazon EMR to evaluate the data on Spark networks"

What do you like best?

Very user friendly interface, no need to maintain any libraries

What do you dislike?

Does not provide 'on premises' options. It's all setup at launch time. The bootstrapping feature is useful.

What business problems are you solving with the product? What benefits have you realized?

Evaluating data on Hadoop networks. Useful statistics generated

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"Ease of use"

What do you like best?

It is easy to use EMR for me as I come from an on-prem background.

What do you dislike?

Not sure of which instance to use. Given so many options in the hardware selection.

What business problems are you solving with the product? What benefits have you realized?

Machine Learning, Training

Amazon EMR review by Internal Consultant
Internal Consultant
Validated Reviewer
Review Source
content

"Easy to Scale Hard to Migrate"

What do you like best?

Eas y to scale based on the latest config

What do you dislike?

Its hard to Migrate to a new cluster types, as the Master is still stuck at the initial config.

What business problems are you solving with the product? What benefits have you realized?

Real Time Reporting

Amazon EMR review by Internal Consultant
Internal Consultant
Validated Reviewer
Review Source
content

"EMR"

What do you like best?

Flexibility, ease of learning, ease of use

What do you dislike?

There really isn't much to dislike yet. We haven't really challenged the product.

What business problems are you solving with the product? What benefits have you realized?

Data Ingestion from on prem to AWS S3 and query

Amazon EMR review by Consultant
Consultant
Validated Reviewer
Review Source
content

"Sqoop and spark operations in EMR"

What do you like best?

Ability to spin up hadoop clusters and scale them up or down as needed

What do you dislike?

Long running EMR cluster ends up with bad nodes

What business problems are you solving with the product? What benefits have you realized?

Incremental data pipeline, ETL process using pyspark

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"EMR"

What do you like best?

The auto scaling option to evaluate, transform and load the data without need to set up new settings.

What do you dislike?

The hard understanding of the user interface.

What business problems are you solving with the product? What benefits have you realized?

ETL and process logs

Amazon EMR review by User
User
Validated Reviewer
Review Source
content

"emr"

What do you like best?

i like that it is fast, and pretty easy to use

What do you dislike?

i dont like how its hard to start jupyter notebooks from it

What business problems are you solving with the product? What benefits have you realized?

write machine learning models

Amazon EMR review by User in Computer Software
User in Computer Software
Validated Reviewer
Review Source
content

"Review for AWS EMR"

What do you like best?

Managed Hadoop cluster clears a lot of operational hazards and optimizes cost.

What do you dislike?

Nothins as such. It has been really nice using amazon EMR

What business problems are you solving with the product? What benefits have you realized?

Analytocs

Amazon EMR review by Administrator in Computer & Network Security
Administrator in Computer & Network Security
Validated Reviewer
Review Source
content

"Cloud is everything."

What do you like best?

Working environment, all in one platform.

What do you dislike?

Complexity and if anything fails whole system goes down

What business problems are you solving with the product? What benefits have you realized?

Instances and cloud data. User friendly.

Kate from G2

Learning about Amazon EMR?

I can help.
* We monitor all Amazon EMR reviews to prevent fraudulent reviews and keep review quality high. We do not post reviews by company employees or direct competitors. Validated reviews require the user to submit a screenshot of the product containing their user ID, in order to verify a user is an actual user of the product.