Amazon EMR Reviews & Product Details


What is Amazon EMR?

Amazon EMR is a web-based service that simplifies big data processing, providing a managed Hadoop framework that makes it easy, fast, and cost-effective to distribute and process vast amounts of data across dynamically scalable Amazon EC2 instances.

Write a Review

Amazon EMR Profile Details

Amazon EMR Profile Details

Vendor
AWS
Description
By giving customers more of what they want - low prices, vast selection, and convenience - Amazon continues to grow and evolve as a world-class e-commerce platform.
Company Website
Year Founded
2006
Total Revenue (USD mm)
177,866
HQ Location
Seattle, WA
Ownership
NASDAQ: AMZN
LinkedIn® Page
www.linkedin.com
Employees on LinkedIn®
38,313
Twitter
@awscloud
Twitter Followers
1,754,279
Show moreShow fewer

Amazon EMR Reviews

Filter Reviews
Filter Reviews
Sort by
Ratings
Company Size
User Role
All Industries
Write a Review
1-45 of 45 total Amazon EMR reviews

Amazon EMR Reviews

Write a Review
Filter By
Connections
Show reviews that mention
1-45 of 45 total Amazon EMR reviews
Copy Review URL
UL
Small-Business
(2-10 employees)
Validated Reviewer
Review Source
Copy Review URL

"AWS EMR at a glance"

What do you like best?

EMR does well in managing the cost as it uses the task node cores to process the data and these instances are cheaper when the data is stored on s3. It is really cost-efficient. No need to maintain any libraries to connect to AWS resources.

What do you dislike?

No UI client for saving the workbooks or code snippets. Everything has to go through submitting process. Not really convenient for tracking the job as well.

What problems are you solving with the product? What benefits have you realized?

EMR is suited if the jobs are long running and doesn't really need much monitoring. EMR is really flexible in processing the data on s3 as a developer doesn't need to spend time on debugging the connections to s3 from a big data framework as most of the configuration is taken care of by Amazon. Very cheap when compared to most of the solutions on the market and the ready to go configuration at the launch time reduces the amount of time required for admin tasks. So, considering the cheap cost, processing options on s3 and scalability via adding task nodes, EMR serves a better purpose for startups considering open source and cost-efficient options.

Copy Review URL
Test Lead
Biotechnology
Enterprise
(5001-10,000 employees)
Validated Reviewer
Verified Current User
Review Source
Copy Review URL

"We've moved our hadoop processing here"

What do you like best?

Ease of spinning up clusters on-demand to save $$$

What do you dislike?

Not much! We had a fairly easy tiime getting Spark on EMR running. Was a little troublesome at first, but once we learned the platform, we love it!

What problems are you solving with the product? What benefits have you realized?

We have moved our hadoop/sqoop data processing here. This allows us to offload the MV refresh process we used to have in place inside Oracle, thus saving us valuable throughput and DB load.

Copy Review URL
UC
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Easy way to run Big data applications in the cloud"

What do you like best?

Amazon EMR is easy to setup and use. There are plenty of ways to get started, such as the AWS EMR console, or you can automate the whole thing using AWS command line tool awscli, or the boto3 api for Python. We use a combination of awscli and boto3 for automation. It provides best in class tools built in, and integration with other amazon services such as s3 for data as well as log aggregation, etc. It also provides a way to use AWS EC2 Spot instances, which reduce the cost to run Map Reduce jobs by 50-80% on average.

What do you dislike?

There are no major dislikes about EMR, but in general, they could provide more options to monitor the cluster, and also provide ways to troubleshoot the failed jobs and provide ways to recover failed jobs. But Amazon is going in the right direction and hope they address those things as well.

Recommendations to others considering the product:

If you are on AWS and looking to run Hadoop/Spark, look no further than Amazon EMR, and it might save a lot of time in setting up your cluster, and helps you focus more on the application business logic than worrying about the infrastructure configuration and setup.

What problems are you solving with the product? What benefits have you realized?

We use EMR primarily for data analytics and big data processing using Spark, Hadoop and we also use S3 for storing the output.

Copy Review URL
A
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Mapreduce for the Cloud"

What do you like best?

EMR is simple to use, configure, and maintain. The number of components available in EMR is very good and serves almost all the usecases and rarely you will have to install a software on your own. The ability to mix instance lifecycles (spot, on-demand, reserved) is very helpful to optimize the costs

What do you dislike?

There isn't an easy way for setting up authorization and authentication in EMR. Cloudera does a better job in maintaining that.

Recommendations to others considering the product:

EMR is the way to go if you're going to run big data workloads in AWS.

What problems are you solving with the product? What benefits have you realized?

We run analytics workload for processing large volumes of data. Primarily the data is stored in S3. We have both transient and permanent clusters, and it is really helpful to setup and teardown the clusters using

Copy Review URL
U
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL

"Useful cloud-based platform"

What do you like best?

The best part is its ease of use for the user. We use it for big data storage. The platform is very useful in regards to its processing and storage of big data.

What do you dislike?

It is really only suited for big data, so if you the user is looking for something else than storage of big data, then I don't really recommend something like this for the user. Also not for users that are not advanced with knowledge of data storage

What problems are you solving with the product? What benefits have you realized?

It provides the ability to handle big data storage. It also is a great choice if you are going to set up a distributed compute platform. EMR's benefit is a great usefulness to startups that a thinking of using open source and cost efficient options

Copy Review URL
U
Small-Business
(11-50 employees)
Validated Reviewer
Review Source
Copy Review URL

"Ideal tool for Analytical Based Project that used Hadoop and Apache Spark."

What do you like best?

Amazon EMR is best option for running resource heavy task. It is secure and easy to setup with its intuitive setup wizard in both basic and advanced mode.

We need to to worry about the cluster formation and handling, Leaving it to Amazon EMR we can concentrate on designing model to run on it.

What do you dislike?

Not much variety to chose from. EMR Clusters are not highly versatility. It comes with basic application setup like hadoop, spark, Hive etc. But we need to relive on Bootstrap action to download and install require applications.

What problems are you solving with the product? What benefits have you realized?

Amazon EMR is used for information extracting from the history and logs of our other applications.

It is very cost effective when compare to other methods.

It made data processing and analysis easy.

Copy Review URL
Information Technology Infrastructure
Telecommunications
Mid-Market
(201-500 employees)
Validated Reviewer
Review Source
Copy Review URL

"IT made easy to process big data"

What do you like best?

.It allows to work on large amount of data an on different framework. EMR offers the expandable low-configuration service as an easier alternative to running in-house cluster computing

What do you dislike?

Its complex to use in initially and i think there should be proper Userinterface for real time.

Recommendations to others considering the product:

Good option for startup.

open source and cost efficient

saves lots of time for setup

What problems are you solving with the product? What benefits have you realized?

Using Hadoop and Spark frameworks. For processing and storage of large data

Copy Review URL
Kapies Vallipuram
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL

"EMR Review"

What do you like best?

We were able to easily crunch over 400million events every few hours. We can use HIVE which makes the learning curve much easier.

What do you dislike?

IF you forget to turn off your instance.. you can expect a nice heavy bill. The sizing of the cluster is not easy to use, we tend to just guess.

What problems are you solving with the product? What benefits have you realized?

We were crunching a lot of data that were coming through our pipeline.

Copy Review URL
Sr. Big Data Engineer
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"EMR"

What do you like best?

the ability to spin up cluster size based upon the needs of each job

What do you dislike?

error logging is sometimes hard to track down root cause

Recommendations to others considering the product:

start with small jobs, take adavantage of new freatures in newer releases

What problems are you solving with the product? What benefits have you realized?

we are building data assets for multiple business units to share via S3 and Redshift

Copy Review URL
Snowflake busted up our performance
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL

"Easy way to setup and use hadoop with flexible configurations"

What do you like best?

the configuration and node managements are flexible and the performance is good

What do you dislike?

the app logs are difficult to view , we may need a way to terminate unused clusters

What problems are you solving with the product? What benefits have you realized?

using map reduce we do parallel processing and the performance of our computation has incresed

Copy Review URL
Sr. Engineer
Enterprise
(5001-10,000 employees)
Validated Reviewer
Review Source
Copy Review URL
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"EMR is great"

What do you like best?

I was able to quickly setup and run the workloads easily. Support team gave enough guidence to resolve issues

What do you dislike?

Most of the featurestes are good,. one thing i felt is, it is less custmizable.

What problems are you solving with the product? What benefits have you realized?

We we able to run some adhoc queries using Hive quickly with EMR

Copy Review URL
U
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL

"EMR: Easy Setup but Dispatching Work has Friction"

What do you like best?

Its very easy to set up and upgrade Spark clusters. Auto scaling with spot is also great, as it can save you lots of money on the correct workloads.

What do you dislike?

The step API does not allow you to run concurrent workloads and its unecessaraly difficult to set up a local client that is not located on the actual EMR cluster (i.e. dispatching work from a scheduler to a Spark cluster on EMR).

What problems are you solving with the product? What benefits have you realized?

ETL workloads and building a data warehouse.

Copy Review URL
U
Small-Business
(11-50 employees)
Validated Reviewer
Review Source
Copy Review URL

"Decent program"

What do you like best?

The program had a lot of capabilities, and options for customization. Those who are familiar with these times of

What do you dislike?

Big learning curve for someone who hasn't used a program like this before, I found it quite frustrating and bit of a headache until I got really comfortable with it.

What problems are you solving with the product? What benefits have you realized?

From a fairly basic understanding, Amazon EMR allowed us to streamline our processes and see an in-depth scope of our business, both under a microscope and birds eye.

Copy Review URL
Cloud System Architect, Robotics and Workforce Intelligence
Computer Software
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL

"Amazon AWS EMR"

What do you like best?

The simplicity of creating an EMR Cluster

What do you dislike?

Sometimes the time it takes to create a cluster in a specific region.

What problems are you solving with the product? What benefits have you realized?

It allows my analytics team to troubleshoot customer issues as they arise and query a customer's current database.

Copy Review URL
Senior Hadoop Administrator
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"EMR Presto"

What do you like best?

Its better than cloudera Impala.but the servcie goes down due to many errors

What do you dislike?

Service goes down frequntly .due to maximum thread count and out of contact with wokrers

What problems are you solving with the product? What benefits have you realized?

replacing Impala with EMR Presto

Copy Review URL
Mr
Validated Reviewer
Review Source
Copy Review URL

"Quick and easy setup"

What do you like best?

SImplicity of emr in usability and scalability. Seamless access to s3

What do you dislike?

Can be expensive to start with. Need to have some understanding of other aws features to start using it.

What problems are you solving with the product? What benefits have you realized?

Data analytics

Copy Review URL
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL

"Amazon EMR"

What do you like best?

make the organization of medical records very easy to process.

What do you dislike?

the user interface can be kind of tricky to use.

Recommendations to others considering the product:

use it with other software as a complement

What problems are you solving with the product? What benefits have you realized?

makes the organization easier to process

Copy Review URL
U
Validated Reviewer
Review Source
Copy Review URL
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"EMR"

What do you like best?

The ease of starting up a new cluster and picking up where the old one left off.

What do you dislike?

No security on some of the web interfaces. Enabling developers to troubleshoot in production is very difficult.

What problems are you solving with the product? What benefits have you realized?

We are running hbase and flink to extract data for legacy sources, convert to avro, and push on. We are solving one source of data for the company

Copy Review URL
A
Small-Business
(11-50 employees)
Validated Reviewer
Review Source
Copy Review URL

"Amazon EMR"

What do you like best?

It is easy to use with lots of documentation available online.

Also, with Amazon EMR we can provision one, hundreds, or thousands of compute instances to process data at any scale. It is extremely low cost as well.

What do you dislike?

Not much to dislike. We faced issues initially going through the documentation to set up EM clusters.

What problems are you solving with the product? What benefits have you realized?

We are using Amazon EMR t process huge dataset withing no time

Copy Review URL
Director
Small-Business
(11-50 employees)
Validated Reviewer
Review Source
Copy Review URL
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"Very good experience "

What do you like best?

Performance and auto scaling compared to other vendors in the market

What do you dislike?

Cost is more compared to other vendors in the market

What problems are you solving with the product? What benefits have you realized?

Insurance analytics

Copy Review URL
U
Small-Business
(2-10 employees)
Validated Reviewer
Review Source
Copy Review URL

"Best last resort."

What do you like best?

Allows us to handle massive data sets when our local hardware won't allow it.

What do you dislike?

Takes a very long time to spin up a cluster, and is relatively expensive.

Recommendations to others considering the product:

Requires a decent amount of technical knowledge to get started.

What problems are you solving with the product? What benefits have you realized?

We are solving the problem of handling large amounts of biological data. We are able to more quickly analyze our data using EMR.

Copy Review URL
Technologist
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Great Hadoop Environment in the cloud"

What do you like best?

Flexibility and quick setup. Ease of installing custom tools

What do you dislike?

Security features are not simple to configure

What problems are you solving with the product? What benefits have you realized?

Replacing Hadoop on premise

Copy Review URL
CN
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Principle Engineer"

What do you like best?

Its a fully managed service in Cloud to handle and Support Big data platform. Which is amazing that there is no need to handle complex configuration of Big data platform.

What do you dislike?

There is none so far, and i am expecting to have more featuers added on this EMR Platform in terms of supporting machine learning.

What problems are you solving with the product? What benefits have you realized?

Big Data analytics

Copy Review URL
Lead BigData Platform Engineer
Financial Services
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Best with Autoscaling"

What do you like best?

Best for autoscaling, helps to spin clusers very fast

What do you dislike?

Unable to use custom AMI(Enterpise Images)

What problems are you solving with the product? What benefits have you realized?

Analytics procurment of Hardware

Copy Review URL
UC
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Simple way to get into Hadoop"

What do you like best?

I like that bootstrapping a cluster is so painless and quick

What do you dislike?

Auto-scaling can be painful, especially when running Spark jobs that run the application master on core nodes and get killed.

Recommendations to others considering the product:

Good for getting acquainted with Hadoop but not the most optimal or cost-effective environment to run Hadoop workloads on

What problems are you solving with the product? What benefits have you realized?

Batch processing large volumes of data

Copy Review URL
IH
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Great for ETL"

What do you like best?

Best way of transforming large volumes of non structured data, especially from S3

What do you dislike?

Still miss something like Impala to do ad-hoc queries (although I know that I can use Athena for it)

What problems are you solving with the product? What benefits have you realized?

Moving data from landing area to EDL (S3 to S3) with low level of data transformation (e.g. merge multiple individual JSON files)

Copy Review URL
I
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Dipping toes in the water -- and drinking the water"

What do you like best?

quick performance with the right configuration

What do you dislike?

time to spin up and cost to keep up, even while not in use

Recommendations to others considering the product:

Do reccomend but have had only limited use but can't wait until additional services and applications are added

What problems are you solving with the product? What benefits have you realized?

extri performance and scoringetscting data from s3 storages and pairing with other data to generate

Copy Review URL
AF
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Great product for batch processing usecases"

What do you like best?

Great for batch usecases, quick and easy to learn, deploy. Reporting, monitoring, alerting ... everything is out of box.

What do you dislike?

Not cost effiecient, monolithic, requires thorough testing, bake in fault tolerance, reseliency ...

What problems are you solving with the product? What benefits have you realized?

Mostly for processing huge loads of batch files and cleansing and streaming the data, data enrichment,

Copy Review URL
UF
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"EMR a great service for analytics"

What do you like best?

EMR is easy to use and helps you get started quickly with your ETL jobs

What do you dislike?

nothing. I actually enjoy what EMR offers

What problems are you solving with the product? What benefits have you realized?

We are taking unstructured data or data that can't be used to make business decisons and transforming it into something in which our business can use.

Copy Review URL
A
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"performance"

What do you like best?

good performance, ease of use and easy to spin cluster in EMR. Easy to integrate and build with terraform.

What do you dislike?

version constraints on components. It is really confusing to add components to the cluster coz of version constraints and its really hard to find good documentation.

What problems are you solving with the product? What benefits have you realized?

analyics

Copy Review URL
C
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"EMR Review"

What do you like best?

The best part of EMR is the auto scaling option. This is highly cost effective.

What do you dislike?

Sometimes the scaling is delyed. Whether its upscaling and douwnscaling. The jobs wait for resources and they dont get it

Recommendations to others considering the product:

Please enhance the autoscaling timing

What problems are you solving with the product? What benefits have you realized?

Big Data Processing

Copy Review URL
C
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"Easy Morning Relaxation"

What do you like best?

I love the transient nature of the solution. Easy to spin up, bootstrap with additional services and configurations, and get going.

What do you dislike?

No big complaints. Failover of the master could be better.

What problems are you solving with the product? What benefits have you realized?

Batch analysis and transformation for ingest into data lakes and Redshift.

Copy Review URL
U
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Good as long as you aren't in a hurry"

What do you like best?

Great for not in memory querying and data access. Powerful for very big data.

What do you dislike?

In-memory solutions like Presto are very limited in data size.

Recommendations to others considering the product:

Make sure you know your requirements for speed and size.

What problems are you solving with the product? What benefits have you realized?

Joining and filtering internal and external data sets.

Copy Review URL
U
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"Emergency review"

What do you like best?

I like the way it is organized and easy to use

What do you dislike?

Very complex. Felt like the instruction could’ve been explained better

Recommendations to others considering the product:

Learn carefully. Group learning

What problems are you solving with the product? What benefits have you realized?

That platform is very organized and easy to use. Learning the gist of this program was easy to follow.

Copy Review URL
U
Enterprise
(5001-10,000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Pretty good for cloud based Hadoop"

What do you like best?

It's really good for handling fast generating, unstructured data.

What do you dislike?

It is not very user friendly and requires massive amounts of memory.

What problems are you solving with the product? What benefits have you realized?

We run Spark jobs on EMR, which are used our machine learning applications for better user recommendations.

Copy Review URL
U
Validated Reviewer
Review Source
Copy Review URL

"Hadoop cluster deployment made easy"

What do you like best?

auto scaling feature, integration of jupyter notebooks, Easy deployment of haoop cluster

What do you dislike?

One management node, if the management node goes down, then cluster goes down and the work is lost

What problems are you solving with the product? What benefits have you realized?

Data processing of unstructured data

Copy Review URL
AC
Small-Business
(11-50 employees)
Validated Reviewer
Review Source
Copy Review URL

"Using Amazon EMR to evaluate the data on Spark networks"

What do you like best?

Very user friendly interface, no need to maintain any libraries

What do you dislike?

Does not provide 'on premises' options. It's all setup at launch time. The bootstrapping feature is useful.

What problems are you solving with the product? What benefits have you realized?

Evaluating data on Hadoop networks. Useful statistics generated

Copy Review URL
U
Enterprise
(5001-10,000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Ease of use"

What do you like best?

It is easy to use EMR for me as I come from an on-prem background.

What do you dislike?

Not sure of which instance to use. Given so many options in the hardware selection.

What problems are you solving with the product? What benefits have you realized?

Machine Learning, Training