AWS Glue Reviews & Product Details


What is AWS Glue?

AWS Glue is a fully managed extract, transform, and load (ETL) service designed to make it easy for customers to prepare and load their data for analytics.

Write a Review

AWS Glue Profile Details

AWS Glue Profile Details

Vendor
AWS
Description
By giving customers more of what they want - low prices, vast selection, and convenience - Amazon continues to grow and evolve as a world-class e-commerce platform.
Company Website
Year Founded
2006
Total Revenue (USD mm)
177,866
HQ Location
Seattle, WA
Ownership
NASDAQ: AMZN
LinkedIn® Page
www.linkedin.com
Employees on LinkedIn®
38,313
Twitter
@awscloud
Twitter Followers
1,747,898
Show moreShow fewer

AWS Glue Reviews

Filter Reviews
Filter Reviews
Sort by
Ratings
Company Size
User Role
For Category
All Industries
Write a Review
1-31 of 31 total AWS Glue reviews

AWS Glue Reviews

Write a Review
Filter By
Connections
Show reviews that mention
1-31 of 31 total AWS Glue reviews
Copy Review URL
Cloud Architect
Information Technology and Services
Enterprise
(10,001+ employees)
Validated Reviewer
Verified Current User
Review Source
Copy Review URL

"Cloud based ETL service!!"

What do you like best?

i have been using it for 1-2 years , the best thing about AWS glue is it's a serverless solution , it works by just pointing AWs glue to all other kinds of ETL jobs and hit run , it basically an service that makes it simple and cost effective to categorize data , clean the data , enrich the data , and it makes the job moving data reliably btwn various data stores very easy and efficient, we can also connect it with oracle database !!

What do you dislike?

it gives very high level of automation and less customization !!

Recommendations to others considering the product:

Very cost effective and reliable service to clean data, enrich data and to perform all kind of ETL (Extract Transform Load ) operations on the data!!

What problems are you solving with the product? What benefits have you realized?

we use it to perform ETL jobs like preparing data for Machine Learning Model , to do analytics and auditing!!

Copy Review URL
Cloud Architect
Information Technology and Services
Mid-Market
(201-500 employees)
Validated Reviewer
Verified Current User
Review Source
Copy Review URL

"The ETL (Extract Tranform And Load) service for AWS!"

What do you like best?

i have been working with AWS Glue for 2-3 years , it allows you to locate , move and transform all your data sets across your business , the most interesting thing about AWS Glue is ,it's server less you can run your all ETL jobs by just pointing Glue to them, you don't need to configure , provision or spinup servers , and you don't need to manage their life cycle , it customizes your task by 80-85%!!

What do you dislike?

it's not that easy to learn and implement AWS Glue because it contains concepts like Crawlers , ETL scripts etc.

Recommendations to others considering the product:

one of the reliable and heavily used server less software for performing all kinds of ETL tasks on data sets!!

What problems are you solving with the product? What benefits have you realized?

we're using it for 2-3 years for performing all the ETL tasks on our Structured as well as Semi-Structure data sets!!

Copy Review URL
Member Of Technical Staff (Data Platform)
Information Technology and Services
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL
Business partner of the vendor or vendor's competitor, not included in G2 scores.

"Better alternative for hive metastore"

What do you like best?

Seems to be great option if you wwant to test out another metastore instead of the default one which comes with hive

What do you dislike?

Its too new and not many tutorials or use cases are mentioned on the web so it will take some time to use this in prod. right now we are still doing poc on it

What problems are you solving with the product? What benefits have you realized?

we can reduce pressure on hive metastore. sometimes the data can be too much and can get spikes in hive metastore so we need something better solution which wont have the same issues we already deal with. glue can be good option here which will solve the problem

Copy Review URL
U
Small-Business
(11-50 employees)
Validated Reviewer
Review Source
Copy Review URL

"Easiest Cloud Platform (according to multiple clients we work with)"

What do you like best?

Ease, simplicity, intuitive-ness. Hands down the easiest platform to administer, maintain, and scale. This is coming from multiple clients we work with who have swtiched to the product from a variety of competitors.

What do you dislike?

Price can be an issue for some, depending on the company size and budgetary restrictions. Certainly it's not anything related to simplicity or ease.

Recommendations to others considering the product:

Everything seems to be wonderful in terms of ease of use and simplicity

What problems are you solving with the product? What benefits have you realized?

As a growing company who supports a number of clients, we've found it's easier in terms of the platform vs. the other options out there. Time is a commodity we never get enough of, so to reduce training and administering time is KEY.

Copy Review URL
AWS Glue Review
Validated Reviewer
Review Source
Copy Review URL

"AWS Glue review"

What do you like best?

It is easy to load and analyze data. Running an ETL (Extract, Transform, Load) job is quite easy and intuitive. Once the data is processed by AWS Glue, the data can be queried and extracted using simple SQL like commands.

What do you dislike?

Integration with other AWS components is not easy even if all the components are in the same environment.

Recommendations to others considering the product:

Best tool for ETL jobs and data analysis

What problems are you solving with the product? What benefits have you realized?

Cleaning up and extracting data from large datasets

Copy Review URL
U
Enterprise
(5001-10,000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Very specific use cases only"

What do you like best?

Schedule ETL operations is very cool, I think it could be a great thing.

What do you dislike?

UI needs work. Very slow, HOWEVER, my first experience with this was at re:invent, so resources were probably stretched thin. I did talk to another person in the training with me who said it does tend to be slow, but he just schedules jobs to run over night and it works great for him.

Recommendations to others considering the product:

Try it out with a POC. I've heard from it works very well in production.

What problems are you solving with the product? What benefits have you realized?

I'm not currently solving a business problem with it. I would like to use it for automated feedback processing.

Copy Review URL
U
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Easy ETL with a few clicks"

What do you like best?

The one thing I love about AWS Glue is that we are using to transform our dynamodb application data into a relation database that reporting tools like tableau and looker can use easily.

What do you dislike?

The documentation and sample code around it is horrible. Usually, I raise an support ticket to resolve my issues.

Recommendations to others considering the product:

If you want to use an ETL in AWS then Glue is your answer. We have used data pipeline and it was too much but with glue it meets our needs.

What problems are you solving with the product? What benefits have you realized?

Taking unstructured data and making it an structured format for data virtulizations.

Copy Review URL
UC
Enterprise
(5001-10,000 employees)
Validated Reviewer
Review Source
Copy Review URL

"AWS Glue is promising but needs improvements"

What do you like best?

1.Serverless. Fully managed.

2. Easy to write a script.

3. Connects with Relational databases (JDBC)

4. Data crawler and integration with Athena.

What do you dislike?

1. No IDE supported for development. Delay in identifying syntax errors or other tyop errors as we have to wait for servers to start.

2. Current DPU configuration is small.

3. Development end point option is available but no way to pause it. We have to delete it or will be charged as long as it is up. Setting up Dev end point can be painful considering organization's security policies.

What problems are you solving with the product? What benefits have you realized?

ETL pipeline.

Copy Review URL
Software Engineering Manager 1
Mid-Market
(501-1000 employees)
Validated Reviewer
Review Source
Copy Review URL

"The Service is good, can become better"

What do you like best?

The service is good in a servless architecture. The glue perfomes the etl spark jobs perfectly well.

What do you dislike?

the service takes a lot of time to provision the servers. the insights in the glue-assembly-jar is not there. We face lot of dependency conflicts.

What problems are you solving with the product? What benefits have you realized?

we wanted to built a serverless data processing pipeline. Glue fits in well there.

Copy Review URL
E
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"It is coming along"

What do you like best?

We began implementing in Jan 18 for etl. It filled our need, but was hard to manage. It management has since improved greatly, and we have several pipelines (with step functions) in production. We continue to evolve our patterns, and glue continues to be a key service within those patterns.

What do you dislike?

It continues to be difficult to understand processing states. It was difficult to orchestrate (we tried triggers, but those worked poorly, hence step functions),

What problems are you solving with the product? What benefits have you realized?

ETL - cleansing raw data, transforming per business rules, publishing to S3 buckets.

Copy Review URL
Senior Systems Engineer
Mid-Market
(51-200 employees)
Validated Reviewer
Review Source
Copy Review URL

"Easy Schema Creation for Athena"

What do you like best?

Glue Crawlers allow me to easily modify and create Athena tables based on ever changing JSON objects.

What do you dislike?

Lack of adequate documentation, (ESPECIALLY examples!) of Glue Crawler Classifiers.

What problems are you solving with the product? What benefits have you realized?

Simplified deployment and maintainance of Athena tables with Glue Crawlers. Removed need to schedule a dedicated Lambda cron job to create partitions.

Copy Review URL
A
Mid-Market
(501-1000 employees)
Validated Reviewer
Verified Current User
Review Source
Copy Review URL

"Good Tool to Index S3 and Athena"

What do you like best?

Extremely easy to create and keep updated as new data sources are added to S3.

What do you dislike?

Limited number of file types are supported.

What problems are you solving with the product? What benefits have you realized?

Indexing our S3 buckets for easy querying the objects stored within our web application

Copy Review URL
Software Engineer
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Good but not enough integration with open source data processing tools"

What do you like best?

Wide selection of built-in process to ETL and schema extraction. integration with EMR, Athena,

What do you dislike?

Harder to integrate with non-EMR based big data tools, such as spark streaming, Flink, etc.

What problems are you solving with the product? What benefits have you realized?

Data catalog of various data sources and sinks

Copy Review URL
Engineering Manager
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"we are first to go live using Glue"

What do you like best?

its fast reliable and easy to code.and i would definitely recommend to other customers

What do you dislike?

the DPUs allow and running the same script parallely with diffrent scripts

Recommendations to others considering the product:

yes, ofcourse

What problems are you solving with the product? What benefits have you realized?

building sales Analytics data lake

Copy Review URL
Principal Engineer
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Next Gen ETL "

What do you like best?

Crawler & how you can integrate with other AWS services

What do you dislike?

It is still evolving; performance can still be improved

What problems are you solving with the product? What benefits have you realized?

We built the Sales Analytics domain using Glue

Copy Review URL
Architect - Big Data and Cloud
Mid-Market
(501-1000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Glue empowers some of our ETL pipelines"

What do you like best?

Glue Data Catalog

Catalog as a service for microservices

What do you dislike?

Not being able to gather a slice of a data

What problems are you solving with the product? What benefits have you realized?

Serverless ETL pipelines

Data Democracy

Copy Review URL
Director, Architecture
Financial Services
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Building Data lake made easy"

What do you like best?

Product needs maturity but a good offering

What do you dislike?

Release cycles are slow. More functonality on Data cataloging features

What problems are you solving with the product? What benefits have you realized?

Building Data Lake and enhancing data analytics

Copy Review URL
VP Software Dev
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Great simple etl tool"

What do you like best?

simplicity of use - ability to build simple transformations quickly

What do you dislike?

network connectivity to on-prem servers was a challenge

What problems are you solving with the product? What benefits have you realized?

Data ingestion

Copy Review URL
Architect
Enterprise
(5001-10,000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Nice ELT Tool"

What do you like best?

Transformation was easy and data manipulation

What do you dislike?

Complex Transformation not possible and unable to make large data processing

What problems are you solving with the product? What benefits have you realized?

Data Ingestion

Copy Review URL
U
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"AWS Glue makes it easy to ETL between S3 and databases with just a little code"

What do you like best?

Feature rich, layers on top of Spark (PySpark), ability to run arbitrary PySpark scripts

What do you dislike?

No GUI for Scala (not a big deal but a noticied absence), slow to spin up clusters, limited number of simultaneous DPUs across an account

What problems are you solving with the product? What benefits have you realized?

Extract data from SQL databases in RDS and in EC2 into S3 for customer access.

Copy Review URL
U
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Glue may be the future"

What do you like best?

Glue seems to be able toi replace lots of annoying ETL stuff and give power to developers.

What do you dislike?

You still have to edit the code manually when there is something more complex stuff going on with the data.

What problems are you solving with the product? What benefits have you realized?

ETL stuff can be done by data scientist instead of outsourcing everything to long projects.

Copy Review URL
A
Enterprise
(1001-5000 employees)
Validated Reviewer
Review Source
Copy Review URL

"Awesome serverless ETL processing "

What do you like best?

Available Spark Engine

Data crawlers to automatically identify schema structure

No need to maintain EMR/Hadoop cluster.

What do you dislike?

Some support for datatypes missing(Map)

Auto scaling is missing.

Recommendations to others considering the product:

FIle types, compression and data types. Size of data

What problems are you solving with the product? What benefits have you realized?

Serverless ETL. reduces cost of maintaining infrastructure and operational cost.

Copy Review URL
UC
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"AWS Glue Review"

What do you like best?

Its Serverless and it provides the catalog, orchestration all together apart from the compute

What do you dislike?

It seems to run on very cheap machines at the backend, which does not provide the right performance all the time

What problems are you solving with the product? What benefits have you realized?

Data Ingestion, transformation for the data lake

Copy Review URL
U
Enterprise
(5001-10,000 employees)
Validated Reviewer
Review Source
Copy Review URL

"analytics on GLue"

What do you like best?

its easy to use, scalable, integrates well with cloudwatch, sns, etc crawler helps with schema identification and user access through athena

What do you dislike?

limit on concurrency, no tagging on glue resources

What problems are you solving with the product? What benefits have you realized?

run analytics on data lake

Copy Review URL
U
Small-Business
(11-50 employees)
Validated Reviewer
Review Source
Copy Review URL

"Glue"

What do you like best?

Speed to deliver insigets to the business

What do you dislike?

Can improve the crawler capability. Should be able to understand datatypes better

What problems are you solving with the product? What benefits have you realized?

Creation of data lake - processing raw data to compressed columnar format

Copy Review URL
U
Mid-Market
(201-500 employees)
Validated Reviewer
Review Source
Copy Review URL

"Good"

What do you like best?

I hope there are more statistics from the console like job management

What do you dislike?

I’d like more meaningful job console to track status

What problems are you solving with the product? What benefits have you realized?

Piping data into data store

Copy Review URL
U
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Glue is Awesome!"

What do you like best?

Ability to transform foundational company data.

What do you dislike?

Challenges with Optimization of derived entities and views created as part of data transformation.

What problems are you solving with the product? What benefits have you realized?

Transforming Company Data

Copy Review URL
U
Mid-Market
(501-1000 employees)
Validated Reviewer
Review Source
Copy Review URL

"ETL solution and data discovery "

What do you like best?

Data discovery capability is the best thing I liked about GLue

What do you dislike?

More complex ETL transformation options will be helpful

What problems are you solving with the product? What benefits have you realized?

ETL solution

Copy Review URL
U
Validated Reviewer
Review Source
Copy Review URL

"Would love to use when it is hardened"

What do you like best?

S3 data discoverability, flexible schema support

What do you dislike?

Unstable ETL, lack of integration with external data catalogs

What problems are you solving with the product? What benefits have you realized?

ETL and ad-hoc data analysis

Copy Review URL
U
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"Glue for spark pipeline"

What do you like best?

Glue automates spark pipeline in serverless manner

What do you dislike?

Glue catalog does not support free text search

What problems are you solving with the product? What benefits have you realized?

Data lake

Copy Review URL
U
Enterprise
(10,001+ employees)
Validated Reviewer
Review Source
Copy Review URL

"ETL workhorse"

What do you like best?

Code generation, customization, Sceduling

What do you dislike?

Limited languages in which code is generated

What problems are you solving with the product? What benefits have you realized?

ETL of streaming data

AWS Glue Features

  • Data Transformations
  • Real-Time Integration
  • Parallel Processing
  • Data Chunker
  • Data Masking
  • Proactive Monitoring

AWS Glue User Ratings

7.5
Ease of Use
Average: 8.4*
8.3
Quality of Support
Average: 8.5*
8.6
Ease of Setup
Average: 8.4*
* ETL Tools Category
Do you work for AWS Glue?