AWS Glue Reviews and details

Screenshots and images

Landing page //
2022-01-29

Badges & Trophies

Promote AWS Glue. You can add any of these badges on your website.

<a href='https://www.saashub.com/experts/rounds/314?utm_source=badge&utm_campaign=badge&utm_content=aws-glue&badge_variant=color&badge_kind=nominated' target='_blank'><img src="https://cdn-b.saashub.com/img/badges/nominated-color.png?v=1" alt="AWS Glue badge" style="max-width: 150px;"/></a>

Show embed code

<a href='https://www.saashub.com/aws-glue?utm_source=badge&utm_campaign=badge&utm_content=aws-glue&badge_variant=color&badge_kind=approved' target='_blank'><img src="https://cdn-b.saashub.com/img/badges/approved-color.png?v=1" alt="AWS Glue badge" style="max-width: 150px;"/></a>

Show embed code

Videos

Build ETL Processes for Data Lakes with AWS Glue - AWS Online Tech Talks

AWS re:Invent BDT 201: AWS Data Pipeline: A guided tour

Getting Started with AWS Glue Data Catalog

Social recommendations and mentions

We have tracked the following product recommendations or mentions on various public social media platforms and blogs. They can help you see what people think about AWS Glue and what they use it for.

Build Your Movie Recommendation System Using Amazon Personalize, MongoDB Atlas, and AWS Glue
AWS Glue is a fully managed extract, transform, and load (ETL) service that makes it easy to prepare and load data for analysis. It helps bridge the gap between our MongoDB Atlas data and the services we'll use for recommendation. - Source: dev.to / about 2 months ago
Using Snowflake data hosted in GCP with AWS Glue
AWS Glue is a fully managed extract, transform, and load (ETL) service provided by Amazon Web Services (AWS). It is designed to make it easy for users to prepare and load their data for analysis. AWS Glue simplifies the process of building and managing ETL workflows by providing a serverless environment for running ETL jobs. - Source: dev.to / 3 months ago
How to check for quality? Evaluate data with AWS Glue Data Quality
It is serverless data integration service to allow you to easily scale your workloads in preparing data and moving transformed data into a target location. - Source: dev.to / 10 months ago
Deploying a Data Warehouse with Pulumi and Amazon Redshift
So in the next post, we'll do that: We'll take what we've done here, add a few more components with Pulumi and AWS Glue, and wire it all up with a few magical lines of Python scripting. - Source: dev.to / over 1 year ago
Serverless Event Driven AI as a Service
Once it's in a Data Lake then you have different options depending on the analytics you need. For more advanced constant analytics then you could look into Amazon Kinesis Data Analytics instead of Firehose to S3, but for Ad-Hoc queries then this is where Glue and Athena come in. - Source: dev.to / over 1 year ago
Well-Architected Review - Part II - Operational Excellence
You will want to use metrics based on operations outcomes to gain useful insights. Now you want to do analytics on your logs and use Cloudwatch Logs Insights or store the logs in Amazon S3, which then triggers an AWS Glue crawler to create an AWS Glue Data Catalog that then can be queried using Amazon Athena using standard SQL. The results can be visualized in Amazon Quicksight. - Source: dev.to / over 1 year ago
AWS Lambda storage options
Storing data in S3 has an additional benefit, given how well it integrates with other AWS services. For instance, you can use Amazon Athena to query your S3 data, or Amazon Rekognition to analyze it. Additionally you can use AWS Glue to perform extract, transform, and loan (ETL) operations. To create ad hoc visualizations and business analysis reports, Amazon QuickSight can connect to your S3 buckets and produce... - Source: dev.to / over 1 year ago
Keep Athena up to date with only the most recent data from partitions
Not 100% if this is what you need, but look into integrating AWS Glue. It should be able to keep the data source that Athena uses up to date real time or close to it, from what I understand. Source: almost 2 years ago
What's New with AWS: AWS Glue Streaming ETL now supports auto-decompression
AWS Glue streaming ETL (Extract Transform and Load) can now detect compressed data streaming from Amazon Kinesis, Amazon Managed Streaming for Apache Kafka (Amazon MSK), and self managed Apache Kafka. It can then automatically decompresses this data without customers having to write code, saving them development hours. AWS Glue Streaming ETL jobs continuously consume data from streaming sources, cleans and... - Source: dev.to / almost 2 years ago
Query compressed logs that are stored in S3 using AWS Athena
Use some ETL service from AWS and push what has been processed to a Log Group: AWS Glue is the service that can be used for this purpose. So it's another option to make everything inside the cloud itself. - Source: dev.to / almost 2 years ago
Ingestion of live data
Also, if you're doing this for an employer, and they have some deeper pockets, there is also AWS Data Pipeline. Source: about 2 years ago
Easy and cost effective way to sync data from RDS Postgres to Redshift
Why aren't you looking at AWS Glue to load the data from Postgres to Redshift? It's relatively inexpensive and purpose built for such tasks. Source: about 2 years ago
Machine Learning Best Practices for Public Sector Organizations
AWS Glue is a fully managed ETL service that makes it simple and cost-effective to categorize, clean, enrich, and migrate data from a source system to a data store for ML. - Source: dev.to / over 2 years ago
Any data engineers familiar with building pipelines in AWS?
Unfortunately there's just so many options for data ingest. Any programming language could be used, and there's plenty of off-the-shelf software and SaaS solutions to do it too. For example it could be done with AWS Data Pipeline (https://aws.amazon.com/datapipeline) or maybe there's just a EC2 virual machine running some custom python code that is doing it. Source: about 3 years ago
Data Factory
Looks like that is a ETL system, so https://aws.amazon.com/glue/. Source: about 3 years ago

External sources with reviews and comparisons of AWS Glue

10 Best ETL Tools (October 2023)

AWS Glue is an end-to-end ETL offering intended to make ETL workloads easier and more integratable with the larger AWS ecosystem. One of the more unique aspects of the tool is that it is serverless, meaning Amazon automatically provisions a server and shuts it down following the completion of the workload.

Source: www.unite.ai

Top 14 ETL Tools for 2023

Notably, AWS Glue is serverless, which means that Amazon automatically provisions a server for users and shuts it down when the workload is complete. AWS Glue also includes features such as job scheduling and “developer endpoints” for testing AWS Glue scripts, improving the tool’s ease of use.

Source: www.integrate.io

A List of The 16 Best ETL Tools And Why To Choose Them

Better yet, when interacting with AWS Glue, practitioners can choose between a drag-and-down GUI, a Jupyter notebook, or Python/Scala code. AWS Glue also offers support for various data processing and workloads that meet different business needs, including ETL, ELT, batch, and streaming.

Source: www.datacamp.com

Top 10 AWS ETL Tools and How to Choose the Best One | Visual Flow

The AWS Glue Data Catalog contains table and job definitions, and other control information. It automatically generates statistics and registers partitions, so data queries can run more efficiently. The catalog also supports an extended history for schema versions, allowing you to see how data has changed over time.

Source: visual-flow.com

Top 5 AWS Glue Alternatives: Best ETL Tools

AWS Glue performs data processing functions like Data Extraction, Data Transformation, and Data Loading to organize enterprise data. This is helpful for organizations that manage large amounts of data. AWS Glue is specifically designed for companies that execute ETL jobs on a serverless platform based on Apache Spark.

Source: hevodata.com

Top 7 ETL Tools for 2021

Source: www.xplenty.com

Do you know an article comparing AWS Glue to other products?
Suggest a link to a post with product alternatives.

Suggest an article

Generic AWS Glue discussion

This is an informative page about AWS Glue. You can review and discuss the product here. The primary details have not been verified within the last quarter, and they might be outdated. If you think we are missing something, please use the means on this page to comment or suggest changes. All reviews and comments are highly encouranged and appreciated as they help everyone in the community to make an informed choice. Please always be kind and objective when evaluating a product and sharing your opinion.

AWS Glue

Fully managed extract, transform, and load (ETL) service subtitle