AWS Glue x Others

Table of contents

General

AWS Glue compared to other AWS Services.

Glue versus Kinesis Data Analytics

AWS Glue

  • Spark-based;
  • ETL is its primary use case;
  • Ideal for transforming/loading data into the Data Lake.

Kinesis Data Analytics

  • Flink-based;
  • Primary use case is Analytics;
  • Ideal for real-time analytics.

Glue versus Kinesis Data Firehose

AWS Glue

  • Complex ETLs with possibility to join different streaming data.
  • Has more Targets available than Kinesis Data Firehose.

Kinesis Data Firehose

  • Simple ETL with the help of Lambda Functions and simple conversion from JSON to Parquet.
  • Target is limited to Splunk, S3, Redshift and Elastic Search.

Glue versus Amazon EMR

AWS Glue

Jobs Spark or Python

Amazon EMR

Jobs on a Hadoop environment, supporting any framework.