AWS Glue x Others
Table of contents
General
AWS Glue compared to other AWS Services.
Glue versus Kinesis Data Analytics
AWS Glue
- Spark-based;
- ETL is its primary use case;
- Ideal for transforming/loading data into the Data Lake.
Kinesis Data Analytics
- Flink-based;
- Primary use case is Analytics;
- Ideal for real-time analytics.
Glue versus Kinesis Data Firehose
AWS Glue
- Complex ETLs with possibility to join different streaming data.
- Has more Targets available than Kinesis Data Firehose.
Kinesis Data Firehose
- Simple ETL with the help of Lambda Functions and simple conversion from JSON to Parquet.
- Target is limited to Splunk, S3, Redshift and Elastic Search.
Glue versus Amazon EMR
AWS Glue
Jobs Spark or Python
Amazon EMR
Jobs on a Hadoop environment, supporting any framework.