Module 15
Data engineering patterns


Three-pronged strategy to build data infrastructure
Elements of a data pipeline

Homogeneous ingestion pattern

Heterogeneous ingestion patterns
Extract, transform and load (ETL)
Extract, load, and transform (ELT)

Batch and Streaming processing patterns

AWS tools to ingest data
Amazon App Flow

AWS DataSync

AWS Data Exchange

Processing Data in AWS
Batch ingestion and processing

AWS Glue
AWS Glue Components

Example

AWS Glue Transformation types
.csv
.parquet
Convert .csv to .parquet
Last updated