Tagged Data Warehouse

The Twelve Days of AWS: Redshift

To talk about Redshift we probably need to talk about Data Warehousing, as Redshift is a fully managed Data Warehouse package. This is not putting your data in a ‘digital warehouse’ to go and gather virtual dust on the back of a virtual shelf somewhere and is more akin to a single source of truth for the state of a business and its information. Whilst Redshift is based on PostgreSQL and can be queried with normal SQL, it is not meant to be a DataBase per se.

12 Days of AWS Day 12 written around snowflakes with a penguin wearing a candy cane sweater

The Twelve Days of AWS: Data Pipelines

Extract, Transform & Load (ETL) is the name of the game when it comes to Data Pipelines. 

The Extract portion will acquire data from some source(s), which will then pass through Transform where some alteration may be needed to that data, to then end up Loaded into another storage format, such as Redshift, S3, to name a couple.

12 Days of AWS Day 10 written around snowflakes with two penguins holding balloons