Multi-engine data stack v1

Julien Hurault

Apr 10, 2024

Ju Data Engineering Weekly - Ep 56

Read →

8 Comments

Sung Won Chung

Apr 10, 2024

Very very excited that you're building content on multi-engine data stacks. Keep it up! Gets me 🧃 up

Expand full comment

Kyle C

Nov 3

Is this still possible now that Snowflake discontinued version-hint.txt?

Expand full comment

Kent Maxwell

Jun 6, 2024

First of all, I love your posts. Your contribution to the data community is very appreciated! Considered this stack, what would you introduce to source data from a database, such as SQL Server?

Expand full comment

Reply (1)

Julien Hurault

Jun 7, 2024

thanks Kent !

You would need CDC. Depending on your need you can opt for open-source (Debezium), AWS DMS or any other vendor in that space. Changes would then be writen to s3 (possiblity with compaction) before downstream consumption.

Expand full comment

Julian Gilyadov

Apr 14, 2024

Embedding DuckDB in Lambdas as an API layer to serve Snowflake results is worth exploring. I wonder how well it scales compared to syncing data to Redis/PSQL or solutions like Airfold/Tinybird.

Expand full comment

Reply (1)