Oscar Gullberg
04/09/2025, 10:28 AMstg
dataset after completion?
Use case: Ingesting data from multiple Shopify stores.
Right now, we run one Meltano pipeline per store, which:
• Extracts raw data into a shared raw_shopify
dataset in BigQuery
• Creates common views in a single stg_shopify
dataset
This setup causes some issues. Ideally, we want to:
1. Ingest each store's raw data into its own dataset (e.g. raw_shopify_store1
, raw_shopify_store2
, etc.) in parallel
2. Run per-store transforms into separate staging datasets (e.g. stg_shopify_store1
, etc.)
3. Run a final transform step that unions everything into a central stg_shopify
dataset
Is there a clean way to do this in Meltano? Any recommendations or patterns others are using?Oscar Gullberg
04/09/2025, 10:29 AMEdgar Ramírez (Arch.dev)
04/09/2025, 4:17 PMload_schema
setting:
• https://docs.meltano.com/concepts/project/#inheriting-plugin-definitions
• https://docs.meltano.com/concepts/plugins/#load_schema-extra