shy-article-27410
12/14/2020, 3:48 PMmany-farmer-43903
12/14/2020, 4:01 PMripe-musician-59933
12/14/2020, 4:38 PMshy-article-27410
12/14/2020, 4:47 PMripe-musician-59933
12/14/2020, 4:59 PMshy-article-27410
12/14/2020, 7:20 PMgray-cricket-92960
12/15/2020, 1:57 AMshy-article-27410
12/15/2020, 7:54 PMripe-musician-59933
12/15/2020, 8:41 PMThe initial use case of moving data from a csv file to Postgres was very straightforward and simple.@shy-article-27410 I'm glad to hear it!
What would be a recommended work flow for receiving/unzipping/staging the files to load only the most recent file? Is this something best done outside Meltano, or is it something Meltano can handle?If that workflow can be expressed in a generic enough way that it could be a setting on the tap, you could consider contributing it; https://github.com/ets/tap-spreadsheets-anywhere supports tons of sources and already does unzipping, so adding some logic to limit it to the most recent file (by creation or modification date) seems like it would be in scope 🙂 Otherwise this would live outside of Meltano, in a pipeline step ahead of
meltano elt
. If you're using Airflow, you could modify the default orchestrate/dags/meltano.py
to add some operators ahead of the BashOperator that runs meltano schedule run <schedule>
.shy-article-27410
12/15/2020, 9:56 PM