edward_smith
04/26/2021, 3:13 PMdouwe_maan
04/26/2021, 3:26 PMmeltano elt
, which matches the schedule name if you used meltano schedule
to define a scheduled pipeline. So state is tracked per scheduled pipeline, and multiple pipelines can use the same extractor.
In your case, it sounds like you have an extractor for each DB, and then 5 inheriting extractors for each of those with a specific subset of selected tables, and then one schedule for each of those inheriting extractors.
Since state is tracked per schedule, moving a stream from one schedule to another would require you to also move the state from one schedule to the other, which right now would require you to manually modify the job
table in the system database: https://meltano.com/docs/project.html#system-database
I'd be interested in exploring automatic parallelization over different tap processes using "subpipelines" so that we could track the state for each DB's tables in one place while the way the tables are balanced over different subprocesses can change over time. @ken_payne has suggested something like that before as well. Maybe it's time to create an issue, Ken?edward_smith
04/26/2021, 3:53 PMedward_smith
04/26/2021, 3:53 PMedward_smith
04/26/2021, 3:58 PMdouwe_maan
04/26/2021, 3:59 PMedward_ryan
04/26/2021, 4:00 PMdouwe_maan
04/26/2021, 4:00 PMken_payne
04/26/2021, 7:18 PM