gray-cricket-92960
02/02/2021, 1:58 PMripe-musician-59933
02/02/2021, 6:35 PMtap-covid-19
doesn't use start_date
and the timestamp stored between runs to only select records created since that point, but rather to only select files that were changed on GitHub since then: https://github.com/singer-io/tap-covid-19/blob/5940583111b1978b0ef252d943f8fb5728bc90e7/tap_covid_19/sync.py#L134-L135
Out of those matching files, it looks like it still imports all recordsgray-cricket-92960
02/02/2021, 6:37 PMripe-musician-59933
02/02/2021, 6:45 PMstart_date
and bookmarks is always the responsibility of the tap itself rather than Meltano, you can usually find it by searching the repo for start_date
which is what I did here: https://github.com/singer-io/tap-covid-19/search?q=start_dateIf i haven’t formally created a pipeline yet does meltano keep track of what i execute the job with (e.g. tap target job_id)?Yep, the runs end up in the
job
table in the system database with their job_id
whether. Using meltano schedule
and the schedules
list in meltano.yml
is completely optional, if you're happy constructing and running the appropriate meltano elt
commands yourselfgray-cricket-92960
02/07/2021, 1:49 AM