omar_abed
03/30/2023, 1:29 PMextractors:
- name: tap-mytap-account1
namespace: tap_mytap
executable: tap-mytap
config:
username: 'abc'
password: '123'
- name: tap-mytap-account2
namespace: tap_mytap
inherit_from: tap-mytap-account1
config:
username: 'xyz'
password: '789'
And I'm trying to load them into the same table using the bigquery loader, like so
schedules:
- name: mytap_account1_to_bigquery
extractor: tap-mytap-account1
loader: target-bigquery
transform: skip
...
- name: mytap_account2_to_bigquery
extractor: tap-mytap-account2
loader: target-bigquery
...
transform: skip
My question is: Is there a way to use the target-bigquery-truncate
loader instead to get these to truncate the destination table on each run? Currently, one job runs and then truncates the results of the other. Can inherited jobs "run together" essentially and truncate previous runs together, rather than truncating each other?
My current workaround is to append the table and dedupe later on, but this is causing my source table to grow unnecessarily large, when all I need is the latest results from both runs, combined.Sven Balnojan
03/30/2023, 2:00 PMomar_abed
03/30/2023, 2:08 PMomar_abed
03/30/2023, 2:11 PMomar_abed
03/30/2023, 2:12 PMtarget-bigquery-truncate
loader, and the next into the target-bigquery
loader, and offset them by a few minutes to make sure the first runs firstSven Balnojan
03/30/2023, 2:13 PMomar_abed
03/30/2023, 2:15 PMtarget-bigquery-truncate
loader I'm talking about is a custom inherited loader which just sets the replication_method
on the target-bigquery
loader to truncate
. So.. never mind. 🤦 Thanks for walking me through it though, this makes a lot of senseSven Balnojan
03/30/2023, 2:16 PM