We have a scenario where both the source and target DB is Bi Meltano #troubleshooting

We have a scenario where both the source and targe...

sreepriya m

05/13/2024, 9:46 AM

We have a scenario where both the source and target DB is Bigquery.We are trying to extract data from BigQuery(source DB) and load to BigQuery (target DB), the approach we are using is Upsert: true and we have set denormalized: true also. When we insert new records in source DB and run the pipeline then those records are correctly populated in target DB, but the issue is when we update any existing records in source DB, it does not update that in target DB instead it inserts as new records (making it as a duplicate record in target DB). Given below is meltano yml used in this case. Please share your thoughts on how to resolve it. Sample YML: extractors: - name: tap-bigquery-load-bronze variant: anelendata pip_url: tap-bigquery config: streams: - name: ** table: ** columns: - '*' datetime_key: _sdc_extracted_at credentials_path: **.json start_datetime: 2019-03-26 153339.217317 UTC start_always_inclusive: true inherit_from: tap-bigquery loaders: - name: target-bigquery-load-silver variant: z3z1ma pip_url: git+https://github.com/z3z1ma/target-bigquery.git config: location: us-central1 project: ** dataset: ** denormalized: true upsert: true method: batch_job credentials_path: json inherit_from: target-bigquery

Andy Carter

05/13/2024, 9:49 AM

Does your source stream define a primary key? https://github.com/z3z1ma/target-bigquery?tab=readme-ov-file#upsert-aka-merge

👍 1

sreepriya m

05/13/2024, 2:57 PM

yes

sreepriya m

05/13/2024, 2:58 PM

both source and target has primary keys defined. But new records are getting inserted in target table instead of getting updated - any idea on this?

Andy Carter

05/13/2024, 2:59 PM

Can you share the ddl for the source and target table?

sreepriya m

05/13/2024, 3:02 PM

create table br-dw-np.bronze.test( EmployeeID INT64 primary key not enforced, FirstName STRING, LastName STRINg, _sdc_extracted_at TIMESTAMP); --target table create table br-dw-np.silver.test( EmployeeID INT64 primary key not enforced, FirstName STRING, LastName STRINg, _sdc_extracted_at TIMESTAMP);

sreepriya m

05/13/2024, 3:02 PM

target bigquery variant used is z3z1ma, and we have not defined any key within meltano yml file

sreepriya m

05/13/2024, 3:08 PM

one thing to note is - table in silver layer was created by meltano initially with no primary key - then it was altered manually to add primary key

sreepriya m

05/13/2024, 3:41 PM

tried with tables created upfront in both bronze and silver with keys defined and then used meltano to populate silver table - still updated rows in bronze layer got inserted in silver layer , not got updated

7 Views

Open in Slack

Previous Next