michael_harris
05/25/2021, 5:42 PMtap-postgres to target-bigquery 🧵michael_harris
05/25/2021, 5:47 PMselect:
- schem-foo.*
- schem-bar.*
- schem-buzz.id
- schem-buzz.created_at
- schem-buzz.updated_at
For one of my tables, I want to ignore some of the fields because they are giving me trouble when trying to load into BigQuery (see error below)
target-bigquery | CRITICAL 'type' or 'anyOf' are required fields in property: {'$ref': '#/definitions/sdc_recursive_string_array'}
But, looking at the logs, it still looks like tap-postgres is selecting all of the fields
tap-postgres | time=2021-05-25 13:33:59 name=tap_postgres level=INFO message=select statement: SELECT "_id" ,CASE WHEN "created_at" < '0001-01-01 00:00:00.000' OR "created_at" > '9999-12-31 23:59:59.999' THEN '9999-12-31 23:59:59.999' ELSE "created_at" END AS "created_at" , "creators" , "i_data" , "id" , "in_market" , "m_data" , "o_data" , "owners" , "tp_data" ,CASE WHEN "updated_at" < '0001-01-01 00:00:00.000' OR "updated_at" > '9999-12-31 23:59:59.999' THEN '9999-12-31 23:59:59.999' ELSE "updated_at" END AS "updated_at"
tap-postgres | FROM "schem"."buzz"
tap-postgres | ORDER BY "updated_at" ASC with itersize 20000
Is there a way for me to have the postgres tap completely ignore these fields in this select statement?michael_harris
05/25/2021, 5:49 PMmichael_harris
05/25/2021, 5:53 PM- name: tap-postgres
variant: transferwise
pip_url: pipelinewise-tap-postgres
config:
host: <HOST>
port: <PORT>
dbname: <DBNAME>
user: <USER>
select:
- schem-foo.*
- schem-bar.*
- schem-buzz.id
- schem-buzz.created_at
- schem-buzz.updated_at
metadata:
schem-foo:
replication-method: INCREMENTAL
replication-key: updated_at
schem-bar:
replication-method: INCREMENTAL
replication-key: updated_at
schem-buzz:
replication-method: INCREMENTAL
replication-key: updated_at
target
- name: target-bigquery
variant: adswerve
pip_url: git+<https://github.com/adswerve/target-bigquery.git@v0.10.2>
config:
project_id: <PROJ_ID>
dataset_id: <DATASET>
credentials_path: <PATH>
validate_records: true
replication_method: appendaaronsteers
05/25/2021, 6:04 PMaaronsteers
05/25/2021, 6:05 PMaaronsteers
05/25/2021, 6:07 PM--list --all should be effective for debugging the inclusion logic: https://meltano.com/docs/command-line-interface.html#selectmichael_harris
05/25/2021, 6:27 PM--list --all , it showed the fields as not being selected but according to the logs they still were 🤷aaronsteers
05/25/2021, 7:07 PMaaronsteers
05/25/2021, 7:11 PMsdc_recursive_string_array issue as well: https://meltano.slack.com/archives/CMN8HELB0/p1621875990047500michael_harris
05/25/2021, 7:30 PMaaronsteers
05/25/2021, 7:32 PM