Jonathan Cass
10/24/2024, 3:49 PMvisch
10/24/2024, 6:01 PMvisch
10/24/2024, 6:01 PMJonathan Cass
10/24/2024, 6:04 PMversion: 1
plugins:
extractors:
- name: tap-postgres
variant: meltanolabs
pip_url: git+<https://github.com/MeltanoLabs/tap-postgres.git>
config:
database: db
host: [redacted]
port: [redacted]
user: [redacted]
filter_schemas:
- public
loaders:
- name: target-bigquery
variant: z3z1ma
pip_url: git+<https://github.com/z3z1ma/target-bigquery.git>
config:
project: [redacted]
credentials_path: google_credentials.json
denormalized: true
method: batch_job
upsert: true
dedupe_before_upsert: true
- name: target-bq-canvas
inherit_from: target-bigquery
config:
dataset: lk_canvas
Jonathan Cass
10/24/2024, 6:04 PMJonathan Cass
10/24/2024, 6:07 PMnarrative_json
field has type jsonb
Jonathan Cass
10/24/2024, 6:25 PMJonathan Cass
10/24/2024, 6:25 PMJonathan Cass
10/24/2024, 6:32 PM2024-10-24T18:28:36.066315Z [info ] 2024-10-24 18:28:36,049 | ERROR | target-bigquery | Draining all sinks and terminating. cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.144763Z [info ] Exception in thread Thread-7: cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.145681Z [info ] Traceback (most recent call last): cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.146994Z [info ] File "/usr/local/lib/python3.9/threading.py", line 980, in _bootstrap_inner cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.147987Z [info ] self.run() cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.148836Z [info ] File "/project/.meltano/loaders/target-bigquery/venv/lib/python3.9/site-packages/target_bigquery/batch_job.py", line 63, in run cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.149286Z [info ] client.load_table_from_file( cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.149794Z [info ] File "/project/.meltano/loaders/target-bigquery/venv/lib/python3.9/site-packages/google/cloud/bigquery/job/base.py", line 966, in result cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.150739Z [info ] return super(_AsyncJob, self).result(timeout=timeout, **kwargs) cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.151180Z [info ] File "/project/.meltano/loaders/target-bigquery/venv/lib/python3.9/site-packages/google/api_core/future/polling.py", line 261, in result cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.152081Z [info ] raise self._exception cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
2024-10-24T18:28:38.152554Z [info ] google.api_core.exceptions.BadRequest: 400 Error while reading data, error message: JSON table encountered too many errors, giving up. Rows: 1; errors: 1. Please look into the errors[] collection for more details.; reason: invalid, message: Error while reading data, error message: JSON table encountered too many errors, giving up. Rows: 1; errors: 1. Please look into the errors[] collection for more details.; reason: invalid, message: Error while reading data, error message: JSON processing encountered too many errors, giving up. Rows: 1; errors: 1; max bad: 0; error percent: 0; reason: invalid, message: Error while reading data, error message: JSON parsing error in row starting at position 0: Repeated field must be imported as a JSON array. Field: data. cmd_type=elb consumer=True job_name=dev:tap-postgres-to-target-bq-canvas name=target-bq-canvas producer=False run_id=8c88d629-1686-4255-bf02-fbce0ac29643 stdio=stderr string_id=target-bq-canvas
visch
10/24/2024, 6:40 PMJonathan Cass
10/24/2024, 6:41 PMJonathan Cass
10/24/2024, 6:42 PMvisch
10/24/2024, 6:42 PMvisch
10/24/2024, 6:42 PMJonathan Cass
10/24/2024, 6:42 PMvisch
10/24/2024, 6:42 PMvisch
10/24/2024, 6:43 PMJonathan Cass
10/25/2024, 7:00 PMtype: ["object", "null"]
for all the json fields in postgres. unclear why that isn't how they were already being exposed đ¤ˇJonathan Cass
10/25/2024, 7:00 PMvisch
10/25/2024, 7:14 PMis_nullable
to that query, and then be sure to either clear the cache or run with --refresh-catalog
visch
10/25/2024, 7:18 PMrm -rf .meltano
and then run meltano invoke --dump=catalog tap-json > catalog.json
and shoot over that file. Might be a bug in tap-postgres we could dump that all into an issueJonathan Cass
10/25/2024, 7:20 PMis_nullable
? and did you mean tap-postgres
instead of tap-json
in the invoke
command at the end?visch
10/25/2024, 7:42 PMselect
column_name,
data_type ,
is_nullable
from information_schema.columns
visch
10/25/2024, 7:50 PMis_nullable
2. I'd like a dump of the discovery call from tap-postgres so meltano invoke tap-postgres --discover > catalog.json
Jonathan Cass
10/25/2024, 7:53 PMis_nullable
as YES
are the ones that were causing me troublevisch
10/25/2024, 7:54 PMdata_type
in thereJonathan Cass
10/25/2024, 7:56 PMwhere data_type='jsonb'
Jonathan Cass
10/25/2024, 8:06 PMcatalog.json
is 85000 lines long ..... but the json columns say
"type":[
"string",
"number",
"integer",
"array",
"object",
"boolean",
"null"
]
(both the is_nullable ones and not)visch
10/25/2024, 8:07 PMvisch
10/25/2024, 8:11 PMvisch
10/25/2024, 8:11 PMJonathan Cass
10/25/2024, 8:12 PMobject
and then there could be an option for this more expansive behavior? seems unexpected to me ...visch
10/25/2024, 8:27 PMJonathan Cass
10/25/2024, 8:41 PMdates_as_string
called json_as_object
?visch
10/26/2024, 11:50 AMJonathan Cass
10/26/2024, 3:19 PMJonathan Cass
10/28/2024, 3:54 PMtap-postgres
tests require a locally running postgres server? if so, what's the best way to get that running?
i tried using docker: docker run --name my_postgres -e POSTGRES_USER=postgres -e POSTGRES_PASSWORD=postgres -e POSTGRES_DB=postgres -p 5432:5432 -d postgres
but am getting the error sqlalchemy.exc.OperationalError: (psycopg2.OperationalError) connection to server at "localhost" (::1), port 5432 failed: FATAL: role "postgres" does not exist
Edgar RamĂrez (Arch.dev)
10/28/2024, 4:20 PMbrew start services postgres
at some point? Then the ports might be conflicting. I think you can check using brew services
.Jonathan Cass
10/28/2024, 4:24 PMJonathan Cass
11/01/2024, 1:41 PMvisch
11/01/2024, 2:18 PMJonathan Cass
11/01/2024, 2:23 PMvisch
11/01/2024, 2:30 PMJonathan Cass
11/01/2024, 2:30 PMvisch
11/01/2024, 2:30 PMvisch
11/01/2024, 2:34 PMJonathan Cass
11/01/2024, 2:35 PMvisch
11/01/2024, 2:36 PMJonathan Cass
11/04/2024, 5:22 PMmeltano lock --all --update
isn't updating it. deleting the lockfile and removing/readding it didn't help. i noticed that the new option isn't up on meltano hub yet - maybe that is the problem? if so, when does that get updated?Edgar RamĂrez (Arch.dev)
11/04/2024, 10:13 PMpip_url
for tap-postgres
in your meltano.yml
?Jonathan Cass
11/04/2024, 10:21 PMgit+<https://github.com/MeltanoLabs/tap-postgres.git>
Edgar RamĂrez (Arch.dev)
11/04/2024, 11:07 PMmain
branch. You can try running meltano install --clean
to refresh the plugin installation, or changing it to meltanolabs-tap-postgres
(and also doing a meltano install
)?Jonathan Cass
11/05/2024, 12:09 AMJonathan Cass
11/05/2024, 12:15 AMJonathan Cass
11/05/2024, 4:10 PMvisch
11/05/2024, 4:18 PMsettings
attribute but meltanohub doesn't have the new settings available automatically, got it!Edgar RamĂrez (Arch.dev)
11/05/2024, 4:18 PMEdgar RamĂrez (Arch.dev)
11/05/2024, 4:18 PMJonathan Cass
11/05/2024, 4:18 PMI thought you said you weren't getting the "latest version"yes, that's what i said, because that's what i thought haha
Jonathan Cass
11/05/2024, 4:18 PMEdgar RamĂrez (Arch.dev)
11/05/2024, 5:18 PMEdgar RamĂrez (Arch.dev)
11/05/2024, 5:41 PM