Non urgent question about `target postgres` datamill co vari Meltano #plugins-general

Non-urgent question about `target-postgres` (datam...

laurent

02/19/2021, 11:25 PM

Non-urgent question about

target-postgres

(datamill-co variant): is the target schema expected to be empty? I tried using

public

on a db which has pre-existing tables, and it failed because the description of some tables was not valid JSON (here https://github.com/datamill-co/target-postgres/blob/9c095d91e215f932caa897a6587c6dc6278db8cf/target_postgres/postgres.py#L223

json.loads

didn't like that it was fed text for humans)

douwe_maan

02/19/2021, 11:26 PM

Can you share the error or other failure symptoms you're seeing?

laurent

02/19/2021, 11:31 PM

sure, so I ran

meltano elt tap-csv target-postgres --job_id=xxxx

and it failed with the following stacktrace:

Copy code

target-postgres | ERROR Exception writing records
target-postgres | Traceback (most recent call last):
target-postgres |   File "/.../ays-elt/.meltano/loaders/target-postgres/venv/lib/python3.8/site-packages/target_postgres/postgres.py", line 236, in write_batch
target-postgres |     self.setup_table_mapping_cache(cur)
target-postgres |   File "/.../ays-elt/.meltano/loaders/target-postgres/venv/lib/python3.8/site-packages/target_postgres/postgres.py", line 223, in setup_table_mapping_cache
target-postgres |     table_path = json.loads(raw_json).get('path', None)
target-postgres |   File "/home/laurent/.pyenv/versions/3.8.7/lib/python3.8/json/__init__.py", line 357, in loads
target-postgres |     return _default_decoder.decode(s)
target-postgres |   File "/home/laurent/.pyenv/versions/3.8.7/lib/python3.8/json/decoder.py", line 337, in decode
target-postgres |     obj, end = self.raw_decode(s, idx=_w(s, 0).end())
target-postgres |   File "/home/laurent/.pyenv/versions/3.8.7/lib/python3.8/json/decoder.py", line 355, in raw_decode
target-postgres |     raise JSONDecodeError("Expecting value", s, err.value) from None
target-postgres | json.decoder.JSONDecodeError: Expecting value: line 1 column 1 (char 0)

douwe_maan

02/19/2021, 11:32 PM

OK. Can you find out what is in that

raw_json

variable if it's not valid JSON?

laurent

02/19/2021, 11:32 PM

so I added some extra logging in the postgres target, and on the line I mentioned,

raw_json

is the description of an existing table, something like "the list of our clients"

douwe_maan

02/19/2021, 11:32 PM

The value comes from

obj_description(c.oid, 'pg_class')

in the query on line 214, so it's odd that Postgres would return anything for that function other than a JSON object description

laurent

02/19/2021, 11:32 PM

yep, my thoughts exactly

douwe_maan

02/19/2021, 11:33 PM

the list of our clients

... Does that value look familiar to you?

douwe_maan

02/19/2021, 11:33 PM

Like where does it actually occur in a table row?

laurent

02/19/2021, 11:33 PM

yes, it's the description of a table

laurent

02/19/2021, 11:33 PM

it's not IN the table, it's like metadata aobut the table

laurent

02/19/2021, 11:33 PM

let me fish up the SQL that created it

douwe_maan

02/19/2021, 11:34 PM

OK, it sounds like this target has certain expectations of what the table description is used for, then

laurent

02/19/2021, 11:35 PM

COMMENT ON TABLE public.employees IS 'the comment I mentioned above'

douwe_maan

02/19/2021, 11:35 PM

Right. Looks like it's fussy about table comments that don't follow its own idea of what should be there

douwe_maan

02/19/2021, 11:35 PM

I think that's worth filing an issue in https://github.com/datamill-co/target-postgres/issues

douwe_maan

02/19/2021, 11:35 PM

And then using a different schema for the time being if that's an option

douwe_maan

02/19/2021, 11:36 PM

If you don't set a

postgres_schema

at all, the default is actually to use a name derived from the tap you used: https://meltano.com/plugins/loaders/postgres.html#postgres-schema

laurent

02/19/2021, 11:36 PM

ok, I was thinking of it, just wanted to sense-check here first, seeing that you're pretty responsive 🙂

douwe_maan

02/19/2021, 11:36 PM

🙂

laurent

02/19/2021, 11:37 PM

I tried with a different schema, but then it failed because it expected the schema to be there in the first place

douwe_maan

02/19/2021, 11:37 PM

Ah, interesting, I thought it auto-created schemas

laurent

02/19/2021, 11:37 PM

let me try without specifying any schema, to see what happens

laurent

02/19/2021, 11:38 PM

meta-question: is datamill the preferred variant for the postgres target?

douwe_maan

02/19/2021, 11:38 PM

Yep

laurent

02/19/2021, 11:38 PM

alright, cool

laurent

02/19/2021, 11:41 PM

confirmed, it does not auto-create the schema:

meltano         | Loading failed (1): target_postgres.exceptions.PostgresError: ('Exception writing records', InvalidSchemaName('schema "tap_csv" does not exist\nLINE 1: CREATE TABLE "tap_csv"."mytable" ();\n

douwe_maan

02/19/2021, 11:41 PM

OK, good to know!

laurent

02/19/2021, 11:43 PM

as a workaround, there's a

before_run_sql

which could be used to create the schema, but it feels a little hackish. Maybe an option like

autocreate_schema

(true/false) would make sense?

douwe_maan

02/19/2021, 11:44 PM

before_run_sql

sounds like a good option for now, but I think a dedicated option definitely wouldn't hurt. We'd probably set it to

true

by default in Meltano. Wanna create an issue on https://github.com/datamill-co/target-postgres?

laurent

02/19/2021, 11:46 PM

yep, working on the issues now. I'll file 2, one for the table comments, and one for the autocreation, ok with you?

douwe_maan

02/19/2021, 11:46 PM

Yep that sounds good, but it's not my repo so what I think doesn't matter too much 😄

laurent

02/19/2021, 11:48 PM

sure, but you're kind of a power-user I guess 🙂

douwe_maan

02/19/2021, 11:48 PM

Kind of

laurent

02/20/2021, 12:07 AM

thanks for the quick response @douwe_maan much appreciated!

douwe_maan

02/20/2021, 12:08 AM

Happy to help as always, thanks for helping debug and improve the ecosystem!

laurent

02/20/2021, 12:09 AM

haha, yeah 4 bugs in 2 days, good start 🙂

douwe_maan

02/20/2021, 12:09 AM

And some of those bugs already fixed, don't forget that part 😉

laurent

02/20/2021, 12:09 AM

I know, that's amazing!

Open in Slack

Previous Next