Hello hello I have two questions which hopefully someone her Meltano #getting-started

Hello hello! I have two questions which hopefully...

Alexander Trauzzi

12/16/2024, 6:27 PM

Hello hello! I have two questions which hopefully someone here can help me out with 🙂 First: Is it possible to have meltano automatically update a destination postgres target schema if the inbound data includes new columns (no need to worry about removals) Second: Is there any way to configure the meltano backend to use postgres? I'm unable to find any documentation on the systemdb option and whether the URI it accepts can include a postgres connection string...

Edgar Ramírez (Arch.dev)

12/16/2024, 7:13 PM

Is it possible to have meltano automatically update a destination postgres target schema if the inbound data includes new columns (no need to worry about removals)

Depends on the target implementation, but for example MeltanoLabs/target-postgres does create the new columns when incoming SCHEMA message includes them.

Is there any way to configure the meltano backend to use postgres?

Yes! See https://docs.meltano.com/concepts/project/#support-for-other-database-types. The URI should look like this

Copy code

<postgresql+psycopg://user:password@host>:port/dbname

❤️ 1

Alexander Trauzzi

12/16/2024, 7:14 PM

Ah cool! Okay, so I'm running

tap-csv

and

target-postgres

. It notices that there are new columns, but they ultimatley don't get created.

Alexander Trauzzi

12/16/2024, 7:14 PM

Is there a special flag I need to provide to trigger it?

Edgar Ramírez (Arch.dev)

12/16/2024, 7:16 PM

It notices that there are new columns

So where do you get this info from? The logs?

Alexander Trauzzi

12/16/2024, 7:17 PM

Yessir

Edgar Ramírez (Arch.dev)

12/16/2024, 7:17 PM

And what does the log say?

Alexander Trauzzi

12/16/2024, 7:17 PM

Copy code

2024-12-16 19:17:27,339 | WARNING  | target-postgres.test | No schema for record field 'school' cmd_type=elb consumer=True job_name=dev:tap-csv-to-target-postgres name=target-postgres producer=False run_id=c31951b7-8c8f-413f-bdfa-5cf6efdcc413 stdio=stderr string_id=target-postgres

Edgar Ramírez (Arch.dev)

12/16/2024, 7:27 PM

Ah, I can you try running with the

--refresh-catalog

flag?

Copy code

meltano run --refresh-catalog ...

Alexander Trauzzi

12/16/2024, 7:30 PM

Trying... 🙂

👍 1

Alexander Trauzzi

12/16/2024, 7:30 PM

Aah, yep. That did it!

Alexander Trauzzi

12/16/2024, 7:31 PM

Any consequences to that flag I should be mindful of?

Edgar Ramírez (Arch.dev)

12/16/2024, 7:37 PM

Nothing of significance for most use cases. The long explanation is that Meltano caches the tap's catalog because discovery can return a lot of stuff and take a long time, specially for database sources. For most taps, the discovery process should be almost instantaneous, as is the case for tap-csv. We're still i the process of figuring out what are the best cache invalidation heuristics, maybe enabling catalog caching for SQL, salesforce and similar dynamic sources and disabling it otherwise.

Alexander Trauzzi

12/16/2024, 7:37 PM

Ahhh neat, okay. That's easy enough to remember 🙂

Alexander Trauzzi

12/16/2024, 7:39 PM

Oh, one last thing.... Is there any way to ask meltano to make a "best effort" to guess types other than text for columns when going to postgres? I'm actually hoping to be able to have it manage the destination schema for me like this.

Alexander Trauzzi

12/16/2024, 7:39 PM

So I definitely don't want to have to explicitly manage the schema. But if it has to be an "after the fact" thing, that's fine too. Just trying to fine tune a few last details.

Alexander Trauzzi

12/16/2024, 7:42 PM

(just noticing now, I can't change the type of destination columns in the DB after they're made without meltano complaining)

Alexander Trauzzi

12/16/2024, 7:43 PM

Copy code

Altering columns is not supported. Could not convert column 'tap_csv.test.id' from 'INTEGER' to 'TEXT'.

Edgar Ramírez (Arch.dev)

12/16/2024, 7:47 PM

> (just noticing now, I can't change the type of destination columns in the DB after they're made without meltano complaining) Ah yeah, it's not safe to alter the column types manually on the tables created by the loader. The best way to customize the types of certain columns would be using schema:

Copy code

extractors:
- name: tap-csv
  schema:
    test:
      id:
        type: "integer"

Edgar Ramírez (Arch.dev)

12/16/2024, 7:48 PM

We do have an issue to make column type alteration configurable: https://github.com/meltano/sdk/issues/1781. But the point above about not changing them manually would still hold: the loader would handle type alterations for you.

Alexander Trauzzi

12/16/2024, 7:49 PM

If I specify schema manually after running once, and run again, will the loader update?

Alexander Trauzzi

12/16/2024, 7:53 PM

Added a comment to the ticket 🙂

7 Views

Open in Slack

Previous Next