Is there any way to specify a target config env variable dif Meltano #troubleshooting

Is there any way to specify a target config env va...

Andy Carter

08/01/2023, 12:59 PM

Is there any way to specify a target config env variable differently on a tap-by-tap basis? I need to do this in

meltano.yml

if possible. I would like to alter the

target-postgres

setting

BATCH_SIZE_ROWS

depending on the tap run. I tried to do this:

Copy code

- name: tap-dynamics
    inherit_from: tap-spreadsheets-anywhere
    config:
      env:
        TARGET_POSTGRES_BATCH_SIZE_ROWS: 1000

But the setting in the tap wasn't getting picked up by the target. If I set

TARGET_POSTGRES_BATCH_SIZE_ROWS

.env

it works fine, but doesn't give any flexibility for changing on a per-tap basis. Suggestions welcome Edit: I'm not using a schedule or job, otherwise the

env:

section there would solve my issue exactly! 😞 Edit2: I guess one solution is to create an second inherited

tap-postgres

and alter its config, then I would do

meltano run my-tap tap-postgres-small-batch

but feels a bit hacky.

quentin_gaborit

08/01/2023, 1:21 PM

There is a

env:

property on

job

definitions?

Andy Carter

08/01/2023, 1:22 PM

Oh sorry, there is on schedule, but job

env:

is a proposal still

Andy Carter

08/01/2023, 1:23 PM

https://github.com/meltano/meltano/issues/6386

quentin_gaborit

08/01/2023, 1:28 PM

That's what I've seen, facing the same issue as you then. I thought inheritance would help but I'm stuck duplicating the same target plugin 10 times in order to simply change the target schema in the configuration so I'm not sure option 2 would work neither

Andy Carter

08/01/2023, 1:30 PM

I think

job

level env would be the ideal solution, from the env resolution order there is no overlap between tap and target as I understand. And I get it,

job

is the right place because that is the confluence of tap and target, and my env var is specific to the combination.

Andy Carter

08/01/2023, 1:38 PM

How are you changing the target schema? Could you use

{MELTANO_EXTRACTOR_NAMESPACE}

instead?

Andy Carter

08/01/2023, 1:39 PM

Copy code

- name: target-postgres
    variant: transferwise
    pip_url: pipelinewise-target-postgres
    config:
      default_target_schema: raw__${MELTANO_EXTRACTOR_NAMESPACE}

Andy Carter

08/01/2023, 1:39 PM

So this gives me

raw__salesforce

raw__zendesk

as my target schemas

quentin_gaborit

08/01/2023, 1:53 PM

I don't know how I fucked up the configuration of the inherited plugins but it actually works. Just like you do, I have several taps but a single target, that I'd like to configure per-tap. In order to have something readable I've split the

meltano.yml

configuration in as many files as there is plugins in my project.

quentin_gaborit

08/01/2023, 1:54 PM

Having the

env:

key available for

jobs

would save me from having a file per bigquery-inherited target but its not that bad in the end.

edgar_ramirez_mondragon

08/01/2023, 2:20 PM

Hey guys 👋 @Andy Carter @quentin_gaborit would any of you be willing to create an issue in the Meltano repo to request support for a job-level

env

mapping?

quentin_gaborit

08/01/2023, 2:21 PM

@edgar_ramirez_mondragon Looks like @Andy Carter found the one opened since quite some time now -> https://meltano.slack.com/archives/C01TCRBBJD7/p1690896190096249?thread_ts=1690894749.896989&cid=C01TCRBBJD7

edgar_ramirez_mondragon

08/01/2023, 2:25 PM

Oh OK, thanks @quentin_gaborit! I see it was marked as accepting PRs and was deprioritized internally

Andy Carter

08/01/2023, 3:51 PM

OK I just tried implementing the 'define a new inherited target' approach to set my target config, but only when I went to run the job I realised the state will be blank, because the tap is essentially 'new' to meltano, so a full sync ensued. Just to try and customize the batch size on a per-tap basis 🤯

steve_clarke

08/01/2023, 7:16 PM

How I do this is I pick-up the environment variables I need via chamber to a secret store in AWS. What it does is it picks up all the environment variables for the tap and target then calls meltano. The secret is to store the target variables that changes along with the tap variables. For us loading into Snowflake, we want different batch sizes and target schemas, so we keep them with the tap parameters. You could also from a scheduler pass in different variables when calling Meltano as well. In summary manage environment variables outside the Meltano.yml file, then you have control.

Open in Slack

Previous Next