Andy Carter
05/16/2024, 11:01 AMjoshua_janicas
06/04/2024, 8:37 PMjoshua_janicas
06/24/2024, 2:29 PMContainer App
, running meltano commands in shell, and then seeing dagster load in azure.
• Meltano state backend? I don't want to lose state every time I make a new change to my docker container
What I am trying to figure out right now is figure out the best way to host the backend state/sqllite db server so that we don't lose state/data when we have to rebuild the container. Is there any advice or setup recommendations to take care of this?Edgar Ramírez (Arch.dev)
06/24/2024, 2:32 PMjoshua_janicas
06/24/2024, 2:32 PMjoshua_janicas
06/24/2024, 2:34 PMAZURE_STORAGE_CONNECTION_STRING
(or is it MELTANO_STATE_BACKEND_AZURE_STORAGE_ACCOUNT_URL (
• MELTANO_STATE_BACKEND_URI
Edgar Ramírez (Arch.dev)
06/24/2024, 2:36 PMMELTANO_STATE_BACKEND_URI
• MELTANO_STATE_BACKEND_AZURE_STORAGE_ACCOUNT_URL
or MELTANO_STATE_BACKEND_AZURE_CONNECTION_STRING
Edgar Ramírez (Arch.dev)
06/24/2024, 2:36 PMjoshua_janicas
06/24/2024, 2:39 PMEdgar Ramírez (Arch.dev)
06/24/2024, 2:39 PMAndy Carter
06/24/2024, 2:53 PMAndy Carter
06/24/2024, 2:54 PMAndy Carter
06/24/2024, 2:55 PMAndy Carter
06/24/2024, 3:08 PMjoshua_janicas
06/24/2024, 7:42 PMRegarding Container apps (ACA), I used that for a while but eventually I found the memory limits of 4GB too low to use dagster to orchestrate meltano and dbt. I was running meltano in the same container as dagster though.
How much memory does your container need, may I ask?
I can give you some code examples for your meltano assets and linking them to dbt assets. There is quite a bit of boilerplate though and knowledge of dbt asset groups.
I'd be more than happy to compare notes!!!
Regarding production db, yes I use postgres flexible which works well for our smaller data scale. For network access I restrict access to certain site IPs and also have to permit all azure related IPs. I think pre-generate some logins and pass this as an env variable to the container for meltano to use. You could probably vnet it if you like but that's a bit above my expertise.
Hi @joshua_janicas I have the Azure state backend working well, I believe you can now use a Managed Identity if you (and your ops sec team) prefer that to a plaintext connection string. I haven't tried that though.
Good to know, thank you!Andy Carter
06/25/2024, 7:13 AMAndy Carter
06/25/2024, 7:14 AMAndy Carter
06/25/2024, 1:32 PMjoshua_janicas
06/25/2024, 1:42 PMjoshua_janicas
06/25/2024, 1:43 PMjoshua_janicas
06/25/2024, 1:45 PMAndy Carter
06/25/2024, 1:53 PMAndy Carter
06/25/2024, 1:53 PMjoshua_janicas
06/25/2024, 1:54 PMjoshua_janicas
06/25/2024, 1:54 PMAndy Carter
06/25/2024, 2:03 PMAndy Carter
06/25/2024, 2:04 PMsalesforce
in meltano that does meltano run tap-salesforce target-bigquery
then you have a dbt group tagged with salesforce
this also does dbt run --tags:salesforce
Andy Carter
06/25/2024, 2:05 PMjoshua_janicas
06/25/2024, 2:08 PMtags
in mind, haven't thought about that. Thank you!Andy Carter
06/25/2024, 2:13 PMstaging
eg staging/freshdesk/*.sql
so I can dbt tag them by folder path.
https://docs.getdbt.com/reference/resource-configs/tagsAndy Carter
06/25/2024, 2:15 PM