Hello Everyone I want to read CSV file from an Azure Blob St Meltano #troubleshooting

Hello Everyone, I want to read CSV file from an A...

Bappi Patra

11/27/2024, 12:15 PM

Hello Everyone, I want to read CSV file from an Azure Blob Storage and load it in Snowflake, and i found there is no azure blob exactor available. even we have tried Tab-spreadsheet anywhere but we are getting connection string error. Please suggest how can i do the integration. If someone tried similar kind of integration from azure to snowflake or azure to any 3rd party.

visch

11/27/2024, 2:36 PM

tried Tab-spreadsheet anywhere but we are getting connection string error.

Could you share this and/or make an issue in the repo?

Andy Carter

11/27/2024, 3:46 PM

I am using csv from azure blob storage via tap-spreadsheets-anywhere in production so likely this is a configuration issue. @Bappi Patra can you share your meltano.yml and the error stack trace Worth checking the auth options for the underlying

smart_open

library for Azure https://github.com/piskvorky/smart_open?tab=readme-ov-file#azure-credentials You will need to set

AZURE_STORAGE_CONNECTION_STRING

env, with a value like

DefaultEndpointsProtocol=https;AccountName=mystorageacccountname;AccountKey=12345.....

👍 1

Bappi Patra

11/29/2024, 6:00 AM

Thanks @visch @Andy Carter, can we use azure service principle to create connection string instead of SAS or Access key, if yes could you please share the steps/example to build the connection.

Andy Carter

11/29/2024, 9:32 AM

Within the Azure portal, you need to enable 'allow storage account key access' for the storage account, and then from the Access Keys section copy your connection string. tap-spreadsheets-anywhere does not support

DefaultAzureCredential

afaik.

Bappi Patra

11/29/2024, 11:52 AM

i have tried with SAS token but i am getting error (see the below) "Plugin configuration is invalid Catalog discovery failed: command ['/mnt/c/Users/E092561/Documents/Project/Meltano/my-meltano-project/.meltano/extractors/tap-spreadsheets-anywhere/venv/bin/tap-spreadsheets-anywhere', '--config', '/mnt/c/Users/E092561/Documents/Project/Meltano/my-meltano-project/.meltano/run/tap-spreadsheets-anywhere/tap.5b253916-038f-4897-9449-5c4b918557ee.config.json', '--discover'] returned 1 with stderr: ERROR Unable to write Catalog entry for 'target_table_name' - it will be skipped due to error Incorrect padding CRITICAL Incorrect padding File "/mnt/c/Users/E092561/Documents/Project/Meltano/my-meltano-project/.meltano/extractors/tap-spreadsheets-anywhere/venv/lib/python3.10/site-packages/azure/storage/blob/_shared/authentication.py", line 192, in _add_authorization_header raise _wrap_exception(ex, AzureSigningError) from ex azure.storage.blob._shared.authentication.AzureSigningError: Incorrect padding" here is our meltano.yaml content "version: 1 default_environment: dev project_id: c6750c09-214e-4641-bf13-8e02e7d89f30 environments: - name: dev - name: staging - name: prod plugins: extractors: - name: tap-spreadsheets-anywhere variant: ets pip_url: git+https://github.com/ets/tap-spreadsheets-anywhere.git config: tables: *- path: <azure://X>*YZ name: target_table_name pattern: '.csv'* start_date: '2017-01-17T110253Z' key_properties: [] format: csv" please suggest if i missed something. is there any other connector available which can take the data from azure storage to snowflake, so that we can also try. Thanks

Andy Carter

11/29/2024, 1:08 PM

The SAS token is not the same thing as storage account connection string, only the connection string will work here. https://learn.microsoft.com/en-us/azure/storage/common/storage-account-keys-manage?tabs=azure-portal#view-account-access-keys

➕ 2

20 Views

Open in Slack

Previous Next