Hello Everyone, I want to read CSV file from an A...
# troubleshooting
b
Hello Everyone, I want to read CSV file from an Azure Blob Storage and load it in Snowflake, and i found there is no azure blob exactor available. even we have tried Tab-spreadsheet anywhere but we are getting connection string error. Please suggest how can i do the integration. If someone tried similar kind of integration from azure to snowflake or azure to any 3rd party.
v
tried Tab-spreadsheet anywhere but we are getting connection string error.
Could you share this and/or make an issue in the repo?
a
I am using csv from azure blob storage via tap-spreadsheets-anywhere in production so likely this is a configuration issue. @Bappi Patra can you share your meltano.yml and the error stack trace Worth checking the auth options for the underlying
smart_open
library for Azure https://github.com/piskvorky/smart_open?tab=readme-ov-file#azure-credentials You will need to set
AZURE_STORAGE_CONNECTION_STRING
env, with a value like
DefaultEndpointsProtocol=https;AccountName=mystorageacccountname;AccountKey=12345.....
👍 1
b
Thanks @visch @Andy Carter, can we use azure service principle to create connection string instead of SAS or Access key, if yes could you please share the steps/example to build the connection.
a
Within the Azure portal, you need to enable 'allow storage account key access' for the storage account, and then from the Access Keys section copy your connection string. tap-spreadsheets-anywhere does not support
DefaultAzureCredential
afaik.
b
i have tried with SAS token but i am getting error (see the below) "Plugin configuration is invalid Catalog discovery failed: command ['/mnt/c/Users/E092561/Documents/Project/Meltano/my-meltano-project/.meltano/extractors/tap-spreadsheets-anywhere/venv/bin/tap-spreadsheets-anywhere', '--config', '/mnt/c/Users/E092561/Documents/Project/Meltano/my-meltano-project/.meltano/run/tap-spreadsheets-anywhere/tap.5b253916-038f-4897-9449-5c4b918557ee.config.json', '--discover'] returned 1 with stderr: ERROR Unable to write Catalog entry for 'target_table_name' - it will be skipped due to error Incorrect padding CRITICAL Incorrect padding File "/mnt/c/Users/E092561/Documents/Project/Meltano/my-meltano-project/.meltano/extractors/tap-spreadsheets-anywhere/venv/lib/python3.10/site-packages/azure/storage/blob/_shared/authentication.py", line 192, in _add_authorization_header raise _wrap_exception(ex, AzureSigningError) from ex azure.storage.blob._shared.authentication.AzureSigningError: Incorrect padding" here is our meltano.yaml content "version: 1 default_environment: dev project_id: c6750c09-214e-4641-bf13-8e02e7d89f30 environments: - name: dev - name: staging - name: prod plugins: extractors: - name: tap-spreadsheets-anywhere variant: ets pip_url: git+https://github.com/ets/tap-spreadsheets-anywhere.git config: tables: *- path: <azure://X>*YZ name: target_table_name pattern: '.csv'* start_date: '2017-01-17T110253Z' key_properties: [] format: csv" please suggest if i missed something. is there any other connector available which can take the data from azure storage to snowflake, so that we can also try. Thanks
a
The SAS token is not the same thing as storage account connection string, only the connection string will work here. https://learn.microsoft.com/en-us/azure/storage/common/storage-account-keys-manage?tabs=azure-portal#view-account-access-keys
2