Hi all, I few questios around "meltano run tap-ms...
# troubleshooting
c
Hi all, I few questios around "meltano run tap-mssql target-parquet" that I can't find the answer to and hoping for some help. 1. Is there a way to overwrite the previous parquet file? Currently it's creating a new dbo-t_Employee-<timestamp>-0-0.gz.parquet each run. 2. The metadata isn't being written to the parquet. What am I missing in my .yml to make that happen?
Copy code
version: 1
default_environment: dev
project_id: 84c22fe3-ab82-40a2-afac-8f2c9059e8f4
environments:
- name: dev
- name: staging
- name: prod
plugins:
  extractors:
  - name: tap-mssql
    variant: buzzcutnorman
    pip_url: git+<https://github.com/BuzzCutNorman/tap-mssql.git>
    config:
      dialect: mssql
      driver_type: pyodbc
      port: 1433
      add_record_metadata: true
      sqlalchemy_url_query:
        driver: ODBC Driver 18 for SQL Server
        TrustServerCertificate: yes
        authentication: ActiveDirectoryIntegrated
      start_date: 2020-01-01
      sqlalchemy_eng_params:
        fast_executemany: 'true'
    select:
    - dbo-t_employee.*
    metadata:
      dbo-t_employee.*:
        replication-method: INCREMENTAL
        replication-key: EmployeeKey
  loaders:
  - name: target-parquet
    variant: automattic
    pip_url: git+<https://github.com/Automattic/target-parquet.git>
    config:
      destination_path: ../data
Thanks in advance
b
I believe
add_record_metadata
is a target config option from what I am seeing. I would try removing
add_record_metadata: true
from the
tap-mssql
config and adding it to the
target-parquet
config and see if the missing meta data appears. 😀
👀 1