Hi all, new meltano user here. I am running into a...
# getting-started
j
Hi all, new meltano user here. I am running into a "BrokenPipeError", and im not sure why. I have a csv file in my desktop and I trying to store in a parquet file. Here is also my meltano.yml. I thought it could be file size but right now my file has 10k rows. Thank you!
Copy code
version: 1
default_environment: dev
project_id: 2fc4aa94-ed4d-49cd-9b6b-c1644bf4608e
environments:
- name: dev
- name: staging
- name: prod
plugins:
  extractors:
  - name: tap-spreadsheets-anywhere
    variant: ets
    pip_url: git+<https://github.com/ets/tap-spreadsheets-anywhere.git>
    config:
      tables:
      - path: 'file:///Users/juanherrera/Desktop/subway-monthly-data'
        name: 'subway_monthly_data'
        pattern: 'MTA_Subway_Hourly_Ridership_small.csv'
        start_date: '2025-03-12T15:30:00Z'
        prefer_schema_as_string: true
        key_properties: ['id']
        format: csv

  loaders:
  - name: target-parquet
    variant: automattic
    pip_url: git+<https://github.com/Automattic/target-parquet.git>
    config:
        destination_path: data/subway_data
        compression_method: snappy
        logging_level: info
        disable_collection: true
1
e
Hi @Juan Pablo Herrera! Can you try the following debugging steps?
Copy code
meltano invoke tap-spreadsheets-anywhere > singer.jsonl
cat singer.jsonl | meltano invoke target-parquet
🙌 1
j
Thank you Edgar!
np 1
e
@Juan Pablo Herrera did you end up finding the cause?
j
I ended up switching to tap-csv for the extractor
👍 1