Juan Pablo Herrera
03/12/2025, 10:41 PMversion: 1
default_environment: dev
project_id: 2fc4aa94-ed4d-49cd-9b6b-c1644bf4608e
environments:
- name: dev
- name: staging
- name: prod
plugins:
extractors:
- name: tap-spreadsheets-anywhere
variant: ets
pip_url: git+<https://github.com/ets/tap-spreadsheets-anywhere.git>
config:
tables:
- path: 'file:///Users/juanherrera/Desktop/subway-monthly-data'
name: 'subway_monthly_data'
pattern: 'MTA_Subway_Hourly_Ridership_small.csv'
start_date: '2025-03-12T15:30:00Z'
prefer_schema_as_string: true
key_properties: ['id']
format: csv
loaders:
- name: target-parquet
variant: automattic
pip_url: git+<https://github.com/Automattic/target-parquet.git>
config:
destination_path: data/subway_data
compression_method: snappy
logging_level: info
disable_collection: true
Edgar Ramírez (Arch.dev)
03/12/2025, 11:39 PMmeltano invoke tap-spreadsheets-anywhere > singer.jsonl
cat singer.jsonl | meltano invoke target-parquet
Juan Pablo Herrera
03/17/2025, 6:02 AMEdgar Ramírez (Arch.dev)
03/18/2025, 10:44 PMJuan Pablo Herrera
03/22/2025, 2:04 AM