craig_astill
05/23/2023, 12:06 PMtap-spreadsheets-anywhere
locally with an externally generated .xlsx
file.
• Using skip_initial
to get to skip some human readable text and start on the table header.
• Using field_names
to list each of the expected header titles.
• File contains ~100 lines of table data.
• There is a Totals
row at the bottom of the table data.
• A few blank lines, then a human readable notes footer and then blank lines until the end of the sheet at row 1000
• meltano invoke tap-spreadsheets-anywhere
has the line: ... INFO Wrote 995 records for stream "<table_name>" ...
.
Expected to only see my ~100 rows of data in the stream.
Doing a: meltano run tap-spreadsheets-anywhere target-postgres
results in: 995 rows written to the table instead ~100.
Any ideas?craig_astill
05/23/2023, 12:13 PMfield_names
has only been implemented in the CSV Handler. It's missing from the Excel Handler.
Search results: https://github.com/search?q=repo%3Aets%2Ftap-spreadsheets-anywhere+field_names&type=code.
Will raise an issue and see if I have time to hack around this myself.craig_astill
05/23/2023, 12:41 PM