I'm playing around with: `tap-spreadsheets-anywher...
# troubleshooting
c
I'm playing around with:
tap-spreadsheets-anywhere
locally with an externally generated
.xlsx
file. • Using
skip_initial
to get to skip some human readable text and start on the table header. • Using
field_names
to list each of the expected header titles. • File contains ~100 lines of table data. • There is a
Totals
row at the bottom of the table data. • A few blank lines, then a human readable notes footer and then blank lines until the end of the sheet at row 1000 •
meltano invoke tap-spreadsheets-anywhere
has the line:
... INFO Wrote 995 records for stream "<table_name>" ...
. Expected to only see my ~100 rows of data in the stream. Doing a:
meltano run tap-spreadsheets-anywhere target-postgres
results in: 995 rows written to the table instead ~100. Any ideas?
Forgot, from an early code dive...
field_names
has only been implemented in the CSV Handler. It's missing from the Excel Handler. Search results: https://github.com/search?q=repo%3Aets%2Ftap-spreadsheets-anywhere+field_names&amp;type=code. Will raise an issue and see if I have time to hack around this myself.