laurent
02/17/2021, 10:41 PMtap-spreadsheets-anywhere
to read an excel file on my local filesystem and load it in postgres. I've been following the getting started page twice, once with tap-csv
which worked fine, but now with the new tap, I'm not seeing anything being loaded. When I run meltano --log-level=debug elt...
I see:
tap-spreadsheets-anywhere | INFO Found 6 files.
tap-spreadsheets-anywhere | INFO Wrote 0 records for stream "my_stream_name".
There are indeed 6 files in that folder, so that seems correct, but I'm seeing no errors or warnings, so I'm not sure where to look next. I went through the plugin docs, but nothing obvious there. I've tried all sorts of regex patterns including ".*"
which I'd assume would catch any file in there, but no luck. Any suggestions where to look?douwe_maan
02/18/2021, 4:59 PMSyncing file "<name>"
log messages originating from https://github.com/ets/tap-spreadsheets-anywhere/blob/208190734440b57b4445f94e5a0d43a43d20df99/tap_spreadsheets_anywhere/file_utils.py#L21?douwe_maan
02/18/2021, 5:00 PMmeltano.yml
for good measure?laurent
02/18/2021, 5:23 PMSyncing file...
in the logslaurent
02/18/2021, 5:25 PMtap-csv
and now I'm getting a confusing error ELT could not be completed: Cannot start extractor: Catalog discovery failed: invalid catalog: Expecting value: line 1 column 1 (char 0)
Let me see if I can go back to the previous config and see the initial problemdouwe_maan
02/18/2021, 5:25 PMlaurent
02/18/2021, 5:26 PMdouwe_maan
02/18/2021, 5:26 PMlaurent
02/18/2021, 6:09 PMlaurent
02/18/2021, 6:13 PMtap-spreadsheets-anywhere
we just found a couple of issues in it:
• there's a bug in how paths are handled for local files, which results in files not being discovered properly
• is looks like xlrd cannot load xlsx files at all
I'm going to file issues on github about these, and try to suggest fixes.eric_simmerman
02/19/2021, 3:00 AMdouwe_maan
02/19/2021, 4:16 PMThis library will no longer read anything other thanThat's for version 2.0.1. The description for 1.2.0 (https://pypi.org/project/xlrd/1.2.0/) stated:files. For alternatives that read newer file formats, please see http://www.python-excel.org/..xls
Extract data from Excel spreadsheets (.xls and .xlsx, versions 2.0 onwards) on any platform.This suggests that the tap should either pin version 1.2.0, or use another library for xlsx as suggested in http://www.python-excel.org/
eric_simmerman
02/19/2021, 5:02 PMdouwe_maan
02/19/2021, 5:03 PMlaurent
02/19/2021, 5:04 PMeric_simmerman
02/19/2021, 5:47 PMdouwe_maan
02/19/2021, 5:48 PMlaurent
02/19/2021, 6:03 PM