jessica_fosler
06/29/2023, 12:32 AMmeltano add loader tap-parquet
I get this error - how do I configure the tap-parquet virtual environment to fulfill this requirement?
ERROR: No matching distribution found for singer-sdk<0.2.0,>=0.1.0
visch
06/29/2023, 10:42 AMvisch
06/29/2023, 10:43 AMuser
06/29/2023, 2:39 PMjessica_fosler
06/29/2023, 10:35 PM^
- because CSVs can have anything. But then the next tap I picked up didn't support custom delimiter correctly and wrote garbage tables out.
I'm like ok, it's got JSON data in there, maybe I should write to a more structured format. Tried DuckDB. It crashed writing out the file.
I'm like OK the file is fundamentally JSON and I'm converting it, so let's stop doing that. So I wrote out to JSONL. Except I couldnt install the tap-singer-jsonl cause it wants a whole other version of python.
I'm like ok, DuckDB is all new and experimental, maybe I should go for an older standard - like parquet. So I write the file out to target-parquet.
And then I try to install tap-parquet, but I cant.
So then I'm like fine, snowflake has instructions for manually loading a parquet file from a stage. So I write a bit of SQL to put the parquet file to an internal stage and when I go to copy into, it tells me target-parquet wrote out a file without magic bytes.
SO, what do you think my next option is? Local postgres instance? Save the tap-csv and manually set the delimiter in snowflake to be ^
?
Am I trying too hard?jessica_fosler
07/01/2023, 9:42 AMuser
07/05/2023, 6:51 PMjessica_fosler
07/08/2023, 3:25 AMjessica_fosler
07/08/2023, 3:26 AMjessica_fosler
07/08/2023, 3:27 AMjessica_fosler
07/08/2023, 3:28 AMjessica_fosler
07/08/2023, 3:28 AMuser
07/10/2023, 6:53 PMmight be worth double listing the tap in hub.meltano.com and/or updating the readme so it shows up in search@jessica_fosler can you say more about what you mean by this? Most users of Meltano and target-snowflake should never have to even consider how all of this works to get it functioning, unfortunately I think your use case with mailchimp made it more complicated due to bugs. The meltanolabs variant of target-snowflake writes JSONL data to an internal stage and loads it into tables for you, so you shouldnt ever have to create stages/files formats/etc manually.
jessica_fosler
07/10/2023, 10:56 PMuser
07/11/2023, 1:45 AMmeltano invoke tap-mailchimp > output.json
then later cat output.json | meltano invoke target-snowflake
. Its not necessarily a long term solution since you can't use invoke for schedules but it could be good for a one time backfilljessica_fosler
07/11/2023, 3:20 AMjessica_fosler
07/11/2023, 3:21 AM