blake_blackshear
07/13/2023, 5:52 PMtap-csv
, tap-s3
, tap-s3-csv
) and there is quite a bit of duplication related to CSV parsing. i am considering making some improvements to tap-sftp
, but trying to avoid reinventing that part of the wheel. most taps seem to orient around parsing a file format, but some seem to be more oriented around the transfer mechanism for data files that could be in many different formats. s3, sftp, http are transfer protocols that could all have the same set of file formats that need to be parsed. how is the community thinking about this?visch
07/13/2023, 5:53 PMvisch
07/13/2023, 5:54 PMsftp
support explicitly added but the library supports it and it shouldn't be a huge liftblake_blackshear
07/13/2023, 5:59 PMblake_blackshear
07/13/2023, 6:01 PMparamiko
?blake_blackshear
07/13/2023, 6:03 PMgpg
decryption support? and/or chaining gpg -> decompress
?visch
07/13/2023, 6:04 PMgpg
yes
library - https://filesystem-spec.readthedocs.io/en/latest/api.html , see https://github.com/MeltanoLabs/tap-universal-file
Chaining not sure what you mean exactlyblake_blackshear
07/13/2023, 6:04 PMvisch
07/13/2023, 6:05 PMgpg
is a no right now, but definitely could get added
zip
is a yes right now.
No reason we couldn't add thatblake_blackshear
07/13/2023, 6:08 PMfsspec
to create a file system mount that is then accessed via standard python file open calls?visch
07/13/2023, 6:09 PMvisch
07/13/2023, 6:09 PMblake_blackshear
07/13/2023, 6:10 PMvisch
07/13/2023, 6:17 PMvisch
07/13/2023, 6:19 PMblake_blackshear
07/13/2023, 6:20 PMblake_blackshear
07/13/2023, 6:21 PMsshfs
in python though?visch
07/13/2023, 6:21 PMvisch
07/13/2023, 6:21 PMblake_blackshear
07/13/2023, 6:21 PMvisch
07/13/2023, 6:21 PMblake_blackshear
07/13/2023, 6:22 PMblake_blackshear
07/13/2023, 6:23 PMblake_blackshear
07/13/2023, 6:23 PMThis project has been archived by its developers and is no longer developed. Alternatives include the mount feature of rclone.
visch
07/13/2023, 6:25 PMvisch
07/13/2023, 6:26 PMvisch
07/13/2023, 6:26 PMblake_blackshear
07/13/2023, 6:29 PMcurl sftp://… | gpg -d | gunzip -c | aws s3 cp - <s3://bucket/>…
is pretty simple toovisch
07/13/2023, 6:30 PMvisch
07/13/2023, 6:30 PMblake_blackshear
07/13/2023, 6:31 PMblake_blackshear
07/13/2023, 6:31 PMblake_blackshear
07/13/2023, 6:31 PMblake_blackshear
07/13/2023, 6:31 PMvisch
07/13/2023, 6:32 PMgpg
would be a bit more invovledblake_blackshear
07/13/2023, 6:32 PMvisch
07/13/2023, 6:32 PMblake_blackshear
07/13/2023, 6:33 PMpython-gnupg
is dependent on the gnupg version in the environment anyway, and its a PITA to script the decrypt if you cant control the envblake_blackshear
07/13/2023, 6:33 PM