Hi! I’m trying to export millions of rows from a p...
# best-practices
d
Hi! I’m trying to export millions of rows from a postgres database with good performance and I was wondering if any of the posgres taps variants made use of the COPY command available to speed the export process ? I tried the datamill-co variant and it does not look like it uses it.
a
I don't know off the top of my head but I would check the pipelinewise variant.
Their fastsync implementation uses it, I'm sure, but I don't know if it is built directly into the plugin or if that is a fully separate codebase.
d
I looked a little bit in the code and it appears most of the variants use the COPY function to load CSV data into Postgres but unfortunately not in the other direction.
I did not know about fastsync, i’ll look into it but their doc states that it “is bypassing the Singer Specification for optimised performance”. It may mean that this functionality is not available to Meltano...
Thanks for confirming though. I wasn't sure about the implementation of the tap behavior but it sounds like it's not yet in a place where it could be used. @damien_coueron - Is this something you'd be interested in contributing for the Postgres tap? The spec extension is part of our roadmap this month.
d
Unfortunately I don’t have the skills to contribute !
a
No problem. Postgres is a heavy-use tap so I do think we may get community contribution, hopefully soon after that spec goes live.