guisong_fu
11/01/2023, 7:56 PMplugins:
extractors:
- name: tap-mysql
variant: transferwise
pip_url: pipelinewise-tap-mysql
metadata:
'*':
replication-method: LOG_BASED
loaders:
- name: target-snowflake
variant: meltanolabs
pip_url: meltanolabs-target-snowflake
config:
hard_delete: false
However, the performance is a bit slow, 1 million rows would take 0.5 hours to load, meaning it will take around 30 days to finish loading the entire table.
Is there any way that I can speed it up? We need to reduce to at most a couple of days to be able to load this amount of data.
Thanks in advance!mark_estey
11/01/2023, 8:36 PMmarcin_wojciechowski
11/01/2023, 9:24 PMguisong_fu
11/02/2023, 8:03 AMLOG_BASED , the tap first checks if there is already such table or if it has synced before, if not, it will start with a full load and remember the bin_log position, once the initial load finishes, it will start syncing data with bin_logs.
And I have indeed tried with full_table , it does not improve the performance..mark_estey
11/02/2023, 2:02 PMguisong_fu
11/02/2023, 3:26 PMLOG_BASEDmark_estey
11/02/2023, 3:33 PMjoshua_janicas
11/02/2023, 5:23 PMmark_estey
11/02/2023, 5:29 PMmark_estey
11/03/2023, 2:34 PM