Sven Balnojan
01/27/2023, 9:40 AMvisch
01/27/2023, 1:52 PMI don't want to have full table later onI do this but I do full tables syncs for all the tables we need right now. Problem with incremental streams and db->db is you miss deletes. Some handle this by using a log based approach instead
Have whatever happens completely versioned.https://learn.microsoft.com/en-us/sql/relational-databases/tables/temporal-tables?view=sql-server-ver16 works pretty well for me
Ability to retry initial loads that didn't work out.Orchestrator, and this ties in with #3
Some way of parallelization for myself.Run each table individually
Define N full table syncs on the same table, partition them by hand in a way I know makes sense (creation date e.g.).I'd automate it all personally, not sure why you wouldn't just full table sync it. Tables are >500M records or something? Other questions: 1. How important is the type mapping to you between the DBs? how accurate does the type mapping need to be, after you get data moving I'd look at this 2. How important is speed? 3. How big is the data?
Sven Balnojan
01/27/2023, 1:56 PMpat_nadolny
01/27/2023, 2:47 PMthomas_briggs
01/27/2023, 2:48 PMthomas_briggs
01/27/2023, 2:50 PMvisch
01/27/2023, 2:51 PMthomas_briggs
01/27/2023, 2:51 PMpat_nadolny
01/27/2023, 2:55 PMvisch
01/27/2023, 2:55 PMpat_nadolny
01/27/2023, 2:57 PMpat_nadolny
01/27/2023, 2:58 PMvisch
01/27/2023, 2:58 PMthomas_briggs
01/27/2023, 2:59 PMpat_nadolny
01/27/2023, 3:09 PMpat_nadolny
01/27/2023, 3:11 PMstate get/set
to update your bookmarks. This could also be helpful if someone is switching from their home grown scripts to Meltano, they dont need to start over if they start their incremental sync from the right point.Sven Balnojan
01/27/2023, 3:24 PMaaronsteers
01/28/2023, 5:29 AMaaronsteers
01/28/2023, 5:52 AMSven Balnojan
01/30/2023, 11:50 AMtaylor
01/30/2023, 3:42 PM