christopher
08/03/2023, 12:29 PM{"type":"SCHEMA","stream":"issues","schema":{"selected":true,"properties":{"url":{"type":["string","null"]}},"type":["object","null"]},"key_properties":["id"]}
v.s.
{"type":"SCHEMA","stream":"issues","schema":{"selected":true,"properties":{"url":{"type":["null","string"]}},"type":["null","object"]},"key_properties":["id"]}
The problem with that is during the loader execution, the loader tells that the schema changed, and ends up writing only the records after the last SCHEMA found in stdout.
root@210c2ae8fa43:/meltano# cat foo.json | poetry run meltano invoke target-s3
2023-08-03T12:26:31.457374Z [info ] Environment 'dev' is active
2023-08-03 12:26:33,786 Target 'target-s3' is listening for input from tap.
2023-08-03 12:26:33,786 Initializing 'target-s3' target sink...
2023-08-03 12:26:33,786 Initializing target sink for stream 'issues'...
2023-08-03 12:26:34,474 Schema has changed for stream 'issues'. Mapping definitions will be reset.
2023-08-03 12:26:34,474 Schema or key properties for 'issues' stream have changed. Initializing a new 'issues' sink...
2023-08-03 12:26:34,474 Initializing 'target-s3' target sink...
2023-08-03 12:26:34,474 Initializing target sink for stream 'issues'...
Does this ring a bell to anyone, and would you have a clue how to fix this stupid list ordering issues?visch
08/03/2023, 1:17 PMuser
08/03/2023, 2:57 PMchristopher
08/03/2023, 3:05 PMedgar_ramirez_mondragon
08/03/2023, 3:32 PMShould I open an issue on the tap repo to ask them?Yeah, I think it's worth asking folks over there.
Yeah the SDK should probably handle better this situation, definitely!Worth logging an issue. I like Pat's proposed solution since it seems to be able to ignore order and duplicates in lists, but I'd worry about performance (probably not a big deal!)
visch
08/03/2023, 3:33 PMedgar_ramirez_mondragon
08/03/2023, 3:33 PMuser
08/03/2023, 3:34 PMuser
08/03/2023, 3:34 PMvisch
08/03/2023, 3:59 PM