fred_reimer
04/08/2022, 9:31 PMaaronsteers
04/08/2022, 10:40 PMpartitions
if context is not set. The partitions
list is seeded from the last STATE
message, so the behavior you describe would make sense but only if context
is missing/empty. That said, as long as you still have the parent-child relationship in tact, I don't know why context
from the parent would not be used.
context_list = [context] if context is not None else self.partitions
aaronsteers
04/08/2022, 10:45 PMignore_parent_replication_keys
and/or state_partitioning_keys
?
I would not expect the old STATE
partitions to be cleaned out but I also would not expect the parent's children to be continually queried when the parent does not exist.aaronsteers
04/08/2022, 10:47 PMn
of parent count is not very large, you can avoid partition-level bookmarks by setting state_partitioning_keys
to a higher-level granularity or to []
to track just a single stream per key.fred_reimer
04/08/2022, 11:07 PMn
is not very large now, accounts is maybe a dozen or two, but it will grow (not to thousands). We are not doing anything fancy here. Just when accounts no longer processes a record for a deleted account_id, then child stream is still trying to do an incremental and sync. That is, until we manually edited the job record in the DB and updated the payload for the last id for the job_id, which worked. But this can't be a manual process. This has to work automatically....fred_reimer
04/08/2022, 11:15 PMfred_reimer
04/08/2022, 11:18 PMaaronsteers
04/08/2022, 11:49 PMstate_partitioning_keys = []
on the account_info child stream. Do you mind testing this if feasible to do so? And also, could you open an issue so we can look into the root cause?aaronsteers
04/08/2022, 11:51 PMaaronsteers
04/08/2022, 11:52 PMfred_reimer
04/09/2022, 12:56 AM