when you're running multiple jobs at a time this can get tricky as you have to pick which source state you want to merge into.
I'd like to know how you're orchestrating the jobs, why select_filter was the choice you went with, how it's being used, and why you need/want state to exist later. Lots of options here, but nothing super easy with state and parallelization and independent streams with select_filter