have other folks encountered OOM errors We containerize our Meltano #singer-taps

have other folks encountered OOM errors. We contai...

haleemur_ali

05/05/2023, 3:33 PM

have other folks encountered OOM errors. We containerize our meltano workloads and encounted an OOM error when doing a full load of marketo activities. what strategies have you employed to work around this error. would we benefit if meltano provided some sort of flushing & state tracking so that memory is cleared before the machine crashes?

haleemur_ali

05/05/2023, 3:33 PM

do you need a volunteer to implement said functionality?

05/05/2023, 3:46 PM

A easy solution we came up with is to have different catalog configs per stream.. deploy steams with huge data to containers with bigger specs

thomas_briggs

05/05/2023, 4:04 PM

What target are you loading to? Some of the targets I've used (target-postgres, for example) batch data for all streams and flush them at the end so if you have a lot of streams, and especially if you have a lot of data, you can use a lot of memory.

haleemur_ali

05/05/2023, 4:05 PM

yeah, that's what we ended up doing. run the big streams on the big machines. we're doing target-s3 for this case, but for a different one, we'll likely do target-redshift.

haleemur_ali

05/05/2023, 4:06 PM

but, the solution is still sub-optimal & inelegant in my view

edgar_ramirez_mondragon

05/05/2023, 4:08 PM

I agree that taps should be able to receive an OOM signal and stop gracefully (i.e. emit a final state message). @haleemur_ali would you like to log an issue in the meltano repo?

haleemur_ali

05/05/2023, 4:08 PM

yes. i'll log it & we can continue the discussion there.

Open in Slack

Previous Next