@txt_file In my case, each chunk is: 2-60 minutes (80% < 15minutes), 1 cpu core, <8GB RAM. Random crashes are fine (any ongoing chunks of work would have to be restarted).
I think many SaaS companies have such tasks (if only they looked). E.g. cassandra compactions could be done this way (with a bit of patching, though).
