Move storage part to a parallel process? #402

yannikschaelte · 2021-01-28T12:46:51Z

Meaning: For store_population, do that in a parallel process, not in the main program. This way, we could get rid of a big part of the "between-generations" time. Downside: Some servers might not like the creation of this additional parallel process. @FelipeR888 @EmadAlamoudi what do you think?

The text was updated successfully, but these errors were encountered:

EmadAlamoudi · 2021-01-28T13:05:02Z

sounds good. However, what is the expected overhead for this additional process? Also, will we need to copy the entire history object to the parallel process? if that is the case then we might need to check RAM per core.

yannikschaelte · 2021-01-28T13:08:40Z

The history object itself should be cheap, as it at no point holds the SQL database in memory (I think), but only the pointers to it, and dynamically queries it when needed. Therefore, I would expect next to no overhead in the main process (except copying this object). File accessors might be a problem, tbd.

yannikschaelte · 2021-01-28T13:10:13Z

And for fast-running simulations we need to make sure that iteration 3 is not written at the same time as iteration 2, so somehow one would need to lock access there. So not trivial to implement. Another problem could be that the main program is canceled by the user, but the writing process has not finished writing yet.

FelipeR888 · 2021-01-28T13:29:14Z

Would probably also be sort of redis-specific then, wouldnt it? But at least for us it does seem reasonable

EmadAlamoudi · 2021-01-28T13:37:07Z

This what comes to my mind too. However, it seems that SQLite handle that on its own since all of its operations are atomics : https://stackoverflow.com/questions/25700759/avoiding-race-conditions-when-an-update-is-based-on-the-count-of-prior-select

yannikschaelte · 2021-01-28T14:10:14Z

Good to know. Maybe one could just try writing a simple parallel process which is started at the beginning of run(), and then waits on a queue for results to write to database. No big algorithmic improvement, but might make things faster for fast models.

Mid-term moving to hdf5 (or making the sql handling faster) probably still necessary.

stephanmg added the enhancement label Nov 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move storage part to a parallel process? #402

Move storage part to a parallel process? #402

yannikschaelte commented Jan 28, 2021 •

edited

Loading

EmadAlamoudi commented Jan 28, 2021

yannikschaelte commented Jan 28, 2021

yannikschaelte commented Jan 28, 2021

FelipeR888 commented Jan 28, 2021

EmadAlamoudi commented Jan 28, 2021

yannikschaelte commented Jan 28, 2021

Move storage part to a parallel process? #402

Move storage part to a parallel process? #402

Comments

yannikschaelte commented Jan 28, 2021 • edited Loading

EmadAlamoudi commented Jan 28, 2021

yannikschaelte commented Jan 28, 2021

yannikschaelte commented Jan 28, 2021

FelipeR888 commented Jan 28, 2021

EmadAlamoudi commented Jan 28, 2021

yannikschaelte commented Jan 28, 2021

yannikschaelte commented Jan 28, 2021 •

edited

Loading