Replies: 4 comments 1 reply
-
I think it is a good idea. |
Beta Was this translation helpful? Give feedback.
-
I also think it would be beneficial. Now I would maybe still have a warning, e.g. when it gets to more than X jobs or flows, to tell users that they are maybe using something not tailored to high-throughput production. |
Beta Was this translation helpful? Give feedback.
-
In my case, the issue with these file based mongoDB (MontyDB, mongita, tinymongo) is that they typically do not support On the other hand, if used with the fireworks manager there could be multiple fireworks trying to update the DB at the same time. I quickly checked but I could not find details about how MontyDB handles concurrency. In any case I expect that the Having looked into file and DB locking I should also add that, since several cluster use NFS, handling concurrency may be tricky. SQLite DBs may be corrupted for concurrent access on an NFS. I used flufl.lock in jobflow-remote for the lock of one file. It is defined as "NFS-safe", but I made a test trying to push it with high number of very fast access on a cluster where the FS is not very fast and it sometimes failed. I was fine with that, since I think it was an unlikley situation for my use case, but still to be taken into account as a source of potential issue when using file based DBs. |
Beta Was this translation helpful? Give feedback.
-
I think this issue might also be relevant here: materialsproject/maggma#832 |
Beta Was this translation helpful? Give feedback.
-
Currently jobflow uses a memory store as the default store. This means that any outputs from workflows are lost at the end of the script or when a notebook is restarted.
What people think about changing the default store to be a
MontyStore
written to the~/.jobflow
folder? This is closer to whatcovalent
does. It would mean that creating a basic atomate tutorial would also be very simple.Beta Was this translation helpful? Give feedback.
All reactions