You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Feb 22, 2024. It is now read-only.
Currently, on each parsing command issued for a HistogramDataFile, a new thread is spawned (from within Django) to parse the file.
Several issues with this approach:
Django carries the burden of parsing files, which it should not
The pod is pretty much overloaded, leading even to pod restarts.
There is no way to prevent multiple Threads being spawned on the same file, wasting resources.
Next steps:
Think about implementing a separate process which parses files, maybe even deployed on a separate pod, with its own API.
Create a new DB model where a queue can be implemented for the separate process to check for files to parse. This has the added advantage of being able to resume file parsing after a crash, as the entry in the queue has not been cleared.