The UI becomes unresponsive. Some kind of feedback, perhaps a comment in the script, that says "wait around X minutes"
from grid.model.perception.vlm.llava import LLaVA
vlm_llava_0 = LLaVA()
You could turn LLAVA into a microservice, that is always running on one of the cloud machines?