[FEEDBACK]  vlm_llava_0 = LLaVA() takes very long, not clear how long

The UI becomes unresponsive. Some kind of feedback, perhaps a comment in the script, that says "wait around X minutes" 

```
from grid.model.perception.vlm.llava import LLaVA
vlm_llava_0 = LLaVA()
```

You could turn LLAVA into a microservice, that is always running on one of the cloud machines?