Hi! Great work on USO. Quick question:
Is it feasible to create a style image library that stores pooled SigCLIP features (from your hierarchical projector) to achieve more generalizable style transfer for specific art styles?
The idea is to pre-compute and pool features from multiple images of the same style, then use these pooled features for better style generalization.
Any obvious technical barriers I should be aware of?
Thanks!