Dear Sir,
Thanks for your great job and open source your code.
And i want to ask whether we can directly finetune other multi-view imgaes version based on existing ckpt? For example, we finetune 3 views on 2 views version. Or we only trainning 3-view from scratch?
If we train with 3 views, how many scenes data entries would be sufficient, what size of GPU is needed, and how long does it usually take to complete the training?
We look forward to your reply, as it is crucial for ours. Thank you!
Best