Hi, thanks for your interesting work. It seems to start the interference on a sequence, you need the first N=3 frames with their masks to form the support set. However, you just have the first frame and its mask at the interference time. I think that you need an auxiliary method to segment the second and third frame before using your method. How did you handle it? Thanks