-
Notifications
You must be signed in to change notification settings - Fork 64
Question about wan_va_server.py #50
Description
I noticed that the infer() function in wan_va_server.py contains three logic branches: reset, compute KV cache, and infer one chunk.
I understand that the reset branch is triggered at the very beginning of a task when the client sends a dictionary where the reset key is set to True and the prompt key contains the text description.
Regarding the other two branches, I have the following questions:
Dictionary Structure: When the inference process enters the compute KV cache or infer one chunk branches, what specific key-value pairs must be included in the dictionary sent by the client, and what is the exact format for each?
Timing: At what specific stages or under what conditions should the client send the dictionaries required for compute KV cache and infer one chunk respectively?