Skip to content
Discussion options

You must be logged in to vote

我们没有探索multi-agent部分,有ROLL的用户自行开发过,两个思路都可以实现:

  • 由actor_infer作为llm,靠prompt切换不同的agent,这里自定义env manager调整rollout过程即可,当然训练部分需要按需设计一下。
  • 由不同的模型作为不同agent后的llm,这种创建定义额外的actor_infer,如actor_infer_aux,都能满足

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Ericnano
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants