generated from fastai/nbdev_template
-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: huggingface/trl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
refactor: Move judges to experimental submodule
#4439
opened Nov 3, 2025 by
behroozazarkhalili
Loading…
refactor: Move Mergekit integration to experimental submodule
#4438
opened Nov 3, 2025 by
behroozazarkhalili
Loading…
fix: Remove chat template setting from non-SFT trainer scripts
#4437
opened Nov 3, 2025 by
behroozazarkhalili
Loading…
docs: Move Multi-Adapter RL section to PEFT integration
#4436
opened Nov 3, 2025 by
behroozazarkhalili
Loading…
docs: List all trainers that support Liger Kernel
#4432
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
docs: Unify model examples to use trl-lib namespace
#4431
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
docs: Add PEFT subsection to reducing memory usage guide
#4430
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
docs: Expand speeding up training guide with acceleration methods
#4428
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
docs: Expand training customization examples
#4427
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
4 tasks done
Replace flash attention2 with kernels-community/flash-attn2
#4426
opened Nov 2, 2025 by
tamoghnokandar
Loading…
4 of 5 tasks
docs: Extend CLI basic usage examples to all supported CLIs
#4425
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
Removed outdated warning about batch contamination
#4423
opened Nov 2, 2025 by
Harras3
Loading…
2 tasks done
docs: Remove outdated conversational dataset conversion guidance
#4422
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
docs: Rewrite PEFT integration guide with comprehensive examples
#4421
opened Nov 2, 2025 by
behroozazarkhalili
Loading…
Add On-Policy Distillation from thinking labs to paper index.
#4410
opened Oct 30, 2025 by
pramodith
Loading…
4 of 5 tasks
Add tip for logging evaluation metrics during regular evaluations
#4367
opened Oct 29, 2025 by
cam1llynha
Loading…
[OpenENV] Openenv rollout_func signature proposal
#4344
opened Oct 27, 2025 by
kashif
Loading…
5 tasks
Use explicit tiny-Qwen2ForCausalLM-2.5 model_id param in CI tests
#4331
opened Oct 23, 2025 by
albertvillanova
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.