Post-training Reading Group - Mountain View

A monthly reading group focused on the latest research in post-training techniques for large language models, including RFT, RLHF, preference learning, synthetic preference data, and related topics.

📚 Papers by Category

[Aug 7, 2025] Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy (Liu et al., 2025)

📅 Meeting Information

Frequency: Monthly
Location: Collinear HQ in Mountain View, CA
Format: In-person discussion of selected papers
Duration: 1-2 hours per session

🎯 Focus Areas

Our reading group covers various aspects of post-training research:

Reinforcement Learning from Human Feedback (RLHF)
Direct Preference Optimization (DPO) and variants
Preference learning and reward modeling
Alignment and safety techniques
Evaluation and benchmarking
Human-AI collaboration in preference data

✋ How to Participate

Sign up for our mailing list
Show up to our events
Suggest papers for discussion
Co-organize sessions

📞 Contact

For questions about the reading group or to suggest papers, please open an issue in this repository or contact research@collinear.ai

Last updated: July 2025

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Aug 2025		Aug 2025
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Post-training Reading Group - Mountain View

📚 Papers by Category

📅 Meeting Information

🎯 Focus Areas

✋ How to Participate

📞 Contact

About

Uh oh!

Releases

Packages

collinear-ai/post-training-reading

Folders and files

Latest commit

History

Repository files navigation

Post-training Reading Group - Mountain View

📚 Papers by Category

📅 Meeting Information

🎯 Focus Areas

✋ How to Participate

📞 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages