Skip to content

collinear-ai/post-training-reading

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

6 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Post-training Reading Group - Mountain View

A monthly reading group focused on the latest research in post-training techniques for large language models, including RFT, RLHF, preference learning, synthetic preference data, and related topics.

πŸ“š Papers by Category

πŸ“… Meeting Information

  • Frequency: Monthly
  • Location: Collinear HQ in Mountain View, CA
  • Format: In-person discussion of selected papers
  • Duration: 1-2 hours per session

🎯 Focus Areas

Our reading group covers various aspects of post-training research:

  • Reinforcement Learning from Human Feedback (RLHF)
  • Direct Preference Optimization (DPO) and variants
  • Preference learning and reward modeling
  • Alignment and safety techniques
  • Evaluation and benchmarking
  • Human-AI collaboration in preference data

βœ‹ How to Participate

  • Sign up for our mailing list
  • Show up to our events
  • Suggest papers for discussion
  • Co-organize sessions

πŸ“ž Contact

For questions about the reading group or to suggest papers, please open an issue in this repository or contact research@collinear.ai


Last updated: July 2025

About

Post-training reading group in Mountain View

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published