Navigation
Breadcrumb

Journal Club

Welcome to our Journal Club! Here, we discuss papers that we find interesting and relevant to our research interests and use-cases. The papers are selected by the team and are presented by members each week! Additionally, a different team member records notes on what was discussed.

📊 Weekly Tracking

DatePaperPresenterNote TakerDiscussion Notes
2025-08-22GLM-4.5V and GLM-4.1V-ThinkingYoshBennotes
2025-08-15On the Generalization of SFTHunterBennotes
2025-08-08Generative Verifiers: Reward Modeling as Next-Token PredictionHunterBennotes
2025-07-11SingLoRAHunterBennotes
2025-06-20Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement LearningHunterBennotes
2025-06-13The Illusion of ThinkingHunterBennotes
2025-05-30The Leaderboard IllusionHunterBennotes
2025-05-23Beyond the Last Answer: Your Reasoning Trace Uncovers More than You ThinkHunterBennotes
2025-05-16xRAG: Extreme Context Compression for Retrieval-augmented Generation with One TokenOliviaHunternotes
2025-05-09Absolute ZeroYoshBen
2025-05-02Tina: Tiny Reasoning Models via LoRAHunternotes
2025-04-25Byte Latent Transformer: Patches Scale Better Than TokensHunterYoshnotes
2025-04-18Unintentional Unalignment: Likelihood Displacement in DPONikhilYoshnotes
2025-04-11PaperBench: Evaluating AI’s Ability to Replicate AI ResearchHunternotes
2025-04-04DAPO: An Open-Source LLM Reinforcement Learning System at ScaleYoshHunternotes
2025-03-28SimpleRL-ZooHunterBennotes
2025-03-21LADDERHunterYoshnotes
2025-03-14InftyThinkYoshBennotes
2025-03-07Visual-RFTYoshHunternotes
2025-02-28SFT Memorizes, RL GeneralizesHunterYoshnotes
2025-02-21Scaling up Test-Time Compute with Latent ReasoningNikhil
2025-02-07DeepSeek-R1Nikhil
2025-01-24rStar-mathHunter
2025-01-17DeepSeek-V2Hunter
2025-01-10DeepSeekMathHunter
2025-01-03ModernBERTHunternotes

📶 Presentation Stats

Team MemberPresentationsNotesSum
Hunter17320
Nikhil303
Yosh549
Olivia101
Ben033