Journal Club

Welcome to our Journal Club! Here, we discuss papers that we find interesting and relevant to our research interests and use-cases. The papers are selected by the team and are presented by members each week! Additionally, a different team member records notes on what was discussed.

📊 Weekly Tracking

Date	Paper	Presenter	Note Taker	Discussion Notes
2025-08-22	GLM-4.5V and GLM-4.1V-Thinking	Yosh	Ben	notes
2025-08-15	On the Generalization of SFT	Hunter	Ben	notes
2025-08-08	Generative Verifiers: Reward Modeling as Next-Token Prediction	Hunter	Ben	notes
2025-07-11	SingLoRA	Hunter	Ben	notes
2025-06-20	Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning	Hunter	Ben	notes
2025-06-13	The Illusion of Thinking	Hunter	Ben	notes
2025-05-30	The Leaderboard Illusion	Hunter	Ben	notes
2025-05-23	Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think	Hunter	Ben	notes
2025-05-16	xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token	Olivia	Hunter	notes
2025-05-09	Absolute Zero	Yosh	Ben
2025-05-02	Tina: Tiny Reasoning Models via LoRA	Hunter		notes
2025-04-25	Byte Latent Transformer: Patches Scale Better Than Tokens	Hunter	Yosh	notes
2025-04-18	Unintentional Unalignment: Likelihood Displacement in DPO	Nikhil	Yosh	notes
2025-04-11	PaperBench: Evaluating AI’s Ability to Replicate AI Research	Hunter		notes
2025-04-04	DAPO: An Open-Source LLM Reinforcement Learning System at Scale	Yosh	Hunter	notes
2025-03-28	SimpleRL-Zoo	Hunter	Ben	notes
2025-03-21	LADDER	Hunter	Yosh	notes
2025-03-14	InftyThink	Yosh	Ben	notes
2025-03-07	Visual-RFT	Yosh	Hunter	notes
2025-02-28	SFT Memorizes, RL Generalizes	Hunter	Yosh	notes
2025-02-21	Scaling up Test-Time Compute with Latent Reasoning	Nikhil
2025-02-07	DeepSeek-R1	Nikhil
2025-01-24	rStar-math	Hunter
2025-01-17	DeepSeek-V2	Hunter
2025-01-10	DeepSeekMath	Hunter
2025-01-03	ModernBERT	Hunter		notes

📶 Presentation Stats

Team Member	Presentations	Notes	Sum
Hunter	17	3	20
Nikhil	3	0	3
Yosh	5	4	9
Olivia	1	0	1
Ben	0	3	3