Navigation
Recent Posts
Ben Elliott

Post-train a Model to Fish

We demonstrate how a specialized 25B parameter Mistral model, post-trained on domain-specific data, can outperform Google's Gemini 2.5 Flash by double-digit margins on insurance loss run extraction tasks.
12 min read
Hunter Heidenreich

LLM Calibration and Confidence Estimation

Explore the critical challenge of uncertainty quantification in large language models. Learn about confidence estimation techniques, calibration metrics like ECE and MCE, and practical methods to improve model reliability from logit-based approaches to ensemble methods and post-hoc calibration.
15 min read

Pagination