Explorer

Root / papers / text_metadata / 2026Q2

.. (Parent Directory)
0406 Zhang - Embarrassingly Simple Self-Distillation Improves Code Generation.txt
0414 Goyal - Distilled Pretraining- A modern lens of Data, In-Context Learning and Test-Time Scaling.txt
0418 Yuksekgonul - Learning to Discover at Test Time.txt
0426 Morosini - Too Sharp, Too Sure- When Calibration Follows Curvature.txt
0520 Mohri - A Bitter Lesson for Data Filtering.txt
0525 Lu - Strong Teacher Not Needed? On Distillation in LLM Pretraining.txt
0601 Huang - Why Larger Models Learn More- Effects of Capacity, Interference, and Rare-Task Retention.txt
0614 Hotsko - Code2LoRA- Hypernetwork-Generated Adapters for Code Language Models under Software Evolution.txt