Machine Learning Papers

Last 14 Days (May 20 – June 02, 2026)

← Previous Week

🏆 Top Papers This Week

#1 TOP PAPER (Score: 92)
Shihao Wang, Shilong Liu, Yuanguo Kuang ... · arXiv
Vision-language models (VLMs) commonly formulate visual grounding and detection as a coordinate-token generation problem, serializing each 2D box into multiple 1D tokens that are learned and decoded largely independently. This token-by-token decoding mismatches the coupled struct...
#2 TOP PAPER (Score: 90)
Hidir Yesiltepe, Jiazhen Hu, Tuna Han Salih Meral ... · arXiv
Long-rollout causal video diffusion has converged on a fixed-size sliding-window KV cache, with recent progress innovating within this layout by changing which tokens occupy the window or how their positions are encoded. The per-head KV layout itself, a dominant contributor to st...
#3 TOP PAPER (Score: 89)
Dayal Singh Kalra, Maissam Barkeshli · arXiv
Hyperparameter transfer allows extrapolating optimal optimization hyperparameters from small to large scales, making it critical for training large language models (LLMs). This is done either by fitting a scaling law to the hyperparameters or by a judicious choice of parameteriza...