Machine Learning Papers

Last 7 Days (March 31 – April 06, 2026)

← Previous Week

🏆 Top Papers This Week

#1 TOP PAPER (Score: 69)
David Ilić, Kostadin Cvejoski, David Stanojević ... · arXiv
All prior membership inference attacks for fine-tuned language models use hand-crafted heuristics (e.g., loss thresholding, Min-K\%, reference calibration), each bounded by the designer's intuition. We introduce the first transferable learned attack, enabled by the observation th...
#2 TOP PAPER (Score: 68)
Aengus Lynch · arXiv
Autonomous AI agents are being deployed with filesystem access, email control, and multi-step planning. This thesis contributes to four open problems in AI safety: understanding dangerous internal computations, removing dangerous behaviors once embedded, testing for vulnerabiliti...
#3 TOP PAPER (Score: 68)
Aleksandar Cvejic, Rameen Abdal, Abdelrahman Eldesokey ... · arXiv
When evaluating identity-focused tasks such as personalized generation and image editing, existing vision encoders entangle object identity with background context, leading to unreliable representations and metrics. We introduce the first principled framework to address this vuln...