All Publications
2026
-
-
Efficient Test-Time Scaling via Self-Calibration
International Conference on Learning Representations (ICLR)
2025
-
Reliable and Responsible Foundation Models
Transactions on Machine Learning Research (TMLR)
-
-
-
CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation
Second Conference on Language Modeling (COLM)
-
Taming Overconfidence in LLMs: Reward Calibration in RLHF
International Conference on Learning Representations (ICLR)
2024
-
S2FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity
Conference on Neural Information Processing Systems (NeurIPS)
-
-
Choosing Wisely and Learning Deeply: Selective Cross-Modality Distillation via CLIP for Domain Generalization
Transactions on Machine Learning Research (TMLR)