publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2026
- PreprintOn the Complexity of Offline Reinforcement Learning with Q*-Approximation and Partial Coverage2026
- ICLRAn Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPsICLR, 2026
- ICLRCDE: Curiosity-driven Exploration for Efficient Reinforcement Learning in Large Language ModelsICLR, 2026
2025
- MATH-AIEvolving Language Models without Labels: Majority Drives Selection, Novelty Promotes VariationNeurIPS 2025 MATH-AI Workshop, 2025
- MATH-AI
- Preprint
- COLT
- AAAI
2024
- NeurIPS
- NeurIPSCorruption-Robust Linear Bandits: Minimax Optimality and Gap-Dependent MisspecificationNeurIPS, 2024
- ICLR
2023
- NeurIPS
- AAMAS/ECAIDiffusion multi-unit auctions with diminishing marginal utility buyersAAMAS (also in ECAI), 2023