Publications

(2025). When Reasoning Meets Compression: Understanding the Effects of LLMs Compression on Large Reasoning Models. Preprint, 2025.

PDF Code

(2025). Generalizable Process Reward Models via Formally Verified Training Data. Preprint, 2025.

PDF Code

(2025). HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?. ICCV, 2025.

PDF Code Dataset Project

(2025). TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora. ACL, 2025.

PDF

(2025). SiReRAG: Indexing Similar and Related Information for Multihop Reasoning. ICLR, 2025.

PDF Code

(2024). LLMs assist NLP Researchers: Critique Paper (Meta-) Reviewing. EMNLP, 2024.

PDF

(2024). When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs. TACL, 2024.

PDF

(2024). Evaluating LLMs at Detecting Errors in LLM Responses. COLM, 2024.

PDF Code Dataset

(2024). Pruning as a Domain-specific LLM Extractor. NAACL Findings, 2024.

PDF Code

(2024). Fair Abstractive Summarization of Diverse Perspectives. NAACL, 2024.

PDF Code

(2024). PEaCE: A Chemistry-Oriented Dataset for Optical Character Recognition on Scientific Documents. LREC-COLING, 2024.

PDF Code

(2024). Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models. Preprint, 2024.

PDF

(2023). FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization. EMNLP, 2023.

PDF Code