Publications
Full list of research publications, ordered by year.
2026
-
China's AI HeistForeign Affairs 2026
-
TRACE: Capability-Targeted Agentic TrainingPreprint 2026
-
OpenJarvis: Personal AI, On Personal DevicesPreprint 2026
2025
-
Intelligence per Watt: Measuring Intelligence Efficiency of Local AIPreprint 2025
-
Weaver: Shrinking the Generation-Verification Gap with Weak VerifiersNeurIPS 2025
-
LMUnit: Fine-grained Evaluation with Natural Language Unit TestsEMNLP 2025
-
Archon: An Architecture Search Framework for Inference-Time TechniquesICML 2025ICLR 2025 Scaling Self-Improving Foundation Models without Human Supervision (SSI FM) Oral Presentation
2024
-
PDFTriage: Question Answering over Long, Structured DocumentsEMNLP Industry 2024
-
Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERTICML 2024
-
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation SystemsNAACL 2024 Oral Presentation
2023
-
UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of RerankersEMNLP 2023
-
Moving Beyond Downstream Task Accuracy for Information Retrieval BenchmarkingACL Findings 2023
-
Embedding Recycling for Language ModelsEACL 2023
2022
-
ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics 2022
-
A Search Engine for Discovery of Scientific Challenges and DirectionsAAAI Conference on Artificial Intelligence 2022 Oral Presentation
2021
-
Argo Scholar: Interactive Visual Exploration of Literature in BrowsersIEEE Visualization Conference 2021 Best Poster, Honorable Mention
-
Large-Scale Analysis of Career Transitions: The Impact of Human Capital, Job History, and Language FactorsPre-print 2021
-
EnergyVis: Interactively Tracking and Exploring Energy Consumption for ML ModelsCHI '21 Extended Abstracts 2021
2020
-
Examining the Ordering of Rhetorical Strategies in Persuasive RequestsFindings of the Association for Computational Linguistics: Empirical Methods in Natural Language Processing 2020
-
Mapping Researchers with PeopleMapIEEE Visualization Conference 2020 Best Poster, Honorable Mention