Publications

publications by categories in reversed chronological order. The asterisks mean that authors are equal contribution.

2025

  1. ICLR
    Harnessing Webpage UIs for Text-Rich Visual Understanding
    Junpeng Liu, Tianyue Ou*, Yifan Song*, Yuxiao Qu*, Wai Lam, Chenyan Xiong, Wenhu Chen, Graham Neubig, and Xiang Yue
    ICLR, 2025
  2. preprint
    VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge
    Tianyue Ou*, Yueqi Song*, Yibo Kong, Zecheng Li, Graham Neubig, and Xiang Yue
    ArXiv, 2025

2024

  1. NeurIPS
    Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
    Tianyue Ou, Frank F. Xu, Aman Madaan, Jiarui Liu, Robert Lo, Abishek Sridhar, Sudipta Sengupta, Dan Roth, Graham Neubig, and Shuyan Zhou
    NeurIPS, 2024
  2. NAACL Demo Track
    CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
    Faria Huq, Zora Zhiruo Wang, Frank F. Xu, Tianyue Ou, Shuyan Zhou, Jeffrey P. Bigham, and Graham Neubig
    2024

2023

  1. preprint
    An In-depth Look at Gemini’s Language Abilities
    Syeda Nahida Akter*, Zichun Yu*, Aashiq Muhamed*, Tianyue Ou*, Alex Bauerle,  Alexander Cabrera, Krish Dholakia, Chenyan Xiong, and Graham Neubig
    ArXiv, 2023
  2. ICLR
    WebArena: A Realistic Web Environment for Building Autonomous Agents
    Shuyan Zhou, Frank F. Xu, Hao Zhu, Xuhui Zhou, Robert Lo, Abishek Sridhar, Xianyi Cheng, Tianyue Ou, Yonatan Bisk, Daniel Fried, and 2 more authors
    ICLR, 2023