Shulin Tian
shulin002 [at] ntu.edu.sg
I am a PhD student at Nanyang Technological University (NTU), Singapore, supervised by Prof. Ziwei Liu and Dr. Hongyuan Zhu .
Previously, I obtained my Bachelor's Degree from NTU, and I spent a wonderful time working with Prof. Ranjay Krishna at University of Washington on vision-language model reasoning, and Prof. Bihan Wen at NTU on low-light image enhancement.
News
[06/2025] Evaluation Agent was selected for an oral presentation at ACL 2025. Congrats to all coauthors!
[06/2025] We release the Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning. Code and data can be found here .
[05/2025] MMInA leaderboard is now live at MMInA Proj Page .
[05/2025] Two papers are accepted to ACL 2025 (one main and one findings).
[03/2025] I am acknowledged as an outstanding reviewer for ICLR 2025 [SCOPE Workshop].
[01/2025] Our paper "AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation" is accepted to ICLR 2025.
[12/2024] Our paper "Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models" is released.
[08/2024] Starting my PhD at MMLab@NTU .
[04/2024] Our paper "MMInA: Benchmarking Multihop Multimodal Internet Agents" is released.
[06/2023] Our paper "Enhancing Low-Light Images Using Infrared-Encoded Images" is accepted to ICIP 2023.
Publications
(* equal contributions)
Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning
arXiv , 2025
Paper /
Project Page /
Code /
Data
Area: Agentic tool-use, long video reasoning, egocentric
Your browser does not support the video tag.
MMInA: Benchmarking Multihop Multimodal Internet Agents
ACL Findings , 2025
Paper /
Project Page /
Code /
Data
Area: Multimodal agent benchmark on long-horizon reasoning
Your browser does not support the video tag.
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
ACL Main , 2025 (Oral)
Paper /
Project Page /
Code
Area: Agent, GenAI
AHA: A Vision-Language-Model for Detecting and Reasoning Over Failures in Robotic Manipulation
ICLR , 2025
Paper /
Project Page
Area: Robotics, VLM
Enhancing Low-light Images Using Infrared Encoded Images
ICIP , 2023
Paper /
Code /
Data
Area: Low-light image enhancement
Nanyang Technological University
PhD in Computer Science
Aug. 2024 - Present
Nanyang Technological University
BEng in Electrical & Electronic Engineering (Highest Distinction)
Aug. 2020 - May 2024
Miscellanea
When it comes to music, I do:
When it comes to sports, I always try new things and do:
๐คฟ Diving: PADI Certificated Open Water (2022) & Advanced Open Water Diver (2024)
๐งท Others: badminton, hiking...