Popular repositories Loading
-
GUI-Agents-Paper-List
GUI-Agents-Paper-List PublicBuilding a comprehensive and handy list of papers for GUI agents
-
TravelPlanner
TravelPlanner Public[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
-
MagicBrush
MagicBrush Public[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".
Repositories
- GUI-Drag Public
OSU-NLP-Group/GUI-Drag’s past year of commit activity - Explorer Public
[ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
OSU-NLP-Group/Explorer’s past year of commit activity - LLM-IOAA Public
Code and data for the paper "Large Language Models Achieve Gold Medal Performance at the International Olympiad on Astronomy & Astrophysics (IOAA)" (https://arxiv.org/abs/2510.05016).
OSU-NLP-Group/LLM-IOAA’s past year of commit activity - TravelPlanner Public
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
OSU-NLP-Group/TravelPlanner’s past year of commit activity - WebDreamer Public
[TMLR'25] "Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents"
OSU-NLP-Group/WebDreamer’s past year of commit activity - Mind2Web-2 Public
[NeurIPS 2025] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge
OSU-NLP-Group/Mind2Web-2’s past year of commit activity - HippoRAG Public
[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.
OSU-NLP-Group/HippoRAG’s past year of commit activity - ScienceAgentBench Public
[ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
OSU-NLP-Group/ScienceAgentBench’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…