NavSpace

How Navigation Agents Follow Spatial Intelligence Instructions

Visual Representation Icon
NavSpace: We introduce the first spatial intelligence benchmark for instruction-based navigation: NavSpace.
Connector Design Icon
Evaluation: we comprehensively evaluate 22 navigation agents in total, containing models from open-source MLLM, proprietary MLLM, lightweight navigation models to navigation large models.
Speech Logo
Discussions: Based on the evaluation results, we conducted a detailed analysis of the limitations of existing methods and distilled four key insights.
Vision Logo
Baseline Model: SNav: We propose SNav, a spatially intelligent navigation model, that surpasses existing models and establishes a strong baseline for NavSpace and real robot tests.
Teaser Image

1CFCS, School of Computer Science, Peking University    2PKU-Agibot Lab    3Shanghai AI Laboratory

*Equal contribution Corresponding author
Vision Logo Overall Introduction
Video
Visual Representation Logo NavSpace
Introduction
Evaluation Logo Evaluation
Protocol
Leaderboard Logo NavSpace
Leaderboard

Click to jump to each section.


Overall Introduction Video

NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions

Visual-Spatial Intelligence Teaser
Figure 1: (Left) NavSpace tasks. (Right) Evaluation results compare various models with baseline SNav.