benchmark tracker

Leaderboard

Data sourced from AllenAI VLA Evaluation Harness.

Models--
Benchmarks--
Results--
Updated--
Evaluation protocols are not fully standardized across all benchmarks. Scores may not always be directly comparable; check benchmark notes and source-table metadata before drawing conclusions.

trend

Benchmark Trend

Loading benchmark data...

Comparable scored models State-of-the-art frontier

scores

Leaderboard Table

Loading results...