Extending Embodied World Model Benchmarking on Modality, Functionality and Platform
ModalityFunctionalityPlatform
WorldArena 2.0 Leaderboard evaluates embodied world
models across simulator video quality, interactive RL environments,
visuo-tactile manipulation, and real-robot action planning.
Track 1: Simulator video-quality evaluation using the WorldArena perceptual metrics, including the benchmark's difficulty and out-of-distribution corrections.