WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation
Paper β’ 2605.25874 β’ Published β’ 40
Computer Vision
RIVER: A Real-Time Interaction Benchmark for Video LLMs
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision