Agentic-MME: What Agentic Capability Really Brings to Multimodal Intelligence? Paper • 2604.03016 • Published 3 days ago • 15
AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents Paper • 2604.02947 • Published 3 days ago • 1
Xpertbench: Expert Level Tasks with Rubrics-Based Evaluation Paper • 2604.02368 • Published 10 days ago • 2
CoME-VL: Scaling Complementary Multi-Encoder Vision-Language Learning Paper • 2604.03231 • Published 3 days ago
InCoder-32B-Thinking: Industrial Code World Model for Thinking Paper • 2604.03144 • Published 3 days ago • 1
Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning Paper • 2604.02007 • Published 4 days ago • 6
Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers Paper • 2604.01128 • Published 5 days ago • 12
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published 5 days ago • 10
Embarrassingly Simple Self-Distillation Improves Code Generation Paper • 2604.01193 • Published 5 days ago • 25
HippoCamp: Benchmarking Contextual Agents on Personal Computers Paper • 2604.01221 • Published 5 days ago • 26
GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation Paper • 2603.26661 • Published 10 days ago • 21
Meta-Harness: End-to-End Optimization of Model Harnesses Paper • 2603.28052 • Published 7 days ago • 12
VectorGym: A Multitask Benchmark for SVG Code Generation, Sketching, and Editing Paper • 2603.29852 • Published Feb 22 • 6
Learn2Fold: Structured Origami Generation with World Model Planning Paper • 2603.29585 • Published Feb 2 • 15
MolmoPoint: Better Pointing for VLMs with Grounding Tokens Paper • 2603.28069 • Published 7 days ago • 7
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention Paper • 2603.28458 • Published 7 days ago • 39
DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing Paper • 2603.28713 • Published 7 days ago • 18
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published 7 days ago • 55