Training GUI agents with augmented reasoning data and a tailored post-training recipe
Rui Yang PRO
Ray2333
AI & ML interests
Deep Reinforcement Learning
Recent Activity
updated a model about 16 hours ago
OpenWebRL/OpenWebRL-4B-SFT published a model about 16 hours ago
OpenWebRL/OpenWebRL-4B-SFT updated a dataset about 17 hours ago
Ray2333/Judge_data_plus