This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning
AI & ML interests
Large Language Models
Recent Activity
Papers
From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning
Attention Amnesia in Hybrid LLMs: When CoT Fine-Tuning Breaks Long-Range Recall, and How to Fix It
This is the checkpoints and dataset for: EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL.
-
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
Paper • 2605.18703 • Published • 50 -
LARK-Lab/EnvFactory-1.7B
Text Generation • 2B • Updated • 92 -
LARK-Lab/EnvFactory-4B
Text Generation • 4B • Updated • 6 -
LARK-Lab/EnvFactory-8B
Text Generation • 8B • Updated • 3
This is the checkpoints and dataset for: From Trainee to Trainer: LLM-Designed Training Environment for RL with Multi-Agent Reasoning
This is the checkpoints and dataset for: EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL.
-
EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
Paper • 2605.18703 • Published • 50 -
LARK-Lab/EnvFactory-1.7B
Text Generation • 2B • Updated • 92 -
LARK-Lab/EnvFactory-4B
Text Generation • 4B • Updated • 6 -
LARK-Lab/EnvFactory-8B
Text Generation • 8B • Updated • 3
models 8
LARK-Lab/Trainee2Trainer
Text Generation • 4B • Updated • 15 • 1
LARK-Lab/SWITCH-Phase3-GRPO-LoRA-Qwen3-8B
Text Generation • Updated • 15
LARK-Lab/EnvFactory-8B
Text Generation • 8B • Updated • 3
LARK-Lab/EnvFactory-4B
Text Generation • 4B • Updated • 6
LARK-Lab/EnvFactory-1.7B
Text Generation • 2B • Updated • 92
LARK-Lab/CodeScaler-8B
Text Classification • 8B • Updated • 38
LARK-Lab/CodeScaler-4B
Text Classification • 4B • Updated • 6
LARK-Lab/CodeScaler-1.7B
Text Classification • 2B • Updated • 7
datasets 9
LARK-Lab/MAPF-FrozenLake-Benchmark
Updated • 34 • 1
LARK-Lab/EnvFactory-SFT-DeepSeekV4Flash-OpenAI
Viewer • Updated • 3.27k • 50
LARK-Lab/EnvFactory-SFT-DeepSeekV4Flash
Updated • 38
LARK-Lab/SWITCH-Math-Train
Viewer • Updated • 45.8k • 43
LARK-Lab/EnvFactory-RL
Viewer • Updated • 3.09k • 68
LARK-Lab/EnvFactory-SFT-FILTERED
Viewer • Updated • 26.5k • 123
LARK-Lab/EnvFactory-SFT-ALL
Viewer • Updated • 53.4k • 70
LARK-Lab/FormalRx-Test
Viewer • Updated • 7.03k • 717 • 2
LARK-Lab/CodeScalerPair-51K
Viewer • Updated • 51.1k • 28 • 1