Text Generation
Safetensors
English
Chinese
qwen3
reward-model
rlhf
principle-following
qwen
conversational
Instructions to use WisdomShell/RewardAnything-8B-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Inference
Will you publish larger models, or training dataset?
1
#3 opened 11 months ago
by
zhanghaoie
Is RewardAnything also effective for model evaluation and comparison? Like llm as a judge?
6
#2 opened 11 months ago
by
BILL-SUN318
Chinese support
1
#1 opened 11 months ago
by
noahYHyy