Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model
ASLP-lab
ASLP-lab
AI & ML interests
None yet
Recent Activity
updated a dataset about 24 hours ago
ASLP-lab/MSU-Benchmark published a dataset 1 day ago
ASLP-lab/MSU-Benchmark updated a Space 4 days ago
ASLP-lab/YingMusic-Singer-PlusOrganizations
None yet
spaces 8
Paused
Agents
8
YingMusic-Singer-Plus
🎤
Edit lyrics, keep the melody
Runtime error
Agents
12
WenetSpeech Yue
🔥
Large-Scale Cantonese Speech Corpus
Runtime error
Agents
2
VoiceSculptor
📚
Running on Zero
Agents
44
DiffRhythm2
🎵
Generate a full song from lyrics and style prompts
Configuration error
Agents
22
SongFormer
🎵
State-of-the-art music analysis with multi-scale datasets
Running on Zero
Agents
Featured
688
Di♪♪Rhythm
🎶
Blazingly Fast and Embarrassingly Simple Song Generation
models 35
ASLP-lab/FM-Speech
Audio Classification • 35B • Updated • 22 • 2
ASLP-lab/SongFormer
0.7B • Updated • 461 • 17
ASLP-lab/Speaker-Reasoner
32B • Updated • 38 • 2
ASLP-lab/Speaker-Reasoner-4194h
32B • Updated • 50 • 1
ASLP-lab/YingMusic-Singer-Plus
0.7B • Updated • 2.8k • 7
ASLP-lab/OmniCodec
Feature Extraction • Updated • 2
ASLP-lab/OSUM-Pangu
Audio-to-Audio • Updated • 2
ASLP-lab/VoiceSculptor-VD
Text-to-Speech • 4B • Updated • 24 • 18
ASLP-lab/WenetSpeech-Wu-Speech-Understanding
Updated • 2
ASLP-lab/WenetSpeech-Wu-Speech-Generation
Text-to-Speech • Updated • 3
datasets 22
ASLP-lab/MSU-Benchmark
Viewer • Updated • 2.85k • 105
ASLP-lab/FMSU-Bench
Viewer • Updated • 24.1k • 3.8k • 1
ASLP-lab/UrduSpeech
Viewer • Updated • 73.5k • 2.21k • 1
ASLP-lab/HumDial-EIBench
Viewer • Updated • 1 • 467 • 2
ASLP-lab/FastTurn-Testset
Updated • 191
ASLP-lab/SongFormDB
Updated • 54.2k • 9
ASLP-lab/SongFormBench
Viewer • Updated • 3.82k • 425 • 3
ASLP-lab/HumDial-FDBench
Updated • 175 • 3
ASLP-lab/WSC-Train
Preview • Updated • 404 • 132
ASLP-lab/LyricEditBench
Viewer • Updated • 7.2k • 206 • 2