haoheliu's picture

haoheliu

haoheliu

·

haoheliu

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

mimbres/YourMT3

liked a Space 3 months ago

erastorgueva-nv/NeMo-Forced-Aligner

liked a model 3 months ago

distil-whisper/distil-large-v3.5

View all activity

Organizations

authored 2 papers 10 months ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11, 2025 • 71

Audio-FLAN: A Preliminary Release

Paper • 2502.16584 • Published Feb 23, 2025 • 36

authored 3 papers over 1 year ago

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Paper • 2407.14329 • Published Jul 19, 2024 • 5

SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General Sound

Paper • 2405.00233 • Published Apr 30, 2024 • 17

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 32

authored 5 papers over 2 years ago

Retrieval-Augmented Text-to-Audio Generation

Paper • 2309.08051 • Published Sep 14, 2023 • 7

AudioSR: Versatile Audio Super-resolution at Scale

Paper • 2309.07314 • Published Sep 13, 2023 • 28

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Paper • 2308.05734 • Published Aug 10, 2023 • 37

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Paper • 2308.01546 • Published Aug 3, 2023 • 18

WavJourney: Compositional Audio Creation with Large Language Models

Paper • 2307.14335 • Published Jul 26, 2023 • 44

authored a paper almost 3 years ago

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

Paper • 2301.12503 • Published Jan 29, 2023 • 1