1 14 13

Jason Liu

liuzhan22

liuzhan22

AI & ML interests

None yet

Recent Activity

liked a dataset about 1 month ago

MRSAudio/MRSAudio

upvoted a paper 2 months ago

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

authored a paper 3 months ago

Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing

View all activity

Organizations

None yet

liked a dataset about 1 month ago

MRSAudio/MRSAudio

Viewer • Updated Nov 1 • 246k • 137k • 3

upvoted a paper 2 months ago

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

Paper • 2406.15704 • Published Jun 22, 2024 • 6

authored a paper 3 months ago

Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing

Paper • 2509.16622 • Published Sep 20 • 1

commented a paper 3 months ago

Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing

Paper • 2509.16622 • Published Sep 20 • 1 •

upvoted a paper 3 months ago

Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing

Paper • 2509.16622 • Published Sep 20 • 1

liked a model 3 months ago

mispeech/midashenglm-7b-0804-fp32

Audio-Text-to-Text • 8B • Updated Oct 31 • 33.3k • 76

authored a paper 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

upvoted a paper 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

liked a model 4 months ago

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated Oct 31 • 56.5k • 249

upvoted a collection 4 months ago

audio

Collection

110 items • Updated 10 days ago • 6

liked a model 4 months ago

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25 • 48.5k • • 694

upvoted a changelog 4 months ago

Changelog

Trending Papers

Jul 28

• 104

authored a paper 4 months ago

Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement

Paper • 2409.09642 • Published Sep 15, 2024 • 1

liked a dataset 4 months ago

fka/awesome-chatgpt-prompts

Viewer • Updated Jan 6 • 203 • 33.2k • 9.43k

upvoted a paper 4 months ago

Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement

Paper • 2409.09642 • Published Sep 15, 2024 • 1

liked a model 4 months ago

bosonai/higgs-audio-v2-tokenizer

Updated Jul 22 • 5.55k • 40

upvoted a paper 4 months ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 73

upvoted a paper 5 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 159

liked a dataset 6 months ago

tsinghua-ee/QualiSpeech

Viewer • Updated Aug 4 • 14.6k • 1.49k • 19

upvoted a paper 6 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166

Jason Liu

AI & ML interests

Recent Activity

Organizations

liuzhan22's activity

Trending Papers