AnyFlow Collection Any-Step Video Diffusion Model with On-Policy Flow Map Distillation • 4 items • Updated 1 day ago • 8
MusicGen Stereo Collection A collection of stereo music generation models as part of the v2 MusicGen release. • 4 items • Updated Apr 24, 2024 • 18
HDR Video Generation via Latent Alignment with Logarithmic Encoding Paper • 2604.11788 • Published Apr 13 • 10
ERNIE-Image Collection The serieas of image generation models, including text2img、img2img. • 2 items • Updated about 1 month ago • 23
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published Apr 13 • 28
OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation Paper • 2604.11804 • Published Apr 13 • 72
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 89 items • Updated 17 days ago • 606
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated 22 days ago • 42
MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 7 items • Updated 13 days ago • 55