stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
β’
0.7B
β’
Updated
β’
15.5k
β’
1.53k
Generate MIDI music from prompts
Segment and track objects in a video
Demo for multimodal understanding and generation