ZehongMa's picture

1 9 2

ZehongMa

zehongma

·

https://zehong-ma.github.io/

zehong-ma

AI & ML interests

MLLMs, Image/Video Generation, Multi-modal Representation Learning

Recent Activity

upvoted a paper 2 days ago

BabyVision: Visual Reasoning Beyond Language

upvoted a paper about 1 month ago

From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs

upvoted a paper about 1 month ago

EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture

View all activity

Organizations

None yet

Papers 2

arxiv:2511.19365

arxiv:2506.09045

spaces 1

DeCo

Embed and display a remote webpage

models 2

zehongma/DeCo

Updated Nov 25, 2025 • 2

zehongma/OVMR

Updated Jun 16, 2025

datasets 1

zehongma/ImageNet21k_OVR

Viewer • Updated Oct 10, 2024 • 20.3k • 10