Ivy Zhang's picture

Ivy Zhang

Ivy1997

·

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

nvidia/parakeet-tdt-0.6b-v3

updated a model 21 days ago

AI-Safeguard/Ivy-Fake

liked a model about 1 month ago

AI-Safeguard/Ivy-Fake

View all activity

Organizations

upvoted an article 7 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21, 2025

•

309

upvoted 2 papers 7 months ago

SpineBench: A Clinically Salient, Level-Aware Benchmark Powered by the SpineMed-450k Corpus

Paper • 2510.03160 • Published Oct 3, 2025 • 4

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published Sep 29, 2025 • 37

upvoted a paper 9 months ago

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10, 2025 • 99

upvoted 3 papers 10 months ago

Arch-Router: Aligning LLM Routing with Human Preferences

Paper • 2506.16655 • Published Jun 19, 2025 • 18

WorldVLA: Towards Autoregressive Action World Model

Paper • 2506.21539 • Published Jun 26, 2025 • 40

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25, 2025 • 64

upvoted 11 papers 11 months ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18, 2025 • 66

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Paper • 2506.10974 • Published Jun 12, 2025 • 19

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12, 2025 • 73

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16, 2025 • 275

Reward Reasoning Model

Paper • 2505.14674 • Published May 20, 2025 • 37

Surfer-H Meets Holo1: Cost-Efficient Web Agent Powered by Open Weights

Paper • 2506.02865 • Published Jun 3, 2025 • 34

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion

Paper • 2506.01111 • Published Jun 1, 2025 • 31

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

IVY-FAKE: A Unified Explainable Framework and Benchmark for Image and Video AIGC Detection

Paper • 2506.00979 • Published Jun 1, 2025 • 12

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Paper • 2505.22664 • Published May 28, 2025 • 7

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25, 2025 • 145

upvoted a paper 12 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5, 2025 • 82

upvoted a paper about 1 year ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published Apr 14, 2025 • 15