13 16 20

Bin Wang

wanderkid

https://wangbindl.github.io/

wangbinDL

AI & ML interests

Computer Vision, Multimodal Large Language Model

Recent Activity

upvoted a paper about 11 hours ago

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

liked a Space 3 days ago

opendatalab/TRivia-3B

liked a dataset 8 days ago

opendatalab/AICC

View all activity

Organizations

upvoted a paper about 11 hours ago

TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition

Paper • 2512.01248 • Published 7 days ago • 9

liked a Space 3 days ago

TRivia-3B

⭐

Convert table images into HTML tags with TRivia-3B

liked a dataset 8 days ago

opendatalab/AICC

Viewer • Updated 6 days ago • 4.84B • 52.3k • 76

authored a paper 2 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 136

upvoted a paper 2 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 136

liked a model 3 months ago

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Sep 29 • 1.39M • 291

updated a model 3 months ago

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Sep 29 • 1.39M • 291

published a model 3 months ago

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Sep 29 • 1.39M • 291

liked a dataset 3 months ago

HuggingFaceFW/finepdfs

Viewer • Updated 6 days ago • 476M • 36.7k • 682

upvoted a paper 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

liked a model 6 months ago

opendatalab/MinerU2.0-2505-0.9B

1B • Updated Jun 12 • 2.44k • 41

New activity in wanderkid/UniMER_Dataset 9 months ago

Add task category and link to CDM paper

#2 opened 9 months ago by

nielsr

upvoted a collection 9 months ago

olmOCR

Collection

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 12 items • Updated 9 days ago • 140

liked a model 10 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 1.21M • • 12.9k

liked a dataset 11 months ago

lmms-lab/LLaVA-OneVision-Data

Viewer • Updated May 24 • 3.94M • 23.4k • 222

liked 2 datasets 12 months ago

opendatalab/OHR-Bench

Viewer • Updated Aug 28 • 8.56k • 1.01k • 16

opendatalab/OmniDocBench

Viewer • Updated Sep 26 • 1.36k • 20.8k • 57

commented a paper 12 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 28 •

authored a paper 12 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 28

upvoted a paper 12 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 28

Bin Wang

AI & ML interests

Recent Activity

Organizations

wanderkid's activity

TRivia-3B

Add task category and link to CDM paper