BinghengWu's picture

3 6 10

BinghengWu

wubingheng

·

https://github.com/wubingheng111

AI & ML interests

I like to fine-tune the small models of the Doge series.

Organizations

upvoted an article 4 months ago

Article

Trainable Dynamic Mask Sparse Attention: Bridging Efficiency and Effectiveness in Long-Context Language Models

Aug 5

•

7

upvoted a collection 4 months ago

🧐Small-Papers

Technical support for the SmallDoges series models. • 2 items • Updated Aug 5 • 2

upvoted a paper 4 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4 • 17

upvoted a collection 5 months ago

🧠 SmolLM3

Smol, multilingual, long-context reasoner • 14 items • Updated Oct 9 • 85

upvoted a collection 11 months ago

Doge

Doge family of small language models. • 12 items • Updated Mar 28 • 6

upvoted a paper 12 months ago

Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture

Paper • 2412.11834 • Published Dec 16, 2024 • 8