4 26 199

Lee Jung Bang

bangbang

AI & ML interests

NLP,CHATBOT,RL

Recent Activity

upvoted a paper about 10 hours ago

Lessons from the Trenches on Reproducible Evaluation of Language Models

liked a Space about 10 hours ago

HuggingFaceFW/blogpost-fine-tasks

upvoted a paper about 10 hours ago

OLMES: A Standard for Language Model Evaluations

View all activity

Organizations

None yet

upvoted 6 papers about 10 hours ago

Does your data spark joy? Performance gains from domain upsampling at the end of training

Paper • 2406.03476 • Published Jun 5, 2024 • 4

upvoted 6 papers about 11 hours ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 35

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 250

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 138

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 45

DataComp-LM: In search of the next generation of training sets for language models

Paper • 2406.11794 • Published Jun 17, 2024 • 55

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 152

upvoted 2 papers 7 days ago

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published 9 days ago • 44

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 15 days ago • 120

upvoted a collection 22 days ago

Nemotron-Personas

Collection

A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 3 items • Updated 8 days ago • 14

upvoted an article 22 days ago

Article

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI

Oct 28

•

upvoted a collection 23 days ago

📄 FinePDFs

Collection

81 items • Updated 30 days ago • 24

upvoted a paper 25 days ago

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7 • 54

upvoted 2 papers 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings

Paper • 2509.04011 • Published Sep 4 • 28

Lee Jung Bang

AI & ML interests

Recent Activity

Organizations

bangbang's activity

🛡️ Nemotron PII: Synthesized Data for Privacy-Preserving AI