Building on HF

Croc-Prog-HF

Croc-Prog-github

AI & ML interests

High-temperature Text-Generation(Creativity of the model), Feature extractor models, Models training optimization(such as: VRAM, Disk space),

Recent Activity

liked a dataset about 10 hours ago

Dampfinchen/Creative_Writing_Multiturn

updated a model about 12 hours ago

Croc-Prog-HF/LoreWeaver-2-LoRA

updated a model about 12 hours ago

Croc-Prog-HF/LoreWeaver-2

View all activity

Organizations

None yet

upvoted a paper about 12 hours ago

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Paper • 2204.05862 • Published Apr 12, 2022 • 3

upvoted an article about 18 hours ago

Article

Introducing OptiMind, a research model designed for optimization

19 days ago

•

upvoted a collection 1 day ago

Sparse Auto-Encoders (SAEs) for Mechanistic Interpretability

Collection

A compilation of sparse auto-encoders trained on large language models. • 37 items • Updated Dec 16, 2025 • 23

upvoted 4 papers 4 days ago

DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints

Paper • 2601.18137 • Published 9 days ago • 24

upvoted a collection 9 days ago

World events

Collection

Dataset containing real world events from 2023 till present • 3 items • Updated 9 days ago • 5

upvoted 2 collections 10 days ago

DeepSeek-Coder

Collection

DeepSeek Coder series • 9 items • Updated Nov 27, 2025 • 75

DeepSeek-Math

Collection

DeepSeek Math series • 6 items • Updated Nov 27, 2025 • 47

upvoted a collection 14 days ago

TranslateGemma

Collection

3 items • Updated 20 days ago • 204

upvoted a changelog 15 days ago

Changelog

HuggingChat for Papers

28 days ago

• 101

upvoted 6 papers 16 days ago

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

Paper • 2512.20578 • Published Dec 23, 2025 • 85

A^3-Bench: Benchmarking Memory-Driven Scientific Reasoning via Anchor and Attractor Activation

Paper • 2601.09274 • Published 21 days ago • 84

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

Paper • 2601.00393 • Published Jan 1 • 130

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 28 days ago • 146

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 26 days ago • 218

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published Dec 31, 2025 • 295

upvoted a paper 24 days ago

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 70

upvoted a paper 26 days ago

Perplexed by Quality: A Perplexity-based Method for Adult and Harmful Content Detection in Multilingual Heterogeneous Web Data

Paper • 2212.10440 • Published Dec 20, 2022 • 1

Croc-Prog-HF

AI & ML interests

Recent Activity

Organizations

Croc-Prog-HF's activity

Introducing OptiMind, a research model designed for optimization

HuggingChat for Papers