fluffyone (Carl Johnson)

liked a model about 1 month ago

zai-org/GLM-4.6

Text Generation • 357B • Updated Sep 30 • 331k • • 1.13k

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated Sep 5 • 78.7k • • 807

liked a model 5 months ago

Qwen/Qwen3-235B-A22B-Instruct-2507

Text Generation • 235B • Updated Sep 17 • 116k • • 728

liked 2 models 7 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 1.21M • • 12.9k

deepseek-ai/DeepSeek-V3-0324

Text Generation • 685B • Updated Mar 27 • 142k • • 3.08k

upvoted a collection 10 months ago

Recurrent Models

Collection

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 15 items • Updated May 21 • 11

liked 2 models about 1 year ago

#1 opened about 1 year ago by

fluffyone

liked a Space about 1 year ago

UGI Leaderboard

📢

1.3k

Uncensored General Intelligence Leaderboard

liked 2 models about 1 year ago

mradermacher/model_requests

Updated Sep 11 • 122

mradermacher/magnum-v2-123b-i1-GGUF

123B • Updated Sep 8, 2024 • 210 • 7

liked 3 models over 1 year ago

anthracite-org/magnum-v2-123b

Text Generation • 123B • Updated Sep 12, 2024 • 53 • 63

AuriAetherwiing/MN-12B-Starcannon-v3

Text Generation • 12B • Updated Aug 6, 2024 • 17 • 26

AuriAetherwiing/MN-12B-Starcannon-v2

Text Generation • 12B • Updated Oct 7, 2024 • 68 • • 25

upvoted a collection over 1 year ago

Saiga

Collection

Russian fine-tunes of different base LLMs • 12 items • Updated Apr 25 • 43

reacted to Lewdiculous's post with ❤️🚀 over 1 year ago

Post

62754

More context for your Pascal GPU or older!

Update: Now available in the official releases of KoboldCpp!
[releases] https://github.com/LostRuins/koboldcpp/releases/latest

These are great news for all the users with GTX 10XX, P40...

Flash Attention implementation for older NVIDIA GPUs without requiring Tensor Cores has come to llama.cpp in the last few days, and should be merged in the next version of KoboldCpp, you can already try it with another fork or by building it.

[Mentioned KCPP fork] https://github.com/Nexesenex/kobold.cpp/releases/latest

[PR] https://github.com/ggerganov/llama.cpp/pull/7188

You should expect less VRAM usage for the same context, allowing you to experience higher contexts with your current GPU.

There have also been reported final tokens/second speed improvements for inference, so that's also grand!

If you have tried it, I'd like to hear your experiences with --flashattention so far, especially for this implementation and for the large number of Pascal (GTX 10XX, P40...) cards.

Discussion linked bellow, with more links to relevant information:

https://huggingface.co/LWDCLS/LLM-Discussions/discussions/11

Cheers!

24 replies

·

reacted to Undi95's post with 🔥 over 1 year ago

Post

16608

Hey everyone,

Just wanted to shout out a massive thank you to all 2000 of you who've followed me on Hugging Face! 🎉 It's incredible to have such an awesome crew backing me up as I dive into all these LLM experiments.

Even though not all my models turn out perfect, I've found some real gems and methods along the way 💎. It's like digging for treasure – sometimes you found nothing, but sometimes you find a pearl, and sometimes you find a new method to try.

Your support and encouragement mean the world to me, and I'm really stoked to keep experimenting and learning. If you told me some years ago I would have so much people following me for what I do, I wouldn't have believed it. Here's to more discoveries and adventures ahead! 🚀

Also, big thanks once again, and a huge shoutout to @IkariDev for being there through this journey and supporting me. I'm excited for our future work together and hope we will continue to make people happy! 👏

I want to thank @Gryphe too, since my early work was heavily inspired from MythoMax and the RP/ERP vibe of it. If I'm here today it's probably because of you 😂

I was so close to forget @chargoddard and his amazing tool too! What will we do without mergekit in our life? Thank you! 🙏

See y'all at 3k!

5 replies

·

liked a model over 1 year ago

mradermacher/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss-i1-GGUF

47B • Updated May 6, 2024 • 386 • 8

Carl Johnson

AI & ML interests

Recent Activity

Organizations

zai-org/GLM-4.6

deepseek-ai/DeepSeek-V3.1

Qwen/Qwen3-235B-A22B-Instruct-2507

deepseek-ai/DeepSeek-R1

deepseek-ai/DeepSeek-V3-0324

Recurrent Models

mradermacher/Llama-3.1-Nemotron-70B-Instruct-HF-i1-GGUF

mradermacher/Llama-3.1-70B-Instruct-abliterated-GGUF

IQ3m

UGI Leaderboard

mradermacher/model_requests

mradermacher/magnum-v2-123b-i1-GGUF

anthracite-org/magnum-v2-123b

AuriAetherwiing/MN-12B-Starcannon-v3

AuriAetherwiing/MN-12B-Starcannon-v2

Saiga

mradermacher/Noromaid-v0.4-Mixtral-Instruct-8x7b-Zloss-i1-GGUF

Carl Johnson

AI & ML interests

Recent Activity

Organizations

fluffyone's activity

IQ3m

UGI Leaderboard