Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gmongaras 's Collections
2Mamba2Furious: Linear in Complexity...
Stuff I'm going to read
Stable Diffusion 3 Checkpoints
datasets
Cosine Attention (Cottention)
Reddit Models
Squad Models
BERT_512
Subtitle Data

2Mamba2Furious: Linear in Complexity...

updated 1 day ago

Pretrained models for the paper 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy (https://arxiv.org/abs/2602.17363)

Upvote
-

  • 2Mamba2Furious: Linear in Complexity, Competitive in Accuracy

    Paper • 2602.17363 • Published 5 days ago • 7

  • gmongaras/medium_8192sl_gpu_64bs__squared__sm_norm__A_mask_type_neg_softplus__in_conv_k_2__att2

    3B • Updated 1 day ago • 13

  • gmongaras/medium_8192sl_gpu_64bs__softmax

    0.7B • Updated 1 day ago • 20

  • gmongaras/medium_8192sl_gpu_64bs__mamba

    0.7B • Updated 1 day ago • 23
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs