Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Supreeth Rao's picture
5 20

Supreeth Rao

Supreeth
https://supreethrao.com
  • SupreethRao99
  • SupreethRao99
  • supreeth-rao

AI & ML interests

Reinforcement Learning, Large Language Models, Distributed Computing

Recent Activity

updated a model 20 days ago
Supreeth/verirl-rlvr-qwen3-4b-thinking
updated a model 20 days ago
Supreeth/verirl-sft-qwen3-4b-tooluse-merged
published a model 20 days ago
Supreeth/verirl-sft-qwen3-4b-tooluse-merged
View all activity

Organizations

None yet

spaces 1

Running
RL

VeriRL — Verilog RTL Design Environment

🔬

Step through a Verirl environment by sending actions and view results

20 days ago

models 9

Supreeth/verirl-rlvr-qwen3-4b-thinking

Updated 20 days ago

Supreeth/verirl-sft-qwen3-4b-tooluse-merged

Text Generation • 4B • Updated 20 days ago • 332

Supreeth/verirl-sft-qwen3-4b-tooluse

Updated 20 days ago

Supreeth/verirl-sft-qwen3-4b-thinking-merged

Text Generation • 4B • Updated 21 days ago • 701

Supreeth/verirl-sft-qwen3-4b-thinking

Updated 22 days ago

Supreeth/verirl-rlvr-qwen3.5-4b

Updated 22 days ago

Supreeth/searchlm-qwen2.5-3b-rlhf

Text Generation • 3B • Updated Jan 31 • 3

Supreeth/DeBERTa-Twitter-Emotion-Classification

Text Classification • 0.1B • Updated Sep 1, 2023 • 32

Supreeth/Toxic-XLM_RoBERTa

Text Classification • Updated Jun 20, 2022 • 9

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs