Supreeth Rao's picture

Supreeth Rao

Supreeth

https://supreethrao.com

AI & ML interests

Reinforcement Learning, Large Language Models, Distributed Computing

Recent Activity

updated a model 20 days ago

Supreeth/verirl-rlvr-qwen3-4b-thinking

updated a model 20 days ago

Supreeth/verirl-sft-qwen3-4b-tooluse-merged

published a model 20 days ago

Supreeth/verirl-sft-qwen3-4b-tooluse-merged

View all activity

Organizations

None yet

spaces 1

VeriRL — Verilog RTL Design Environment

Step through a Verirl environment by sending actions and view results

models 9

Supreeth/verirl-rlvr-qwen3-4b-thinking

Updated 20 days ago

Supreeth/verirl-sft-qwen3-4b-tooluse-merged

Text Generation • 4B • Updated 20 days ago • 332

Supreeth/verirl-sft-qwen3-4b-tooluse

Updated 20 days ago

Supreeth/verirl-sft-qwen3-4b-thinking-merged

Text Generation • 4B • Updated 21 days ago • 701

Supreeth/verirl-sft-qwen3-4b-thinking

Updated 22 days ago

Supreeth/verirl-rlvr-qwen3.5-4b

Updated 22 days ago

Supreeth/searchlm-qwen2.5-3b-rlhf

Text Generation • 3B • Updated Jan 31 • 3

Supreeth/DeBERTa-Twitter-Emotion-Classification

Text Classification • 0.1B • Updated Sep 1, 2023 • 32

Supreeth/Toxic-XLM_RoBERTa

Text Classification • Updated Jun 20, 2022 • 9

datasets 0

None public yet