Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
153
1
1
Alex Shaw
alexgshaw
Follow
blanchon's profile picture
busishengui's profile picture
kiki842940's profile picture
12 followers
·
7 following
https://www.tbench.ai/
alexgshaw
alexgshaw
alexgshaw
AI & ML interests
None yet
Recent Activity
new
activity
1 day ago
harborframework/terminal-bench-2-leaderboard:
Add Codex CLI GPT-5.5 Terminal-Bench 2.0 submission metadata
new
activity
1 day ago
harborframework/terminal-bench-2-leaderboard:
Add Codex CLI GPT-5.5 Terminal-Bench 2.0 submission
new
activity
3 days ago
harborframework/terminal-bench-2-leaderboard:
Add clnkr GPT-5.5 results
View all activity
Organizations
alexgshaw
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
harborframework/terminal-bench-2-leaderboard
1 day ago
Add Codex CLI GPT-5.5 Terminal-Bench 2.0 submission metadata
2
#183 opened 1 day ago by
Wuji2000
Add Codex CLI GPT-5.5 Terminal-Bench 2.0 submission
2
#182 opened 1 day ago by
Wuji2000
New activity in
harborframework/terminal-bench-2-leaderboard
3 days ago
Add clnkr GPT-5.5 results
2
#180 opened 3 days ago by
cosgroveb
Add JJAgent (Multi-Models) submissions
2
#179 opened 3 days ago by
bddppq
Add Polaris Terminal-Bench 2.0 submission
1
#178 opened 3 days ago by
imcynic
Add JJAgent(Multiple Models) submissions - 87.1%
2
#177 opened 3 days ago by
bddppq
New activity in
harborframework/terminal-bench-2-leaderboard
4 days ago
Add NexAU-AHE x GPT-5.5(84.5%) submission (ATIF v1.6 + Meerkat self-audit)
3
#176 opened 4 days ago by
kiki842940
NexAU-AHE: Meerkat reward-hacking audit + official ATIF v1.6 trajectories
6
#175 opened 4 days ago by
kiki842940
Add NexAU-AHE x GPT-5.5 submission
2
#174 opened 4 days ago by
kiki842940
Add NexAU-AHE x GPT-5.5 submission
3
#173 opened 4 days ago by
kiki842940
Add Harness Agent (MiniMax-M2.7-highspeed) submission
1
#172 opened 4 days ago by
lazyfrog
New activity in
harborframework/terminal-bench-2-leaderboard
7 days ago
Add NexAU x GPT-5.5 submission
4
#171 opened 7 days ago by
kiki842940
New activity in
harborframework/terminal-bench-2-leaderboard
9 days ago
Add spoox-o-m GPT 5.3-codex and 5-nano submissions
3
#169 opened 9 days ago by
plaume8
vix__claude-opus-4-7 — 89.9% mean / 97.75% pass@5
13
#170 opened 9 days ago by
kirby88
Add spoox-o-m GPT-5 submissions
2
#168 opened 9 days ago by
plaume8
New activity in
harborframework/terminal-bench-2-leaderboard
10 days ago
Add spoox-o-m submissions
2
#167 opened 10 days ago by
plaume8
Add Wecode GPT-5.5 Terminal-Bench 2.0 submission
4
#164 opened 12 days ago by
JakeCYFan
New activity in
harborframework/terminal-bench-2-leaderboard
11 days ago
Add LemonHarness(GPT-5.3-Codex) submission - 84.5%
1
#166 opened 11 days ago by
yzmmomo
New activity in
harborframework/terminal-bench-2-leaderboard
12 days ago
Add Capy GPT-5.5 submission
1
#165 opened 12 days ago by
justinsunyt
New activity in
harborframework/terminal-bench-2-leaderboard
13 days ago
Add little-coder__qwen3.5-9b submission (9.2%)
1
#163 opened 13 days ago by
itayinbar
Load more