Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
bethgelab
's Collections
Concept-Aware Batch Sampling
Delta Belief RL
Delta Belief RL
updated
26 days ago
Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction".
Upvote
1
iaa01/CIA-1.7B
2B
•
Updated
26 days ago
•
13
•
1
iaa01/CIA-4B
4B
•
Updated
26 days ago
•
77
•
2
Klingspor/Qwen3-1.7B-SFT
Text Generation
•
2B
•
Updated
26 days ago
•
4.94k
•
1
Klingspor/Qwen3-4B-SFT
Text Generation
•
4B
•
Updated
26 days ago
•
105
Klingspor/StarPO-1.7B
Text Generation
•
2B
•
Updated
26 days ago
•
28
Klingspor/StarPO-4B
Text Generation
•
4B
•
Updated
26 days ago
•
88
•
2
Upvote
1
Share collection
View history
Collection guide
Browse collections