Sylvest/OpenVLA-AC-PD-1traj-libero-object
8B
•
Updated
•
15
Official Collections for SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models, including SFT and RL models.