Open X-Embodiment datasets in LeRobot format with standard transfomation (https://github.com/Tavish9/any4lerobot)
AI & ML interests
Embodied AI
Recent Activity
EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining.
-
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper β’ 2508.21112 β’ Published β’ 77 -
IPEC-COMMUNITY/EO-1-3B
Robotics β’ Updated β’ 12 -
IPEC-COMMUNITY/EO-Data1.5M
Viewer β’ Updated β’ 739k β’ 3.64k β’ 12 -
IPEC-COMMUNITY/demos25
Viewer β’ Updated β’ 75 β’ 261
-
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation
Paper β’ 2502.18041 β’ Published β’ 1 -
IPEC-COMMUNITY/openfly-agent-7b
Image-Text-to-Text β’ 8B β’ Updated β’ 112 -
IPEC-COMMUNITY/OpenFly_DataGen
Updated β’ 398 β’ 1 -
IPEC-COMMUNITY/OpenFly-rlds
Updated β’ 3.12k
-
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Paper β’ 2501.15830 β’ Published β’ 13 -
IPEC-COMMUNITY/spatialvla-4b-224-pt
Image-Text-to-Text β’ 4B β’ Updated β’ 14.1k β’ 11 -
IPEC-COMMUNITY/spatialvla-4b-mix-224-pt
Image-Text-to-Text β’ 4B β’ Updated β’ 512 β’ 4 -
IPEC-COMMUNITY/spatialvla-4b-224-sft-bridge
Robotics β’ 4B β’ Updated β’ 142 β’ 1
-
IPEC-COMMUNITY/libero_spatial_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 53k β’ 3.2k β’ 1 -
IPEC-COMMUNITY/libero_goal_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 52k β’ 2.78k -
IPEC-COMMUNITY/libero_object_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 67k β’ 2.79k -
IPEC-COMMUNITY/libero_10_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 101k β’ 3.22k β’ 1
Open X-Embodiment datasets in LeRobot format with standard transfomation (https://github.com/Tavish9/any4lerobot)
-
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Paper β’ 2501.15830 β’ Published β’ 13 -
IPEC-COMMUNITY/spatialvla-4b-224-pt
Image-Text-to-Text β’ 4B β’ Updated β’ 14.1k β’ 11 -
IPEC-COMMUNITY/spatialvla-4b-mix-224-pt
Image-Text-to-Text β’ 4B β’ Updated β’ 512 β’ 4 -
IPEC-COMMUNITY/spatialvla-4b-224-sft-bridge
Robotics β’ 4B β’ Updated β’ 142 β’ 1
EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining.
-
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper β’ 2508.21112 β’ Published β’ 77 -
IPEC-COMMUNITY/EO-1-3B
Robotics β’ Updated β’ 12 -
IPEC-COMMUNITY/EO-Data1.5M
Viewer β’ Updated β’ 739k β’ 3.64k β’ 12 -
IPEC-COMMUNITY/demos25
Viewer β’ Updated β’ 75 β’ 261
-
IPEC-COMMUNITY/libero_spatial_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 53k β’ 3.2k β’ 1 -
IPEC-COMMUNITY/libero_goal_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 52k β’ 2.78k -
IPEC-COMMUNITY/libero_object_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 67k β’ 2.79k -
IPEC-COMMUNITY/libero_10_no_noops_1.0.0_lerobot
Viewer β’ Updated β’ 101k β’ 3.22k β’ 1
-
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation
Paper β’ 2502.18041 β’ Published β’ 1 -
IPEC-COMMUNITY/openfly-agent-7b
Image-Text-to-Text β’ 8B β’ Updated β’ 112 -
IPEC-COMMUNITY/OpenFly_DataGen
Updated β’ 398 β’ 1 -
IPEC-COMMUNITY/OpenFly-rlds
Updated β’ 3.12k