FAST: Efficient Action Tokenization for Vision-Language-Action Models
Paper
•
2501.09747
•
Published
•
27
A FAST tokenizer, trained on the DROID dataset only. For a universal FAST tokenizer, applicable to any robot dataset, see here.
For details about FAST tokenizers, see our paper on efficient action tokenization for VLA training:
FAST: Efficient Action Tokenization for Vision-Language-Action Models