ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer
Paper
• 2603.03583 • Published
• 1
Scalable Artificial Intelligence
ByteFlow: Language Modeling through Adaptive Byte Compression without a Tokenizer
DDiT: Dynamic Patch Scheduling for Efficient Diffusion Transformers