view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 6 days ago • 223
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published 13 days ago • 238
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper • 2511.20785 • Published 11 days ago • 148
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published 18 days ago • 222
TiDAR: Think in Diffusion, Talk in Autoregression Paper • 2511.08923 • Published 25 days ago • 110
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published 26 days ago • 104
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published about 1 month ago • 208
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30 • 114
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published Oct 27 • 96
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published Oct 24 • 99
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Paper • 2510.19779 • Published Oct 22 • 59
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published Oct 21 • 110
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning Paper • 2510.15444 • Published Oct 17 • 147
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples Paper • 2510.07192 • Published Oct 8 • 5
AI for Service: Proactive Assistance with AI Glasses Paper • 2510.14359 • Published Oct 16 • 73
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13 • 165
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation Paper • 2510.08673 • Published Oct 9 • 125