Ben Pope
realbenpope
AI & ML interests
None yet
Organizations
None yet
Diffusion Language Models
Visual reasoning
-
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
Paper • 2406.09403 • Published • 23 -
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Paper • 2406.09411 • Published • 19 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 44
Memory Tokens
Recurrent architecture
Small LMs
Embeddings
Diffusion Language Models
Steady state model
Visual reasoning
-
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
Paper • 2406.09403 • Published • 23 -
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Paper • 2406.09411 • Published • 19 -
Transformers meet Neural Algorithmic Reasoners
Paper • 2406.09308 • Published • 44
Reasoning
Memory Tokens
In context learning
Recurrent architecture
MoE
Small LMs