hyungu lee
gurigoo
AI & ML interests
None yet
Recent Activity
View all activity
Organizations
None yet
robot
NLP
-
Differential Transformer
Paper • 2410.05258 • Published • 179 -
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Paper • 2410.20424 • Published • 40 -
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
Paper • 2412.17739 • Published • 41 -
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Paper • 2505.22617 • Published • 131
cv gen
-
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper • 2501.05441 • Published • 95 -
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity
Paper • 2503.07677 • Published • 86 -
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting
Paper • 2503.08677 • Published • 29 -
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space
Paper • 2503.09419 • Published • 6
about Transformer
-
What Matters in Transformers? Not All Attention is Needed
Paper • 2406.15786 • Published • 31 -
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Paper • 2410.17243 • Published • 93 -
Forgetting Transformer: Softmax Attention with a Forget Gate
Paper • 2503.02130 • Published • 32 -
Transformers without Normalization
Paper • 2503.10622 • Published • 171
3D
-
RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models
Paper • 2409.19989 • Published • 18 -
3D Scene Generation: A Survey
Paper • 2505.05474 • Published • 21 -
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Paper • 2505.22129 • Published • 15 -
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
Paper • 2505.18600 • Published • 48
OCR
cv gen
-
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper • 2501.05441 • Published • 95 -
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity
Paper • 2503.07677 • Published • 86 -
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting
Paper • 2503.08677 • Published • 29 -
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space
Paper • 2503.09419 • Published • 6
robot
about Transformer
-
What Matters in Transformers? Not All Attention is Needed
Paper • 2406.15786 • Published • 31 -
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Paper • 2410.17243 • Published • 93 -
Forgetting Transformer: Softmax Attention with a Forget Gate
Paper • 2503.02130 • Published • 32 -
Transformers without Normalization
Paper • 2503.10622 • Published • 171
NLP
-
Differential Transformer
Paper • 2410.05258 • Published • 179 -
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
Paper • 2410.20424 • Published • 40 -
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
Paper • 2412.17739 • Published • 41 -
The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Paper • 2505.22617 • Published • 131
3D
-
RoCoTex: A Robust Method for Consistent Texture Synthesis with Diffusion Models
Paper • 2409.19989 • Published • 18 -
3D Scene Generation: A Survey
Paper • 2505.05474 • Published • 21 -
What Makes for Text to 360-degree Panorama Generation with Stable Diffusion?
Paper • 2505.22129 • Published • 15 -
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
Paper • 2505.18600 • Published • 48