10 20 18

Owen

Owen777

https://owen718.github.io

Owen718

AI & ML interests

None yet

Recent Activity

updated a model 8 days ago

Owen777/results01

published a model 8 days ago

Owen777/results01

updated a model 10 days ago

Owen777/results03

View all activity

Organizations

upvoted a paper 14 days ago

UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios

Paper • 2511.18050 • Published 16 days ago • 37

upvoted an article 21 days ago

Article

We’re open-sourcing our text-to-image model and the process behind it

26 days ago

•

upvoted 2 papers about 2 months ago

Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback

Paper • 2510.16888 • Published Oct 19 • 21

SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning

Paper • 2510.10047 • Published Oct 11 • 13

upvoted a collection 2 months ago

DeepSeek-V3.2

Collection

4 items • Updated 7 days ago • 504

upvoted a paper 2 months ago

LucidFlux: Caption-Free Universal Image Restoration via a Large-Scale Diffusion Transformer

Paper • 2509.22414 • Published Sep 26 • 20

upvoted a paper 4 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

upvoted a paper 8 months ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 64

upvoted 8 papers 9 months ago

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published Mar 10 • 37

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 71

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published Mar 7 • 36

Effective and Efficient Masked Image Generation Models

Paper • 2503.07197 • Published Mar 10 • 11

upvoted a paper 10 months ago

VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing

Paper • 2502.17258 • Published Feb 24 • 79

upvoted an article 10 months ago

Article

What is test-time compute and how to scale it?

Feb 6

•

109

upvoted 2 papers about 1 year ago

MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models

Paper • 2410.13370 • Published Oct 17, 2024 • 37

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 52