HanSaem Kim

kensaem

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

upvoted a paper 1 day ago

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

upvoted a paper 9 days ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

View all activity

Organizations

None yet

kensaem's activity

upvoted 2 papers 1 day ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published 9 days ago • 38

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

Paper • 2504.20690 • Published 9 days ago • 17

upvoted 9 papers 9 days ago

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Paper • 2504.15271 • Published 17 days ago • 65

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published 18 days ago • 50

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published 14 days ago • 86

Subject-driven Video Generation via Disentangled Identity and Motion

Paper • 2504.17816 • Published 15 days ago • 11

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published 17 days ago • 155

RepText: Rendering Visual Text via Replicating

Paper • 2504.19724 • Published 10 days ago • 30

upvoted 8 papers 15 days ago

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Paper • 2504.12395 • Published 22 days ago • 17

Cobra: Efficient Line Art COlorization with BRoAder References

Paper • 2504.12240 • Published 22 days ago • 27

BitNet b1.58 2B4T Technical Report

Paper • 2504.12285 • Published 22 days ago • 70

Seedream 3.0 Technical Report

Paper • 2504.11346 • Published 23 days ago • 54

FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation

Paper • 2504.07405 • Published 28 days ago • 12

PixelFlow: Pixel-Space Generative Models with Flow

Paper • 2504.07963 • Published 28 days ago • 19

Compass Control: Multi Object Orientation Control for Text-to-Image Generation

Paper • 2504.06752 • Published 29 days ago • 10

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published 28 days ago • 46

upvoted a paper 23 days ago

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published 28 days ago • 31