HUST Vision Lab

university

https://github.com/hustvl

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

xinggangw authored a paper 4 days ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Doctor-James updated a model about 2 months ago

hustvl/OmniMamba

Doctor-James published a model about 2 months ago

hustvl/OmniMamba

View all activity

hustvl's activity

xinggangw

authored a paper 4 days ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published 10 days ago • 38

Doctor-James

updated a model about 2 months ago

hustvl/OmniMamba

Any-to-Any • Updated Mar 20 • 5

Doctor-James

published a model about 2 months ago

hustvl/OmniMamba

Any-to-Any • Updated Mar 20 • 5

lyy001

updated a model about 2 months ago

hustvl/MaTVLM_0_25_Mamba2

Image-Text-to-Text • Updated Mar 18 • 11 • 1

lyy001

published a model about 2 months ago

hustvl/MaTVLM_0_25_Mamba2

Image-Text-to-Text • Updated Mar 18 • 11 • 1

wondervictor

updated a dataset about 2 months ago

hustvl/GSEval

Viewer • Updated Mar 15 • 3.72k • 377 • 2

LianghuiZhu

authored a paper about 2 months ago

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Paper • 2503.10596 • Published Mar 13 • 18

wondervictor

authored a paper about 2 months ago

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Paper • 2503.10596 • Published Mar 13 • 18

xinggangw

authored a paper about 2 months ago

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Paper • 2503.10596 • Published Mar 13 • 18

finne

updated a dataset about 2 months ago

hustvl/GSEval

Viewer • Updated Mar 15 • 3.72k • 377 • 2

wondervictor

published a dataset about 2 months ago

hustvl/GSEval

Viewer • Updated Mar 15 • 3.72k • 377 • 2

xinggangw

authored 2 papers about 2 months ago

OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models

Paper • 2503.08686 • Published Mar 11 • 19

AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Paper • 2503.07608 • Published Mar 10 • 23

HongyuanTao

updated 2 models 2 months ago

hustvl/mmMamba-hybrid

Image-Text-to-Text • Updated Feb 26 • 8 • 1

hustvl/mmMamba-linear

Image-Text-to-Text • Updated Feb 26 • 26 • 3

MapleF9

updated a model 3 months ago

hustvl/va-vae-imagenet256-experimental-variants

Updated Feb 21 • 3

xinggangw

authored a paper 3 months ago

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Paper • 2502.13144 • Published Feb 18 • 40

xinggangw

updated a model 3 months ago

hustvl/mmMamba-linear

Image-Text-to-Text • Updated Feb 26 • 26 • 3

xinggangw

in hustvl/mmMamba-linear 3 months ago

Add metadata tags and link to code

#1 opened 3 months ago by

nielsr

HongyuanTao

authored a paper 3 months ago

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Paper • 2502.13145 • Published Feb 18 • 38

AI & ML interests

Recent Activity

Team members 10

hustvl's activity

Add metadata tags and link to code