Zesen Cheng
ClownRat
AI & ML interests
multi-modal foundation model; Segmentation, Detection, and Tracking;
Recent Activity
upvoted
a
paper
15 days ago
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers