Zesen Cheng
ClownRat
AI & ML interests
multi-modal foundation model; Segmentation, Detection, and Tracking;
Recent Activity
upvoted
a
paper
5 days ago
Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers
upvoted
a
paper
5 days ago
API Agents vs. GUI Agents: Divergence and Convergence