AuroraCap - a wchai Collection

wchai 's Collections

STEVE

AuroraCap

updated Mar 19

Efficient, Performant Video Detailed Captioning and a New Benchmark

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Paper • 2410.03051 • Published Oct 4, 2024 • 6
wchai/AuroraCap-7B-VID-xtuner

Video-Text-to-Text • Updated Oct 7, 2024 • 28 • 5
wchai/AuroraCap-7B-IMG-xtuner

Image-Text-to-Text • Updated Oct 7, 2024 • 6 • 2
wchai/Video-Detailed-Caption

Viewer • Updated Oct 7, 2024 • 1.03k • 879 • 9

Note The VDC benchmark contains 1,027 videos with captions averaging over 500 words.
wchai/lmms_VDC_test

Viewer • Updated Oct 19, 2024 • 5.14k • 504 • 2

Note VDC benchmark in lmms-eval format.
wchai/AuroraCap-trainset

Preview • Updated Oct 13, 2024 • 904 • 8

Note over 20M image and video data collection for AuroraCap training with vicuna and llama-3 pre-tokenize.
wchai/AuroraCap-recaption

Viewer • Updated Oct 7, 2024 • 22.4k • 52 • 5

Note video data recaptioned by AuroraCap.