14 23 28

Yushi Bai

bys0318

https://bys0318.github.io/

bys0318

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

THUDM/GLM-4-32B-0414

authored a paper 16 days ago

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

upvoted a paper 16 days ago

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

View all activity

Organizations

bys0318's activity

upvoted a paper 16 days ago

An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes

Paper • 2504.15270 • Published 17 days ago • 10

upvoted a collection 23 days ago

GLM-4-0414

Collection

GLM-4-0414 series model • 8 items • Updated 23 days ago • 123

upvoted 3 papers about 2 months ago

Shifting Long-Context LLMs Research from Input to Output

Paper • 2503.04723 • Published Mar 6 • 22

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published Dec 30, 2024 • 19

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11 • 17

upvoted a paper 3 months ago

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published Feb 20 • 24

upvoted a paper 4 months ago

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published Jan 22 • 20

upvoted 2 papers 5 months ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published Dec 19, 2024 • 38

AlphaTablets: A Generic Plane Representation for 3D Planar Reconstruction from Monocular Videos

Paper • 2411.19950 • Published Nov 29, 2024 • 6

upvoted a paper 6 months ago

LongReward: Improving Long-context Large Language Models with AI Feedback

Paper • 2410.21252 • Published Oct 28, 2024 • 18

upvoted a paper 7 months ago

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published Oct 21, 2024 • 16

upvoted a paper 8 months ago

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Paper • 2409.02897 • Published Sep 4, 2024 • 48

upvoted 2 papers 9 months ago

LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

Paper • 2408.07055 • Published Aug 13, 2024 • 67

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Paper • 2408.06072 • Published Aug 12, 2024 • 40

upvoted 3 papers 10 months ago

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 32

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3, 2024 • 13

Simulating Classroom Education with LLM-Empowered Agents

Paper • 2406.19226 • Published Jun 27, 2024 • 32

upvoted a paper 11 months ago

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

Paper • 2406.12793 • Published Jun 18, 2024 • 33

upvoted a collection 11 months ago

GLM-4

Collection

GLM-4 Open Models • 14 items • Updated 25 days ago • 118

upvoted a paper about 1 year ago

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

Paper • 2403.05121 • Published Mar 8, 2024 • 25