Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
IoDmitri 's Collections
VLM papers for science
Dmitri’s papers

Dmitri’s papers

updated Mar 19
Upvote
-

  • ReLearn: Unlearning via Learning for Large Language Models

    Paper • 2502.11190 • Published Feb 16 • 29

  • Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

    Paper • 2502.11089 • Published Feb 16 • 156

  • Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

    Paper • 2502.11357 • Published Feb 17 • 10

  • DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

    Paper • 2503.12797 • Published Mar 17 • 30

  • Towards Self-Improving Systematic Cognition for Next-Generation Foundation MLLMs

    Paper • 2503.12303 • Published Mar 16 • 7

  • R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

    Paper • 2503.12937 • Published Mar 17 • 29

  • MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research

    Paper • 2503.13399 • Published Mar 17 • 21
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs