Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
JM-Brun 's Collections
Agents
Attribution
SLMs
LLM-as-a-judge
LLM Training
LLM-KG
Research Tool
LLM Architecture
LLM Data
World model
Reasonning
LLM Math
Interpretability XAI
Hallucinations

Interpretability XAI

updated Mar 4
Upvote
-

  • ReAGent: Towards A Model-agnostic Feature Attribution Method for Generative Language Models

    Paper • 2402.00794 • Published Feb 1, 2024 • 1

  • Rethinking Interpretability in the Era of Large Language Models

    Paper • 2402.01761 • Published Jan 30, 2024 • 24

  • Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

    Paper • 2502.03032 • Published Feb 5 • 60

  • Tell me why: Visual foundation models as self-explainable classifiers

    Paper • 2502.19577 • Published Feb 26 • 11
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs