GAN Dissection: Visualizing and Understanding Generative Adversarial Networks Paper • 1811.10597 • Published Nov 26, 2018
Semantic Photo Manipulation with a Generative Image Prior Paper • 2005.07727 • Published May 15, 2020
Understanding the Role of Individual Units in a Deep Neural Network Paper • 2009.05041 • Published Sep 10, 2020
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics Paper • 2102.01672 • Published Feb 2, 2021
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity Paper • 2404.03214 • Published Apr 4, 2024 • 2
Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations Paper • 2403.06009 • Published Mar 9, 2024
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Paper • 2502.09927 • Published Feb 14
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping Paper • 2501.06589 • Published Jan 11
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models Paper • 2502.08130 • Published Feb 12 • 9
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models Paper • 2409.04787 • Published Sep 7, 2024 • 1