Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
omarcevi 's Collections
Papers4Reading

Papers4Reading

updated Nov 14, 2024
Upvote
-

  • CLEAR: Character Unlearning in Textual and Visual Modalities

    Paper • 2410.18057 • Published Oct 23, 2024 • 210

  • CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

    Paper • 2410.23090 • Published Oct 30, 2024 • 56

  • What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

    Paper • 2410.23743 • Published Oct 31, 2024 • 64

  • "Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

    Paper • 2411.02355 • Published Nov 4, 2024 • 51

  • Benchmarking and Dissecting the Nvidia Hopper GPU Architecture

    Paper • 2402.13499 • Published Feb 21, 2024

  • Balancing Pipeline Parallelism with Vocabulary Parallelism

    Paper • 2411.05288 • Published Nov 8, 2024 • 20

  • OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

    Paper • 2411.04905 • Published Nov 7, 2024 • 125

  • Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

    Paper • 2411.07232 • Published Nov 11, 2024 • 67

  • BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

    Paper • 1810.04805 • Published Oct 11, 2018 • 18

  • Mixtral of Experts

    Paper • 2401.04088 • Published Jan 8, 2024 • 159
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs