Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
pgarbacki 's Collections
RL
data
retrieval
tool use
image
multimodal
optimizers
video
finetuning
foundational models
routing
reasoning
computer use

finetuning

updated Feb 12
Upvote
-

  • Badllama 3: removing safety finetuning from Llama 3 in minutes

    Paper • 2407.01376 • Published Jul 1, 2024

  • Weighted-Reward Preference Optimization for Implicit Model Fusion

    Paper • 2412.03187 • Published Dec 4, 2024 • 12

  • Sparse Matrix in Large Language Model Fine-tuning

    Paper • 2405.15525 • Published May 24, 2024
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs