Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mguirao 's Collections
Adversarial-LLM

Adversarial-LLM

updated Oct 3, 2024
Upvote
-

  • Adversarial Attacks on Multimodal Agents

    Paper • 2406.12814 • Published Jun 18, 2024 • 4

  • SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

    Paper • 2405.08317 • Published May 14, 2024 • 13

  • AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

    Paper • 2404.16873 • Published Apr 21, 2024 • 30

  • Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

    Paper • 2402.16822 • Published Feb 26, 2024 • 18

  • Emerging Vulnerabilities in Frontier Models: Multi-Turn Jailbreak Attacks

    Paper • 2409.00137 • Published Aug 29, 2024

  • GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks

    Paper • 2409.19521 • Published Sep 29, 2024
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs