FlexTheAi's picture
Upload folder using huggingface_hub
e202b16 verified

A newer version of the Gradio SDK is available: 5.29.0

Upgrade

Example of LLM inference using FlashAttention

Example script of using FlashAttention for inference coming soon.