A newer version of the Gradio SDK is available: 5.29.0
5.29.0
Example script of using FlashAttention for inference coming soon.