torch transformers accelerate bitsandbytes fastapi uvicorn peft auto-gptq