streamlit transformers torch PyPDF2 python-docx pdfplumber sentencepiece