PyPDF2 pandas scikit-learn gliner plotly