Clelia Astra Bertelli

as-cle-bert

https://www.cleliasportfolio.xyz

AI & ML interests

Recent Activity

replied to their post about 2 hours ago

Ever dreamt of ingesting into a vector DB that pile of CSVs, Word documents and presentations laying in some remote folders on your PC?🗂️ What if I told you that you can do it within three to six lines of code?🤯 Well, with my latest open-source project, 𝐢𝐧𝐠𝐞𝐬𝐭-𝐚𝐧𝐲𝐭𝐡𝐢𝐧𝐠 (https://github.com/AstraBert/ingest-anything), you can take all your non-PDF files, convert them to PDF, extract their text, chunk, embed and load them into a vector database, all in one go!🚀 How? It's pretty simple! 📁 The input files are converted into PDF by PdfItDown (https://github.com/AstraBert/PdfItDown) 📑 The PDF text is extracted using LlamaIndex readers 🦛 The text is chunked exploiting Chonkie 🧮 The chunks are embedded thanks to Sentence Transformers models 🗄️ The embeddings are loaded into a Qdrant vector database And you're done!✅ Curious of trying it? Install it by running: 𝘱𝘪𝘱 𝘪𝘯𝘴𝘵𝘢𝘭𝘭 𝘪𝘯𝘨𝘦𝘴𝘵-𝘢𝘯𝘺𝘵𝘩𝘪𝘯𝘨 And you can start using it in your python scripts!🐍 Don't forget to star it on GitHub and let me know if you have any feedback! ➡️ https://github.com/AstraBert/ingest-anything

replied to their post 2 days ago

posted an update 3 days ago

View all activity

Organizations

as-cle-bert's activity

New activity in as-cle-bert/pdfitdown about 2 months ago

Update requirements.txt

#1 opened about 2 months ago by

not-lain

New activity in greenfit-ai/synthetic-sport-products-sustainability 3 months ago

Librarian Bot: Add language metadata for dataset

#2 opened 3 months ago by

librarian-bot

New activity in bluesky-community/README 5 months ago

Ideas!

#1 opened 5 months ago by

davanstrien

New activity in as-cle-bert/Llama-3.1-405B-FP8 9 months ago

why

#1 opened 9 months ago by

YaserDS-777

New activity in as-cle-bert/saccaromyces-cerevisiae-base about 1 year ago

Librarian Bot: Add language metadata for dataset

#2 opened about 1 year ago by

librarian-bot

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

New activity in huggingchat/chat-ui about 1 year ago

[ASSISTANTS] Community thread

189

#356 opened about 1 year ago by

victor

New activity in as-cle-bert/plastic-enzymes about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

New activity in as-cle-bert/scerevisiae-transcripts-biotypes about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

New activity in as-cle-bert/breastcancer-auto-objdetect about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

New activity in as-cle-bert/genetics-arxiv-wiki about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

New activity in as-cle-bert/VirBiCla-training about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter