model

#1
by rakmik - opened

Please specify a llm model and a embading model so they can give the best answer, or maybe from anyone who has tried it.

Please specify a llm model and a embading model so they can give the best answer, or maybe from anyone who has tried it.

  1. Update: I’ve switched the LLM to google_gemma-3-1b-it-Q8_0.gguf and the embedding model to Alibaba-NLP/gte-multilingual-base.
  2. I opted for smaller models due to limited CPU resources.
  3. Note: Since the original repository broke, I’ve created a new one here: https://huggingface.co/spaces/sergey21000/chatbot-rag.
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment