Visual Document Retrieval
Transformers
Safetensors
ColPali
English
pretraining
tonywu71 commited on
Commit
56b5c73
Β·
verified Β·
1 Parent(s): 8df7ecd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -94,6 +94,13 @@ with torch.no_grad():
94
  scores = processor.score_retrieval(query_embeddings.embeddings, image_embeddings.embeddings)
95
  ```
96
 
 
 
 
 
 
 
 
97
  ## Limitations
98
 
99
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.
 
94
  scores = processor.score_retrieval(query_embeddings.embeddings, image_embeddings.embeddings)
95
  ```
96
 
97
+ ## Resources
98
+
99
+ - The *ColPali* arXiv paper can be found [here](https://doi.org/10.48550/arXiv.2407.01449). πŸ“„
100
+ - The official blog post detailing ColPali can be found [here](https://huggingface.co/blog/manu/colpali). πŸ“
101
+ - The original model implementation code for the ColPali model and for the `colpali-engine` package can be found [here](https://github.com/illuin-tech/colpali). 🌎
102
+ - Cookbooks for learning to use the transformers-native version of *ColPali*, fine-tuning, and similarity maps generation can be found [here](https://github.com/tonywu71/colpali-cookbooks). πŸ“š
103
+
104
  ## Limitations
105
 
106
  - **Focus**: The model primarily focuses on PDF-type documents and high-ressources languages, potentially limiting its generalization to other document types or less represented languages.