Commit
·
094fcb6
1
Parent(s):
6cd60e2
Changed dataset name, added sentence transformer references back in
Browse files
README.md
CHANGED
@@ -39,9 +39,9 @@ model-index:
|
|
39 |
name: Cosine Ap
|
40 |
---
|
41 |
|
42 |
-
#
|
43 |
|
44 |
-
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base) on the Quora
|
45 |
|
46 |
## Model Details
|
47 |
|
@@ -52,7 +52,7 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [A
|
|
52 |
- **Output Dimensionality:** 768 dimensions
|
53 |
- **Similarity Function:** Cosine Similarity
|
54 |
- **Training Dataset:**
|
55 |
-
- Quora
|
56 |
<!-- - **Language:** Unknown -->
|
57 |
<!-- - **License:** Unknown -->
|
58 |
|
@@ -115,17 +115,17 @@ print(similarities.shape)
|
|
115 |
|
116 |
### Training Dataset
|
117 |
|
118 |
-
####
|
119 |
|
120 |
-
* Dataset:
|
121 |
* Size: training samples
|
122 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
123 |
|
124 |
### Evaluation Dataset
|
125 |
|
126 |
-
####
|
127 |
|
128 |
-
* Dataset:
|
129 |
* Size: evaluation samples
|
130 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
131 |
|
@@ -133,15 +133,18 @@ print(similarities.shape)
|
|
133 |
|
134 |
### BibTeX
|
135 |
|
|
|
|
|
136 |
#### Sentence Transformers
|
137 |
```bibtex
|
138 |
-
@inproceedings{
|
139 |
-
title = "",
|
140 |
-
author = "",
|
141 |
-
|
142 |
-
|
143 |
-
|
144 |
-
|
|
|
145 |
}
|
146 |
```
|
147 |
|
|
|
39 |
name: Cosine Ap
|
40 |
---
|
41 |
|
42 |
+
# Redis Semantic Caching embedding model based on Alibaba-NLP/gte-modernbert-base
|
43 |
|
44 |
+
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [Alibaba-NLP/gte-modernbert-base](https://huggingface.co/Alibaba-NLP/gte-modernbert-base) on the [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity for the purpose of semantic caching.
|
45 |
|
46 |
## Model Details
|
47 |
|
|
|
52 |
- **Output Dimensionality:** 768 dimensions
|
53 |
- **Similarity Function:** Cosine Similarity
|
54 |
- **Training Dataset:**
|
55 |
+
- [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset)
|
56 |
<!-- - **Language:** Unknown -->
|
57 |
<!-- - **License:** Unknown -->
|
58 |
|
|
|
115 |
|
116 |
### Training Dataset
|
117 |
|
118 |
+
#### Quora
|
119 |
|
120 |
+
* Dataset: [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset)
|
121 |
* Size: training samples
|
122 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
123 |
|
124 |
### Evaluation Dataset
|
125 |
|
126 |
+
#### Quora
|
127 |
|
128 |
+
* Dataset: [Quora](https://www.kaggle.com/datasets/quora/question-pairs-dataset)
|
129 |
* Size: evaluation samples
|
130 |
* Columns: <code>question_1</code>, <code>question_2</code>, and <code>label</code>
|
131 |
|
|
|
133 |
|
134 |
### BibTeX
|
135 |
|
136 |
+
#### Redis Langcache-embed Models
|
137 |
+
|
138 |
#### Sentence Transformers
|
139 |
```bibtex
|
140 |
+
@inproceedings{reimers-2019-sentence-bert,
|
141 |
+
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
142 |
+
author = "Reimers, Nils and Gurevych, Iryna",
|
143 |
+
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
144 |
+
month = "11",
|
145 |
+
year = "2019",
|
146 |
+
publisher = "Association for Computational Linguistics",
|
147 |
+
url = "https://arxiv.org/abs/1908.10084",
|
148 |
}
|
149 |
```
|
150 |
|