Update README.md
Browse files
README.md
CHANGED
|
@@ -25,7 +25,7 @@ license: apache-2.0
|
|
| 25 |
|
| 26 |
## Intented Usage & Model Info
|
| 27 |
|
| 28 |
-
`jina-embedding-t-en-v1` is a tiny
|
| 29 |
This dataset consists of 380 million pairs of sentences, which include both query-document pairs.
|
| 30 |
These pairs were obtained from various domains and were carefully selected through a thorough cleaning process.
|
| 31 |
The Linnaeus-Full dataset, from which the Linnaeus-Clean dataset is derived, originally contained 1.6 billion sentence pairs.
|
|
|
|
| 25 |
|
| 26 |
## Intented Usage & Model Info
|
| 27 |
|
| 28 |
+
`jina-embedding-t-en-v1` is a tiny language model that has been trained using Jina AI's Linnaeus-Clean dataset.
|
| 29 |
This dataset consists of 380 million pairs of sentences, which include both query-document pairs.
|
| 30 |
These pairs were obtained from various domains and were carefully selected through a thorough cleaning process.
|
| 31 |
The Linnaeus-Full dataset, from which the Linnaeus-Clean dataset is derived, originally contained 1.6 billion sentence pairs.
|