Update README.md
Browse files
README.md
CHANGED
@@ -428,7 +428,7 @@ model-index:
|
|
428 |
DEk21_hcmute_embedding is a Vietnamese text embedding focused on RAG and production efficiency:
|
429 |
|
430 |
📚 **Trained Dataset**:
|
431 |
-
The model was trained on an in-house dataset consisting of approximately **
|
432 |
|
433 |
🪆 **Efficiency**:
|
434 |
Trained with a **Matryoshka loss**, allowing embeddings to be truncated with minimal performance loss. This ensures that smaller embeddings are faster to compare, making the model efficient for real-world production use.
|
|
|
428 |
DEk21_hcmute_embedding is a Vietnamese text embedding focused on RAG and production efficiency:
|
429 |
|
430 |
📚 **Trained Dataset**:
|
431 |
+
The model was trained on an in-house dataset consisting of approximately **100,000 examples** of legal questions and their related contexts.
|
432 |
|
433 |
🪆 **Efficiency**:
|
434 |
Trained with a **Matryoshka loss**, allowing embeddings to be truncated with minimal performance loss. This ensures that smaller embeddings are faster to compare, making the model efficient for real-world production use.
|