A benchmark for evaluating sentence/document embeddings of Scandinavian language models.
You can install the Scandinavian Embedding Benchmark (seb) via pip from PyPI:
pip install sebTo see more examples, see the documentation.
| Documentation | |
|---|---|
| 🔧 Installation | Installation instructions on how to install this package |
| 👩💻 Usage | Introduction on how to use the package |
| 📖 Documentation | A minimal and developing documentation |
| Type | |
|---|---|
| 🚨 Bug Reports | GitHub Issue Tracker |
| 🎁 Feature Requests & Ideas | GitHub Issue Tracker |
| 👩💻 Usage Questions | GitHub Discussions |
| 🗯 General Discussion | GitHub Discussions |
To cite this work please refer to the following work accepted at neurips:
Enevoldsen, K., Kardos, M., Muennighoff, N., & Nielbo, K. (2024). The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding. In Advances in Neural Information Processing Systems
or use the following BibTeX:
@inproceedings{enevoldsen2024scandinavian,
title={The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding},
author={Enevoldsen, Kenneth and Kardos, M{\'a}rton and Muennighoff, Niklas and Nielbo, Kristoffer},
booktitle={Advances in Neural Information Processing Systems},
year={2024},
url={https://nips.cc/virtual/2024/poster/97869}
}
