Spaces:
Sleeping
Sleeping
File size: 1,922 Bytes
05d3571 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
# LASER: application to cross-lingual natural language inference
This codes shows how to use the multilingual sentence embedding for
cross-lingual NLI, using the XNLI corpus.
We train a NLI classifier on the English MultiNLI corpus, optimizing
the meta-parameters on the English XNLI development corpus.
We then apply that classifier to the test set for all 14 transfer languages.
The foreign languages development set is not used.
## Installation
Just run `bash ./xnli.sh`
which install XNLI and MultiNLI corpora,
calculates the multilingual sentence embeddings,
trains the classifier and displays results.
The XNLI corpus is available [here](https://www.nyu.edu/projects/bowman/xnli/).
## Results
You should get the following results for zero-short cross-lingual transfer.
They slightly differ from those published in the initial version of the paper [2]
due to the change to PyTorch 1.0 and variations in random number generation, new optimization of meta-parameters, etc.
| en | fr | es | de | el | bg | ru | tr | ar | vi | th | zh | hi | sw | ur |
|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|
| 74.65 | 72.26 | 73.15 | 72.48 | 72.73 | 73.35 | 71.08 | 69.84 | 70.48 | 71.94 | 69.20 | 71.38 | 65.95 | 62.14 | 61.82 |
All numbers are accuracies on the test set
## References
Details on the corpus are described in this paper:
[1] Alexis Conneau, Guillaume Lample, Ruty Rinott, Adina Williams, Samuel R. Bowman, Holger Schwenk and Veselin Stoyanov,
[*XNLI: Cross-lingual Sentence Understanding through Inference*](https://aclweb.org/anthology/D18-1269),
EMNLP, 2018.
Detailed system description:
[2] Mikel Artetxe and Holger Schwenk,
[*Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond*](https://arxiv.org/pdf/1812.10464),
arXiv, Dec 26 2018.
|