aboutsummaryrefslogtreecommitdiffstats
path: root/README.md
diff options
context:
space:
mode:
Diffstat (limited to 'README.md')
-rw-r--r--README.md29
1 files changed, 29 insertions, 0 deletions
diff --git a/README.md b/README.md
index 4c5073e..8de096e 100644
--- a/README.md
+++ b/README.md
@@ -66,6 +66,28 @@ Then;
66This will create two directories; `dictionaries` and `wordnets`. 66This will create two directories; `dictionaries` and `wordnets`.
67Linewise aligned definition files are in `wordnets/ready`. 67Linewise aligned definition files are in `wordnets/ready`.
68 68
69<details><summary>Language pairs and number of available aligned glosses</summary>
70<p>
71Source Language | Target Language | # of Pairs
72--- | --- | ---:
73en | bg | 4959
74en | el | 18136
75en | it | 12688
76en | ro | 58754
77en | sl | 3144
78en | sq | 4681
79bg | el | 2817
80bg | it | 2115
81bg | ro | 4701
82el | it | 4801
83el | ro | 2144
84el | sq | 4681
85it | ro | 10353
86ro | sl | 2085
87ro | sq | 4646
88</p>
89</details>
90
69## Acquiring The Embeddings 91## Acquiring The Embeddings
70 92
71We use [VecMap](https://github.com/artetxem/vecmap) on [fastText](https://fasttext.cc/) embeddings. You can skip this step if you are providing your own polylingual embeddings. 93We use [VecMap](https://github.com/artetxem/vecmap) on [fastText](https://fasttext.cc/) embeddings. You can skip this step if you are providing your own polylingual embeddings.
@@ -88,3 +110,10 @@ git submodule init && git submodule update
88 110
89Bear in mind that this will require around 50 GB free space. 111Bear in mind that this will require around 50 GB free space.
90 112
113## Quick Demo
114
115`demo.sh` is included, downloads data for 2 languages.
116
117```bash
118./demo.sh
119```