Machine Translation

Covid-19 MLIA Data

Training

Language Size Sentences Download
Round 1
English-French 107.9 Mbyte (compressed) 1,004,715 en-fr.zip
English-German 98.6 Mbyte (compressed) 926,147 en-de.zip
English-Greek 100.2 Mbyte (compressed) 834,240 en-el.zip
English-Italian 91.3 Mbyte (compressed) 900,472 en-it.zip
English-Spanish 107.3 Mbyte (compressed) 1,028,287 en-es.zip
English-Swedish 78.1 Mbyte (compressed) 806,925 en-sv.zip

Validation

Language Size Sentences Download
Round 1
English-French 282 Kbyte (compressed) 728 en-fr.zip
English-German 153 Kbyte (compressed) 528 en-de.zip
English-Greek 1,500 Kbyte (compressed) 3,878 en-el.zip
English-Italian 1,400 Kbyte (compressed) 3,745 en-it.zip
English-Spanish 855 Kbyte (compressed) 2,473 en-es.zip
English-Swedish 186 Kbyte (compressed) 723 en-sv.zip

Test

Language Size Sentences Download
Round 1
English-French 244 Kbyte 2,000 test-enfr-src.en.sgm
English-German 263 Kbyte 2,000 test-ende-src.en.sgm
English-Greek 310 Kbyte 2,000 test-enel-src.en.sgm
English-Italian 254 Kbyte 2,000 test-enit-src.en.sgm
English-Spanish 235 Kbye 2,000 test-enes-src.en.sgm
English-Swedish 266 Kbyte 2,000 test-ensv-src.en.sgm

Runs and Rolling Reports

Runs and rolling reports for all the round are available in following git repository.