Download the UN Parallel Corpus ! ^/ J4 {+ G" i- N我下载了。解压了大约有3G大的。不过不知道怎么打开用。有高手请教下# v* o$ g+ W: v2 k. i% E8 u) p
' ?5 A+ m. b+ RXML文件: / K$ ^' ^% e! g7 v2 u2 n }/ j0 B- H; xUNv1.0-TEI.zh.tar.gz.00, s$ B% \8 s! e% @+ J$ Y
1 @/ B/ K) W2 F( |
Plain-text bitexts 6 v0 S5 i5 l9 Q$ Z/ B" fWe also make available plain-text bitexts that span all documents for a specific language pair and can be used more readily with SMT training pipelines. Inside a language-pair specific archive consists of a plain-text file for each language and one file with ids. 5 s& _" O0 ^' h$ U7 T1 ^链接) D) E' b7 K0 x0 P6 w+ c