掌上百科 - PDAWIKI

 找回密码
 免费注册

QQ登录

只需一步,快速开始

查看: 276|回复: 0

[词典讯] 小语种的全量变形表

[复制链接]
  • TA的每日心情

    7 天前
  • 签到天数: 184 天

    [LV.7]常住居民III

    发表于 2018-3-5 16:44:40 | 显示全部楼层 |阅读模式
    本帖最后由 孤影 于 2018-3-6 23:27 编辑 ) J6 ~* B0 b! X# z) n% Q

    3 Y! R4 F4 ~& \+ K5 j% Lhttp://www.lexiconista.com/datasets/lemmatization/3 f/ p' f5 X: `# b
    & d3 ^. E9 m1 B; t- r/ K
    These are large-coverage, machine-readable lemma/token pairs in several languages which I have collected (legally) from various sources, mostly as part of my work on the Global Glossary project. I use these for query expansion during fulltext searches: if a user searches for the lemma walk, the query is expanded to also search for the tokens walking, walked etc.& a, `8 M5 c! |* x/ ]- N

    7 t$ S  G5 i/ \  M6 r3 XThese are plain text files (zipped). Each line contains one lemma/token pair separated by a tab character in this sequence: lemma, tab, token. The files are encoded in UTF-8 with Windows-style line breaks.
    您需要登录后才可以回帖 登录 | 免费注册

    本版积分规则

    顶部qrcode底部
    关注掌上百科官方微信公众号送积分
    关注掌上百科官方微信公众号
    关注微信公众号 pdawiki,获取邀请码,看文抢积分,抽奖得浮云! Follow our Wechat official account "pdawiki", get invitation codes, and play the lottery to earn points (积分)!

    小黑屋|手机版|Archiver|PDAWIKI |网站地图

    GMT+8, 2020-11-30 06:38 , Processed in 0.020910 second(s), 9 queries , MemCache On.

    Powered by Discuz! X3.4

    Copyright © 2001-2020, Tencent Cloud.

    快速回复 返回顶部 返回列表