cod 9th词汇量考
本帖最后由 zhuode 于 2019-1-22 17:57 编辑以下资料来自Google,分别为不同网站介绍COD9光盘或书籍的内容。
https://www.amazon.co.uk/Oxford-Concise-Dictionary-9th-Edition/dp/B00004UA9J
Oxford Concise Dictionary: 9th Editionby Focus Multimedia LtdPlatform : Windows 98, Windows 95Be the first to review this itemAvailable from these sellers.CD ROM version of Oxford's best selling single volume paper based dictionaryContains over 9,000 added new words and phrases.The definitive resource for scholars, professional and general readers alike.Contains the most up to date spellings, with improved coverage based on a 'computer' snapshot of today's language.150,000 entries and 230,000 definitions
https://www.strandbooks.com/dictionaries/concise-oxford-dictionary-9th-edition
PRODUCT DESCRIPTIONThumb-indexed edition. This new and updated edition of Oxford's best-selling one-volume reference has fully 25% more content than the previous edition: 150,000 entries; 230,000 definitions. Added features include clearer etymologies, improved coverage of scientific/technical terms, and an up-to-date pronunciation system. 1,648p.
https://www.amazon.com/Concise-Oxford-Dictionary-Current-English/dp/0198613199
Grunge, nip and tuck, pesto, and chill out--these are but a sampling of the over nine thousand new words and phrases to be found in The Concise Oxford Dictionary, Ninth Edition. This new and updated edition of Oxford's best selling single-volume dictionary brings together the latest and most comprehensive research in current English, based on the massive and constantly expanding database of the Oxford English Dictionary. Containing over 150,000 entries and 230,000 definitions, the Concise boasts up to 25% more content than the previous edition to give users the tools they need to stay current with our ever-changing language. Each entry is written in a clear, straightforward style to facilitate ease of use.New features of the ninth edition of the Concise Oxford English Dictionary include:· Over 25% new material· Over 9,000 new words and phrases· 150,000 entries and 230,000 definitions· The most up-to-date spellings, with improved coverage of meaning, usage and coverage based on a computer 'snapshot' of today's language· Over 300 new boxed usage notes with guidance on good English by a team of expert lexicographers· New, clearer etymologies with extensive coverage of idioms, phrases, and abbreviations· Improved coverage of scientific and technical terms including the new fields of environmental studies and information services· New words and phrases including: pan-fry, post-traumatic stress, body wrap, multiculturalism, Cajun, heli-skiing, and thousands more· New, more up-to-date pronunciation system, representing today's received pronunciation· Thorough international coverage, making the Concise a bestseller worldwideThe Concise Oxford Dictionary, Ninth Edition, a standard within its field, is the definitive resource for scholars, professionals, and general readers alike.
CD ROM或Hardcover,一致显示为150000词条,230000释义。我提取该CD ROM数据前先行统计了下,该光盘软件有索引词条数目为85375,与150000差一小半儿。第一时间认为自己统计不全,但一分析又觉得可能并非为此,原因如后。以150000词条,230000释义,一个词条平均1.5个释义,有词典使用经历的人都会觉得不真实,明显偏少。那为什么会是该数据呢?原因是这150000词条可分为三部分:主词条、子词条、词组。常用词典的会有同感,主词条常有多个释义,子词条(或名变形词条,如:happily为happy的子词条)常不含释义,词组常含一个偶有含多个释义。这样一来考虑到会有0个或1个释义的词条的存在,该数据也就可以理解了,那为什么我统计的原盘词条数会少一小半呢?因为原盘只给主词条和子词条䢖了索引,词组部分没有䢖,至于加上词组部分能不能到150000,那就要看数据处理之后全建索引再统计了。
发此帖的另一个原因是知会大家COD9索引抓取工作大部头已经完成了,感兴趣尤其是打算参与COD9软件mdx化的童鞋可前往了解(原帖:https://www.pdawiki.com/forum/fo ... ad&tid=32390&extra=)。至于剩余部分,其实就是USAGE,现在本人还在寻找合适的提取手段。另外问一下谁知道有什么方法或软件可以将85375个pdf文件合并为一个pdf?
请问怎么参与?帮忙制作Mac版本然后分享给参与的人可以咩 {:4_96:}
jonah_w 发表于 2019-1-22 18:04
请问怎么参与?帮忙制作Mac版本然后分享给参与的人可以咩
当然,不过要等大神mdx化之后喽 zhuode 发表于 2019-1-22 18:06
当然,不过要等大神mdx化之后喽
好的,非常荣幸。 已经有12版了,为什么还要做第9版?
wsagun 发表于 2019-1-22 18:18
已经有12版了,为什么还要做第9版?
参考这儿:https://www.pdawiki.com/forum/forum.php?mod=viewthread&tid=32380&highlight=COD9 合并PDF可以用adobe acrobat dc duancj 发表于 2019-1-22 21:10
合并PDF可以用adobe acrobat dc
我试过,得一部分一部分合并,无法一次性合并所有,要不会崩溃 本帖最后由 GL_n 于 2019-1-22 22:30 编辑
用python合并,很方便。可能的话,我可以帮你完成合并。 GL_n 发表于 2019-1-22 22:25
用python合并,很方便。
不会,- -!,我现在在用acrobat一千个一组来合并 楼主辛苦了!
页:
[1]