掌上百科 - PDAWIKI

 找回密码
 免费注册

QQ登录

只需一步,快速开始

查看: 1123|回复: 10

[词典考据] cod 9th词汇量考

[复制链接]
  • TA的每日心情
    开心
    2019-1-23 08:36
  • 签到天数: 180 天

    [LV.7]常住居民III

    发表于 2019-1-22 17:57:39 | 显示全部楼层 |阅读模式
    本帖最后由 zhuode 于 2019-1-22 17:57 编辑 3 d, @) c0 ~- f' R& E7 n( r

    ! [8 N. P) O8 R- ~8 T
    以下资料来自Google,分别为不同网站介绍COD9光盘或书籍的内容。

    7 l" A' a, e/ @6 {/ ^' z9 T
    7 S" S* t% |; X! [0 u( M. F
    https://www.amazon.co.uk/Oxford-Concise-Dictionary-9th-Edition/dp/B00004UA9J

    # S2 G8 W( d8 J  p7 \6 X, ^2 N
    Oxford Concise Dictionary: 9th Edition
    by Focus Multimedia Ltd
    Platform : Windows 98, Windows 95
    Be the first to review this item
    Available from these sellers.
    CD ROM version of Oxford's best selling single volume paper based dictionary
    Contains over 9,000 added new words and phrases.
    The definitive resource for scholars, professional and general readers alike.
    Contains the most up to date spellings, with improved coverage based on a 'computer' snapshot of today's language.
    150,000 entries and 230,000 definitions
      ~' O1 h/ U" w% [# m& I
    https://www.strandbooks.com/dictionaries/concise-oxford-dictionary-9th-edition

    : a. Z: D3 G, r0 F* d
    PRODUCT DESCRIPTION
    Thumb-indexed edition. This new and updated edition of Oxford's best-selling one-volume reference has fully 25% more content than the previous edition: 150,000 entries; 230,000 definitions. Added features include clearer etymologies, improved coverage of scientific/technical terms, and an up-to-date pronunciation system. 1,648p.
    / N$ j5 z9 I: ~$ y8 m
    https://www.amazon.com/Concise-Oxford-Dictionary-Current-English/dp/0198613199
    + B4 Q# |( n" J& i" Y
    Grunge, nip and tuck, pesto, and chill out--these are but a sampling of the over nine thousand new words and phrases to be found in The Concise Oxford Dictionary, Ninth Edition. This new and updated edition of Oxford's best selling single-volume dictionary brings together the latest and most comprehensive research in current English, based on the massive and constantly expanding database of the Oxford English Dictionary. Containing over 150,000 entries and 230,000 definitions, the Concise boasts up to 25% more content than the previous edition to give users the tools they need to stay current with our ever-changing language. Each entry is written in a clear, straightforward style to facilitate ease of use.
    New features of the ninth edition of the Concise Oxford English Dictionary include:
    · Over 25% new material
    · Over 9,000 new words and phrases
    · 150,000 entries and 230,000 definitions
    · The most up-to-date spellings, with improved coverage of meaning, usage and coverage based on a computer 'snapshot' of today's language
    · Over 300 new boxed usage notes with guidance on good English by a team of expert lexicographers
    · New, clearer etymologies with extensive coverage of idioms, phrases, and abbreviations
    · Improved coverage of scientific and technical terms including the new fields of environmental studies and information services
    · New words and phrases including: pan-fry, post-traumatic stress, body wrap, multiculturalism, Cajun, heli-skiing, and thousands more
    · New, more up-to-date pronunciation system, representing today's received pronunciation
    · Thorough international coverage, making the Concise a bestseller worldwide
    The Concise Oxford Dictionary, Ninth Edition, a standard within its field, is the definitive resource for scholars, professionals, and general readers alike.

    / C4 G$ H8 C  v2 k! i. N1 \0 }$ ^6 `5 A

    9 ~" g) t  C! p, |  h
    CD ROM或Hardcover,一致显示为150000词条,230000释义。我提取该CD ROM数据前先行统计了下,该光盘软件有索引词条数目为85375,与150000差一小半儿。第一时间认为自己统计不全,但一分析又觉得可能并非为此,原因如后。以150000词条,230000释义,一个词条平均1.5个释义,有词典使用经历的人都会觉得不真实,明显偏少。那为什么会是该数据呢?原因是这150000词条可分为三部分:主词条、子词条、词组。常用词典的会有同感,主词条常有多个释义,子词条(或名变形词条,如:happily为happy的子词条)常不含释义,词组常含一个偶有含多个释义。这样一来考虑到会有0个或1个释义的词条的存在,该数据也就可以理解了,那为什么我统计的原盘词条数会少一小半呢?因为原盘只给主词条和子词条䢖了索引,词组部分没有䢖,至于加上词组部分能不能到150000,那就要看数据处理之后全建索引再统计了。

    9 j- R( u; {, ?' y& R
    发此帖的另一个原因是知会大家COD9索引抓取工作大部头已经完成了,感兴趣尤其是打算参与COD9软件mdx化的童鞋可前往了解(原帖:https://www.pdawiki.com/forum/fo ... ad&tid=32390&extra=)。至于剩余部分,其实就是USAGE,现在本人还在寻找合适的提取手段。另外问一下谁知道有什么方法或软件可以将85375个pdf文件合并为一个pdf?

    + n) [0 n, u( v# [
  • TA的每日心情
    奋斗
    2019-4-14 02:12
  • 签到天数: 93 天

    [LV.6]常住居民II

    发表于 2019-1-22 18:04:47 | 显示全部楼层
    请问怎么参与?帮忙制作Mac版本然后分享给参与的人可以咩 4 X, f1 R% Q, p7 K; A
  • TA的每日心情
    开心
    2019-1-23 08:36
  • 签到天数: 180 天

    [LV.7]常住居民III

     楼主| 发表于 2019-1-22 18:06:56 | 显示全部楼层
    jonah_w 发表于 2019-1-22 18:04/ n  s3 ^; b* D5 H
    请问怎么参与?帮忙制作Mac版本然后分享给参与的人可以咩

    6 b9 x' \  W* {" K: d" H当然,不过要等大神mdx化之后喽
  • TA的每日心情
    奋斗
    2019-4-14 02:12
  • 签到天数: 93 天

    [LV.6]常住居民II

    发表于 2019-1-22 18:07:38 | 显示全部楼层
    zhuode 发表于 2019-1-22 18:061 @+ x* w0 Q' k
    当然,不过要等大神mdx化之后喽
    2 ^5 @$ ~4 x+ n  h0 w* u
    好的,非常荣幸。
  • TA的每日心情
    开心
    2019-9-26 22:12
  • 签到天数: 42 天

    [LV.5]常住居民I

    发表于 2019-1-22 18:18:55 | 显示全部楼层
    已经有12版了,为什么还要做第9版?
    5 W+ R8 H! ~) P, }: `2 a$ S1 ^% S
  • TA的每日心情
    开心
    2019-1-23 08:36
  • 签到天数: 180 天

    [LV.7]常住居民III

     楼主| 发表于 2019-1-22 18:34:31 | 显示全部楼层
    wsagun 发表于 2019-1-22 18:18: p4 h) H' k0 W0 e
    已经有12版了,为什么还要做第9版?

    ! ^1 A2 O( {) d参考这儿:https://www.pdawiki.com/forum/fo ... &highlight=COD9
  • TA的每日心情
    慵懒
    2023-1-10 21:48
  • 签到天数: 802 天

    [LV.10]以坛为家III

    发表于 2019-1-22 21:10:24 | 显示全部楼层
    合并PDF可以用adobe acrobat dc
  • TA的每日心情
    开心
    2019-1-23 08:36
  • 签到天数: 180 天

    [LV.7]常住居民III

     楼主| 发表于 2019-1-22 21:23:39 | 显示全部楼层
    duancj 发表于 2019-1-22 21:10
    $ k6 ^& V2 ~+ }. y合并PDF可以用adobe acrobat dc
      I- n' o2 V) I7 y, h
    我试过,得一部分一部分合并,无法一次性合并所有,要不会崩溃
  • TA的每日心情
    奋斗
    2018-5-18 05:02
  • 签到天数: 27 天

    [LV.4]偶尔看看III

    发表于 2019-1-22 22:25:10 | 显示全部楼层
    本帖最后由 GL_n 于 2019-1-22 22:30 编辑 $ x8 A& i5 @4 k! W1 z8 }
    , \9 Q5 f% @8 }) y, G4 K3 @# p
    用python合并,很方便。可能的话,我可以帮你完成合并。
  • TA的每日心情
    开心
    2019-1-23 08:36
  • 签到天数: 180 天

    [LV.7]常住居民III

     楼主| 发表于 2019-1-22 22:30:04 | 显示全部楼层
    GL_n 发表于 2019-1-22 22:25
    9 ?- @- m4 A/ ~9 q8 d4 a3 I用python合并,很方便。
    . f4 ]) V3 j' b' b; B) a
    不会,- -!,我现在在用acrobat一千个一组来合并
  • TA的每日心情

    2019-2-16 21:57
  • 签到天数: 9 天

    [LV.3]偶尔看看II

    发表于 2019-1-26 00:31:27 | 显示全部楼层
    楼主辛苦了!
    您需要登录后才可以回帖 登录 | 免费注册

    本版积分规则

    小黑屋|手机版|Archiver|PDAWIKI |网站地图

    GMT+8, 2024-3-29 17:08 , Processed in 0.031273 second(s), 8 queries , MemCache On.

    Powered by Discuz! X3.4

    Copyright © 2001-2023, Tencent Cloud.

    快速回复 返回顶部 返回列表