Dictionaries

   Overview
   Arabic
   Chinese
   Japanese
   Korean
   Mobile

Other

   Articles/papers
   KDPS
   Jack Halpern

Company

   About
   Data Licensing
   Jobs
   Location
   Contact
   Map










CJK Lexical Resources


The CJK Dictionary Institute, specializing in CJK computational lexicography, is engaged in the continuous expansion of comprehensive CJK lexical databases. Currently, these databases cover about 24 million entries in Japanese, Chinese (both simplified and traditional), Korean and Arabic, and include a rich set of grammatical and semantic attributes. These resources are used by major portals and software developers in a wide variety of applications, including named entity recognition, machine translation, anti-money laundering and risk management, morphological analysis.



  • Arabic lexical resources
    Proper nouns -- especially personal names -- and their variants.

  • Chinese lexical resources
    Both Simplified Chinese (SC) and Traditional Chinese (TC) data, covering general vocabulary, technical terms, and proper nouns.

  • Japanese lexical resources
    Over seven million entries, including technical terminology, proper nouns and variants, a phonetic database, and lexical and morphological databases.

  • Korean lexical resources
    Includes large proper noun data (both personal and place names), as well as a Korean Lexical Database.

  • Other lexical resources
    CJKI also maintains resources for Persian, Spanish and Vietnamese.