
The CJKI Chinese lexical database currently contains over four million Simplified Chinese (SC) and Traditional Chinese (TC) headwords covering general vocabulary, important technical terms, and proper nouns. Each lexeme is accompanied by a pinyin reading or readings, and various other attributes (see chinword.htm for details).
What is especially noteworthy is that the pinyin readings take into account the differences in pronunciation between Taiwan and the People's Republic of China, as shown in the table below. Even highly educated native Chinese speakers are often surprised to discover that such differences exist.
Our pinyin readings have been thoroughly proofread for accuracy, and explicitly indicate the neutral tone, which is often ignored by conventional dictionaries. This data, which can be provided in all the major transcription systems such as Yale, Wade-Giles, Zhuyin and IPA, is especially useful for speech technology applications, such as TTS (text-to-speech) software.
| SC Pinyin | TC Pinyin
| qi3 | qi4
| wei1 | wei2
| wei1 | wei2
| fa4 | fa3
| lin2-qi1 | lin2-qi2 |
| qi1-dai4 | qi2-dai4 |
| qi3-ye4 | qi4-ye4 |
| xian3-wei1-jing4 | xian3-wei2-jing4 |
| wei1-xiao4 | wei2-xiao4 |
| li3-fa4 | li3-fa3
| wei1-xian3 | wei2-xian3
| |
|---|
| Hanzi | POS | Pinyin | Rank |
|---|---|---|---|
| 抱拢 | V | bao4-long3 | C |
| 报录 | NC | bao4-lu4 | C |
| 暴露 | V | bao4-lu4 | A |
| 暴乱 | NC | bao4-luan4 | A |
| 鲍螺 | NC | bao4-luo2 | C |
| 抱锣 | V | bao4-luo2 | C |
| 暴落 | NC | bao4-luo4 | C |
| 豹略 | NC | bao4-lve4 | C |
| 暴掠 | V | bao4-lve4 | C |
| 报马 | V | bao4-ma3 | C |
| 爆满 | V | bao4-man3 | C |
| 豹猫 | NC | bao4-mao1 | C |
| 暴民 | NC | bao4-min2 | C |
| 报名 | V | bao4-ming2 | A |
| 爆鸣 | V | bao4-ming2 | C |
| 报命 | NC | bao4-ming4 | C |
| 报幕 | V | bao4-mu4 | C |