Note
Updates in this Fork: This fork includes small fixes and improvements. The main change is the addition of a GitHub action that automatically converts the latest version of CC-CEDICT on a monthly basis. This ensures that the dictionary is always up-to-date with the latest entries from CC-CEDICT.
A Python script that converts the CC-CEDICT file to a Yomichan-supported dictionary file.
Yomitan is a browser extension, mainly used for Japanese, which allows opening dictionaries when mouse is over a word.
Although there already exists a CC-CEDICT dictionary from shoui, it is not simplistic enough, for example the searched word and its pinyin can be seen twice. Thus, I created my own version for it.
The new dictionary with 课 opened:
The dictionary has new bullet points after classifiers (CL) because there usually is a new meaning after them. Other dictionaries usually lump everything into one line, which is not desirable.
It is also possible to use bullet points as the seperator instead of commas and express tones with numbers. Bullet points make sense with words which don't have classifiers. Tone numbers are useful because ā and ǎ look the same when the font is small.
The new dictionary with 课 opened with bullet points as the seperator and pinyin with number tones:
Download links for
- CC-CEDICT dictionary
- CC-CEDICT dictionary with number tones
- CC-CEDICT dictionary with bullet points
- CC-CEDICT dictionary with number tones and bullet points
https://github.com/lonlazer/cc-cedict-for-yomichan/releases/latest
- You need to have Yomitan.
- Download the dictionary (zip file) from the Download section and don't unzip it.
- Yomichan > Settings > Dictionaries > Configure installed and enabled dictionaries... > Import.
- Choose the dictionary (zip file).
CC-CEDICT can be downloaded from here. https://www.mdbg.net/chinese/dictionary?page=cc-cedict
Run by typing
python main.py <unpacked cc-cedict filepath>
If you want bullet point seperators, use --seperate
flag. Pinyin numbers can be enabled with --pinyin-numbers
flag.