View Source Unicode.Unihan.Utils (Unicode v0.3.0)
Functions to parse the Unicode Unihan database files.
Link to this section Summary
Functions
Parse the jyutping_index.csv file.
Parse one Unicode Unihan file and return a mapping from codepoint to a map of metadata for that codepoint.
Parse all Unicode Unihan files and return a mapping from codepoint to a map of metadata for that codepoint.
Parse the cjk_radicals.txt file.
Returns a map of the field definitions for a Unihan codepoint.
Link to this section Functions
Parse the jyutping_index.csv file.
Parse one Unicode Unihan file and return a mapping from codepoint to a map of metadata for that codepoint.
Parse all Unicode Unihan files and return a mapping from codepoint to a map of metadata for that codepoint.
Parse the cjk_radicals.txt file.
There is one line per CJK radical number. Each line contains three fields, separated by a semicolon (';'). The first field is the CJK radical number. The second field is the CJK radical character. The third field is the CJK unified ideograph.
Simplified radicals are represented by radical numbers with a
trailing apostrophe '
.
Returns a map of the field definitions for a Unihan codepoint.