Baxter-Sagart Old Chinese reconstruction (Version 1.00, 20 Feb. 2011)

‼ An up-to-date version of the Baxter-Sagart reconstruction is in preparation and will be available soon

Middle Chinese readings in Baxter’s notation for 9000 characters:

  • NEW! For a 89-page pdf document (two pages per pdf page) giving Middle Chinese readings for 9000 Guangyun characters in Baxter’s notation, sorted by pinyin alphabetical order, click here.


All files use Unicode (UTF-8) encoding for Chinese characters and phonetic symbols. All the Chinese characters in the present file are included in one of the following sets:

Unicode CJK Unified Ideographs (U+4E00 to U+9FFF) (4018 entries)

Unicode CJK Unified Ideographs, Extension A (U+3400 to U+4DBF) (33 entries)

Unicode CJK Unified ideographs, Extension B (U+20000 to U+2A6DF) (34 entries)

We use the font "SimSun Founder Extended" (file name: SURSONG.ttf), but any Unicode font that includes the character sets listed above should be adequate.

A free Unicode font for phonetic symbols is available at