Our goal is to develop an natural curriculmn for Chinese from fundamentals. We ask the questions: How Chinese was created? How can we use this to teach people Chinese?
Using the logographic/symbolic nature of Chinese, we aim to create a "character net" where we group words with same basic elements together. This will help educators develop an intuitive curriculum for teaching.
How can machine learning help?
How do you find linkages between characters through these simple elements? Luckily, we don’t have to draw the connections by hand: We can use modern machine learning techniques!
What else we can do?
Our language net has the potential to dramatically decrease the amount of memorization needed to learn Chinese. This process can not only be applied to Chinese but also to other logogram-based languages. Check out this nice map from Wikipedia with all the logogram-based languages arranged in the world:
Since the process of clustering does not rely on a dictionary, we can even use these machine learning techniques to decipher ancient languages such as Mayan!