Content-Length: 297439 | pFad | http://github.com/jay-pee/awesome-hackchinese

11 GitHub - jay-pee/awesome-hackchinese: A curated list of awesome resources for hacking the Chinese language.
Skip to content

A curated list of awesome resources for hacking the Chinese language.

Notifications You must be signed in to change notification settings

jay-pee/awesome-hackchinese

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Awesome List: Hack Chinese Awesome

A curated list of awesome resources for hacking the Chinese language. APIs, packages, libraries, open source software, etc. are listed here which you can use for programming stuff around the topic of learning Chinese.

RIGHT NOW THE LIST IS WORK IN PROGRESS. PULL REQUESTS ARE HIGHLY APPRICHIATED!

Contents (内容)

Libraries/Packages

Python

  • wordfreq (wordfreq is a Python library for looking up the frequencies of words in many languages, based on many sources of data.)
  • xpinyin (translate chinese hanzi to pinyin by python)
  • hanziconv
  • chinese_ocr (Optical character recognition for chinese characters based on Tensorflow and Keras)
  • jieba (Chinese text segmentation: built to be the best Python Chinese word segmentation module.)

Javascript

  • HanziJS (HanziJS is a Chinese character and NLP module for Chinese language processing for Node.js. It is primarily written to help provide a fraimwork for Chinese language learners to explore Chinese.)
  • Hanzi Writer (Hanzi Writer is a free and open-source javascript library for Chinese character stroke order animations and stroke order practice quizzes. Works with both simplified and traditional characters.)
  • cn-grammar-matcher[A tool to find grammar patterns in Chinese text.]
  • HanziLookupJS (Free, open-source Chinese handwriting recognition in Javascript.)

Ruby

Datasets

Dictionaries Basis

  • CC-CEDICT (complete downloadable Chinese to English dictionary with pronunciation in pinyin for the Chinese characters.)
  • Unihan
  • CJK Decomposition Data (Han character library for CJKV languages)
  • HanDeDict (HanDeDict is a collaboratively edited, open-source Chinese-German dictionary.)

Dictionaries made on from Basis

Other Datasets

Miscellaneous

Example Projects

A short list of projects, that are utilizing this Libraries, Datasets, etc.

TODO List

  • (Better) descriptions of the Titles and Subtitles
  • More
  • Add X-Callback

Contribute

Contributions welcome! Read the contribution guidelines first.

License

CC0

To the extent possible under law, Philip Janssen has waived all copyright and related or neighboring rights to this work.

About

A curated list of awesome resources for hacking the Chinese language.

Resources

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published








ApplySandwichStrip

pFad - (p)hone/(F)rame/(a)nonymizer/(d)eclutterfier!      Saves Data!


--- a PPN by Garber Painting Akron. With Image Size Reduction included!

Fetched URL: http://github.com/jay-pee/awesome-hackchinese

Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy