2012-06-20 01:41:36 +00:00
|
|
|
# cChardet
|
|
|
|
This library is high speed universal character encoding detector. - binding to libcharsetdetect
|
|
|
|
|
|
|
|
# Requires
|
|
|
|
Cython: [http://www.cython.org/](http://www.cython.org/)
|
|
|
|
|
|
|
|
uchardet-enhanced: [https://bitbucket.org/medoc/uchardet-enhanced/overview](https://bitbucket.org/medoc/uchardet-enhanced/overview)
|
|
|
|
|
|
|
|
pip install or easy_install -U cython
|
|
|
|
|
2012-06-20 02:29:50 +00:00
|
|
|
# Benchmark
|
|
|
|
see tests.TestCchardetSpeed
|
|
|
|
Sample(shift_jis): testdata/wikipediaJa_One_Thousand_and_One_Nights.txt
|
|
|
|
chardet: 4.009999990463257s, shift_jis
|
|
|
|
cchardet: 0.0009999275207519531s shift_jis
|
|
|
|
|
2012-06-20 01:41:36 +00:00
|
|
|
# Contact
|
|
|
|
[My blog](http://blog.remu.biz)
|
|
|
|
|
|
|
|
Sorry for my poor English :)
|