diff --git a/README.markdown b/README.markdown index 621361f..effe6da 100644 --- a/README.markdown +++ b/README.markdown @@ -3,46 +3,42 @@ cChardet cChardet is high speed universal character encoding detector. - binding to [charsetdetect](https://bitbucket.org/medoc/uchardet-enhanced/overview). ## Support codecs -* Big5 -* EUC-JP -* EUC-KR -* GB18030 -* HZ-GB-2312 -* IBM855 -* IBM866 -* ISO-2022-CN -* ISO-2022-JP -* ISO-2022-KR -* ISO-8859-2 -* ISO-8859-5 -* ISO-8859-7 -* ISO-8859-8 -* KOI8-R -* Shift_JIS -* TIS-620 -* UTF-8 -* UTF-16BE -* UTF-16LE -* UTF-32BE -* UTF-32LE -* WINDOWS-1250 -* WINDOWS-1251 -* WINDOWS-1252 -* WINDOWS-1253 -* WINDOWS-1255 -* EUC-TW -* X-ISO-10646-UCS-4-2143 -* X-ISO-10646-UCS-4-3412 -* x-mac-cyrillic + +- Big5 +- EUC-JP +- EUC-KR +- GB18030 +- HZ-GB-2312 +- IBM855 +- IBM866 +- ISO-2022-CN +- ISO-2022-JP +- ISO-2022-KR +- ISO-8859-2 +- ISO-8859-5 +- ISO-8859-7 +- ISO-8859-8 +- KOI8-R +- Shift_JIS +- TIS-620 +- UTF-8 +- UTF-16BE +- UTF-16LE +- UTF-32BE +- UTF-32LE +- WINDOWS-1250 +- WINDOWS-1251 +- WINDOWS-1252 +- WINDOWS-1253 +- WINDOWS-1255 +- EUC-TW +- X-ISO-10646-UCS-4-2143 +- X-ISO-10646-UCS-4-3412 +- x-mac-cyrillic ## Requires -* Cython: [http://www.cython.org/](http://www.cython.org/) -e.g.) Ubuntu 12.04 - -```bash -$ sudo apt-get install build-essential python-dev cython -``` +- Cython: [http://www.cython.org/](http://www.cython.org/) ## Installation @@ -50,7 +46,6 @@ $ sudo apt-get install build-essential python-dev cython $ cd /tmp $ git clone git://github.com/PyYoshi/cChardet.git $ cd cChardet -$ python setup.py build $ python setup.py install ``` @@ -65,35 +60,53 @@ $ pip install -U cchardet ```python # -*- coding: utf-8 -*- import cchardet as chardet -with open(r"tests/testdata/wikipediaJa_One_Thousand_and_One_Nights_SJIS.txt", "rb") as f: +with open(r"src/tests/testdata/wikipediaJa_One_Thousand_and_One_Nights_SJIS.txt", "rb") as f: msg = f.read() -result = chardet.detect(msg) -print(result) + result = chardet.detect(msg) + print(result) ``` ## Benchmark -code: [tests.TestCchardetSpeed](https://github.com/PyYoshi/cChardet/blob/master/src/tests/bench.py) -sample: [tests/testdata/wikipediaJa_One_Thousand_and_One_Nights_SJIS.txt](https://github.com/PyYoshi/cChardet/blob/master/src/tests/testdata/wikipediaJa_One_Thousand_and_One_Nights_SJIS.txt) +```bash +$ cd src/ +$ pip install chardet +$ python tests/bench.py +``` -### Performance: -CPU: Intel Core i7 860 2.8GHz +### Performance -RAM: DDR3-1333 16GB +CPU: Intel(R) Core(TM) i3-4170 CPU @ 3.70GHz -Platform: Kubuntu 12.04 amd64, Python 2.7.3 64-bit +RAM: DDR3 1600Mhz 16GB -### Result: +Platform: Ubuntu 16.04 amd64 + +#### Python 2.7.12 - + - + + +
Request (call/s)
chardet0.32chardet0.26
cchardet975.46cchardet1408.73
+ +#### Python 3.5.2 + + + + + + + + + +
Request (call/s)
chardet0.28
cchardet1380.40