Usage

Basic Usage

The easiest way to get started is to use the detect method.

>>> from chardetng_py import detect
>>> detect(b'Jakby r\xeaka Boga')
'windows-1254'

There is also a detect method available for compatability with chardet, but it will always report None for the language. The confidence value will either be 0.99 or 0.01 depending on whether chardetng returns a “high” or “low” confidence flag.

>>> from chardetng_py.compat import detect
>>> detect(b'Jakby r\xeaka Boga')
{'encoding': 'windows-1254', 'confidence': 0.99, 'language': None}

Advanced Usage

It is also possible to use the EncodingDetector class directly.

>>> from chardetng_py import EncodingDetector
>>> detector = EncodingDetector()
>>> detector.feed(b'Jakby r\xeaka Boga', last=True)
>>> detector.guess(tld=None, allow_utf8=True)
'windows-1254'