Text_LanguageDetect

Tuesday, February 14, 2006

Stuck in encoding

So I started working on encoding detection... but I ran into some conceptual problems. Specifically, some languages can be written in multiple scripts, which in turn may be written in different encodings. For that matter, virtually any language can be written in any phonetic script. What to do about that?

Thursday, February 02, 2006

encoding detection

I've figured out how I want to do encoding dection... I've decided to take the incremental approach and support single-byte encodings only at first.

Hopefully I'll have some time to work on this over the weekend.