Links and more
- Make a test with your own text file or select one of the listed languages to your right.
- The basic n-gram functions plus a Demo script to implement your own language detetection in php.
- As a OOP alternative use the PHP class called LangDetect together with a example.php
- In order to implement your own language detection you need to download the finger-prints of 70+ plus languages. With the LangDetect PHP class you can generate further finger-prints yourself.
- Weighted World Maps on Wealth, Population, Greenhouse emissions and Internet usage.
- Read more about how n-grams work? at Wikipedia
- Also, see recommended Python n-gram script (TextCat) by Thomas Mangin.
- Inspired by the Language Identifier from Xerox.
- and Gertjan van Noord's Textcat written in Perl and his Demo.
- There are two publications worth reading about different methods of Textual Analysis and Language Identification .
- Another publication about Evaluation of Language Identification Methods by Simon Kranig at the Univeristy of Tübingen.
- Read the very instructive blog of "The Nameless One" about How to detect which language a text is written in? Or when science meets human! with several links.
- Ajax Pop-up used on this website were done with Greybox "a pop-up window that doesn't suck".
- JavaScript libraries used on this website from script.aculo.us.
- There are approximately 6900 languages currently spoken around the world. Top 30 languages of the world and much more at World Languages & Cultures.
- Visit OpenSubtitles.org the biggest data base for DivX subtitles on earth.