This task is an idea that I got though I was reading through the N-gram text Categorization. Using n-grams (a n-gram is a sequence of n letters) as a substitute of phrases had numerous rewards: Language independent: it is not essential to use stemming algorithm, to use words root (i.e. Also, it doesnt function only for English, seeing that the algorithm learns about prior article Simple tokenization: it has a exceptional way to parse and extract functions, because it is not operating with words it doesnt care about the language. One other excellent notion is to develop manifeste (and no cost) know-how databases of popular things, like as languages texts, that consumers will be ready to download|install and install on their WP. Ajax, ajax, ajax.. you may perhaps head so considerably about ajax, and you could be questioning, whats ajax?, is it a new programming language? Nicely, this is not a programming language, this is only the new pattern which is download|install a website-web page or a part of it when you will need, steering clear of reloading the hole web page. For averting this trouble, there is a new methodology of grow web sites, named AJAX (Asynchronous Javascript And XML), this is a combination concerning client facet (javascript) and server facet (commonly PHP, but could be publish in any language, even statics files). To keep clear of this difficulty, we will need to have two variations of the internet site, Ajax and traditional, considering that ajax is a lot better to surf for people who have a new browser, and we won't be able to ignore world wide web-robots considering that theyll give rewards.