Stemming

The Porter stemming algorithm (or ‘Porter stemmer’) is a process for removing the commoner morphological and inflexional endings from words in English. Its main use is as part of a term normalisation process that is usually done when setting up Information Retrieval systems (i.e., indexing).

The algorithm is available pre-coded in several languages, including , , and (of all things) . There are also notes on the Portuguese version, plus a more limited set of implementations.

Genuinely Useful: