natural
General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
wuzzy
library for simularity identification
trigram-similarity
Determining the similarity of alphanumeric text based on trigram matching