natural
General natural language (tokenizing, stemming (English, Russian, Spanish), part-of-speech tagging, sentiment analysis, classification, inflection, phonetics, tfidf, WordNet, jaro-winkler, Levenshtein distance, Dice's Coefficient) facilities for node.
methodius-cli
Run Methodius from the command line. Analyze text for ngrams and frequencies with ease.
simplengrams
The easiest way to get n-gram chunks from strings or token arrays!
ngrammy
N-gram search index that is character based and supports Unicode. Useful for implementing autocomplete in functional programming style.
ngram-diff
a tiny package to visualize ngram similarity in reasonably sized chunks of text