Small library that provides functions to tokenize a string into an array of words with or without punctuation