stophtml

Tokenizes an HTML string, extracting plain text while ignoring HTML tags