@dvdagames/pgn-tokenizer

TypeScript version of PGN Tokenizer, a Byte Pair Encoding (BPE) tokenizer for Chess Portable Game Notiation (PGN).

kitoken

Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization