A parser and converter for Unicode Characters, including (standard and extended) Numerical Character References