Is the byte-order mark (BOM) supported
Yes, it does detect the BOM at the beginning of the file and use it to set the UTF-8 encoding type.
Which encodings are supported?
ASCII, UTF-8, ISO-8859-1, Windows 1251 and Windows 1252
Is XHMTL or XML supported?
Neither XHTML or XML are currently supported. If that is something you would like us to add support for, please contact us and let us know.
Is the script element parsed?
The script element is not parsed. The contents are put into a text node. You can manipulate it with any string functions. You can easily add text nodes with more scripts before or after the original text node to prepend or append new scripts.
Can it be used to validate the HTML?
The parser cannot be used as a validator. The parser was designed to match the tolerance of the browsers as much as possible.
Can I use this to convert an encoding type?
The DOM cannot be used to convert from one encoding type to another.