|
Xport's parser
type alias, (and other aliases typed
from xhtml_parser), parses xhtml and html from files and input streams.
Parsed input is saved to a document object. The parsed input can be either a full well formed document, or can be a partial document. Parsing partial documents is made possible in the case where a valid xhtml document may be comprised of other partial xhtml documents.
If the input being parsed is valid, well formed xhtml, parser
will parse the input correctly with no issues. If the input is invalid xhtml or
html, parser
will parse the input as well as possible, and the parsed output will be valid well formed xhtml. Depending on badly mal-formed
the input is, the parsed output may not contain all of the content of the input.
The parser allows logging capabilities to monitor or review the parsing operation. The level of logging detail can be set, as well as the type of logging output.
The type alias that you use for your parser
depends on two factors. The document type you wish to parse the input to, and the character type of the input and
resulting document. The list below reveals the available type aliases for Xport's
parser
.
There are only a few operations availabe for parser
, which are listed below.