org.basex.build.file
Class HTMLParser

java.lang.Object
  extended by org.basex.core.Proc
      extended by org.basex.build.Parser
          extended by org.basex.build.SingleParser
              extended by org.basex.build.xml.XMLParser
                  extended by org.basex.build.file.HTMLParser

public final class HTMLParser
extends XMLParser

This class uses TagSoup to convert HTML input to well-formed XML. If TagSoup is not found in the classpath, the original document is passed on. TagSoup was written by John Cowan and is based on the Apache 2.0 License: http://home.ccil.org/~cowan/XML/tagsoup/.

Author:
BaseX Team 2005-12, BSD License, Christian Gruen

Field Summary
static java.lang.String NAME
          Name of HTML Parser.
 
Fields inherited from class org.basex.build.SingleParser
builder
 
Fields inherited from class org.basex.build.Parser
atts, nsp, prop, src
 
Fields inherited from class org.basex.core.Proc
listen, updating
 
Constructor Summary
HTMLParser(IO source, Prop pr)
          Constructor.
HTMLParser(IO source, java.lang.String options, Prop pr)
          Constructor.
 
Method Summary
static boolean available()
          Checks if a CatalogResolver is available.
static java.lang.String parser()
          Returns the name of the parser, or an empty string.
 
Methods inherited from class org.basex.build.xml.XMLParser
close, det, parse, prog
 
Methods inherited from class org.basex.build.SingleParser
builder, parse
 
Methods inherited from class org.basex.build.Parser
emptyParser, info, singleParser, target, xmlParser
 
Methods inherited from class org.basex.core.Proc
abort, checkStop, databases, detail, listen, proc, progress, registered, registered, startTimeout, stop, stopTimeout, tit, title
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

NAME

public static final java.lang.String NAME
Name of HTML Parser.

See Also:
Constant Field Values
Constructor Detail

HTMLParser

public HTMLParser(IO source,
                  Prop pr)
           throws java.io.IOException
Constructor.

Parameters:
source - document source
pr - database properties
Throws:
java.io.IOException - I/O exception

HTMLParser

public HTMLParser(IO source,
                  java.lang.String options,
                  Prop pr)
           throws java.io.IOException
Constructor.

Parameters:
source - document source
options - options
pr - database properties
Throws:
java.io.IOException - I/O exception
Method Detail

available

public static boolean available()
Checks if a CatalogResolver is available.

Returns:
result of check

parser

public static java.lang.String parser()
Returns the name of the parser, or an empty string.

Returns:
name of parser