Package org.htmlcleaner
package org.htmlcleaner
-
ClassDescriptionBase token interface.Base class for all tokens.Browser compact XML serializer - creates resulting XML by stripping whitespaces wherever possible, but preserving single whitespace where at least one exists.Properties defining cleaner's behaviourContains transformation collection.Command line usage class.HTML comment token.Compact HTML serializer - creates resulting HTML by stripping whitespaces wherever possible.Compact XML serializer - creates resulting XML by stripping whitespaces.Configuration file tag provider - reads XML file in specified format and creates a Tag Provider.HTML text token.This is the default tag provider for HTML Cleaner Note this is no longer generated from XML - see https://sourceforge.net/p/htmlcleaner/bugs/81/Most HTML 4 elements permitted within the BODY are classified as either block-level elements or inline elements.HTML doctype token.DOM serializer - creates xml DOM.HTML tag end token.Main HtmlCleaner class.General HtmlCleaner runtime exception.Support for ANT.Marker interface denoting nodes of the document treeAbstract HTML serializer - contains common logic for descendants.Main HTML tokenizer.Provides set of TagInfo instances.JDom serializer - creates xml JDom instance out of the TagNode.It contains the MathML tags to use with Html5 tagsPretty HTML serializer - creates resulting HTML with indenting lines.Pretty XML serializer - creates resulting XML with indenting lines.Basic abstract serializer - contains common logic for descendants (methods
writeXXX()
.Simple HTML serializer - creates resulting HTML without indenting and/or compacting.Simple XML serializer - creates resulting XML without indenting lines.This class contains map with special entities used in HTML and their unicodes.Class contains information about single HTML tag.
It also contains rules for tag balancing.XML node tag - basic node of the cleaned HTML tree.Defines action to be performed on TagNodesHTML tag token - descendants are start (TagNode) and end token (EndTagToken).Describes how specified tag is transformed to another one, or is ignored during parsingA traversal-based serializer for DOM; used to avoid recursion and stack overflow for large HTML documents.Common utilities.Abstract XML serializer - contains common logic for descendants.Depth-first node traversor.Node visitor interface.Utility for searching cleaned document tree with XPath expressions.Exception that could occure during XPather evaluation.