The JDK has a built-in HTML parser that supports HTML 1.0 or so. It should support parsing of basic tags and text formatting forms.
The reason for using other third-party parsers is the need to support "real" HTML pages DHTML, JavaScript, etc.
JSoup is one of the popular parsers that can do this work. For more information about other implementations, please take a look at the following discussion:
Pure Java HTML viewer / renderer for use in scrolling
source share