There are two different approaches how to write a parser with KDE:

It is important to understand that you cannot write an html parser using QXMLInPutSource unless you use strict XHTML. A line like

<body lang=DE link=blue vlink=purple bgcolor=#eeeeff>

stops a QXMLInPutSource-Parser completely because the quotation marks are missing. To convert an html file into an xhtml file use

  • tidy
  • QTextEdit

You can use a parser for

  • converting html syntax in mediawiki syntax, see html2mediawiki
  • programming a spider to follow all links in an html file, see spider
  • creating an automated table of content of an html file, see add_toc