VB.NET
nodes with the intent to make it easy for programs to identify and extract the text parts.
VB.NET
Convert HTML to XML with Auto-Correction
See more HTML-to-XML/Text Examples
Simple HTML to XML conversion. Demonstrates how the HTML is auto-corrected to create well-formed XML. In this example, the closing is missing. Also, text is encapsulated inChilkat VB.NET Downloads
' This example assumes the Chilkat API to have been previously unlocked.
' See Global Unlock Sample for sample code.
Dim htmlToXml As New Chilkat.HtmlToXml
' Indicate the charset of the output XML we'll want.
htmlToXml.XmlCharset = "utf-8"
' Set the HTML:
htmlToXml.Html = "<html><body><p>This is a test <a href=""http://www.chilkatsoft.com/"">Chilkat Software</a></body></html>"
' Get the XML:
Debug.WriteLine(htmlToXml.ToXml())
' This is the output:
' <?xml version="1.0" encoding="utf-8" ?>
'
' <root>
' <html>
' <body>
' <p>
' <text>This is a test </text>
' <a href="http://www.chilkatsoft.com/">
' <text>Chilkat Software</text>
' </a>
' </p>
' </body>
' </html>
' </root