Sample code for 30+ languages & platforms
DataFlex

Scrub Xml

See more XML Examples

Demonstrates the Scrub method for some common XML cleaning needs.

Note: This example requires Chilkat v9.5.0.77 or greater.

Chilkat DataFlex Downloads

DataFlex
Use ChilkatAx-win32.pkg

Procedure Test
    Boolean iSuccess
    String s
    Handle hoXml
    String sTemp1

    Move False To iSuccess

    // The Scrub method has options for trimming whitespace from the ends of attributes and content,
    // trimming inside whitespace, and for case conversion for tags and attribute names.

    // Trimming inside whitespace is defined as replacing CR, LF, and tab chars with SPACE chars, and 
    // removing extra SPACE chars so that at most one SPACE separates non-SPACE chars.

    // For example, load the following XML:

    Move '<xyz><td class=" colHoldingPercent col7" style="border-bottom-color: rgb(221, 221, 221);               border-image-outset: 0;  "> abc' + (character(13)) + (character(10)) + '123    456</td></xyz>' To s

    Get Create (RefClass(cComChilkatXml)) To hoXml
    If (Not(IsComObjectCreated(hoXml))) Begin
        Send CreateComObject of hoXml
    End
    Get ComLoadXml Of hoXml s To iSuccess

    Get ComGetXml Of hoXml To sTemp1
    Showln sTemp1

    // The XML before scrubbing is:

    // <xyz>
    //     <td class=" colHoldingPercent col7" style="border-bottom-color: rgb(221, 221, 221);               border-image-outset: 0;  ">abc
    // 123    456</td>
    // </xyz>

    Send ComScrub To hoXml "AttrTrimEnds,AttrTrimInside,ContentTrimEnds,ContentTrimInside"
    Get ComGetXml Of hoXml To sTemp1
    Showln sTemp1

    // The XML after scrubbing is:

    // <?xml version="1.0" encoding="utf-8"?>
    // <xyz>
    //     <td class="colHoldingPercent col7" style="border-bottom-color: rgb(221, 221, 221); border-image-outset: 0;">abc 123 456</td>
    // </xyz>


End_Procedure