Chilkat HOME .NET Core C# Android™ AutoIt C C# C++ Chilkat2-Python CkPython Classic ASP DataFlex Delphi ActiveX Delphi DLL Go Java Lianja Mono C# Node.js Objective-C PHP ActiveX PHP Extension Perl PowerBuilder PowerShell PureBasic Ruby SQL Server Swift 2 Swift 3,4,5... Tcl Unicode C Unicode C++ VB.NET VBScript Visual Basic 6.0 Visual FoxPro Xojo Plugin
(Chilkat2-Python) Extract all HTML Objects from a Web PageSee more MHT / HTML Email ExamplesDemonstrates how to download a Web page (at a URL) and extract all HTML objects. Eg. images, links, CSS files, JavaScript files, etc.
import sys import chilkat2 # This example assumes the Chilkat API to have been previously unlocked. # See Global Unlock Sample for sample code. mht = chilkat2.Mht() # Download a URL into an in-memory MHT web archive contained # in a string variable. # The following URL is randomly picked and was valid at the time of writing this example: mhtDoc = mht.GetMHT("https://www.tetonlodge.com/") if (mht.LastMethodSuccess != True): print(mht.LastErrorText) sys.exit() # Extract the HTML and embedded objects: unpackDir = "C:/AAWorkarea/mhtTesting/" htmlFilename = "lodge.html" partsSubdir = "objects" # Extract to C:/AAWorkarea/mhtTesting/lodge.html. # images and other embedded objects are placed in # C:/AAWorkarea/mhtTesting/objects. Directories are automatically # created if they don't already exist. success = mht.UnpackMHTString(mhtDoc,unpackDir,htmlFilename,partsSubdir) if (success != True): print(mht.LastErrorText) else: print("Unpacked!") |
© 2000-2024 Chilkat Software, Inc. All Rights Reserved.