![]() |
Chilkat • HOME • Android™ • AutoIt • C • C# • C++ • Chilkat2-Python • CkPython • Classic ASP • DataFlex • Delphi DLL • Go • Java • Node.js • Objective-C • PHP Extension • Perl • PowerBuilder • PowerShell • PureBasic • Ruby • SQL Server • Swift • Tcl • Unicode C • Unicode C++ • VB.NET • VBScript • Visual Basic 6.0 • Visual FoxPro • Xojo Plugin
(Chilkat2-Python) Extract all HTML Objects from a Web PageSee more MHT / HTML Email ExamplesDemonstrates how to download a Web page (at a URL) and extract all HTML objects. Eg. images, links, CSS files, JavaScript files, etc.
import sys import chilkat2 # This example assumes the Chilkat API to have been previously unlocked. # See Global Unlock Sample for sample code. mht = chilkat2.Mht() # Download a URL into an in-memory MHT web archive contained # in a string variable. # The following URL is randomly picked and was valid at the time of writing this example: mhtDoc = mht.GetMHT("https://www.tetonlodge.com/") if (mht.LastMethodSuccess != True): print(mht.LastErrorText) sys.exit() # Extract the HTML and embedded objects: unpackDir = "C:/AAWorkarea/mhtTesting/" htmlFilename = "lodge.html" partsSubdir = "objects" # Extract to C:/AAWorkarea/mhtTesting/lodge.html. # images and other embedded objects are placed in # C:/AAWorkarea/mhtTesting/objects. Directories are automatically # created if they don't already exist. success = mht.UnpackMHTString(mhtDoc,unpackDir,htmlFilename,partsSubdir) if (success != True): print(mht.LastErrorText) else: print("Unpacked!") |
© 2000-2025 Chilkat Software, Inc. All Rights Reserved.