Chilkat2-Python
Chilkat2-Python
Download Web Page to MHT and Extract Images (all in memory)
See more MHT / HTML Email Examples
Downloads a web page to an MHT archive (in memory) and then extracts each image to a byte array in memory.Chilkat Chilkat2-Python Downloads
import sys
import chilkat2
success = False
# This example assumes the Chilkat API to have been previously unlocked.
# See Global Unlock Sample for sample code.
# Note: This URL exists at the time of writing and testing this example (on 12-June-2020)
# However, it will surely not continue to exist for very long.
# You should choose a different URL. (Any web page with images will do.)
url = "https://www.fendi.com/it/abbigliamento-uomo/cravatta-fxc160a3nwf0qg2"
mht = chilkat2.Mht()
# Downloads to an MHT string.
# MHT is just MIME, which is the same format as an email but with different semantics.
mhtStr = mht.GetMHT(url)
if (mht.LastMethodSuccess == False):
print(mht.LastErrorText)
sys.exit()
# We can still treat the MHT MIME as an email and iterate over the "related items".
email = chilkat2.Email()
success = email.SetFromMimeText(mhtStr)
if (success == False):
print(email.LastErrorText)
sys.exit()
numRelatedItems = email.NumRelatedItems
i = 0
sbContentType = chilkat2.StringBuilder()
while i < numRelatedItems :
sbContentType.SetString(email.GetRelatedContentType(i))
print("Content-Type: " + sbContentType.GetAsString())
if (sbContentType.StartsWith("image/",False) == True):
# We have an image.
# Get the image data.
# imageData is a memoryview
imageData = email.GetRelatedData(i)
# Do what you need with the image data..
i = i + 1