Chilkat HOME .NET Core C# Android™ AutoIt C C# C++ Chilkat2-Python CkPython Classic ASP DataFlex Delphi ActiveX Delphi DLL Go Java Lianja Mono C# Node.js Objective-C PHP ActiveX PHP Extension Perl PowerBuilder PowerShell PureBasic Ruby SQL Server Swift 2 Swift 3,4,5... Tcl Unicode C Unicode C++ VB.NET VBScript Visual Basic 6.0 Visual FoxPro Xojo Plugin
(Chilkat2-Python) GetBaseDomainThe GetBaseDomain method is a utility function that converts a domain into a "domain base", which is useful for grouping URLs. For example: abc.chilkatsoft.com, xyz.chilkatsoft.com, and blog.chilkatsoft.com all have the same base domain: chilkatsoft.com. Things get more complicated when considering country domains (.au, .uk, .se, .cn, etc.) and government, state, and .us domains. Also, domains such as blogspot, wordpress, etc, are treated specially so that "xyz.blogspot.com" has a base domain of "xyz.blogspot.com". Note: If you find other domains that should be treated similarly to blogspot.com, send a request to support@chilkatsoft.com.
import chilkat2 spider = chilkat2.Spider() print(spider.GetBaseDomain("www.chilkatsoft.com")) print(spider.GetBaseDomain("blog.chilkatsoft.com")) print(spider.GetBaseDomain("www.news.com.au")) print(spider.GetBaseDomain("blogs.bbc.co.uk")) print(spider.GetBaseDomain("xyz.blogspot.com")) print(spider.GetBaseDomain("www.heaids.org.za")) print(spider.GetBaseDomain("www.hec.gov.pk")) print(spider.GetBaseDomain("www.e-mrs.org")) print(spider.GetBaseDomain("cra.curtin.edu.au")) # Prints: # chilkatsoft.com # chilkatsoft.com # news.com.au # bbc.co.uk # xyz.blogspot.com # heaids.org.za # hec.gov.pk # e-mrs.org # curtin.edu.a |
© 2000-2024 Chilkat Software, Inc. All Rights Reserved.