HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings.
2.0.0-chaika
Hierarchical Cluster Engine Python language binding
|
Public Member Functions | |
def | open (self, url, kwargs) |
Public Member Functions inherited from dc_crawler.Fetcher.BaseFetcher | |
def | __init__ (self) |
def | open (self, url, method='get', headers=None, timeout=100, allow_redirects=True, proxies=None, auth=None, data=None, log=None, allowed_content_types=None, max_resource_size=None, max_redirects=CONSTS.MAX_HTTP_REDIRECTS_LIMIT, filters=None, executable_path=None, depth=None, macro=None) |
def | should_have_meta_res (self) |
def | getDomainNameFromURL (self, url, default='') |
Additional Inherited Members | |
Static Public Member Functions inherited from dc_crawler.Fetcher.BaseFetcher | |
def | init (dbWrapper=None, siteId=None) |
def | get_fetcher (typ, dbWrapper=None, siteId=None) |
Public Attributes inherited from dc_crawler.Fetcher.BaseFetcher | |
connectionTimeout | |
logger | |
Static Public Attributes inherited from dc_crawler.Fetcher.BaseFetcher | |
fetchers = None | |
int | TYP_NORMAL = 1 |
int | TYP_DYNAMIC = 2 |
int | TYP_URLLIB = 5 |
int | TYP_CONTENT = 6 |
int | TYP_AUTO = 7 |
float | CONNECTION_TIMEOUT = 1.0 |
Definition at line 1511 of file Fetcher.py.
def dc_crawler.Fetcher.URLLibFetcher.open | ( | self, | |
url, | |||
kwargs | |||
) |
Definition at line 1523 of file Fetcher.py.