HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings.
2.0.0-chaika
Hierarchical Cluster Engine Python language binding
Main Page
Related Pages
+
Namespaces
Namespace List
+
Namespace Members
+
All
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
+
Functions
_
a
b
c
d
e
f
g
i
j
l
m
o
p
r
s
t
u
v
w
+
Variables
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
+
Classes
Class List
Class Index
Class Hierarchy
+
Class Members
+
All
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
z
+
Functions
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
r
s
t
u
v
w
x
+
Variables
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
z
+
Files
File List
▼
HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings.
►
HCE project, Python bindings, Distributed Tasks Manager application, Distributed Crawler service.
Todo List
►
Namespaces
▼
Classes
►
Class List
Class Index
►
Class Hierarchy
▼
Class Members
►
All
►
Functions
►
Variables
►
Files
•
All
Classes
Namespaces
Files
Functions
Variables
Pages
- g -
genDBFields() :
dc_db.URLContentTask.URLContentTask
generateBatchitemsByURLSchema() :
dc_crawler.CrawlerTask.CrawlerTask
generateCriterionSQL() :
dc_db.BaseTask.BaseTask
generateDomainUrl() :
app.Utils.UrlParser
generateEmptyResponse() :
dcc.DCC.DCC
,
dtma.DTMA.DTMA
,
dtmc.DTMC.DTMC
generateETagsString() :
dc_crawler.DetectModified.DetectModified
generateGetObjects() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateObjectsList() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateResource() :
dc_crawler.ResourceProcess.ResourceProcess
generateSetObjects() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateSQLCustomObject() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateStatObjects() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateStopObjects() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateSuspendObject() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateSystemObject() :
dtma.DTMAObjectsFiller.DTMAObjectsFiller
generateTemplatesFromRowTemplates() :
dc_processor.scrapy_extractor.ScrapyExtractor
generateUrlSchema() :
dc_crawler.UrlSchema.UrlSchema
get() :
dc_processor.scraper_result.Result
get_body() :
transport.Request.Request
,
transport.Response.Response
get_connection_uid() :
transport.IDGenerator.IDGenerator
get_crawl_delay() :
dc_crawler.OwnRobots.RobotExclusionRulesParser
get_data() :
app.Utils.MLStripper
get_fetcher() :
dc_crawler.Fetcher.BaseFetcher
get_uid() :
transport.Request.Request
,
transport.Response.Response
,
transport.UIDGenerator.UIDGenerator
getAdditionPurges() :
dc_db.URLPurgeTask.URLPurgeTask
getAdminHandler() :
admin.Command.Command
getAllDocs() :
dc_processor.ScraperMultiItemsTask.ScraperResultDocuments
getAllLogsAsString() :
dc_crawler.Fetcher.SeleniumFetcher
getAllTags() :
dc_processor.ScraperMultiItemsTask.ScraperResultDocuments
getBatchFromInput() :
dc_crawler.RTCFinalizer.RTCFinalizer
,
dc_crawler.RTCPreprocessor.RTCPreprocessor
getBatchTaskIdByURL() :
dc.BatchTasksManager.BatchTasksManager
getBatchTasksCount() :
dc.BatchTasksManager.BatchTasksManager
getBehaviour() :
dc_crawler.DetectModified.DetectModified
getBestDatatimeData() :
dc_processor.Scraper.Scraper
getBestValue() :
dc_processor.scraper_result.Result
getBody() :
admin.NodeManagerResponse.NodeManagerResponse
getCodec() :
dc_crawler.ResourceProcess.ResourceProcess
getCommandName() :
admin.Command.Command
getCommonPath() :
dc_processor.ScraperMultiItemsTask.ScraperResultDocuments
getConfigOption() :
dc_postprocessor.PostProcessingApplicationClass.PostProcessingApplicationClass
getConfigVarsFields() :
app.BaseServerManager.BaseServerManager
getConnectedNodesFromSchema() :
dtm.ResourcesStateMonitor.ResourcesStateMonitor
getConnectionIdentityByEvent() :
app.AdminInterfaceServer.AdminInterfaceServer
getCookie() :
dc_crawler.HTTPCookieResolver.HTTPCookieResolver
getData() :
dc_crawler.HTTPCookieResolver.HTTPCookieResolver.Cookie
,
dc_crawler.ProxyJsonWrapper.ProxyJsonWrapper
getDataBuffer() :
dc_processor.ProcessorFeedParser.ProcessorFeedParser
getDefaultItem() :
app.ResponseExtractor.ResponseExtractor
getDefaultProperties() :
dc_processor.AuthorType.AuthorType
getDeletedTask() :
dtm.TasksManager.TasksManager
getDepthFromUrl() :
dc_crawler.URLProcess.URLProcess
getDeserialize() :
dtmc.DTMCObjectsSerializator.DTMCObjectsSerializator
getDir() :
app.Utils.PathMaker
,
dc_crawler.CrawlerTask.CrawlerTask
getDomain() :
app.Utils.UrlParser
getDomainNameFromURL() :
dc_crawler.Fetcher.BaseFetcher
getDomainsForUrlSourcesRules() :
dc_processor.Scraper.Scraper
getDRCEConnectionParamsFromPool() :
dc.SitesManager.SitesManager
getDTMTaskState() :
dc.BatchTasksManager.BatchTasksManager
,
dc.BatchTasksManagerProcess.BatchTasksManagerProcess
getElapsedTime() :
admin.Node.Node
getEmptyTags() :
dc_processor.scraper_result.Result
getEnaibledProxies() :
dc_crawler.DBProxyWrapper.DBProxyWrapper
getEnvVars() :
dc_crawler.RTCPreprocessor.RTCPreprocessor
getErrorCode() :
admin.NodeManager.NodeManager
,
admin.NodeManagerRequest.NodeManagerRequest
,
admin.NodeManagerResponse.NodeManagerResponse
,
dbi.dbi.DBI
,
ftests_db_in_memory.DBI
getErrorMsg() :
dbi.dbi.DBI
,
ftests_db_in_memory.DBI
getExitCode() :
dc_processor.ProcessorFeedParser.ProcessorFeedParser
,
dc_processor.ProcessorStoreContentKVDB.ProcessorStoreContentKVDB
,
dc_processor.ProcessorTask.ProcessorTask
,
dc_processor.Scraper.Scraper
getExtractorByName() :
dc_processor.Scraper.Scraper
getFieldParams() :
dc_crawler.CollectURLs.CollectURLs
getFilePath() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getFilledTags() :
dc_processor.scraper_result.Result
getFreeProcInstanceNumber() :
app.Utils.LoggerFileName
getFromProperties() :
dc.EventObjects.Site
getFromProperty() :
dc_crawler.FetcherType.FetcherType
getGmtTime() :
app.Filters.Filters
getHeaderContent() :
dc_processor.Scraper.Scraper
getHost() :
admin.Node.Node
getIndexNumberOfPath() :
dc_processor.ScraperMultiItemsTask.ScraperResultDocuments
getInputPickle() :
app.ContentUpdater.ContentUpdater
,
app.UrlsToBatchTask.UrlsToBatchTask
getInt() :
app.DateTimeType.DateTimeType
getLang() :
app.DateTimeType.DateTimeType
getLangTags() :
dc_processor.ScraperLangDetector.ScraperLangDetector
getLangTagsNames() :
dc_processor.ScraperLangDetector.ScraperLangDetector
getListOfUniqueURLs() :
app.UrlsToBatchTask.UrlsToBatchTask
getLocalOffset() :
dc_processor.PDateTimezonesHandler.PDateTimezonesHandler
getLogger() :
app.Utils.MPLogger
getLogLevel() :
app.BaseServerManager.BaseServerManager
getMaxCount() :
dc_processor.ScraperMultiItemsTask.ScraperResultDocuments
getMaxCountParameters() :
dc_crawler.UrlSchema.UrlSchema
getMonthNumber() :
app.DateTimeType.DateTimeType
getNewSchedulerTask() :
tests.test_dtm_SchedulerTaskScheme.TestSchedulerTaskScheme
getNewsItem() :
app.ResponseExtractor.ResponseExtractor
getNewTaskLog() :
tests.test_dtm_TaskLogScheme.TestTaskLogScheme
getNextBestExtractor() :
dc_processor.Scraper.Scraper
getNormalized() :
app.Url.Url
getNormalizeMask() :
app.UrlNormalize.UrlNormalize
getObject() :
ftests.ftest_FieldsSQLExpressionEvaluator.Test
getObjectFromJsonFile() :
dtm.ResourcesStateMonitor.ResourcesStateMonitor
getOptions() :
dc_crawler.Fetcher.SeleniumFetcher
getPairNames() :
dc_processor.AuthorType.AuthorType
getParams() :
admin.Command.Command
getPlannedRunTime() :
dtm.Scheduler.Scheduler
getPort() :
admin.Node.Node
getProcessDirs() :
dc_crawler.Fetcher.SeleniumFetcher
getProcessedContent() :
dc_processor.ProcessorTask.ProcessorTask
,
dc_processor.Scraper.Scraper
getProcessorCmd() :
dc_processor.ProcessorTask.ProcessorTask
getPropValueFromSiteProperties() :
dc_processor.ProcessorTask.ProcessorTask
getProxies() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getProxy() :
dc_crawler.HTTPProxyResolver.HTTPProxyResolver
,
dc_crawler.ProxyResolver.ProxyResolver
getProxyData() :
dc_crawler.ProxyJsonWrapper.ProxyJsonWrapper
,
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getProxyList() :
dc_crawler.ProxyJsonWrapper.ProxyJsonWrapper
,
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getProxyName() :
dc_crawler.CrawlerTask.CrawlerTask
getPubdateUseSourceMask() :
dc_crawler.CrawlerTask.CrawlerTask
getQueryPrefix() :
dc_processor.ProcessorFeedParser.ProcessorFeedParser
getRawContent() :
dc_processor.ProcessorTask.ProcessorTask
getRawContentCheck() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getRawContentCheckFaults() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getRawContentCheckPatterns() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getRawContentCheckRotate() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getRawContentFromFS() :
dc_processor.ProcessorTask.ProcessorTask
getRealUrl() :
dc_crawler.URLProcess.URLProcess
getRedirectProperty() :
dc_crawler.HTTPRedirectResolver.HTTPRedirectResolver
getRequestEvent() :
dc.ClientInterfaceService.ClientInterfaceService
,
dtm.ClientInterfaceService.ClientInterfaceService
getRequestTimeout() :
admin.Command.Command
getResourceFromTaskResponse() :
dtm.ResourcesStateMonitor.ResourcesStateMonitor
getResourcesAVG() :
dtm.ResourcesRecalculating.ResourcesRecalculating
getResponceEventType() :
dtm.TasksDataManager.TasksDataManager
getResponses() :
admin.NodeManager.NodeManager
getResponsesDicts() :
admin.NodeManager.NodeManager
getSiteFields() :
dc_db.URLCleanupTask.URLCleanUpTask
getSiteProperties() :
ftests.ftest_FieldsSQLExpressionEvaluator.Test
getSitesFromClientResponseItems() :
dc.SitesManager.SitesManager
getSource() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getStatDataFields() :
app.BaseServerManager.BaseServerManager
GetStats() :
app.Url.Url
getStatusUpdateEmptyProxyList() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getStatusUpdateNoAvailableProxy() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getStatusUpdateTriesLimits() :
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getStatVarsDumpFileName() :
app.BaseServerManager.BaseServerManager
getString() :
app.DateTimeType.DateTimeType
getSummaryLang() :
dc_processor.ScraperLangDetector.ScraperLangDetector
getSystemStat() :
app.BaseServerManager.BaseServerManager
getTagNamesExistAllDocs() :
dc_processor.ScraperMultiItemsTask.ScraperResultDocuments
getTagValueByName() :
app.ResponseExtractor.ResponseExtractor
getTasksDeserialize() :
dtmc.DTMCObjectsSerializator.DTMCObjectsSerializator
getTemplate() :
dc_processor.Scraper.Scraper
getTimeout() :
admin.Node.Node
getTimeSinceEpoch() :
dtm.Scheduler.Scheduler
getTimezone() :
app.DateTimeType.DateTimeType
getTriesCount() :
dc_crawler.HTTPProxyResolver.HTTPProxyResolver
,
dc_crawler.ProxyResolver.ProxyResolver
,
dc_crawler.UserProxyJsonWrapper.UserProxyJsonWrapper
getUnixTimeFromString() :
dc.ClientInterfaceService.ClientInterfaceService
getURL() :
dc.EventObjects.URL
,
dc_db.URLStatusTask.URLStatusTask
getURLContent() :
dc_crawler.RTCFinalizer.RTCFinalizer
,
dc_db.URLContentTask.URLContentTask
getURLContentFromBatch() :
dc_crawler.RTCFinalizer.RTCFinalizer
getURLFetchJson() :
app.UrlFetchJsonToDBTaskConvertor.UrlFetchToJsonDBTaskConvertor
getURLFromURLTable() :
dc_db.URLFetchTask.URLFetchTask
getURLsCountFromClientResponseItems() :
dc.BatchTasksManager.BatchTasksManager
getVariableFromHeaderContent() :
dc_processor.Scraper.Scraper
getXPathFromContent() :
dc_processor.ml_extractor.MLExtractor
getXpathValueForDTime() :
dc_processor.TemplateExtractorXPathPreparing.TemplateExtractorXPathPreparing
Generated on Fri Nov 24 2017 18:55:20 for HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings. by
1.8.13