HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings.
2.0.0-chaika
Hierarchical Cluster Engine Python language binding
Main Page
Related Pages
+
Namespaces
Namespace List
+
Namespace Members
+
All
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
+
Functions
_
a
b
c
d
e
f
g
i
j
l
m
o
p
r
s
t
u
v
w
+
Variables
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
+
Classes
Class List
Class Index
Class Hierarchy
+
Class Members
+
All
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
z
+
Functions
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
r
s
t
u
v
w
x
+
Variables
_
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
z
+
Files
File List
▼
HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings.
►
HCE project, Python bindings, Distributed Tasks Manager application, Distributed Crawler service.
Todo List
►
Namespaces
▼
Classes
►
Class List
Class Index
►
Class Hierarchy
▼
Class Members
►
All
►
Functions
►
Variables
►
Files
•
All
Classes
Namespaces
Files
Functions
Variables
Pages
Here is a list of all class members with links to the classes they belong to:
- i -
id :
app.UrlsToBatchTask.UrlsToBatchTask
,
dbi_sql_test.TestObj
,
dc.EventObjects.Batch
,
dc.EventObjects.ClientResponseItem
,
dc.EventObjects.Proxy
,
dc.EventObjects.ProxyUpdate
,
dc.EventObjects.Site
,
dc.EventObjects.SiteCleanup
,
dc.EventObjects.SiteDelete
,
dc.EventObjects.SiteStatus
,
dc.EventObjects.SiteUpdate
,
demoTask.DemoBackLogTask
,
drce.Commands.BaseRequest
,
drce.Commands.ResponseItem
,
dtm.EEResponsesTable.EEResponsesTable
,
dtm.EventObjects.CheckTaskState
,
dtm.EventObjects.DeleteEEResponseData
,
dtm.EventObjects.DeleteTask
,
dtm.EventObjects.DeleteTaskData
,
dtm.EventObjects.DeleteTaskResults
,
dtm.EventObjects.EEResponseData
,
dtm.EventObjects.ExecuteTask
,
dtm.EventObjects.FetchEEResponseData
,
dtm.EventObjects.FetchTaskData
,
dtm.EventObjects.FetchTasksResults
,
dtm.EventObjects.GetTaskManagerFields
,
dtm.EventObjects.NewTask
,
dtm.EventObjects.ScheduledTask
,
dtm.EventObjects.Task
,
dtm.EventObjects.TaskManagerFields
,
dtm.EventObjects.UpdateTask
,
dtm.EventObjects.UpdateTaskFields
,
dtm.SchedulerTask.SchedulerTask
,
dtm.TaskLog.TaskLog
,
ftests.ftest_DCC.ConnectionStub
,
ftests.ftest_DTMC.ConnectionStub
,
ftests_db_in_memory.TasksDataTable
,
task.Task
id_generator :
drce.CommandExecutor.CommandExecutor
,
tests.test_transport_ConnectionBuilder.TestConnectionBuilder
,
transport.ConnectionBuilder.ConnectionBuilder
idGenerator :
tests.test_transport_IDGenerator.TestIDGenerator
ids :
dtm.EventObjects.AvailableTaskIds
,
dtm.EventObjects.FetchTasksResultsFromCache
,
dtm.EventObjects.GetScheduledTasksResponse
,
dtm.EventObjects.GetTasksStatus
imageExtraction() :
dc_processor.alchemyapi.AlchemyAPI
imagesProcessing() :
dc_processor.newspaper_extractor.NewspaperExtractor
imageTagging() :
dc_processor.alchemyapi.AlchemyAPI
imgDelimiter :
dc_processor.base_extractor.BaseExtractor
includeFieldsNames :
dbi.EventObjects.CustomRequest
incrementLimits() :
dc_crawler.ProxyResolver.ProxyResolver
INDEX_FILE_EXTENTION :
dc_crawler.HTTPProxyResolver.HTTPProxyResolver
indexFileName :
dc_crawler.ProxyResolver.ProxyResolver
,
dc_crawler.UrlSchema.UrlSchema
indexStruct :
dc_crawler.UrlSchema.UrlSchema
init() :
dc_crawler.Fetcher.BaseFetcher
,
dc_postprocessor.LinkResolver.LinkResolver
,
dc_postprocessor.PostProcessingModuleClass.PostProcessingModuleClass
,
dc_postprocessor.PostprocessorTask.PostprocessorTask
,
dc_postprocessor.SocialModule.SocialModule
,
ftests.ftest_PostProcessingApplicationClass.TestApplication
inited :
tests.test_dtm_TasksStateUpdateService.TestTasksStateUpdateService
initFiends() :
dc_crawler.RobotsParser.RobotsParser
initHTTPHeaders() :
dc_crawler.CrawlerTask.CrawlerTask
initializeTmpDirs() :
dc_crawler.Fetcher.SeleniumFetcher
initLogger() :
app.ResponseExtractor.ResponseExtractor
initStatFields() :
app.BaseServerManager.BaseServerManager
initTagsLimitsConfig :
app.ResponseExtractor.ResponseExtractor
initTagsUniqueHashConfig :
app.ResponseExtractor.ResponseExtractor
inlineURLMacroDelimiter :
dc_crawler.Fetcher.SeleniumFetcher
innerDelimiter :
app.ExtendInnerText.ExtendInnerText
innerText() :
app.ExtendInnerText.ExtendInnerText
innerTextTagReplacers :
dc_processor.scrapy_extractor.ScrapyExtractor
,
dc_processor.TemplateExtractorXPathPreparing.TemplateExtractorXPathPreparing
innerTextToList() :
app.ExtendInnerText.ExtendInnerText
input :
drce.Commands.TaskExecuteStruct
,
dtm.EventObjects.NewTask
,
dtm.EventObjects.Task
,
task.Task
input_batch :
dc_processor.ProcessorTask.ProcessorTask
input_data :
dc_processor.ProcessorFeedParser.ProcessorFeedParser
,
dc_processor.ProcessorStoreContentKVDB.ProcessorStoreContentKVDB
,
dc_processor.Scraper.Scraper
inputBatch() :
dc_postprocessor.PostProcessingApplicationClass.PostProcessingApplicationClass
inputFile :
dc_postprocessor.PostProcessingApplicationClass.PostProcessingApplicationClass
insert() :
dbi.dbi.DBI
,
ftests.ftest_TaskDataManager.TestTaskDataManager
,
ftests.ftest_TasksDataManager.TestTasksDataManager
,
ftests_db_in_memory.DBI
INSERT_EE_DATA :
dtm.Constants.EVENT_TYPES
INSERT_EE_DATA_RESPONSE :
dtm.Constants.EVENT_TYPES
insertFSFunc() :
dc_db.DBDataTask.DBDataTask
insertFunc :
dc_db.DBDataTask.DBDataTask
insertKVDBSpecificFunct() :
dc_db.DBDataTask.DBDataTask
insertNewSiteProperties() :
dc_crawler.CollectURLs.CollectURLs
,
ftests.ftest_DBTasksWrapper.Test
insertOnUpdate() :
dbi.dbi.DBI
,
ftests_db_in_memory.DBI
insertProxy() :
dc_db.ProxyNewTask.ProxyNewTask
instantiateModules() :
dc_postprocessor.PostprocessorTask.PostprocessorTask
intelligentExtractor() :
app.DateTimeType.DateTimeType
internalCalculating() :
algorithms.MetricContentSize.MetricContentSize
,
algorithms.MetricWCount.MetricWCount
internalIndexes :
dc_crawler.ProxyResolver.ProxyResolver
io :
dtm.EventObjects.Resource
,
dtm.EventObjects.ResourcesAVG
,
Resources.Resources
is_allowed() :
dc_crawler.OwnRobots.RobotExclusionRulesParser
is_closed() :
transport.Connection.Connection
is_connection_registered() :
app.BaseServerManager.BaseServerManager
is_default() :
dc_crawler.OwnRobots._Ruleset
is_expired() :
dc_crawler.OwnRobots.RobotExclusionRulesParser
is_not_empty() :
dc_crawler.OwnRobots._Ruleset
is_send :
ftests.ftest_test_async_MsgSend.Client
is_url_allowed() :
dc_crawler.OwnRobots._Ruleset
isAbortedByTTL :
dc_crawler.CollectURLs.CollectURLs
,
dc_crawler.CrawlerTask.CrawlerTask
isAllowedInputString() :
app.DateTimeType.DateTimeType
isAllowedLimits() :
dc_processor.MediaLimitsHandler.MediaLimitsHandler
isAllowedReplaceMimeType() :
dc_crawler.ResourceProcess.ResourceProcess
isAllowedSiteLimits() :
dc_processor.ProcessorTask.ProcessorTask
isAllowedUrl() :
dc_crawler.HTTPRedirectResolver.HTTPRedirectResolver
isAvailableProxy() :
dc_crawler.CrawlerTask.CrawlerTask
isAvailableUrl() :
dc_crawler.CrawlerTask.CrawlerTask
isAvailableUrls() :
dc_db.URLPurgeTask.URLPurgeTask
isDeleteTableExist() :
dc_db.URLPurgeTask.URLPurgeTask
isDisabledSite() :
dc_processor.ProcessorTask.ProcessorTask
isDTMTaskDead() :
dc.BatchTasksManager.BatchTasksManager
,
dc.BatchTasksManagerProcess.BatchTasksManagerProcess
isEmptyProxiesList() :
dc_crawler.ProxyResolver.ProxyResolver
isEqualOffset() :
dc_processor.PDateTimezonesHandler.PDateTimezonesHandler
isError :
app.DateTimeType.DateTimeType
isExcludeNode() :
app.ExtendInnerText.ExtendInnerText
isExist() :
app.Filters.Filters
isExistInActions() :
app.Filters.Filters
isExistStage() :
app.Filters.Filters
isExtract :
dc_processor.ScraperMultiItemsTask.ScraperResultDocuments
isGoodWord() :
dc_processor.AuthorType.AuthorType
isHostAvailable() :
dc_crawler.CrawlerTask.CrawlerTask
isInProperties() :
dc.EventObjects.Site
isIsoFormatDate() :
dc_db.SiteTask.SiteTask
isLoadUrls :
dc_processor.NewspaperWrapper.NewspaperWrapper
isLocking :
dc.EventObjects.URLFetch
isNeedRotateProxy() :
dc_crawler.CrawlerTask.CrawlerTask
,
dc_crawler.HTTPProxyResolver.HTTPProxyResolver
isNegative :
app.DateTimeType.OffsetTzInfo
isNormalUrl() :
app.Utils.UrlNormalizator
isNotModified() :
dc_crawler.DetectModified.DetectModified
ISO_SEP :
app.DateTimeType.DateTimeType
isOverlimitMaxResources() :
dc_processor.ProcessorTask.ProcessorTask
isPossibleToRun() :
dtm.Scheduler.Scheduler
isReqSended :
dtm.TasksExecutor.TasksExecutor
isResourceNotChanged :
dc_crawler.DetectModified.DetectModified
isRootURL() :
dc_crawler.CrawlerTask.CrawlerTask
isSiteExist() :
dc_db.BaseTask.BaseTask
isStarted :
app.Profiler.Profiler
isStatisticEnabled() :
dc_db.StatisticLogManager.StatisticLogManager
isSuspend() :
dtm.EventObjects.AdminSuspend
isTagFilled() :
dc_processor.scraper_result.Result
isTagNotFilled() :
dc_processor.base_extractor.BaseExtractor
isTagValueNotEmpty() :
dc_processor.base_extractor.BaseExtractor
isUpdateCollection :
dc_crawler.URLProcess.URLProcess
isUrlExist() :
dc_crawler.URLProcess.URLProcess
isUtf8CodePage() :
app.DateTimeType.DateTimeType
isValid() :
app.Url.Url
isValidURL() :
app.Utils.UrlParser
isValueIn() :
app.Utils.PropertiesValidator
ITEM_BREAK :
app.HostRequestStorage.HostRequestStorage
ITEM_PROCESS :
app.HostRequestStorage.HostRequestStorage
itemDelimiter :
app.ResponseExtractor.ResponseExtractor
itemObject :
dc.EventObjects.ClientResponseItem
itemProperties :
dc.EventObjects.URLContentResponse
items :
dc.EventObjects.Batch
,
dc_crawler.RTCFinalizer.RTCFinalizer
,
drce.Commands.TaskResponse
itemsList :
dc.EventObjects.ClientResponse
iterations :
dc.EventObjects.Site
,
dc.EventObjects.SiteUpdate
iterNumber :
ftests.ftest_test_async_MsgSend.Client
itr :
dc_processor.Scraper.Scraper
Generated on Fri Nov 24 2017 18:55:20 for HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings. by
1.8.13