HCE Project Python language Distributed Tasks Manager Application, Distributed Crawler Application and client API bindings.  2.0.0-chaika
Hierarchical Cluster Engine Python language binding
dc_db.FieldRecalculatorDefaultCriterions Namespace Reference

Functions

def getDefaultCriterions (criterionName, siteId, queryCallback)
 

Variables

 logger = Utils.MPLogger().getLogger()
 
string CRIT_RESOURCES = "COUNTER_CRIT_RESOURCES"
 
string CRIT_CONTENTS = "COUNTER_CRIT_CONTENTS"
 
string CRIT_CLURLS = "COUNTER_CRIT_CLURLS"
 
string CRIT_NURLS = "COUNTER_CRIT_NURLS"
 
string CRIT_DURLS = "COUNTER_CRIT_DURLS"
 
string CRIT_CRURLS = "COUNTER_CRIT_CRURLS"
 
string CRIT_PURLS = "COUNTER_CRIT_PURLS"
 
string CRIT_ERRORS = "COUNTER_CRIT_ERRORS"
 
string CRIT_CRAWLED_THIS_NODE = '( NOT (`Status`=' + str(dc.EventObjects.URL.STATUS_CRAWLED) + ' AND `Crawled`=0 ))'
 
dictionary DefaultRecalculatorCriterions
 

Detailed Description

@package: dc
@author scorp
@link: http://hierarchical-cluster-engine.com/
@copyright: Copyright © 2013-2014 IOIX Ukraine
@license: http://hierarchical-cluster-engine.com/license/
@since: 0.1

Function Documentation

◆ getDefaultCriterions()

def dc_db.FieldRecalculatorDefaultCriterions.getDefaultCriterions (   criterionName,
  siteId,
  queryCallback 
)

Definition at line 42 of file FieldRecalculatorDefaultCriterions.py.

42 def getDefaultCriterions(criterionName, siteId, queryCallback):
43  ret = ""
44  SQL_SELECT_TEMPLATE = "SELECT `Value` FROM `sites_properties` WHERE `Name`='%s' AND `Site_Id`='%s'"
45  criterionDict = {}
46  query = SQL_SELECT_TEMPLATE % (criterionName, siteId)
47  res = queryCallback(query, Constants.PRIMARY_DB_ID)
48  if res is not None and len(res) > 0 and res[0] is not None:
49  criterionDict[app.SQLCriterions.CRITERION_WHERE] = res[0][0]
50  elif criterionName in DefaultRecalculatorCriterions:
51  criterionDict = DefaultRecalculatorCriterions[criterionName]
52  ret = app.SQLCriterions.generateCriterionSQL(criterionDict, None, siteId)
53  logger.debug(">>> Recalculate Def Ret = " + ret)
54  return ret
55 
def generateCriterionSQL(criterions, additionWhere=None, siteId=None)
def getDefaultCriterions(criterionName, siteId, queryCallback)
Here is the call graph for this function:

Variable Documentation

◆ CRIT_CLURLS

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_CLURLS = "COUNTER_CRIT_CLURLS"

Definition at line 21 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_CONTENTS

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_CONTENTS = "COUNTER_CRIT_CONTENTS"

Definition at line 20 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_CRAWLED_THIS_NODE

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_CRAWLED_THIS_NODE = '( NOT (`Status`=' + str(dc.EventObjects.URL.STATUS_CRAWLED) + ' AND `Crawled`=0 ))'

Definition at line 28 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_CRURLS

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_CRURLS = "COUNTER_CRIT_CRURLS"

Definition at line 24 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_DURLS

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_DURLS = "COUNTER_CRIT_DURLS"

Definition at line 23 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_ERRORS

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_ERRORS = "COUNTER_CRIT_ERRORS"

Definition at line 26 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_NURLS

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_NURLS = "COUNTER_CRIT_NURLS"

Definition at line 22 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_PURLS

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_PURLS = "COUNTER_CRIT_PURLS"

Definition at line 25 of file FieldRecalculatorDefaultCriterions.py.

◆ CRIT_RESOURCES

string dc_db.FieldRecalculatorDefaultCriterions.CRIT_RESOURCES = "COUNTER_CRIT_RESOURCES"

Definition at line 19 of file FieldRecalculatorDefaultCriterions.py.

◆ DefaultRecalculatorCriterions

dictionary dc_db.FieldRecalculatorDefaultCriterions.DefaultRecalculatorCriterions
Initial value:
1 = {CRIT_RESOURCES: {"WHERE": "`Status`=" + \
2  str(dc.EventObjects.URL.STATUS_CRAWLED) + " AND `Crawled`>0 AND `Size`>0 " + \
3  "AND ((`ErrorMask` & 4198399) = 0) AND `ContentType`='text/html'"},
4  CRIT_CONTENTS: {"WHERE": " `Status`=7 AND `TagsCount`>0 AND `Processed`>0"},
5  CRIT_CLURLS: {'WHERE': "`ParentMd5`<>'' AND " + CRIT_CRAWLED_THIS_NODE},
6  CRIT_NURLS: {"WHERE": "`Status`=" + str(dc.EventObjects.URL.STATUS_NEW)},
7  CRIT_DURLS: {},
8  CRIT_CRURLS: {},
9  CRIT_PURLS: {},
10  CRIT_ERRORS: {'WHERE': "`ErrorMask`>0 AND " + CRIT_CRAWLED_THIS_NODE}}

Definition at line 30 of file FieldRecalculatorDefaultCriterions.py.

◆ logger

dc_db.FieldRecalculatorDefaultCriterions.logger = Utils.MPLogger().getLogger()

Definition at line 16 of file FieldRecalculatorDefaultCriterions.py.