HCE project C++ developers source code library  1.1.1
HCE project developer library
 All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Friends Macros
HCE::component Namespace Reference

Classes

class  ComponentBase
class  ComponentManager
class  ComponentManagerPool
class  ComponentManagerPoolIf
class  ComponentManagerPoolSimple
class  ComponentsFactory
class  BoostPlugin
class  CldPlugin
class  FakeHCRCPlugin
class  IcuPlugin
class  LCore
struct  LCoreData
class  MecabPlugin
struct  ProcessInfo
class  Refine
class  SnowballPlugin
class  SymbolInfo
class  TagReducePlugin
class  TPlugin
 we cannot pure virtual for partial inheritance More...
class  TurglemPlugin

Typedefs

typedef unsigned int LanguageMask
typedef unsigned char SymbolType
typedef unsigned char AlgorithmType
typedef unsigned short Language

Enumerations

enum  MethodType {
  MT_NONE, SPLIT, LDET, NORM,
  POS, TAGS, HCRC, WTYPE
}
enum  PluginType {
  PT_NONE, DEFAULT, SNOWBALL_NORM, ICU_BOUNDARIES,
  BOOST_SPLIT, BOOST_TOKENIZER, POCO_TOKENIZER, CHROMIUM_LDET,
  TAG_REDUCE, MECAB_NORM, TURGLEM_POS, FAKE_HCRC,
  MECAB_WTYPE, MECAB_TOKENIZE
}
enum  Language_enum {
  ENGLISH = 0, JAPANESE, RUSSIAN, UKRAINIAN,
  BYELORUSSIAN = UKRAINIAN, GERMAN, FRENCH, THAI
}
enum  LanguageMask_enum {
  LM_NONE = 0, LM_ENGLISH = 1 << ENGLISH, LM_JAPANESE = 1 << JAPANESE, LM_RUSSIAN = 1 << RUSSIAN,
  LM_UKRAINIAN = 1 << UKRAINIAN, LM_BYELORUSSIAN = 1 << BYELORUSSIAN, LM_GERMAN = 1 << GERMAN, LM_FRENCH = 1 << FRENCH,
  LM_THAI = 1 << THAI, LM_ALL = (LanguageMask) (-1)
}
enum  Status { METHOD_FOUND, METHOD_NOT_FOUND, PLUGIN_NOT_FOUND }
enum  SymbolType_enum { SI_COMMON = 0, SI_DELIMITER = 1, SI_DIGIT = 2 }
enum  part_of_speech {
  NOUN = 0, ADJ_FULL = 1, VERB = 2, VBE = 3,
  MOD = 4, NUMERAL = 5, CONJ = 6, INTERJ = 7,
  PREP = 8, PARTICLE = 9, ART = 10, ADV = 11,
  PN = 12, ORDNUM = 13, PRON = 14, POSS = 15,
  PN_ADJ = 16
}
enum  grammem {
  Plural = 0, Singular = 1, Masculinum = 2, Feminum = 3,
  Animative = 4, Perfective = 5, Nominative = 6, ObjectCase = 7,
  Narrative = 8, Geographics = 9, Proper = 10, PersonalPronoun = 11,
  Possessive = 12, Predicative = 13, Uncountable = 14, ReflexivePronoun = 15,
  DemonstrativePronoun = 16, Mass = 17, Comparativ = 18, Supremum = 19,
  FirstPerson = 20, SecondPerson = 21, ThirdPerson = 22, PresentIndef = 23,
  Infinitive = 24, PastIndef = 25, PastParticiple = 26, Gerund = 27,
  Futurum = 28, Conditional = 29, ApostropheS = 30, Apostrophe = 31,
  Names = 32, Organisation = 33
}

Functions

template<class T >
const Language detectLanguage (const T &token)

Typedef Documentation

typedef unsigned char HCE::component::AlgorithmType

Definition at line 23 of file SymbolInfo.hpp.

typedef unsigned short HCE::component::Language

Definition at line 24 of file SymbolInfo.hpp.

typedef unsigned int HCE::component::LanguageMask

Definition at line 93 of file ProcessInfo.hpp.

typedef unsigned char HCE::component::SymbolType

Definition at line 21 of file SymbolInfo.hpp.

Enumeration Type Documentation

Enumerator:
Plural 
Singular 
Masculinum 
Feminum 
Animative 
Perfective 
Nominative 
ObjectCase 
Narrative 
Geographics 
Proper 
PersonalPronoun 
Possessive 
Predicative 
Uncountable 
ReflexivePronoun 
DemonstrativePronoun 
Mass 
Comparativ 
Supremum 
FirstPerson 
SecondPerson 
ThirdPerson 
PresentIndef 
Infinitive 
PastIndef 
PastParticiple 
Gerund 
Futurum 
Conditional 
ApostropheS 
Apostrophe 
Names 
Organisation 

Definition at line 61 of file TurglemPlugin.hpp.

Languages constants

Enumerator:
ENGLISH 
JAPANESE 
RUSSIAN 
UKRAINIAN 
BYELORUSSIAN 
GERMAN 
FRENCH 
THAI 

Definition at line 99 of file ProcessInfo.hpp.

Enumerator:
LM_NONE 
LM_ENGLISH 
LM_JAPANESE 
LM_RUSSIAN 
LM_UKRAINIAN 
LM_BYELORUSSIAN 
LM_GERMAN 
LM_FRENCH 
LM_THAI 
LM_ALL 

Definition at line 112 of file ProcessInfo.hpp.

method type

Enumerator:
MT_NONE 
SPLIT 
LDET 
NORM 
POS 
TAGS 
HCRC 
WTYPE 

Definition at line 27 of file ProcessInfo.hpp.

Enumerator:
NOUN 
ADJ_FULL 
VERB 
VBE 
MOD 
NUMERAL 
CONJ 
INTERJ 
PREP 
PARTICLE 
ART 
ADV 
PN 
ORDNUM 
PRON 
POSS 
PN_ADJ 

Definition at line 39 of file TurglemPlugin.hpp.

plugin type

Enumerator:
PT_NONE 
DEFAULT 
SNOWBALL_NORM 
ICU_BOUNDARIES 
BOOST_SPLIT 
BOOST_TOKENIZER 
POCO_TOKENIZER 
CHROMIUM_LDET 
TAG_REDUCE 
MECAB_NORM 
TURGLEM_POS 
FAKE_HCRC 
MECAB_WTYPE 
MECAB_TOKENIZE 

Definition at line 52 of file ProcessInfo.hpp.

Enumerator:
METHOD_FOUND 
METHOD_NOT_FOUND 
PLUGIN_NOT_FOUND 

Definition at line 183 of file ProcessInfo.hpp.

Enumerator:
SI_COMMON 
SI_DELIMITER 
SI_DIGIT 

Definition at line 58 of file SymbolInfo.hpp.

Function Documentation

template<class T >
const Language HCE::component::detectLanguage ( const T &  token)
inline

Definition at line 108 of file CldPlugin.cpp.

Here is the caller graph for this function: