BoW Entity Extractor

Intended audience: ANALYSTS DEVELOPERS ADMINISTRATORS

AO Platform: 4.3

Overview

This topic contains the Parameters configuration section of the BoW (Bag of Words) Entity Extractor Strategy.

Display Name Label	Description
Dependency (DEP)	Syntactic dependency relation.
Head of token (HEAD)	Syntactic head of the token.
Lemma (LEMMA)	Base form of the token, with no inflectional suffixes.
Lexeme (LEX)	The underlying lexeme.
Lowercase (LOWER)	Lowercase form of the token.
Morphological Annotation (MORPH)	Morphological Annotation.
Normalized token (NORM)	The token’s norm, i.e. a normalized form of the token text.
ID of Token (ORTH)	ID of the verbatim text content.
Part-of-Speech (POS)	Coarse-grained part-of-speech from the Universal POS tag set.
Fine-grained part-of-speech (TAG)	Fine-grained part-of-speech.
Text (TEXT)	Verbatim text content.