Solution Development
Breadcrumbs

BoW Entity Extractor

Intended audience: analysts developers administrators

AO Platform: 4.3

Overview

This topic contains the Parameters configuration section of the BoW (Bag of Words) Entity Extractor Strategy.

image-20220506-092228.png

Properties

Label

Description

Fuzzy


Match Word Type

See table below.

Word Providers


Match Word Types

Display Name Label

Description

Dependency (DEP)

Syntactic dependency relation.

Head of token (HEAD)

Syntactic head of the token.

Lemma (LEMMA)

Base form of the token, with no inflectional suffixes.

Lexeme (LEX)

The underlying lexeme.

Lowercase (LOWER)

Lowercase form of the token.

Morphological Annotation (MORPH)

Morphological Annotation.

Normalized token (NORM)

The token’s norm, i.e. a normalized form of the token text.

ID of Token (ORTH)

ID of the verbatim text content.

Part-of-Speech (POS)

Coarse-grained part-of-speech from the Universal POS tag set.

Fine-grained part-of-speech (TAG)

Fine-grained part-of-speech.

Text (TEXT)

Verbatim text content.


Also see Testing Strategies.






Contact App Orchid | Disclaimer