Skip to main content
Skip table of contents

BoW Entity Extractor

Intended audience: ANALYSTS DEVELOPERS ADMINISTRATORS

AO Platform: 4.3

Overview

This topic contains the Parameters configuration section of the BoW (Bag of Words) Entity Extractor Strategy.

Properties

Label

Description

Fuzzy

Match Word Type

See table below.

Word Providers

Match Word Types

Display Name Label

Description

Dependency (DEP)

Syntactic dependency relation.

Head of token (HEAD)

Syntactic head of the token.

Lemma (LEMMA)

Base form of the token, with no inflectional suffixes.

Lexeme (LEX)

The underlying lexeme.

Lowercase (LOWER)

Lowercase form of the token.

Morphological Annotation (MORPH)

Morphological Annotation.

Normalized token (NORM)

The token’s norm, i.e. a normalized form of the token text.

ID of Token (ORTH)

ID of the verbatim text content.

Part-of-Speech (POS)

Coarse-grained part-of-speech from the Universal POS tag set.

Fine-grained part-of-speech (TAG)

Fine-grained part-of-speech.

Text (TEXT)

Verbatim text content.

Also see Testing Strategies.


Contact App Orchid | Disclaimer

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.