Skip to main content

Train Classifier - Features Tab

Abstract

Train Classifier - Features Tab

The Features tab will be visible inside the Status & Settings tab and you can select it from there (1).

26411132.png

In its Features section you see the three options with sliders, Terms, Concepts and Shadow Concepts.

They determine details to be taken into account by the classifier once you added documents and selected an algorithm to use.

  • Terms is the default option turned on, as it refers to any terms occurring in your documents and related to your categories.

  • Concepts refers to the project's thesaurus. If you set this option to On, the concepts of your thesaurus will be taken into account during classification.

  • Shadow Concepts is the function PoolParty offers as a special feature, for the classifier too. It means that terms that frequently occur in the vicinity of your actual concepts are taken into special account as possible concepts, so-called 'shadow concepts'.

Note

Because of the machine learning algorithms' nature to learn and train as well as the complexity of possible results you have to find out in training your classifiers, which settings to use for best possible results.

The Corpora section (2) will display corpora and their quality if you have created any and also have let them be calculated in the Corpus Management.

The table columns refer to the values that relate to the corpus as such.

The corpora can be additionally part of the calculation as documents for training the classifier.