About the concept

The general idea is to classify texts that the model has never seen before, or in other words, allow us to assign an appropriate label to a piece of text without having received any training examples before. This approach is also called zero-shot learning.


Advantages

There are multiple advantages, including:

No need for training data. Fast way to classify text without taking up your time for additional annotation of examples, solving the problem of “cold start”.

Unlimited possibilities for labels. Your model will be able to detect any label you might be interested in finding in your data, such as emotions, different topics, sentiment, personality traits and others.

High-performance quality. Rapid classification of the data set without the need for prior training while getting results similar to those of a deep learning model trained with large amounts of data.


Key elements

To create any model using Symanto Brain, there are two key elements required

  • Classes or also called labels, between which to discriminate (classify) the text, e.g. NEGATIVE POSITIVE

  • Patterns, also known as label descriptors, allow the semantic matching between the analysed text and the different labels, e.g. This text is {}

 

Input: The product has no issues but the packaging causes so much extra to squirt out and you can't stop it. For how expensive it is it's such a waste.

Result: THIS TEXT IS NEGATIVE

With this, the zero-shot model allows rapid classification of the data set without the need for prior training and with results similar to those of a deep learning model trained with large amounts of data.


Examples

Please head over to Use Case Examples to view some common use cases.


Next: How to set up your classification task

Symanto Brain Knowledge Base 2022