TLDR; we need to load the labelled data and train a model to predict a label based on text input.

We can start with a simple model using Support Vector Machines (SVMs) before figuring out how to do this by fine turning a language model using transformers.

Notes on Transformers

Notes on SVMs

Data

LABELS

Pipeline yay

learn from features

SVM work when they find a hyperplane in a multi-dimensional space - to seperate samples of diff classes.

→ example: matrix above comes from multidimensional space