Some Terminologies:

NLP Task:

  1. Dataset
  2. Text Preprocessing
    1. Tokenization, Lowering the words
    2. Stemming, Lemmatization, Stop Words
  3. Words to Vectors (BOW | TFIDF | Word2Vec)

One Hot Encoding:

BOW (Bag Of Words):

D1 → He is a good boy

D2 → She is a good girl

D3 → Boy and girl are good