TFIDF

Term Frequency and Inverse Document Frequency

⇒ Come to solve the problem of Semantic meaning between words that is not captured using BOW

⇒ The rare words must have higher weighted when we are creating the vectors

EX:

D1 → He is a good boy

D2 → She is a good girl

D3 → Boy and girl are good

image.png

The weight of word good is zero because it’s found at every sentence