Unsupervised Learning

REGRESSION

Why we need activation function

Hyperparameter Tuning

Why do we need non-linear activation functions

Weight Initialisation

Gradient Checking

Different Batch Gradients - BGD, SBGD

Vanishing Gradients

RNN

LSTM VS GRU VS RNN

Word2Vec vs Glove

Word Embeddings - SkipGram vs CBOW

Negative Sampling

Embeddings -

Given three sentences: