Attention is all you need

ONENET: JOINT DOMAIN, INTENT, SLOT PREDICTION FOR SPOKEN LANGUAGE UNDERSTANDING

Neural Architectures for Named Entity Recognition

SEMI-SUPERVISED CLASSFICATION WITH GRAPH CONVOLUTIONAL NETWORKS

Bidirectional LSTM-CRF Models for Sequence Tagging

Language Models are Few-Shot Learners

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

StackPropagationSLU

ERNIE: Enhanced Language Representation with Informative Entities

A Simple Framework for Contrastive Learning of Visual Representation