Paper Paper Reading Notes #01: Attention Is All You Need This is a new series of my notes on paper reading, covering various areas in computer architecture, algorithms, and machine learning. The first paper is Attention Is All You Need from Google that
Tensorflow Tensorflow workflow Explanation Data Dataset Create dataset Prepossess dataset Mapping features and label to dataset Shuffle the dataset Create batched data Create an iterator for loading data Use the iterator for feeding data or in a