Vocabulary

The vocabulary is the set of unique tokens in the corpus.