By themselves, BigramTagger and TrigramTagger perform quite poorly. This is
partly because they cannot learn context from the first word(s) in a sentence. Since
a UnigramTagger class doesn't care about the previous context, it is able to have
higher baseline accuracy by simply guessing the most common tag for each word.