As an example of the accuracy of this prediction, if the first 10,879,522 words of the AP89 collection are scanned, Heaps’ law predicts that the number of unique words will be 100,151, whereas the actual value is 100,024.
Predictions are much less accurate for small numbers of words (< 1,000).