With that many words, it seems likely that the number of new words would eventually drop to near zero and Heaps’ law would not be applicable.
Heaps’ law provides a good fit for this data, although the parameter values are very different than those for other TREC collections and outside the boundaries established as typical with these and other smaller collections.