Create a small test collection in some non-English language using web pages.
Do the basic text processing steps of tokenizing, stemming, and stopping using tools from the book website and from other websites.
Show examples of the index term representation of the documents.