The corpus is organized into 15 files, where each file contains several hundred posts collected on a given date, for an age-specific chatroom (teens, 20s, 30s, 40s, plus a generic adults chatroom).
The filename contains the date, chatroom, and number of posts; e.g., The Brown Corpus was the first million-word electronic corpus of English, created in 1961 at Brown University.
Our video sections: Nude Beach, Beach Cabin, Locker Room, Upskirt, Spy Camera, WC, Shower Room.
Our gallery sections: Amateur Pictures, Voyeur Sneaks, Hardcore Pictures.
If you have a credit card (VISA, Discover, JCB) or if you can pay by online check (only US customers), you can use our traditional safe and secure payment provider CCBill.com!
You will provide credit card and personal information only to CCBill's secure site.
We can ask for the topics covered by one or more documents, or for the documents included in one or more categories.
For convenience, the corpus methods accept a single fileid or a list of fileids.
The documents have been classified into 90 topics, and grouped into two sets, called "training" and "test"; thus, the text with fileid Unlike the Brown Corpus, categories in the Reuters corpus overlap with each other, simply because a news story often covers multiple topics.The previous example also showed how we can access the "raw" text of the book Although Project Gutenberg contains thousands of books, it represents established literature.It is important to consider less formal language as well.Your information is transmitted via encryption between you and payment system.We never see your credit card or personal information.