Similar to the small web. A user curated list of known spam blogs to use as a corpora for classification purposes.
This is probably naive but frequently highly ranking domains can be deranked if they have a certain percentage of content that matches the classifier.