Recall in search, we could “manipulate” the algorithm by including the same word count lots of times or using many synonyms to increase the chance of getting a search term.
Thus, we use the link text (and surrounding text) as terms instead of the actual page contents.
Forum/Comment Spam
The above approach has the issue that you can go to pages that allow posting and link your webpage (i.e. YouTube, Facebook, CBC News, etc.). As these pages have high rank, you also have high rank ⇒ you can choose terms that your web page gets.
Spam Farming
Recall that “spider traps” will accumulate rank. Random jumps prevent them from accumulating all rank, but it is still boosted by the topology.
Spam Farming involves the following technique:
As a start to solving this issue, we ignore links tagged as “nofollow”, and convince forums, new sites, etc. to insert “nofollow” to all links in comments.
Trying to identify the “farm pages” is difficult (i.e. small pages not allowed ⇒ can be made large enough to count, force small pages to include more).
Personalized Page Rank