AntiSpam Techniques
The XWiki project offers several solutions to help fight spam. These are especially useful when XWiki is used as a public site.
Correct Permissions
The first thing to do is to ensure you have not left comments open to anyone, since this will get you a lot of spam.
You should allow the Comment right only to registered users, or at least ask for Captcha for anonymous users (see below).
Here are some scripts to help you remove comment spam:
CAPTCHA
You can enable Captchas both on the registration screen and for comments.
You can also use the XWiki Catpcha module if you need a tighter integration in your own wiki pages / wiki application.
Notifications
You can choose to be notified (even in real time) about changes done to your wiki, and thus be able to quickly revert spam.
Delayed Indexing
You can implement Delayed Indexing to discourage spammers. It tells search engines not to index pages that were recently updated. On an active wiki, spam will be removed very quickly by community members. Once a page has not been edited for a while, you can assume the community approves of the page's content and thus is spam-free.
This can be achieved by editing the "HTTP META INFO" section of the "Presentation" Administration section. For example, you could decide to wait 5 hours after a page has been created before allowing it to be indexed by using:
#if ($datetool.getDate().time - $doc.getCreationDate().time < 5*24*60*60*1000)
<meta name=“robots” content=“noindex,nofollow” />
#end
AntiSpam Application
Last but not least, you could install the AntiSpam Application, which provides 2 main features:
- Ability to prevent spam from being entered (by filtering on keywords)
- UI to easily remove spam content and all content entered by a given user