Reader George let me know about this interesting tool which acts as a search engine spam detector, analyzing your HTML and letting you know which of your practices might be interpreted by a search engine spider as black hat SEO. The tool tries to detect keyword stuffing, doorway pages and hidden text.
On my site, it found my blogroll as hidden text, because of the nifty tool I use to collapse to 250+ links into a tiny space. They also didn’t like a list I did of languages Google Talk supports, since it looked like keyword stuffing. Other than that, I was completely clean. Very nice.
How does your site do?
For fun, I’ve entered in some sites and checked them out:
- Google Blogoscoped - perfectly clean
- Miel’s blog - also clean
- Matt Cutts blog - indeed, clean
- Microsoft.com - invisible links
- Yahoo.com - four instances of invisible links
- Ask.com - two instances of invisible links
- Jason’s blog - 12 links to doorway pages
- Boing Boing - their writing style is too similar to keyword stuffing. Hilarious!
- The official Google blog - lots of invisible text that reads “< whitespace >” and the same keywords stuffing detected by me, since I was just quoting them.
Find anything interesting, post it below. It seems like the majority of the problems detected are honest mistakes. My advice: If it detects nothing that could be considered obviously sinister (like invisible links to online poker/phentermine sites), don’t worry, but if it looks suspicious, think about tweaking your HTML to remove that code.