DISQUS

Andy Beard - Internet Business Systems Discussion: Akismet – the Danger of Collective Intelligence (and why I don't use it)

  • Tim Nash · 2 years ago
    About a week ago I noticed that almost no comments I made were getting through on many blogs, including my own a quick email had it sorted out, though when I asked how I triggered the black listing, I got no answer, though I think it could be down to regularly changing the name field I post comments under various names Tim Nash, Venture Skills, Vskills Team depending on the blog and subject, normally always with the same URL but sometimes a non personal email address if I'm unsure of the blog. These variants I suspect is what put me in a sin bin rather then URL based spam. Still its rather amusing to be blocked from commenting on your own blogs (well once its sorted it is)
  • Tim Nash · 2 years ago
    About a week ago I noticed that almost no comments I made were getting through on many blogs, including my own a quick email had it sorted out, though when I asked how I triggered the black listing, I got no answer, though I think it could be down to regularly changing the name field I post comments under various names Tim Nash, Venture Skills, Vskills Team depending on the blog and subject, normally always with the same URL but sometimes a non personal email address if I'm unsure of the blog. These variants I suspect is what put me in a sin bin rather then URL based spam. Still its rather amusing to be blocked from commenting on your own blogs (well once its sorted it is)
  • Matt · 2 years ago
    Bloggers which report things as spam which aren't, which is not uncommon, are devalued in the Akismet system. That's how it's been from day 1, and numerous attempts to poison Akismet have failed. The advantage of so many people using Akismet is it becomes very difficult for any one person to influence the system.

    However what you don't mention, and I think is far more likely scenario, is that a spam filter gives people plausible deniability for deleting a comment from their site. It's awkward to say "I didn't like your comment" so people just blame it on the computer third-party. This used to really confuse me, as people who had been "blocked" would contact Akismet and we deeply investigate every reported FP, but the logs were showing that they weren't ever marked as spam.
  • Matt · 2 years ago
    Bloggers which report things as spam which aren't, which is not uncommon, are devalued in the Akismet system. That's how it's been from day 1, and numerous attempts to poison Akismet have failed. The advantage of so many people using Akismet is it becomes very difficult for any one person to influence the system.

    However what you don't mention, and I think is far more likely scenario, is that a spam filter gives people plausible deniability for deleting a comment from their site. It's awkward to say "I didn't like your comment" so people just blame it on the computer third-party. This used to really confuse me, as people who had been "blocked" would contact Akismet and we deeply investigate every reported FP, but the logs were showing that they weren't ever marked as spam.
  • AndyBeard · 2 years ago
    That doesn't explain why I would notice an increase in comments being held for moderation on sites where I make comments on a fairly regular basis, even though those comments are pulled out of moderation.

    Why would I notice specific things such as providing a deeper link through to more relevant content, or making a longer comment affect my ability to escape the moderation queue?

    There are advantages to collective intelligence, and I did query how much weighting is transferred.

    Since the beginning of December I have tracked approximately 450 comments I have made in various places. I would estimate 100 to 150 of those are on Wordpress installations running Akismet.

    I would estimate that of those comments less than 5 have not appeared after a period of time, yet I still have comments that hit the moderation queue on blogs where I have previously commented without problems, and it seems extremely erratic.
    I am referring to fresh posts, no links in the body of the comment, and comments of a reasonable length that as far as I am aware didn't contain words that might trip up filters.

    With that amount of comment history I would expect the collective intelligence to now be working in my favor, allowing me to include a link in a comment even on sites such as Darren's, or to be able to write a 500+ word comment without it going in the moderation queue.

    Please forgive my lack of faith in filters that rely on global databases, but having spent a lot of time trying to train Gmail into allowing the email I want to get to me, false positives are a major sticking point.

    I expect problems on a first comment, or maybe a comment to an older post, but not to new posts on blogs where my comments are always accepted.
  • AndyBeard · 2 years ago
    That doesn't explain why I would notice an increase in comments being held for moderation on sites where I make comments on a fairly regular basis, even though those comments are pulled out of moderation.

    Why would I notice specific things such as providing a deeper link through to more relevant content, or making a longer comment affect my ability to escape the moderation queue?

    There are advantages to collective intelligence, and I did query how much weighting is transferred.

    Since the beginning of December I have tracked approximately 450 comments I have made in various places. I would estimate 100 to 150 of those are on Wordpress installations running Akismet.

    I would estimate that of those comments less than 5 have not appeared after a period of time, yet I still have comments that hit the moderation queue on blogs where I have previously commented without problems, and it seems extremely erratic.
    I am referring to fresh posts, no links in the body of the comment, and comments of a reasonable length that as far as I am aware didn't contain words that might trip up filters.

    With that amount of comment history I would expect the collective intelligence to now be working in my favor, allowing me to include a link in a comment even on sites such as Darren's, or to be able to write a 500+ word comment without it going in the moderation queue.

    Please forgive my lack of faith in filters that rely on global databases, but having spent a lot of time trying to train Gmail into allowing the email I want to get to me, false positives are a major sticking point.

    I expect problems on a first comment, or maybe a comment to an older post, but not to new posts on blogs where my comments are always accepted.
  • Matt · 2 years ago
    Moderation has nothing to do with Akismet, that's other settings inside of WP. For example on your Options > Discussion page you have things like "Moderate comments with more than X links."
  • Matt · 2 years ago
    Moderation has nothing to do with Akismet, that's other settings inside of WP. For example on your Options > Discussion page you have things like "Moderate comments with more than X links."
  • AndyBeard · 2 years ago
    So are you stating that number of links in the body, length of comment, whether you supply an optional URL and how deep it goes are not factors within the Akismet algorithms?

    From what I remember, on comments on Problogger they just don't appear if they have hit a hurdle.

    With Spam Karma, comments that almost pass end up in the normal moderation queue, and I thought that was the same with Akismet
  • AndyBeard · 2 years ago
    So are you stating that number of links in the body, length of comment, whether you supply an optional URL and how deep it goes are not factors within the Akismet algorithms?

    From what I remember, on comments on Problogger they just don't appear if they have hit a hurdle.

    With Spam Karma, comments that almost pass end up in the normal moderation queue, and I thought that was the same with Akismet
  • Matt · 2 years ago
    Akismet is binary, it either says something is spam or it isn't. The only time it may send something to the moderation queue is if the service is down.
  • Matt · 2 years ago
    Akismet is binary, it either says something is spam or it isn't. The only time it may send something to the moderation queue is if the service is down.
  • Adrian Sutton · 2 years ago
    Matt,
    Suggesting that Akismet isn't to blame for the false positives and that the comments were just deleted seems pretty suspicious to me considering that just recently I was being blocked from Scoble's blog - where he doesn't delete comments that criticise him, nor were my comments critical - and you investigated the problem and resolved it. Further, a collegue of mine who uses Akismet confirmed that it marked all of my comments on his blog as spam. See http://www.symphonious.net/2007/02/02/scoble-yo...

    Also, Akismet may be separate to moderation, but if Akismet marks something as spam and later the site owner recovers the comment, it looks as if the comment has been held in moderation, which is what Andy has most likely been seeing.

    Finally, Andy the "Post" button for comments does nothing in NetNewsWire (uses the Safari rendering engine).
  • Adrian Sutton · 2 years ago
    Matt,
    Suggesting that Akismet isn't to blame for the false positives and that the comments were just deleted seems pretty suspicious to me considering that just recently I was being blocked from Scoble's blog - where he doesn't delete comments that criticise him, nor were my comments critical - and you investigated the problem and resolved it. Further, a collegue of mine who uses Akismet confirmed that it marked all of my comments on his blog as spam. See http://www.symphonious.net/2007/02/02/scoble-yo...

    Also, Akismet may be separate to moderation, but if Akismet marks something as spam and later the site owner recovers the comment, it looks as if the comment has been held in moderation, which is what Andy has most likely been seeing.

    Finally, Andy the "Post" button for comments does nothing in NetNewsWire (uses the Safari rendering engine).
  • AndyBeard · 2 years ago
    Adrian,

    Thanks for stopping by - Robert's is another blog that most of the time my comments appear without problem, yet sometimes they don't.

    I don't chase Robert over it, because he has enough to deal with. Robert definitely doesn't delete critical comments, and I have racked up a couple.

    In fact one fairly serious comment didn't get through Akismet, even without a link, and I just made sure I included the warning in a later followup comment. Who knows the potential harm to the thread of discussion that missed comment made.

    Maybe coComment integration is causing a problem, I will try switching off - note you may have to visit my comments policy sometime soon to place comments here - just working out the code to enforce people at least view it once.
  • AndyBeard · 2 years ago
    Adrian,

    Thanks for stopping by - Robert's is another blog that most of the time my comments appear without problem, yet sometimes they don't.

    I don't chase Robert over it, because he has enough to deal with. Robert definitely doesn't delete critical comments, and I have racked up a couple.

    In fact one fairly serious comment didn't get through Akismet, even without a link, and I just made sure I included the warning in a later followup comment. Who knows the potential harm to the thread of discussion that missed comment made.

    Maybe coComment integration is causing a problem, I will try switching off - note you may have to visit my comments policy sometime soon to place comments here - just working out the code to enforce people at least view it once.
  • Matt · 2 years ago
    I would never suggest that Akismet never makes mistakes. In fact, more than anyone else in the world I know exactly how many mistakes it makes every day. What's different about it isn't that it doesn't mess up (even five nines of accuracy would be a few dozen mistakes a day, with the amount we process) but that it adapts and learns quickly.

    If you think a comment has been caught, just email the person and ask them to submit it as a false positive if it was caught. It's not personal, and it probably has very little to do with what you do (number of links, etc) and far more to do with what spammers are doing. (Remember only 6% of the comments we process are legit.)
  • Matt · 2 years ago
    I would never suggest that Akismet never makes mistakes. In fact, more than anyone else in the world I know exactly how many mistakes it makes every day. What's different about it isn't that it doesn't mess up (even five nines of accuracy would be a few dozen mistakes a day, with the amount we process) but that it adapts and learns quickly.

    If you think a comment has been caught, just email the person and ask them to submit it as a false positive if it was caught. It's not personal, and it probably has very little to do with what you do (number of links, etc) and far more to do with what spammers are doing. (Remember only 6% of the comments we process are legit.)
  • Mark Jaquith · 2 years ago
    My main problem is that I get far too many spam comments to peruse the log and rescue the FNs. 400+ a day. I'd imagine Scoble gets thousands a day.

    My first step toward a solution to that problem is this plugin, which notifies users when their comments are moderated or put into the spam bin. When something goes into the spam bin, I immediately shoot them a note that encourages them to contact me and ask me to rescue their comment.

    The next thing I'm considering is a plugin that goes through all comments marked as spam and deletes the really obvious spam. Akismet is an imperial "thumbs up" or "thumbs down" decision, so there's no way to separate the SPAM from the maybe-spam. If I could eliminate 80% of the spam log on sight, I might be able to skim the Akismet spam log for false positives.

    Even a simple blacklist (perhaps making the built-in WP blacklist delete caught comments instead of putting them in the spam bin) would likely help.
  • Mark Jaquith · 2 years ago
    My main problem is that I get far too many spam comments to peruse the log and rescue the FNs. 400+ a day. I'd imagine Scoble gets thousands a day.

    My first step toward a solution to that problem is this plugin, which notifies users when their comments are moderated or put into the spam bin. When something goes into the spam bin, I immediately shoot them a note that encourages them to contact me and ask me to rescue their comment.

    The next thing I'm considering is a plugin that goes through all comments marked as spam and deletes the really obvious spam. Akismet is an imperial "thumbs up" or "thumbs down" decision, so there's no way to separate the SPAM from the maybe-spam. If I could eliminate 80% of the spam log on sight, I might be able to skim the Akismet spam log for false positives.

    Even a simple blacklist (perhaps making the built-in WP blacklist delete caught comments instead of putting them in the spam bin) would likely help.
  • AndyBeard · 2 years ago
    I will have to give that a little test run. Maybe an option is to provide a get out of jail card that will either send an email with confirmation click, or move the comment to the normal moderation queue.

    Even when people fill in my captcha they often still end up in the moderation queue, but it is very rare that I have to pull comments from the black list in Spam Karma. The worst is when someone pre-prepares a comment, pastes it in, it gets caught because they have not been on the page long enough, and then they write another comment saying "I left a comment but it was caught by your filter" - I have had some weird database bugs when people have managed to get their personal domain banned.
  • AndyBeard · 2 years ago
    I will have to give that a little test run. Maybe an option is to provide a get out of jail card that will either send an email with confirmation click, or move the comment to the normal moderation queue.

    Even when people fill in my captcha they often still end up in the moderation queue, but it is very rare that I have to pull comments from the black list in Spam Karma. The worst is when someone pre-prepares a comment, pastes it in, it gets caught because they have not been on the page long enough, and then they write another comment saying "I left a comment but it was caught by your filter" - I have had some weird database bugs when people have managed to get their personal domain banned.
  • DianeV · 2 years ago
    Before trying Akismet, I'd implemented a little spam comment word blocker, which was an update of a little code I found in the WordPress.org forums. It did on occasion make posting comments difficult, and so I was thrilled to find how effective Akismet is.

    However, after a while I tired of scanning the Akismet queue, so I partially re-enabled it. Since I don't want to be snagged for links, here's the unlinked URL:

    http://developedtraffic.com/2007/02/11/wordpres...

    There are links to the full post about the word blocker in the post.
  • DianeV · 2 years ago
    Before trying Akismet, I'd implemented a little spam comment word blocker, which was an update of a little code I found in the WordPress.org forums. It did on occasion make posting comments difficult, and so I was thrilled to find how effective Akismet is.

    However, after a while I tired of scanning the Akismet queue, so I partially re-enabled it. Since I don't want to be snagged for links, here's the unlinked URL:

    http://developedtraffic.com/2007/02/11/wordpres...

    There are links to the full post about the word blocker in the post.
  • AndyBeard · 2 years ago
    Just edited it to make the link live.

    The content of comments here doesn't affect what gets through unless you go crazy. I think I set the penalty low for links, and only if there are more than 4 links in the content.

    There are actually plugins available that use Akismet as a variable in their overall calculations, rather than as a sole judge.

    That little hack might be useful to block some of the spam but I like being able to write
    [code]a little bit of code in my comments[/code]
  • AndyBeard · 2 years ago
    Just edited it to make the link live.

    The content of comments here doesn't affect what gets through unless you go crazy. I think I set the penalty low for links, and only if there are more than 4 links in the content.

    There are actually plugins available that use Akismet as a variable in their overall calculations, rather than as a sole judge.

    That little hack might be useful to block some of the spam but I like being able to write
    [code]a little bit of code in my comments[/code]
  • Tim Linden · 2 years ago
    I agree about the requiring signup policy. I don't bother with blogs that require a signup or block comments altogether (unless its very important).
  • Tim Linden · 2 years ago
    I agree about the requiring signup policy. I don't bother with blogs that require a signup or block comments altogether (unless its very important).
  • DianeV · 2 years ago
    Andy, I know what you mean.

    I'd noticed that much comment spam contains both normal links and BB code links, I just thought blocking the BB code for URLs would stop an awful lot of it from being posted in the first place. But that "word blocker" doesn't block normal HTML links.

    I hadn't considered wanting to write BB code in comments, since those don't work on my blogs.
  • DianeV · 2 years ago
    Andy, I know what you mean.

    I'd noticed that much comment spam contains both normal links and BB code links, I just thought blocking the BB code for URLs would stop an awful lot of it from being posted in the first place. But that "word blocker" doesn't block normal HTML links.

    I hadn't considered wanting to write BB code in comments, since those don't work on my blogs.
  • Dawud Miracle · 2 years ago
    Andy. I just wrote a post that I'll publish in the next day or two about a comment I left at a corporate blog that was shut down because I had a differing opinion of what their company holds. Sorry it's not posted yet to see. But before the weekend's end.
  • Dawud Miracle · 2 years ago
    Andy. I just wrote a post that I'll publish in the next day or two about a comment I left at a corporate blog that was shut down because I had a differing opinion of what their company holds. Sorry it's not posted yet to see. But before the weekend's end.
  • Lars Koudal · 2 years ago
    I have been very afraid of Akismet providing false positives, especially since I was running it with Bad Behaviour. It might have been a bit excessive, but I was getting a lot of referral spam.

    I found the Comment Guard WP plugin by accident, and have it running on my blog right now. I am very positive with my experience so far, and have had great support from Angsuman with some technical problems. Read about it here:

    http://blog.taragana.com/index.php/archive/word...

    It is now in beta, but I think it is quite easy to get in to the program.
  • Lars Koudal · 2 years ago
    I have been very afraid of Akismet providing false positives, especially since I was running it with Bad Behaviour. It might have been a bit excessive, but I was getting a lot of referral spam.

    I found the Comment Guard WP plugin by accident, and have it running on my blog right now. I am very positive with my experience so far, and have had great support from Angsuman with some technical problems. Read about it here:

    http://blog.taragana.com/index.php/archive/word...

    It is now in beta, but I think it is quite easy to get in to the program.
  • engtech · 2 years ago
    Wow, for a blog post about commenting I had a really hard time finding out how to comment. There was two screen worths between the post and the comments, and there is another screen worth AFTER where the comment box should be. It made if very hard to see the sections where it says you have to read the comment policy to enable the form.

    Just a though, do a "fake" comment form that is read only input where the real form should be that clearly indicates the comment policy info -- that will make it jump out.

    ---

    on to my comment!

    ---

    One thing I'd *really* like Akismet to start doing is be "less binary". I'd like to more easily ignore the stuff that is guaranteed spam and focus on reviewing the stuff that may or may not be spam.
  • engtech · 2 years ago
    Wow, for a blog post about commenting I had a really hard time finding out how to comment. There was two screen worths between the post and the comments, and there is another screen worth AFTER where the comment box should be. It made if very hard to see the sections where it says you have to read the comment policy to enable the form.

    Just a though, do a "fake" comment form that is read only input where the real form should be that clearly indicates the comment policy info -- that will make it jump out.

    ---

    on to my comment!

    ---

    One thing I'd *really* like Akismet to start doing is be "less binary". I'd like to more easily ignore the stuff that is guaranteed spam and focus on reviewing the stuff that may or may not be spam.
  • AndyBeard · 2 years ago
    Apologies that you found it a little difficult to find - the cookie based system was only hacked in yesterday and certainly needs some tweaking.
    I didn't intend to implement it as quickly, but some manual spam managed to get emailed to some comment subscribers. It will become slightly more streamlined, especially for posts that already have quite a few comments.
  • AndyBeard · 2 years ago
    Apologies that you found it a little difficult to find - the cookie based system was only hacked in yesterday and certainly needs some tweaking.
    I didn't intend to implement it as quickly, but some manual spam managed to get emailed to some comment subscribers. It will become slightly more streamlined, especially for posts that already have quite a few comments.
  • Artem · 2 years ago
    Well, what is the alternative of Akismet, Andy? My blog receive more than 20 spam comments per day. I was tired to delete it until Akismet was installed. What is the alternative?
  • AndyBeard · 2 years ago
    What I currently use is Spam Karma, and optionally you can use Akismet as an additional filter within Spam Karma.
  • Artem · 2 years ago
    Well, what is the alternative of Akismet, Andy? My blog receive more than 20 spam comments per day. I was tired to delete it until Akismet was installed. What is the alternative?
  • AndyBeard · 2 years ago
    What I currently use is Spam Karma, and optionally you can use Akismet as an additional filter within Spam Karma.
  • Artem · 2 years ago
    Thank you for this, Andy.
  • Artem · 2 years ago
    Thank you for this, Andy.
  • engtech · 2 years ago
    I wonder if Akismet takes the amount of time on the page into consideration like how Spam Karma does?

    That might be the source of my problems... I'm a very fast commenter :)

    I wrote a piece for Lorelle On WordPress where I talked about two things you can do if Akismet is always marking your own comments as spam.

    http://lorelle.wordpress.com/2007/09/25/help-ak...
  • AndyBeard · 2 years ago
    Engtech

    I am actually quite sick of Akismet, because I haven't got time to monitor every blog I link to to see if a trackback was pulled out of the sin bin.

    It happened recently on Scobleizer again, and he didn't know I was part of the "conversation"

    Hmm... is Akismet sending email addresses to Akismet.com?
  • engtech · 2 years ago
    I wonder if Akismet takes the amount of time on the page into consideration like how Spam Karma does?

    That might be the source of my problems... I'm a very fast commenter :)

    I wrote a piece for Lorelle On WordPress where I talked about two things you can do if Akismet is always marking your own comments as spam.

    http://lorelle.wordpress.com/2007/09/25/help-ak...
  • AndyBeard · 2 years ago
    Engtech

    I am actually quite sick of Akismet, because I haven't got time to monitor every blog I link to to see if a trackback was pulled out of the sin bin.

    It happened recently on Scobleizer again, and he didn't know I was part of the "conversation"

    Hmm... is Akismet sending email addresses to Akismet.com?
  • DianeV · 2 years ago
    Thanks, Engtech. That happened to me on my own blog, so this will come in handy!
  • DianeV · 2 years ago
    Thanks, Engtech. That happened to me on my own blog, so this will come in handy!
  • MikeT · 1 year ago
    All I know is that my comments keep getting blocked on the wordpress.com blogs I visit...
  • MikeT · 1 year ago
    All I know is that my comments keep getting blocked on the wordpress.com blogs I visit...