A while back I said that I had made a breakthrough in spam filtering and that I filed a provisional patent on my new method. I’m now ready to reveal how it works. I’m calling it the Evolution Filter. You can read a detailed explanation here.
Basically, if you are talking about things that are said in ordinary email and it’s things that spammers never say, it’s good email. And if you are talking about things that only spammers say, it’s spam.
Most spam filters are based on matching things. Bayesian filters compare the message to known ham and spam. Or you are matching rules. This filter is based on NOT MATCHING. We match ham and not match spam to test for ham. We match spam and not match ham to test for spam.
The advantage to not matching is that I’m comparing to an infinite set rather than a finite set. Or comparing to the unknown rather than the known.
I’m getting really close to 100% accuracy and there’s room to improve it. This doesn’t just stop spam, it decimates it. Read my article and your see how I do it.
Also – this is NOT BAYESIAN. I know what Bayesian is. I have Bayesian with SpamAssassin. This is not the same thing. In fact – it’s not even similar.