From: Christopher Browne <cbbrowne@acm.org>
Subject: Re: GC, existed? the foreigner
Date: 17 Jan 2004 14:52:51 GMT
Date: 2004-01-17T14:52:51+00:00 [thread overview]
Message-ID: <bubi82$euq6c$1@ID-125932.news.uni-berlin.de> (raw)
In-Reply-To: 100glfb4e45iof0@corp.supernews.com
In the last exciting episode, "Randy Brukardt" <randy@rrsoftware.com> wrote:
> "Christopher Browne" <cbbrowne@acm.org> wrote in message
> news:bu7i55$ema5s$1@ID-125932.news.uni-berlin.de...
>> If none of your _real_ email contains words like "egret," "beseech,"
>> or "shibboleth," then it certainly won't look like "ham."
>
> The initial description of Baysian filters included a rule that
> anything unrecognized was considered 10% chance of being spam. In
> that case, sticking any garbage into a message will help get it
> passed. I doubt that current filters work that way, but I don't know
> for sure.
Which paper provided evidence of the efficacity of that? It seems
hard to find an "initial description"; the major papers seemed to
emerge in about 1998, and even at that point, they were primarily
writing about document _classification_, not "spam detection."
I contributed to the work on Ifile back in 1996/1997 (before 1998!),
and have been using Naive Bayesian filtering ever since; there is NO
such rule in the code I use, and I have never seen such a rule in the
scientific literature.
Actually, it doesn't even make sense to suggest such a rule. Naive
Bayesian filters don't use random number generators to decide what to
do with mail; that "rule" can be of _no_ help in what is an entirely
deterministic classification process.
--
If this was helpful, <http://svcs.affero.net/rm.php?r=cbbrowne> rate me
http://www.ntlug.org/~cbbrowne/ifilter.html
"La Cicciolina [...] Electing her was an interesting contrast to the
situation in the UK: In Italy they elect a representative from the sex
industry. In the UK, they elect their clients." -- Peter Gutmann
next prev parent reply other threads:[~2004-01-17 14:52 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <mailman.27.1073938595.279.comp.lang.ada@ada-france.org>
2004-01-15 20:50 ` GC, existed? the foreigner Adam Beneschan
2004-01-16 1:59 ` Jeffrey Carter
2004-01-16 2:26 ` Christopher Browne
2004-01-16 21:20 ` Randy Brukardt
2004-01-16 22:29 ` Robert A Duff
2004-01-17 1:23 ` Jeffrey Carter
2004-01-17 5:20 ` Randy Brukardt
2004-01-17 17:26 ` Georg Bauhaus
2004-01-17 14:52 ` Christopher Browne [this message]
2004-01-17 22:11 ` tmoran
replies disabled
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox