[Spambayes] using spambayes with non-English languages !!

Andrea Medici andrea at credemtel.it
Thu Mar 18 03:47:24 EST 2004


hi all

First, I have no problem with the program. 
This message contains only informations.

"
I am using SpamBayes POP3 Proxy Version 0.4 (February 2004) (binary), 
with version 2.3.2+ (#49, Oct 23 2003, 15:50:06) [MSC v.1200 32 bit 
(Intel)] of Python; my operating system is Windows 5.1.2600.2 
(Service Pack 1).  I have trained 778 ham and 929 spam.
"

I only want to tell you, hoping this can improve your work, that in 
an Italian email environment the filter, probably, has to be trained 
a little more than usual.
Follow is my configuration file:
============
[html_ui]
display_received_time:True
display_adv_find:True
display_score:True
default_unsure_action:spam
[Headers]
notate_to:spam unsure
include_evidence:False
include_score:True
include_thermostat:False
header_score_logarithm:True
header_score_digits:3
[Storage]
messageinfo_storage_file:E:\Program 
Files\SpamBayes\proxy\spambayes.messageinfo.db
persistent_storage_file:E:\Program Files\SpamBayes\proxy\hammie.db
ham_cache:E:\Program Files\SpamBayes\proxy\pop3proxy-ham-cache
spam_cache:E:\Program Files\SpamBayes\proxy\pop3proxy-spam-cache
unknown_cache:E:\Program Files\SpamBayes\proxy\pop3proxy-unknown-
cache
[pop3proxy]
listen_ports:30000
remote_servers:172.20.13.99
[Tokenizer]
mine_received_headers:True
[Categorization]
spam_cutoff:0.25
ham_cutoff:0.2
[Categorization]
ham_cutoff:0.2
[Classifier]
max_discriminators:50
unknown_word_strength:0.4
[Classifier]
unknown_word_strength:0.4
====

As you can see I have change some default values and after
1 month and about 1800 emails, things seem to go better.
I have an hit/miss ratio of about 1 out of 100 messages as false 
negative, but NO false positive. 
I do not miss no more good messages and my PegasusMail filter out 
exactly only spam.

great!!

ciao
Andrea

--------------------------------------------------------------------
 andrea.medici at credemtel.it         CREDEMTEL S.p.A. - Gruppo CREDEM
            ph. +39 0522 271 540   fx. +39 0522 926 414
 Via R. Livatino, 9            42100 Reggio Emilia             Italy
FingerPrint   5F83 CB7C AAFF 6C44 FF4C  C5EF 7E18 3420 3CC7 067C
--------------------------------------------------------------------





More information about the Spambayes mailing list