[Spambayes-checkins] website faq.ht, NONE, 1.1 applications.ht, 1.2, 1.3 background.ht, 1.13, 1.14 developer.ht, 1.7, 1.8 docs.ht, 1.8, 1.9 download.ht, 1.2, 1.3 index.ht, 1.11, 1.12 links.h, 1.5, 1.6

Tony Meyer anadelonbrin at users.sourceforge.net
Wed Apr 16 19:52:20 EDT 2003


Update of /cvsroot/spambayes/website
In directory sc8-pr-cvs1:/tmp/cvs-serv1396

Modified Files:
	applications.ht background.ht developer.ht docs.ht download.ht 
	index.ht links.h 
Added Files:
	faq.ht 
Log Message:
General update to cover the new applications,
the alpha2 release, and an increase in
documentation.

--- NEW FILE: faq.ht ---
Title: SpamBayes: Frequently Asked Questions
Author-Email: spambayes at python.org
Author: spambayes

<h2>Frequently Asked Questions</h2>
<ol>
<li>Development</li>
<ol>
<li><a href="#tokentrick">Hey!  Why don't you implement cool tokenizer trick X?  I think it would really foil those spammers!</a></li>
<li><a href="#serverside">This software is great!  I want to implement it for all my users. Are there plans to develop a server-side spambayes solution?</a></li>
</ol>
<li>Using Spambayes</li>
<ol>
<li><a href="#unsure">I just got a spam, but the system said it was "unsure". Why couldn't it tell that it was spam - it's obvious?</a></li>
<li><a href="#stillunsure">OK, I trained on that message. But I just got *another* one, and the stupid system still thinks it's unsure. Why did it ignore me???</a></li>
<li><a href="#wipetraining">I've mucked up my training and I want to start all over again, but there isn't an option for this anywhere.  What do I do?</a></li>
</ol>
</ol>
<p>If you have any suggestions about other questions and answers that should be included
here, please mail <a href="mailto:spambayes at python.org">the list</a> with them.</p>
<h3><a name="#tokentrick">Hey!  Why don't you implement cool tokenizer trick X?  I think it
   would really foil those spammers!</a></h3>
<p>Have you run your tokenizer trick against a set of messages to see if
   it actually works?  Many times what seems like a good idea turns out
   not to help much, and sometimes even hurts.  If you have a good idea,
   you've run it against a batch of messages and can prove that it
   helps, paste the code for your technique and the proof to the mailing
   list.  If you're not a coder, but are really keen on your idea, post
   a feature request on the project page, and wait for someone else to
   code it for you (but make sure you do some testing when it's done).
   Otherwise, you will likely get a message from Tim Peters about
   why you need to test your idea :)</p>

<h3><a name="#serverside">This software is great!  I want to implement it for all my users.
   Are there plans to develop a server-side spambayes solution?</a></h3>
<p>The problem with a server-side solution is that everyone has a
   different idea of what is spam - that's the whole strength of the
   bayesian-style filtering concept.  If you are certain that *all*
   of your users would agree on what is spam and what is not, then
   this might work for you, but otherwise you really have to have
   individual databases for each user.  Either way, you should be
   able to modify spambayes easily enough to fit into your setup.
   Please let the list know if you do have success in this area, and
   we'll update this answer.</p>

<h3><a name="#unsure">I just got a spam, but the system said it was "unsure". Why
   couldn't it tell that it was spam - it's obvious?</a></h3>
<p>It may be obvious to you, but the classifier only works on
   the information it has been given. Maybe this is "new" (you've
   never seen this particular flavour of spam before), or maybe
   there aren't enough clues in the message which the system is
   aware of as strong spam clues.</p>

<h3><a name="#stillunsure">OK, I trained on that message. But I just got *another* one,
   and the stupid system still thinks it's unsure. Why did it
   ignore me???</a></h3>
<p>It didn't, but you may need to train on a few more of this type
   of message to get it classified as "spam". The classification
   algorithm weights its results based on the number of times it
   has seen a particular clue, so that clues unique to this type
   of message may need a few more instances to become "convincing".</p>

<h3><a name="#wipetraining">I've mucked up my training and I want to start all over again,
   but there isn't an option for this anywhere.  What do I do?</a></h3>
<p>Because training from scratch is a very rare occurance, and because
   deleting all your training information is something you don't want
   to do by accident, there isn't an option for this.  However, you
   can quite simply do this manually.  All the training data is stored
   in a file, usually called hammie.db, and if you delete (or rename)
   this, then you will start training from scratch.  If you are using
   the web interface for the POP3 proxy, the configuration page tells
   you what this file is called (and where it is) down towards the
   bottom of the page.</p>

Index: applications.ht
===================================================================
RCS file: /cvsroot/spambayes/website/applications.ht,v
retrieving revision 1.2
retrieving revision 1.3
diff -C2 -d -r1.2 -r1.3
*** applications.ht	3 Mar 2003 22:24:39 -0000	1.2
--- applications.ht	17 Apr 2003 01:52:17 -0000	1.3
***************
*** 8,18 ****
  getting there (and help is always appreciated).
  </p>
! <h3>Outlook2000</h3>
! <p>Sean True and Mark Hammond have developed an addin for Outlook2000 that
  adds support for the spambayes classifier. 
  <h4>Requirements</h4>
  <ul>
! <li>Python2.2 or later (2.2.2 recommended)
! <li>Outlook 2000 (<b>not</b> Outlook Express)
  <li>Python's <a href="http://starship.python.net/crew/mhammond">win32com</a>
  extensions (win32all-149 or later - currently ActivePython is not suitable)
--- 8,18 ----
  getting there (and help is always appreciated).
  </p>
! <h3>Outlook</h3>
! <p>Sean True and Mark Hammond have developed an addin for Outlook (2000 and XP) that
  adds support for the spambayes classifier. 
  <h4>Requirements</h4>
  <ul>
! <li>Python 2.2 or later (2.2.2 recommended)
! <li>Outlook 2000 or Outlook XP (<b>not</b> Outlook Express)
  <li>Python's <a href="http://starship.python.net/crew/mhammond">win32com</a>
  extensions (win32all-149 or later - currently ActivePython is not suitable)
***************
*** 22,33 ****
  </p>
  <h4>Availability</h4>
! <p>At the moment, you'll need to use CVS to get the code - go <a href="http://sourceforge.net/cvs/?group_id=61702">to the CVS page</a> on the project's sourceforge site for more.</p>
  
  <h3>hammie.py</h3>
! <p>hammie is a command line tool for marking mail as ham or spam. Skip Montanaro has started a <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/INTEGRATION.txt?rev=HEAD&content-type=text/plain">guide to integrating hammie with your mailer</a> (Unix-only instructions at the moment - additions welcome!). 
  Currently it focusses on running hammie via procmail. </p>
  <h4>Requirements</h4>
  <ul>
! <li>Python2.2 or later (2.2.2 recommended)
  <li>Currently documentation focusses on Unix.
  </ul>
--- 22,37 ----
  </p>
  <h4>Availability</h4>
! <p>Mark has packaged together <a href="http://starship.python.net/crew/mhammond/downloads/SpamBayes-Outlook-Setup-002.exe">an installer for the plugin</a>.
! You can download it from <a href="http://starship.python.net/crew/mhammond/spambayes/">his website</a>.
! This is currently at version 002.</p>
! <p>Alternatively, you can use CVS to get the code - go <a href="http://sourceforge.net/cvs/?group_id=61702">to the CVS page</a> on the project's sourceforge site for more.</p>
  
  <h3>hammie.py</h3>
! <p>hammie is a command line tool for marking mail as ham or spam. Skip Montanaro has started a
! <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/INTEGRATION.txt?rev=HEAD&content-type=text/plain">guide to integrating hammie with your mailer</a> (Unix-only instructions at the moment - additions welcome!). 
  Currently it focusses on running hammie via procmail. </p>
  <h4>Requirements</h4>
  <ul>
! <li>Python 2.2 or later (2.2.2 recommended)
  <li>Currently documentation focusses on Unix.
  </ul>
***************
*** 38,41 ****
--- 42,58 ----
  <p>pop3proxy sits between your mail client and your real POP3 server and marks
  mail as ham or spam as it passes through. See the <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/pop3proxy.py?rev=HEAD&content-type=text/plain">docstring at the top of pop3proxy.py</a> for more.
+ <h4>Requirements</h4>
+ <ul>
+ <li>Python2.2 or later (2.2.2 recommended)
+ <li>Should work on windows/unix/whatever... ?
+ </ul>
+ </p>
+ <h4>Availability</h4>
+ <p>At the moment, you'll need to use CVS to get the code - go <a href="http://sourceforge.net/cvs/?group_id=61702">to the CVS page</a> on the project's sourceforge site for more.</p>
+ 
+ <h3>imapfilter.py</h3>
+ <p>imap filter connects to your imap server and marks mail as ham or spam,
+ moving it to appropriate folders as it arrives.
+ See the <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/pop3proxy.py?rev=HEAD&content-type=text/plain">docstring at the top of imapfilter.py</a> for more.
  <h4>Requirements</h4>
  <ul>

Index: background.ht
===================================================================
RCS file: /cvsroot/spambayes/website/background.ht,v
retrieving revision 1.13
retrieving revision 1.14
diff -C2 -d -r1.13 -r1.14
*** background.ht	22 Jan 2003 08:30:05 -0000	1.13
--- background.ht	17 Apr 2003 01:52:17 -0000	1.14
***************
*** 97,101 ****
  them wrong, but it's completely confident in its (wrong) score. (Note that the difference isn't as apparent as it could be - it's a logarithmic scale graph!)
  </p>
! <p class="todo">Add more here - cancellation disease, fudge factors, &c</p>
  
  <p>Gary Robinson's 
--- 97,101 ----
  them wrong, but it's completely confident in its (wrong) score. (Note that the difference isn't as apparent as it could be - it's a logarithmic scale graph!)
  </p>
! <!--Add more here - cancellation disease, fudge factors, &c-->
  
  <p>Gary Robinson's 
***************
*** 230,235 ****
  <p>There's a lot of background on what's been tried available from
  the mailing list archives. Initially, the discussion started on 
! the <a href="">python-dev</a> list, but then moved to the 
! <a href="">spambayes</a> list. 
  
  <ul>
--- 230,235 ----
  <p>There's a lot of background on what's been tried available from
  the mailing list archives. Initially, the discussion started on 
! the <a href="http://mail.python.org/pipermail/python-dev/">python-dev</a> list, but then moved to the 
! <a href="http://mail.python.org/pipermail-21/spambayes/">spambayes</a> list. 
  
  <ul>
***************
*** 251,255 ****
  
  <ul>
! <li>The older CVS repository via <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/python/python/nondist/sandbox/spambayes/?hideattic=0">view CVS</a>, or the <a href="presfchangelog.html">entire changelog</a>. Development here stopped on the 6th of September 2002.
  <li>After that, work moved to this project's <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/spambayes/spambayes/">CVS tree</a>
  </ul>
--- 251,256 ----
  
  <ul>
! <li>The older CVS repository via <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/python/python/nondist/sandbox/spambayes/?hideattic=0">view CVS</a>, or the <a href="presfchangelog.html">entire changelog</a>.
! Development here stopped on the 6th of September 2002.
  <li>After that, work moved to this project's <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/spambayes/spambayes/">CVS tree</a>
  </ul>

Index: developer.ht
===================================================================
RCS file: /cvsroot/spambayes/website/developer.ht,v
retrieving revision 1.7
retrieving revision 1.8
diff -C2 -d -r1.7 -r1.8
*** developer.ht	17 Jan 2003 07:34:44 -0000	1.7
--- developer.ht	17 Apr 2003 01:52:18 -0000	1.8
***************
*** 6,10 ****
  <p>So you want to get involved?</p>
  <h2>Running the code</h2>
! <p>This project works with either Python 2.2.2, Python2.3 (currently in alpha), 
  or on the bleeding edge of python code,
  available from <a href="http://sourceforge.net/cvs/?group_id=5470">CVS on
--- 6,10 ----
  <p>So you want to get involved?</p>
  <h2>Running the code</h2>
! <p>This project works with either Python 2.2.2, Python 2.3 (currently in alpha), 
  or on the bleeding edge of python code,
  available from <a href="http://sourceforge.net/cvs/?group_id=5470">CVS on
***************
*** 30,39 ****
  
  <h3>So what needs to be done</h3>
! <p>Currently (early November) work is now being focussed on finding 
! additional things that are beneficial to the tokenizer. The combining
! scheme is now pretty solid and pretty amazing. The other big body of
! work at the moment is producing something that's useful to end-users -
! actually building the applications and the code so that Tim's sister
! &lt;wink&gt; can use the system.</p>
  
  <h2>Collecting training data</h2>
--- 30,42 ----
  
  <h3>So what needs to be done</h3>
! <p>Currently (April) work is now being focused on improving deployment
! of the system (actually building the applications and the code so that
! Tim's sister &lt;wink&gt; can use the system), with the aim of eventually
! putting out a beta (or even full!) release.  There is still a certain
! amount of interest in finding additional things that are beneficial to
! the tokenizer, although most people are pretty happy with how it works
! at the moment. The combining scheme is now pretty solid and pretty amazing.
! The other big body of work is monitoring the bug reports and feature requests
! that come in and trying to resolve those.</p>
  
  <h2>Collecting training data</h2>

Index: docs.ht
===================================================================
RCS file: /cvsroot/spambayes/website/docs.ht,v
retrieving revision 1.8
retrieving revision 1.9
diff -C2 -d -r1.8 -r1.9
*** docs.ht	17 Jan 2003 06:26:50 -0000	1.8
--- docs.ht	17 Apr 2003 01:52:18 -0000	1.9
***************
*** 8,12 ****
  <li>The <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/TESTING.txt?rev=HEAD&content-type=text/plain">TESTING.txt</a> file -- Clues about the practice of statistical testing, adapted from Tim
   comments on python-dev.
! <li>There's also a vast number of clues and notes scattered as block comments through the code.
  </ul>
  
--- 8,14 ----
  <li>The <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/TESTING.txt?rev=HEAD&content-type=text/plain">TESTING.txt</a> file -- Clues about the practice of statistical testing, adapted from Tim
   comments on python-dev.
! <li>A rudimentary set of <a href="faq.html">Frequently Asked Questions (FAQ)</a>.</li>
! <li>Instructions on <a href="http://cvs.sourceforge.net/cgi-bin/viewcvs.cgi/*checkout*/spambayes/spambayes/INTEGRATION.txt?rev=HEAD&content-type=text/plain">integrating Spambayes</a> into your mail system.</li>
! <li>There're also a vast number of clues and notes scattered as block comments through the code.
  </ul>
  

Index: download.ht
===================================================================
RCS file: /cvsroot/spambayes/website/download.ht,v
retrieving revision 1.2
retrieving revision 1.3
diff -C2 -d -r1.2 -r1.3
*** download.ht	17 Jan 2003 17:00:52 -0000	1.2
--- download.ht	17 Apr 2003 01:52:18 -0000	1.3
***************
*** 4,9 ****
  
  <h2>Source Releases</h2>
! <p>The first pre-release of version 1.0 of the SpamBayes project is available.
! Download version 1.0a1 from the sourceforge <a href="https://sourceforge.net/project/showfiles.php?group_id=61702">Files</a> page as either a gzipped tarball or a zip file of the source files.
  </p>
  
--- 4,9 ----
  
  <h2>Source Releases</h2>
! <p>The second pre-release of version 1.0 of the SpamBayes project is available.
! Download version 1.0a2 from the sourceforge <a href="https://sourceforge.net/project/showfiles.php?group_id=61702">Files</a> page as either a gzipped tarball or a zip file of the source files.
  </p>
  
***************
*** 22,25 ****
--- 22,30 ----
  
  <h2>Binary Releases</h2>
+ <h3>Outlook Plugin</h3>
+ <p>Mark has packaged together <a href="http://starship.python.net/crew/mhammond/downloads/SpamBayes-Outlook-Setup-002.exe">an installer for the plugin</a>.
+ You can download it from <a href="http://starship.python.net/crew/mhammond/spambayes/">his website</a>.
+ This is currently at version 002.</p>
+ <h3>Other</h3>
  <p>None as yet.</p>
  
***************
*** 29,45 ****
  more details. 
  </p>
- <p><b>Update:</b></p>
- <p class="note">
- (2003-01-14 14:04:19 - Project CVS Services)   As of 2003-01-14,
- pserver-based CVS repository access and ViewCVS (web-based) CVS repository
- access have been taken offline as to stabilize CVS server performance for
- developers. These services will be re-enabled as soon as the underlying
- scalability issues have been analyzed and resolved (as soon as 2003-01-15,
- if possible). Additional updates will be posted to the Site Status page as
- they become available. Your patience is appreciated.
- </p>
- 
- <a name="snapshot"><h2>Nightly snapshots</h2></a>
- <p>A nightly snapshot is available: <a href="download/spambayes-nightly.tar.gz">spambayes-nightly.tar.gz</a>. </p>
- <p class="note">Note that due to some Sourceforge issues, this is currently being built with a "manual cron" (i.e. when I remember). Once Sourceforge's CVS issues are resolved, this will be available automatically.</p>
- 
--- 34,35 ----

Index: index.ht
===================================================================
RCS file: /cvsroot/spambayes/website/index.ht,v
retrieving revision 1.11
retrieving revision 1.12
diff -C2 -d -r1.11 -r1.12
*** index.ht	20 Jan 2003 20:32:52 -0000	1.11
--- index.ht	17 Apr 2003 01:52:18 -0000	1.12
***************
*** 5,10 ****
  
  <h2>News</h2>
! <p>First pre-release available. See the <a href="download.html">download</a> page for more.</p>
! <H2>What is SpamBayes?</h2>
  <p>
  The SpamBayes 
--- 5,10 ----
  
  <h2>News</h2>
! <p>Second pre-release available. See the <a href="download.html">download</a> page for more.</p>
! <h2>What is SpamBayes?</h2>
  <p>
  The SpamBayes 
***************
*** 92,96 ****
  <p>The code is currently available from a variety of methods from the
  <a href="download.html">downloads</a> page. The current release is
! 1.0 prerelease 1.
  </p>
  
--- 92,96 ----
  <p>The code is currently available from a variety of methods from the
  <a href="download.html">downloads</a> page. The current release is
! 1.0 alpha 2.
  </p>
  

Index: links.h
===================================================================
RCS file: /cvsroot/spambayes/website/links.h,v
retrieving revision 1.5
retrieving revision 1.6
diff -C2 -d -r1.5 -r1.6
*** links.h	17 Jan 2003 07:34:44 -0000	1.5
--- links.h	17 Apr 2003 01:52:18 -0000	1.6
***************
*** 8,11 ****
  <h3>Getting the code</h3>
  <li><a href="download.html">Releases</a>
- <li><a href="download.html#snapshot">Snapshots</a>
  <li><a href="download.html#cvs">CVS access</a>
--- 8,10 ----





More information about the Spambayes-checkins mailing list