[spambayes-bugs] [ spambayes-Bugs-1166146 ] Tokenizer fails on bad
URL
SourceForge.net
noreply at sourceforge.net
Fri Mar 18 19:27:13 CET 2005
Bugs item #1166146, was opened at 2005-03-18 10:27
Message generated for change (Tracker Item Submitted) made by Item Submitter
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=1166146&group_id=61702
Category: hammie
Group: None
Status: Open
Resolution: None
Priority: 5
Submitted By: Leonid (leobru)
Assigned to: Nobody/Anonymous (nobody)
Summary: Tokenizer fails on bad URL
Initial Comment:
The following line in the body of a message being
scored or trained
http://)
causes spambayes to die. The potentially relevant
spambayesrc settings are
[Tokenizer]
x-fancy_url_recognition=True
x-pick_apart_urls=True
[URLRetriever]
x-slurp_urls=True
The relevant backtrace is
File
"/usr/home/leob/spambayes-2/spambayes/classifier.py",
line 374, in _add_msg
for word in Set(wordstream):
File "/usr/local/lib/python2.3/sets.py", line 429, in
__init__
self._update(iterable)
File "/usr/local/lib/python2.3/sets.py", line 383, in
_update
for element in iterable:
File
"/usr/home/leob/spambayes-2/spambayes/classifier.py",
line 762, in _add_slurped
slurped_tokens = self._generate_slurp()
File
"/usr/home/leob/spambayes-2/spambayes/classifier.py",
line 556, in _generate_slurp
tokens = self.slurp(*slurp_wordstream)
File
"/usr/home/leob/spambayes-2/spambayes/classifier.py",
line 663, in slurp
domain = mo.group(1)
AttributeError: 'NoneType' object has no attribute 'group'
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=1166146&group_id=61702
More information about the Spambayes-bugs
mailing list