[spambayes-bugs] [ spambayes-Bugs-1187208 ] import into CDB chokes
on 8-bit chars
SourceForge.net
noreply at sourceforge.net
Fri Apr 22 06:08:42 CEST 2005
Bugs item #1187208, was opened at 2005-04-21 20:45
Message generated for change (Comment added) made by anadelonbrin
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=1187208&group_id=61702
>Category: None
Group: 1.1.x
>Status: Closed
>Resolution: Fixed
Priority: 5
Submitted By: Leonid (leobru)
>Assigned to: Tony Meyer (anadelonbrin)
Summary: import into CDB chokes on 8-bit chars
Initial Comment:
If the CSV file contains an iso-8859-1 character,
import into CDB fails:
file csv (2 lines, save as iso-8859-1):
1,1
fiancée,1,1
sb_dbexpimp.py -i -o
Storage:persistent_use_database:cdb -o
Storage:persistent_storage_file:cdb -v -f csv
Importing file csv into database /.../cdb
Storing database, please be patient. Even moderately sized
databases may take a very long time to store.
Traceback (most recent call last):
File "./sb_dbexpimp.py", line 248, in ?
runImport(dbFN, useDBM, newDBM, flatFN)
File "./sb_dbexpimp.py", line 200, in runImport
bayes.store()
File
"/usr/home/leob/spambayes-1.1a1/scripts/spambayes/storage.py",
line 649, in store
cdb.cdb_make(db, items)
File
"/usr/home/leob/spambayes-1.1a1/scripts/spambayes/cdb.py",
line 166, in cdb_make
outfile.write(key)
UnicodeEncodeError: 'ascii' codec can't encode
character u'\xe9' in position 5: ordinal not in range(128)
----------------------------------------------------------------------
>Comment By: Tony Meyer (anadelonbrin)
Date: 2005-04-22 16:08
Message:
Logged In: YES
user_id=552329
Thanks. Fixed in storage.py r1.51
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=498103&aid=1187208&group_id=61702
More information about the Spambayes-bugs
mailing list