[Mailman-Users] Chinese characters spam filter?
Yasuhito FUTATSUKI
futatuki at poem.co.jp
Sat Jul 9 09:07:48 EDT 2016
On 07/09/16 10:04, I wrote:
> How about using 'backslashreplace' instead of 'replace' to encode to
> list's preferred language in Mailman/Handlers/SpamDetect.py ?
>
> Then, desirable pattern in this case seems to be
>
> ~Subject.*(\\u[0-9a-f]{4}){4}
^Subject:.*(\\u[0-9a-f]{4,5}){4}
(CJK Unified Ideographs Extension uses code point scope >= u+20000)
Of course, you can specify code point scope more pricisely, not
to match other than Chinese characters.
--
Yasuhito FUTATSUKI <futatuki at poem.co.jp>
More information about the Mailman-Users
mailing list