Mailman 3 replace nbsp with space - lxml - The Python XML Toolkit

Aug. 22, 2019


      I use the following code to replace nbsp with space. Is it the best
way to do so in lxml? Thanks.

from lxml import html
doc = html.parse(sys.stdin, parser = html.HTMLParser(encoding='utf-8'))
for x in doc.iter():
    if x.text is not None:
        x.text = x.text.replace(u'\xa0', ' ')
    if x.tail is not None:
        x.tail = x.tail.replace(u'\xa0', ' ')
sys.stdout.write(html.tostring(doc).encode('utf-8'))

-- 
Regards,
Peng

replace nbsp with space

Peng Yu

Stefan Behnel

Peng Yu

Stefan Behnel

tags

participants (2)