[lxml] Parsing HTML files with HTML entities