[lxml-dev] HTML parser is in the trunk! Ready for 1.0 ?
data:image/s3,"s3://crabby-images/c6057/c6057bed8007c428c0e26b11fb68644c69f16b19" alt=""
Hi all, I just merged the HTML parser branch into the trunk. Paul reported that the latest branch version compiled cleanly on Mac-OS X Tiger (libxml 2.6.16) - and it even passed all tests there, including those on broken HTML. Newer versions of both libxml2 and libxslt are recommended, though. Another recent update on the trunk is the support for xml:id, which is currently available through an XMLDTDID function (XMLID was already in use by ET and is compatible in lxml). The new functionality is now directly based on the libxml2 ID hash table provided by the parser. This means that lxml now supports dictionary-like access to elements having an "xml:id" attribute or DTD-REF attributes. I think it is now the time to fix features for lxml 1.0. Expect it to be released next month (hopefully after Pyrex 0.9.4.1). If you think that lxml still misses something that should be in 1.0 or if you know about any remaining (or new) bugs, report back to the list. Please start a separate thread in that case instead of replying to this mail. Martijn and I are happy about any comment that helps us get lxml better. Have fun, Stefan
participants (1)
-
Stefan Behnel