[lxml-dev] Using Regex to search CSSselectors?
I am switching my code from BeautifulSoup to LXML for HTML parsing however there is one area of my code that I was hoping someone could help on. Beautfiul soup allows re compiled patterns as arguments to search for nodes. I was hoping that LXML would allow for something similar. Basically I need to find all tags with a class name that matches the regex expression r'post_?\d+' and another one that matches the tag's id to the regex r'(user|profile)_\d+' Thanks
Kyle Hanson, 18.06.2010 21:42:
I am switching my code from BeautifulSoup to LXML for HTML parsing however there is one area of my code that I was hoping someone could help on.
Beautfiul soup allows re compiled patterns as arguments to search for nodes. I was hoping that LXML would allow for something similar. Basically I need to find all tags with a class name that matches the regex expression r'post_?\d+' and another one that matches the tag's id to the regex r'(user|profile)_\d+'
The CSS selectors don't support this, but XPath does. Just use the EXSLT namespace for regular expressions. http://codespeak.net/lxml/xpathxslt.html#the-xpath-class Note that you can also use the cssselect module manually to convert a given CSS selector to an equivalent XPath expression. Stefan
participants (2)
-
Kyle Hanson
-
Stefan Behnel