Paul - thanks for the input, it's interesting to see how pyparser handles it. Anyhow, a simple regex took care of the issue in BS: for i in soup.findAll(re.compile('^p|^div'),align=re.compile('^center| ^left')): print i Thanks again! T