Is it possible to make a regular expression that will match: '<html blah>' or '<html>' without having to make it into two complete expressions seperated by a pipe: r'<html[ \t].+?>|<html>' I want it to require a space or tab and at least one character before the closing bracket, after 'html', or just the closing bracket. Scott