split and regexp on textfile

Flyzone flyzone at technologist.com
Fri Apr 13 03:59:52 EDT 2007


Hi,
i have a problem with the split function and regexp.
I have a file that i want to split using the date as token.
Here a sample:
-----
Mon Apr  9 22:30:18 2007
text
text
Mon Apr  9 22:31:10 2007
text
text
----

I'm trying to put all the lines in a one string and then to separate
it
(could be better to not delete the \n if possible...)
  while 1:
     line = ftoparse.readline()
     if not line: break
     if line[-1]=='\n': line=line[:-1]
             file_str += line
  matchobj=re.compile('[A-Z][a-z][a-z][ ][A-Z][a-z][a-z][ ][0-9| ][0-9]
[ ][0-9][0-9][:]')
  matchobj=matchobj.split(file_str)
  print matchobj

i have tried also
   matchobj=re.split(r"^[A-Z][a-z][a-z][ ][A-Z][a-z][a-z][ ][0-9| ]
[0-9][ ][0-9][0-9][:]",file_str)
and reading all with one:
   file_str=ftoparse.readlines()
but the split doesn't work...where i am wronging?




More information about the Python-list mailing list