Scanning a file character by character

Duncan Booth duncan.booth at invalid.invalid
Tue Feb 10 07:06:06 EST 2009


Steven D'Aprano <steven at REMOVE.THIS.cybersource.com.au> wrote:

> On Mon, 09 Feb 2009 19:10:28 -0800, Spacebar265 wrote:
> 
>> How would I do separate lines into words without scanning one character
>> at a time?
> 
> Scan a line at a time, then split each line into words.
> 
> 
> for line in open('myfile.txt'):
>     words = line.split()
> 
> 
> should work for a particularly simple-minded idea of words.
> 
Or for a slightly less simple minded splitting you could try re.split:

>>> re.split("(\w+)", "The quick brown fox jumps, and falls over.")[1::2]
['The', 'quick', 'brown', 'fox', 'jumps', 'and', 'falls', 'over']


-- 
Duncan Booth http://kupuguy.blogspot.com



More information about the Python-list mailing list