[Tutor] thesaurus

Dave Angel davea at ieee.org
Sat Jul 11 23:38:04 CEST 2009


Pete Froslie wrote:
> thanks for the advice Alan.. I am wondering about the following:
>
> new_word = response3[2]
> old_word = response[word_number]
>
> #this works but adds carriage returns*********
> for line in fileinput.FileInput("journey_test.txt",inplace=1):
>    line = line.replace(old_word, new_word)
>    print line
>
> It seems that if I put it in at the difficulty location it works, but it
> also adds carriage returns to the entire text file. Basically it ends up
> double spacing a single-spaced document.  I'm also not sure if this is
> causing the trouble you suggested against (writing and reading at the same
> time).
>
> cheers
>
> On Sat, Jul 11, 2009 at 3:38 PM, ALAN GAULD <alan.gauld at btinternet.com>wrote:
>
>   
>>> I am having trouble with probably the most simple part:
>>> I cannot seem to go back into the 'txt' file and replace the word I just
>>> searched with the new word!
>>>       
>> Its not a good idea to try to read and write to the same file at the same
>> time. The normal approach is to weither ead the file into memory and
>> process it then write it back to the file or to open a second file and
>> write to that then copy the second file over the original.
>>
>>     
>>> One with re.sub, which I can't seem to get to work
>>>       
>> re.sub works on a text string it doesn't affect the file.
>>
>> read the content into a string, close the input file.
>> Use re.sub to make the changes (or even just the replace
>> method of strings) then write the changed string back out
>> to the file.
>>
>> HTH,
>>
>> Alan G
>>
>>     
fileinput.FileInput() finesses the problem of updating "in place" very 
nicely.  But it also replaces STDOUT, which can make it harder to debug 
your code, because if you put other print statements there, they'll go 
to the file, by default.  Anyway, here's your answers to the doublespacing:

If you put a trailing comma on the print statement, it won't append an 
extra newline, so you won't end up doublespacing.
Or you could rstrip() the 'line' variable, to remove the newline that's 
there before print.
Or you could write to stdout using write() instead of print.

I'd like to know if there's a reason you're putting this code all 
inline.  By breaking it into functions, you'd have a chance to work on 
the different aspects of it independently.  That also might mean that 
you could change different aspects of it independently, or even 
parameterize it from a command line to do slightly different things.

For example, what if you'd like to substitute more than one word for its 
synomym?  Do you really want to pass through the entire file each time?  
Or what if you want to get your synomyms from somewhere else, or from 
multiple places?

DaveA



More information about the Tutor mailing list