fileinput
Pascal
patatetom at gmail.com
Fri Oct 25 16:12:23 EDT 2019
I have a small python (3.7.4) script that should open a log file and
display its content but as you can see, an encoding error occurs :
-----------------------
import fileinput
import sys
try:
source = sys.argv[1:]
except IndexError:
source = None
for line in fileinput.input(source):
print(line.strip())
-----------------------
python3.7.4 myscript.py myfile.log
Traceback (most recent call last):
...
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe8 in position 799:
invalid continuation byte
python3.7.4 myscript.py < myfile.log
Traceback (most recent call last):
...
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe8 in position 799:
invalid continuation byte
-----------------------
I add the encoding hook to overcome the error but this time, the script
reacts differently depending on the input used :
-----------------------
import fileinput
import sys
try:
source = sys.argv[1:]
except IndexError:
source = None
for line in fileinput.input(source,
openhook=fileinput.hook_encoded("utf-8", "ignore")):
print(line.strip())
-----------------------
python3.7.4 myscript.py myfile.log
first line of myfile.log
...
last line of myfile.log
python3.7.4 myscript.py < myfile.log
Traceback (most recent call last):
...
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe8 in position 799:
invalid continuation byte
python3.7.4 myscript.py /dev/stdin < myfile.log
first line of myfile.log
...
last line of myfile.log
python3.7.4 myscript.py - < myfile.log
Traceback (most recent call last):
...
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe8 in position 799:
invalid continuation byte
-----------------------
does anyone have an explanation and/or solution ?
More information about the Python-list
mailing list