Better Regex and exception handling for this small code
MRAB
python at mrabarnett.plus.com
Wed Jul 12 20:49:31 EDT 2017
On 2017-07-12 23:49, Nick Mellor wrote:
> On Wednesday, 12 July 2017 02:32:29 UTC+10, Ganesh Pal wrote:
>> Dear Python friends
>>
>> I am trying to open a file and check if there is a pattern has changed
>> after the task got completed?
>>
>> file data:
>> ........................................................
>>
>> #tail -f /file.txt
>> ..........................................
>> Note: CRC:algo = 2, split_crc = 1, unused = 0, initiator_crc = b6b20a65,
>> journal_crc = d2097b00
>> Note: Task completed successfully.
>> Note: CRC:algo = 2, split_crc = 1, unused = 0, initiator_crc = d976d35e,
>> journal_crc = a176af10
>>
>>
>> I have the below piece of code but would like to make this better more
>> pythonic , I found regex pattern and exception handling poor here , any
>> quick suggestion in your spare time is welcome.
>>
>>
>> #open the existing file if the flag is set and check if there is a match
>>
>> log_file='/file.txt'
>> flag_is_on=1
>>
>> data = None
>> with open(log_file, 'r') as f:
>> data = f.readlines()
>>
>>
>> if flag_is_on:
>> logdata = '\n'.join(data)
>> reg = "initiator_crc =(?P<ini_crc>[\s\S]*?), journal_crc"
>> crc = re.findall(re.compile(reg), logdata)
>> if not crc:
>> raise Exception("Pattern not found in logfile")
>>
>> checksumbefore = crc[0].strip()
>> checksumafter = crc[1].strip()
>> logging.info("checksumbefore :%s and checksumafter:%s"
>> % (checksumbefore, checksumafter))
>>
>> if checksumbefore == checksumafter:
>> raise Exception("checksum not macthing")
>>
>> I am on Linux and Python 2.7
>>
>> Regards,
>> Ganesh
>
> There's not much need to compile regexes unless you've got *a lot* of them in your code. The first ones are automatically compiled and cached:
>
> https://stackoverflow.com/questions/452104/is-it-worth-using-pythons-re-compile
>
I think this is the first time that I've seen someone pass a compiled
pattern into re.findall.
The usual way is to pass the pattern as a string:
crc = re.findall(reg, logdata)
If you have a lot of them, or it's in a loop that'll iterate many times,
it'll be quicker if you compile it first (outside the loop):
pattern = re.compile(reg)
and then use the compiled pattern's .findall method:
crc = pattern.findall(logdata)
More information about the Python-list
mailing list