<div dir="ltr"><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;text-indent:0pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;font-family:Calibri"><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"></span></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;text-indent:0pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;font-family:Calibri"><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">Regular expressions</span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"></span></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;text-indent:0pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;font-family:Calibri"><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"> </span></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;text-indent:0pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial;font-family:Calibri"><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">*?</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">, </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">+?</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">, </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">??</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt"></span></p><p class="gmail-p" align="justify" style="margin:0pt 0pt 5pt;text-align:justify;line-height:16.8pt;font-family:"Times New Roman";font-size:12pt"><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">The </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">'*'</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">, </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">'+'</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">, and </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">'?'</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"> qualifiers are all </span><span class="gmail-16" style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">greedy</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">; they match as much text as possible. Sometimes this behaviour isn’t desired; if the RE </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"><.*></span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"> is matched against </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">'<a></span><span class="gmail-15" style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"> </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">b</span><span class="gmail-15" style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"> </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"><c>'</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">, it will match the entire string, and not just </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">'<a>'</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">. Adding </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">?</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"> after the qualifier makes it perform the match in </span><span class="gmail-16" style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">non-greedy</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"> or </span><span class="gmail-16" style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">minimal</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"> fashion; as </span><span class="gmail-16" style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">few</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"> characters as possible will be matched. Using the RE </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)"><.*?></span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial"> will match only </span><span style="font-family:sans-serif;letter-spacing:0pt;font-size:11.5pt;background:rgb(236,240,243)">'<a>'</span><span style="font-family:Arial;letter-spacing:0pt;font-size:12pt;background-image:initial;background-position:initial;background-size:initial;background-repeat:initial;background-origin:initial;background-clip:initial">.</span><span style="font-size:12pt"></span></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;font-family:Calibri">It says that the RE <.*?> matches only <a> but it also matches <c></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;font-family:Calibri"><br></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;font-family:Calibri"><br></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;font-family:Calibri"><br></p><p class="MsoNormal" style="margin:0pt 0pt 0.0001pt;font-family:Calibri"> I am also attaching the pdf document with the same content </p></div>