number of different lines in a file

Grant Edwards grante at
Fri May 19 21:04:02 CEST 2006

On 2006-05-19, Paul McGuire <ptmcg at> wrote:

>> If the log has a lot of repeated lines in its original state then
>> running uniq twice, once up front to reduce what needs to be sorted,
>> might be quicker?
>>  uniq log_file | sort| uniq | wc -l
>> - Pad.
> Why would the second running of uniq remove any additional lines that
> weren't removed in the first pass?

Because uniq only removes _adjacent_ identical lines.

> For that matter, if this is a log file, wont every line have a timestamp,
> making duplicates extremely unlikely?


Grant Edwards                   grante             Yow!  If our behavior is
                                  at               strict, we do not need fun!

More information about the Python-list mailing list