requested sort enhancement - unique with count

David H. Durgee Tue, 21 Oct 2014 13:36:06 -0700

At present it is necessary to use both sort and uniq if you want atabulated list of unique items. As an example, I needed to do so for anerror log to determine what errors were most frequent:


sort .xsession-errors | uniq --count | sort -n

If I were interested only in the unique lines without counts I couldhave used sort -u, but that does not give me any feel for how frequent aline is in the file. The above works well as long as the file is nottoo large, but obviously takes a lot of both time and temporary space ifthe file is huge. If sort could count the lines while keeping only theunique lines there would be only one pass through the file and noexcessive temporary space would be needed.

My incentive for processing this particular log file was some sort ofrun-away error logging that actually filled the partition the log filewas on, at which point the file was about 7G of data. I wound up usingtail to take the last million lines and getting the most frequent errorsfrom that and assuming that this way typical, but of course it wouldhave been nice to know.


Dave

requested sort enhancement - unique with count

Reply via email to