[Trisquel-users] Re : Find the instances of each of a list of strings and print each set in a separate file

lcerf Sun, 19 Jul 2020 12:56:01 -0700

I executed that, out of curiosity. It essentially does the same as thecommand line I have given since the beginning of this thread:$ sort -u IPv4.May2020.37.nMapoG.txt | awk 'FILENAME == ARGV[1] { a[$1] = $2} FILENAME == ARGV[2] && $1 in a { print $2 >> "out/" a[$1] "," $1 }'PTRList.txt -


"Essentially" because:

Your solution outputs one file per PTR in PTR-files, which always containsone single line: the PTR (which is also in the file name); what is the point?The files output in CountsFiles contain duplicates, e.g., "low.lowe001.net96.125.160.252" is twice in CountsFiles/low.lowe001.net.2.txt; in youroriginal post, you used 'sort -u IPv4.May2020.37.nMapoG.txt' to removeduplicates: that is why I did the same in my solution;Every line has two fields, but the first one is always the same PTR (which isalso in the file name); what is the point?

Executing it takes 0.3s on my system, against 0.01s for mine;

'ls -v' would not sort the files in CountsFiles by "number of instances"; inyour original post, your use of 'sort -nrk 2' (the argument should have been2,2) suggests you want to sort by "number of instances"; that is why thenames of the output files in my solution start with the "number ofinstances".

Now, if you want the duplicates, if you insist on the names you chose and ifyou really want the repeated PTR in a first field (what only looks like awaste of disk space), it is trivial to adapt my solution:awk 'FILENAME == ARGV[1] { a[$1] = $2 } FILENAME == ARGV[2] && $1 in a {print $1, $2 >> "CountsFiles/" $1 "." a[$1] ".txt" }' PTRList.txtIPv4.May2020.37.nMapoG.txt

One single program. Against more than a dozen to create your Script.* filesthat then execute 252 other commands (more generally: 6 times the number ofPTRs).

[Trisquel-users] Re : Find the instances of each of a list of strings and print each set in a separate file

Reply via email to