Remember the Delta Process from Calculus 1.01 ?
https://www.brighthubeducation.com/homework-math-help/108376-finding-the-derivative-of-a-function-in-calculus/

That's where I am in Scripting 1.01 ...

Back to the problem at hand.

Step (1) selected the IPv6 addresses of the Type A & Type B rows in the cleansed File01.txt:
awk '{print $2}' 'File01.txt' | sort | uniq -c > TempTQ01.txt ;
awk '{print $2, $1}' 'TempTQ01.txt' | sort -nrk 2 > TempTQ02.txt

Step (2) selects and lists all the Type B entries in File01.txt (SS.IPv6-HN-GLU-MB-Domains-January2020-All.ods.txt):
awk '{print $1}' 'TempTQ02.txt' | sort > TempTQ10.txt ;
awk '{print $1}' 'TempTQ10.txt' - | grep - File01.txt | awk '{print $1,$2,$3,$4}' '-' > TempTQ11.txt

Never mind simplicity or efficiency; it took 0.006 second CPU time and 0.032 second real time. It did reveal a number of Type C rows that I had missed in my visual inspection ==> TempTQ13.txt

Next step: For each row in TempTQ11.txt, print $2,$3,$4 to cache, find $1 in File10.txt's $2 column, and print that $2 to Column $1 along with cache's $2,$3,$4 into a new file ...

Step (3) matches the Keys in Col.$2 of the Type A rows with the data in Col's $2,$3 & $4 of Type B rows: join -a 2 -1 1 -2 2

Reply via email to