I don't know how awk (or this particular implementation) works, but it could be ...

michaelmior · on May 29, 2019

Sure, you only need to compare when there's a hash collision, but you still need to keep all the lines in memory for later comparison.

mannykannot · on May 29, 2019

Sure (though they could be in a compressed form, such as a suffix tree), but that wasn't the issue I was addressing.

chasil · on May 29, 2019

AWK was the first "scripting" language to implement associative arrays, which they claim they took from SNOBOL4.

Since then, perl and php have also implemented associative arrays. All three can loop over the text index of such an array and produce the original value, which a (bijective) hash cannot do.