Finding repeated lines
Posted: Sat Feb 16, 2002 1:42 pm
Hi. I need to find and eliminate duplicate lines on a file with more than 5000 lines. I know this can be partly achieved with the "sort" + "eliminate duplicate lines" command. However what I need to do is slightly different.
Wherever there is a duplicate line I need to eliminate both the duplicate AND the original line. Since the file is very large and the strings quite long, it is a rather tedious process to do it by hand.
I've tried to understand the RE syntax for "find and replace" but could not come up with a solution. Can anybody help?
Henrique Serra
serra@cpd.ufmt.br
Wherever there is a duplicate line I need to eliminate both the duplicate AND the original line. Since the file is very large and the strings quite long, it is a rather tedious process to do it by hand.
I've tried to understand the RE syntax for "find and replace" but could not come up with a solution. Can anybody help?
Henrique Serra
serra@cpd.ufmt.br