Filter a text file to get unique entries based on the value in column 3

Question

Filter a text file to get unique entries based on the value in column 3

I know a little bash, but I have to deal with the problem of filtering the file. I will explain an example:

For a text file such as the following (file1):

10.10.12 bib24 Avenger goodone 10.10.12 bib21 The_Dark_Knight_Rises betterone 10.10.12 bib53 Avenger goodone 10.10.12 bib35 Ice_Age wow 11.10.12 bib53 TheAmazingSpiderMan nice 11.10.12 bib54 TheAmazingSpiderMan nice 11.10.12 bib01 Avenger goodone 12.10.12 bib29 Avenger goodone 12.10.12 bib11 TheAmazingSpiderMan nice 12.10.12 bib03 Ice_Age wow 12.10.12 bib98 Ice_Age wow 14.10.12 bib12 Ice_Age wow

This is the result I want (file2):

 10.10.12 bib24 Avenger goodone 10.10.12 bib21 The_Dark_Knight_Rises betterone 10.10.12 bib35 Ice_Age wow 11.10.12 bib53 TheAmazingSpiderMan nice

So my question is: which command to use to get this result (file2)? (i.e. the first entry into the movie, not taking into account columns / fields 1, 2, and 4).

I hope this is clear enough.

+4

sorting bash filter

minutemaid Oct 18 '12 at 13:35

source share

3 answers

Try to do:

 sort -u -k3 file.txt

Output

 10.10.12 bib24 Avenger goodone 10.10.12 bib35 Ice_Age wow 11.10.12 bib53 TheAmazingSpiderMan nice 10.10.12 bib21 The_Dark_Knight_Rises betterone

+4

Gilles quenot Oct 18 '12 at 13:43

source share

For rusty csh users:

Use this:

 awk '{c[$3]++} {if (c[$3] == 1) print $0}' file.txt

Because with the original answer there will be an error “event not found” (it can also make “!” A normal character !, but it is easier to read and use)

0

Hemant verma Feb 24 '14 at 10:37

source share

Steve · Accepted Answer · 2012-10-18T13:50:28+0000

Here is one way: awk :

 awk '!a[$3]++' file.txt

Results:

 10.10.12 bib24 Avenger goodone 10.10.12 bib21 The_Dark_Knight_Rises betterone 10.10.12 bib35 Ice_Age wow 11.10.12 bib53 TheAmazingSpiderMan nice

Filter a text file to get unique entries based on the value in column 3

More articles: