Remove duplicate row using PowerShell

Question

I have a text file:

1 2 4 5 6 7 1 3 5 6 7 8 1 2 3 4 5 6 1 2 4 5 6 7

Here the first and last line are close. I have many double line files. I need to remove all duplicates.

+6

Suliman 30 sept '11 at 11:46

5 answers

Alan jackson · Answer 1 · 2014-02-12T20:35:38+0000

It all seems very complicated. It is as simple as:

 gc $filename | sort | get-unique > $output

Using actual file names instead of variables:

 gc test.txt| sort | get-unique > unique.txt

Shay levy · Answer 2 · 2011-09-30T13:29:02+0000

To get unique strings:

 PS > Get-Content test.txt | Select-Object -Unique 1 2 4 5 6 7 1 3 5 6 7 8 1 2 3 4 5 6

To remove a duplicate

 PS > Get-Content test.txt | group -noelement | ` where {$_.count -eq 1} | select -expand name 1 3 5 6 7 8 1 2 3 4 5 6

Roman kuzmin · Answer 3 · 2011-09-30T12:10:58+0000

If the order is not important:

 Get-Content test.txt | Sort-Object -Unique | Set-Content test-1.txt

If order is important:

 $set = @{} Get-Content test.txt | %{ if (!$set.Contains($_)) { $set.Add($_, $null) $_ } } | Set-Content test-2.txt

CB. · Answer 4 · 2011-09-30T12:07:04+0000

Try something like this:

 $a = @{} # declare an arraylist type gc .\mytextfile.txt | % { if (!$a.Contains($_)) { $a.add($_)}} | out-null $a #now contains no duplicate lines

To set the contents of $ a to mytextfile.txt:

 $a | out-file .\mytextfile.txt

Will · Answer 5 · 2017-03-27T00:53:40+0000

 $file = "C:\temp\filename.txt" (gc $file | Group-Object | %{$_.group | select -First 1}) | Set-Content $file

The source file now contains only unique lines

Already published options did not help me for some reason