I have been beating my head into the desk with this issue, and I don't think it's a simple 'uniq' or 'sort' issue.
I have a file, with many duplicate values in them.
File
dog
dog
cat
owl
owl
turkey
weasel
giraffe
giraffe
rooster
The output I am looking for would only have the following from the above file:
Output:
cat
turkey
weasel
rooster
Everything I've found so far either removes all the dupes, and keeping one copy of 'owl' or 'dog', which is not what I need. If it's duplicated, I don't want it at all in the output. The file I have is one I've merged from two other files, and they have nearly 50,000 lines in each one, so you can understand why it takes so long.