I have an array of hashes, and I want the unique values out of it. Calling Array.uniq
doesn't give me what I expect.
a = [{:a => 1},{:a => 2}, {:a => 1}]
a.uniq # => [{:a => 1}, {:a => 2}, {:a => 1}]
Where I expected:
[{:a => 1}, {:a => 2}]
In searching around on the net, I didn't come up with a solution that I was happy with. Folks recommended redefining Hash.eql?
and Hash.hash
, since that is what Array.uniq
is querying.
Edit: Where I ran into this in the real world, the hashes were slightly more complex. They were the result of parsed JSON that had multiple fields, some of which the values were hashes as well. I had an array of those results that I wanted to filter out the unique values.
I don't like the redefine Hash.eql?
and Hash.hash
solution, because I would either have to redefine Hash
globally, or redefine it for each entry in my array. Changing the definition of Hash
for each entry would be cumbersome, especially since there may be nested hashes inside of each entry.
Changing Hash
globally has some potential, especially if it were done temporarily. I'd want to build another class or helper function that wrapped saving off the old definitions, and restoring them, but I think this adds more complexity than is really needed.
Using inject
seems like a good alternative to redefining Hash
.
I've had a similar situation, but hashes had keys. I used sorting method.
What I mean:
you have an array:
you sort it (
#sort_by {|t| t[:x]}
) and get this:now a bit modified version of answer by Aaaron Hinni:
I've also tried:
but it's very slow. here is my benchmark:
results:
You can use (tested in ruby 1.9.3),
The pipe method on arrays (available since 1.8.6) performs set union (returning an array), so the following is another possible way to get unique elements of any array
a
:[] | a
I can get what I want by calling
inject
This will return:
The answer you give is similar to the one discussed here. It overrides the
hash
andeql?
methods on the hashes that are to appear in the array which then makesuniq
behave correctly.Assuming your hashes are always single key-value pairs, this will work:
Hash.to_a creates an array of key-value arrays, so the first map gets you:
uniq on Arrays does what you want, giving you:
and then the second map puts them back together as hashes again.