Best way to remove duplicate characters (words) in

2019-02-20 19:02发布

What would be the best way of removing any duplicate characters and sets of characters separated by spaces in string?

I think this example explains it better:

foo = 'h k k h2 h'

should become:

foo = 'h k h2' # order not important

Other example:

foo = 's s k'

becomes:

foo = 's k'

3条回答
在下西门庆
2楼-- · 2019-02-20 19:15
' '.join(set(foo.split()))

Note that split() by default will split on all whitespace characters. (e.g. tabs, newlines, spaces)

So if you want to split ONLY on a space then you have to use:

' '.join(set(foo.split(' ')))
查看更多
做自己的国王
3楼-- · 2019-02-20 19:27
out = []
for word in input.split():
    if not word in out:
        out.append(word)
output_string = " ".join(out)

Longer than using a set, but it keeps the order.

Edit: Nevermind. I missed the part in the question about order not being important. Using a set is better.

查看更多
你好瞎i
4楼-- · 2019-02-20 19:29

Do you mean?

' '.join( set( someString.split() ) )

That's the unique space-delimited words in no particular order.

查看更多
登录 后发表回答