Split on either a space or a hyphen?

2020-03-02 05:20发布

In Python, how do I split on either a space or a hyphen?

Input:

You think we did this un-thinkingly?

Desired output:

["You", "think", "we", "did", "this", "un", "thinkingly"]

I can get as far as

mystr.split(' ')

But I don't know how to split on hyphens as well as spaces and the Python definition of split only seems to specify a string. Do I need to use a regex?

3条回答
▲ chillily
2楼-- · 2020-03-02 05:55
>>> import re
>>> text = "You think we did this un-thinkingly?"
>>> re.split(r'\s|-', text)
['You', 'think', 'we', 'did', 'this', 'un', 'thinkingly?']

As @larsmans noted, to split by multiple spaces/hyphens (emulating .split() with no arguments) used [...] for readability:

>>> re.split(r'[\s-]+', text)
['You', 'think', 'we', 'did', 'this', 'un', 'thinkingly?']

Without regex (regex is the most straightforward option in this case):

>>> [y for x in text.split() for y in x.split('-')]
['You', 'think', 'we', 'did', 'this', 'un', 'thinkingly?']

Actually the answer by @Elazar without regex is quite straightforward as well (I would still vouch for regex though)

查看更多
Anthone
3楼-- · 2020-03-02 05:55

A regex is far easier and better, but if you're staunchly opposed to using one:

import itertools

itertools.chain.from_iterable((i.split(" ") for i in myStr.split("-")))
查看更多
做个烂人
4楼-- · 2020-03-02 05:57

If your pattern is simple enough for one (or maybe two) replace, use it:

mystr.replace('-', ' ').split(' ')

Otherwise, use RE as suggested by @jamylak.

查看更多
登录 后发表回答