Remove items from list that are substrings of othe

2019-09-02 01:52发布

This question already has an answer here:

python delete substrings from list of strings 1 answer
How can I count the occurrences of a list item? 22 answers

Python 3:

I have a list of words. I want to create a dict that contains unique words from the list as key and their frequency as value. Also I want to remove any words that are substrings of other words from the list.

For eg:

list  =   ['goon ', 'goonk ', 'goon ', 'goonj ', 'w ', 'wo ', 'wor ', 'world ', 'world ']

Dictionary should be :

dict  =  {'goonj': 1, 'world': 2, 'goonk':1}

I have tried the following methods but I do not get the desired dict.

Method 1: If the substring list for key is empty then I add that key to the dict.

for keyword in list:         
    if not [key for key in list if keyword in key and key != keyword]:
        if keyword in KeywordDict:
            KeywordDict[keyword] += 1
        else:
            KeywordDict[keyword] = 1

Method 2: Add the word to dict and remove all it's substring keys from dict.

if keyword in KeywordDict:
    KeywordDict[keyword] += 1
else:
    KeywordDict[keyword] = 1
for key in KeywordDict:
    if keyword.startswith(key) > -1:
        KeywordDict.pop(key)

I have tried other minor variations of the above methods but the list still contains words that are substrings.

The actual list has about 300 words.

I have also tried using list comprehension and dict comprehension with the same bug.

What am I doing wrong? Can someone suggest an alternative approach?

标签： python list dictionary substring

0条回答

Remove items from list that are substrings of othe

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间