How to pick a random english word from a list [clo

2019-01-18 00:22发布

问题:

What would be the best way to go about getting a function that returns a random English word (preferably a noun), without keeping a list of all possible words in a file before hand?

回答1:

Word lists need not take up all that much space.

Here's a JSON wordlist with over 5000 words, all nouns. It clocks in at under 50K, the size of a medium-sized jpeg image.

I'll leave choosing a random one as an exercise for the reader.



回答2:

You can't. There is no algorithm to generate meaningful words. You can only generate words that sound like English, but they won't have any meaning.



回答3:

You could have the function try and parse an online resource such as:

http://www.zokutou.co.uk/randomword/



回答4:

Another theoretical approach: you could scrape the random wikipedia article page and return the N-th word of the article.



回答5:

Just use setgetgo's random word api. It's free, it's easy, and it rocks.

http://randomword.setgetgo.com/



回答6:

There's a random word generator here - it's not English but it's English-ish, i.e. the words are similar enough to language that a user can read the words and store them in short-term memory.

Source code is in C# and a bit kludged, but you could use a similar approach in Python to generate lots of words without having to store a massive list.

Alternatively, you could call the web service on the demo page directly - it's hosted on GoDaddy though, so no guarantees it will work in production!



回答7:

You can download the "words common to SOWPODS and TWL" lists from http://www.math.toronto.edu/jjchew/scrabble/lists/ . I put all the words in those files together and the list weighed in at about 642k. Not huge by any standards. The lists do contain a whole lot of obscure words though, since they are meant for tournament Scrabble use. The good thing is that the lists form a substantial subset of the English language.



回答8:

Well, you have three options:

  • Hard-code the list of words and initialize an array with it.
  • Fetch the list from an internet location instead of a file.
  • Keep a list of possible words in a file.

The only way to avoid the above is if you're not concerned whether the word is real: you can just generate random-length strings of characters. (There's no way to programmatically generate words without a dictionary list to go from.)



标签: python random