What would be the best way to go about getting a function that returns a random English word (preferably a noun), without keeping a list of all possible words in a file before hand?
问题:
回答1:
Word lists need not take up all that much space.
Here's a JSON wordlist with over 5000 words, all nouns. It clocks in at under 50K, the size of a medium-sized jpeg image.
I'll leave choosing a random one as an exercise for the reader.
回答2:
You can't. There is no algorithm to generate meaningful words. You can only generate words that sound like English, but they won't have any meaning.
回答3:
You could have the function try and parse an online resource such as:
http://www.zokutou.co.uk/randomword/
回答4:
Another theoretical approach: you could scrape the random wikipedia article page and return the N-th word of the article.
回答5:
Just use setgetgo's random word api. It's free, it's easy, and it rocks.
http://randomword.setgetgo.com/
回答6:
There's a random word generator here - it's not English but it's English-ish, i.e. the words are similar enough to language that a user can read the words and store them in short-term memory.
Source code is in C# and a bit kludged, but you could use a similar approach in Python to generate lots of words without having to store a massive list.
Alternatively, you could call the web service on the demo page directly - it's hosted on GoDaddy though, so no guarantees it will work in production!
回答7:
You can download the "words common to SOWPODS and TWL" lists from http://www.math.toronto.edu/jjchew/scrabble/lists/ . I put all the words in those files together and the list weighed in at about 642k. Not huge by any standards. The lists do contain a whole lot of obscure words though, since they are meant for tournament Scrabble use. The good thing is that the lists form a substantial subset of the English language.
回答8:
Well, you have three options:
- Hard-code the list of words and initialize an array with it.
- Fetch the list from an internet location instead of a file.
- Keep a list of possible words in a file.
The only way to avoid the above is if you're not concerned whether the word is real: you can just generate random-length strings of characters. (There's no way to programmatically generate words without a dictionary list to go from.)