I'm trying to parse words out of a string and put them into an array. I've tried the following thing:
@string1 = "oriented design, decomposition, encapsulation, and testing. Uses "
puts @string1.scan(/\s([^\,\.\s]*)/)
It seems to do the trick, but it's a bit shaky (I should include more special characters for example). Is there a better way to do so in ruby?
Optional: I have a cs course description. I intend to extract all the words out of it and place them in a string array, remove the most common word in the English language from the array produced, and then use the rest of the words as tags that users can use to search for cs courses.
For me the best to spliting sentences is:
Even with multilingual words and punctuation marks work perfectly:
Well, you could split the string on spaces if that's your delimiter of interest
Or split on word boundaries
Or on non-words
Hint: try testing each of these on http://rubular.com
And note that ruby 1.9 has some differences from 1.8
The split command.
will split the string into an array based on a regular expression. \W means any "non-word" character and the "+" means to combine multiple delimiters.
For Rails you can use something like this: