I need to do Word by word comparison of two strings. Something like diff, but for words, not for lines.
Like it is done in wikipedia http://en.wikipedia.org/w/index.php?title=Horapollo&action=historysubmit&diff=21895647&oldid=21893459
In result I want return the two arrays of indexes of words, which are different in two string.
Are there any libraries/frameworks/standalone_methods for .NET which can do this?
P.S. I want to compare several kilobytes of text
You might try this, though I am not sure it's what you are looking for StringUtils.difference() (http://commons.apache.org/lang/api-release/org/apache/commons/lang/StringUtils.html#difference%28java.lang.String,%20java.lang.String%29)
Alternately, the Eclipse (eclipse.org) project has a diff comparison feature, which means they must also have code to determine the differences, you might browse through their API or source to see what you can find.
Good luck.