I'm looking for a high performance Java library for fuzzy string search.
There are numerous algorithms to find similar strings, Levenshtein distance, Daitch-Mokotoff Soundex, n-grams etc.
What Java implementations exists? Pros and cons for them? I'm aware of Lucene, any other solution or Lucene is best?
I found these, does anyone have experience with them?
You can try the Completely library, it relies on text preprocessing to create an in-memory index for efficiently answering (fuzzy) searches in large data sets. Unlike Lucene and other full featured text search libraries, the API is small and easy to get started.
Apache Lucene is the only way, I think. I don't know any better search lib.