Fuzzy string search library in Java [closed]

2019-01-03 05:04发布

I'm looking for a high performance Java library for fuzzy string search.

There are numerous algorithms to find similar strings, Levenshtein distance, Daitch-Mokotoff Soundex, n-grams etc.

What Java implementations exists? Pros and cons for them? I'm aware of Lucene, any other solution or Lucene is best?

I found these, does anyone have experience with them?

8条回答
【Aperson】
2楼-- · 2019-01-03 05:29

You can try the Completely library, it relies on text preprocessing to create an in-memory index for efficiently answering (fuzzy) searches in large data sets. Unlike Lucene and other full featured text search libraries, the API is small and easy to get started.

查看更多
一夜七次
3楼-- · 2019-01-03 05:36

Apache Lucene is the only way, I think. I don't know any better search lib.

Apache Lucene(TM) is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.

查看更多
登录 后发表回答