This is an automated email from the ASF dual-hosted git repository. paulk pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/groovy-website.git
commit d35dfb4c26cc6375716fe7910550d67672ffc7b2 Author: Paul King <[email protected]> AuthorDate: Fri Jan 31 08:50:42 2025 +1000 minor edits --- site/src/site/blog/groovy-text-similarity.adoc | 23 ++++++++++++++++++----- 1 file changed, 18 insertions(+), 5 deletions(-) diff --git a/site/src/site/blog/groovy-text-similarity.adoc b/site/src/site/blog/groovy-text-similarity.adoc index e26360c..285fcb0 100644 --- a/site/src/site/blog/groovy-text-similarity.adoc +++ b/site/src/site/blog/groovy-text-similarity.adoc @@ -34,13 +34,20 @@ Handling cases explicitly like this soon becomes tedious. We'll look at some libraries which can help us handle comparisons in more general ways. -First, we'll examine three libraries for performing similarity matching: +First, we'll examine two libraries for performing similarity matching using string metrics: -* info.debatty:java-string-similarity -* org.apache.commons:commons-text Apache Commons Text -* commons-codec:commons-codec Apache Commons Codec for Soundex +* `info.debatty:java-string-similarity` +* `org.apache.commons:commons-text` Apache Commons Text -Then we'll look at some deep learning options. +Then we'll look at some libraries for phonetic matching: + +* `commons-codec:commons-codec` Apache Commons Codec for Soundex and Metaphone +* `org.openrefine:main` OpenRefine for Metaphone3 + +Then we'll look at some deep learning options for increased semantic matching: + +* `org.deeplearning4j:deeplearning4j-nlp` for Glove and ConceptNet models +* `ai.djl` with Pytorch for a universal-sentence-encoder model and Tensorflow with an Angle model == Simple String Metrics @@ -309,3 +316,9 @@ Other referenced sites: * https://github.com/tdebatty/java-string-similarity * https://github.com/OpenRefine/OpenRefine * https://djl.ai/ + +Related libraries and links: + +* https://github.com/EdDuarte/similarity-search-java +* https://github.com/intuit/fuzzy-matcher +* https://www.youtube.com/watch?v=AHlnGId-Y-0
