This is an automated email from the ASF dual-hosted git repository.
paulk pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/groovy-website.git
The following commit(s) were added to refs/heads/asf-site by this push:
new 6cc1992 blog post skeleton (draft)
6cc1992 is described below
commit 6cc1992d42bca283666891dcae0b3d038c759974
Author: Paul King <[email protected]>
AuthorDate: Sat Jan 18 16:53:31 2025 +1000
blog post skeleton (draft)
---
site/src/site/blog/groovy-text-similarity.adoc | 37 ++++++++++++++++++++++++++
1 file changed, 37 insertions(+)
diff --git a/site/src/site/blog/groovy-text-similarity.adoc
b/site/src/site/blog/groovy-text-similarity.adoc
new file mode 100644
index 0000000..6637f81
--- /dev/null
+++ b/site/src/site/blog/groovy-text-similarity.adoc
@@ -0,0 +1,37 @@
+= Groovy Text Similarity
+Paul King
+:revdate: 2025-01-25T19:22:00+00:00
+:draft: true
+:keywords: groovy, deep learning, apache commons
+:description: This blog looks at processing some algorithms for testing text
similarity.
+
+== Introduction
+
+String similarity helps answer questions like:
+
+* Are two Jira/GitHub issues duplicates of the same issue?
+* Are two (or more) customer records actually for the same customer?
+* Is some social media topic trending because multiple posts are really about
the same thing?
+* Can I understand some natural language customer request even when it
contains spelling mistakes?
+* As a doctor, can I find a medical journal paper discussing a patient's
medical diagnosis/symptoms/treatment?
+* As a programmer, can I find a solution to my coding problem?
+
+== Simple comparisons
+
+Examines three libraries for performing similarity matching:
+
+* info.debatty:java-string-similarity
+* org.apache.commons:commons-text Apache Commons Text
+* commons-codec:commons-codec Apache Commons Codec for Soundex
+
+== Further information
+
+Source code for this post:
+
+* https://github.com/paulk-asert/groovy-string-similarity
+
+Other referenced sites:
+
+* https://commons.apache.org/proper/commons-text/
+* https://commons.apache.org/proper/commons-codec/
+* https://github.com/tdebatty/java-string-similarity