https://bugzilla.wikimedia.org/show_bug.cgi?id=44350
Web browser: ---
Bug ID: 44350
Summary: Cannot search in Javanese script
Product: MediaWiki extensions
Version: unspecified
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: Unprioritized
Component: Lucene Search
Assignee: [email protected]
Reporter: [email protected]
CC: [email protected]
Classification: Unclassified
Mobile Platform: ---
Note: To display the font correctly, visit
http://jv.wikipedia.org/wiki/Pitulung:Aksara_Jawa#English
I can't search using Javanese alphabet/script in sites like Javanese Wikipedia
or Wiktionary. The word I'm using in this example are: ꦱꦸꦒꦼꦁ (transliterated:
"sugeng" or "sugêng"), ꦱꦸꦒꦼꦁꦮꦂꦱꦲꦺꦁꦒꦭ꧀ (transliterated: "sugeng warsa enggal" or
"sugêng warsa enggal" without spaces)
For example, in jv.wikt there're
http://jv.wiktionary.org/wiki/sugêng_warsa_enggal and it's script form
http://jv.wiktionary.org/wiki/ꦱꦸꦒꦼꦁꦮꦂꦱꦲꦺꦁꦒꦭ꧀
I tried to search the "ꦱꦸꦒꦼꦁ" and "ꦱꦸꦒꦼꦁꦮꦂꦱꦲꦺꦁꦒꦭ꧀", but returns zero result
(other than title match for the second search term)
*
http://jv.wiktionary.org/w/index.php?title=Astamiwa:Pencarian&search=ꦱꦸꦒꦼꦁ&fulltext=1
*
http://jv.wiktionary.org/w/index.php?title=Astamiwa:Pencarian&search=ꦱꦸꦒꦼꦁꦮꦂꦱꦲꦺꦁꦒꦭ꧀&fulltext=1
Expected result: returns pages that contains the terms, i.e. [[sugêng]],
[[sugêng warsa enggal]]
Note: Javanese script is a Scriptio continua script. I don't know if that
affects the Lucene search or not
(http://en.wikipedia.org/wiki/Scriptio_continua)
Another example in Wikipedia: http://jv.wikipedia.org/wiki/ꦠꦺꦃ Trying to search
the title ("ꦠꦺꦃ" - "tèh") or any word in the content will return zero result.
--
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.
_______________________________________________
Wikibugs-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l