[ https://issues.apache.org/jira/browse/LUCENE-1628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12721044#action_12721044 ]
Mark Miller commented on LUCENE-1628: ------------------------------------- bq. mark, on the same topic: if possible, at some time it would be great to know which licenses are OK, and which ones are not. Found it. No Problem: * Apache License 2.0 * ASL 1.1 * BSD * MIT/X11 * NCSA * W3C Software license * X.Net * zlib/libpng with some hassle: * CDDL 1.0 * CPL 1.0 * EPL 1.0 * IPL 1.0 * MPL 1.0 and MPL 1.1 * SPL 1.0 http://www.apache.org/legal/3party.html > Persian Analyzer > ---------------- > > Key: LUCENE-1628 > URL: https://issues.apache.org/jira/browse/LUCENE-1628 > Project: Lucene - Java > Issue Type: New Feature > Components: contrib/analyzers > Reporter: Robert Muir > Assignee: Mark Miller > Priority: Minor > Fix For: 2.9 > > Attachments: LUCENE-1628.patch, LUCENE-1628.patch > > > A simple persian analyzer. > i measured trec scores with the benchmark package below against > http://ece.ut.ac.ir/DBRG/Hamshahri/ : > SimpleAnalyzer: > SUMMARY > Search Seconds: 0.012 > DocName Seconds: 0.020 > Num Points: 981.015 > Num Good Points: 33.738 > Max Good Points: 36.185 > Average Precision: 0.374 > MRR: 0.667 > Recall: 0.905 > Precision At 1: 0.585 > Precision At 2: 0.531 > Precision At 3: 0.513 > Precision At 4: 0.496 > Precision At 5: 0.486 > Precision At 6: 0.487 > Precision At 7: 0.479 > Precision At 8: 0.465 > Precision At 9: 0.458 > Precision At 10: 0.460 > Precision At 11: 0.453 > Precision At 12: 0.453 > Precision At 13: 0.445 > Precision At 14: 0.438 > Precision At 15: 0.438 > Precision At 16: 0.438 > Precision At 17: 0.429 > Precision At 18: 0.429 > Precision At 19: 0.419 > Precision At 20: 0.415 > PersianAnalyzer: > SUMMARY > Search Seconds: 0.004 > DocName Seconds: 0.011 > Num Points: 987.692 > Num Good Points: 36.123 > Max Good Points: 36.185 > Average Precision: 0.481 > MRR: 0.833 > Recall: 0.998 > Precision At 1: 0.754 > Precision At 2: 0.715 > Precision At 3: 0.646 > Precision At 4: 0.646 > Precision At 5: 0.631 > Precision At 6: 0.621 > Precision At 7: 0.593 > Precision At 8: 0.577 > Precision At 9: 0.573 > Precision At 10: 0.566 > Precision At 11: 0.572 > Precision At 12: 0.562 > Precision At 13: 0.554 > Precision At 14: 0.549 > Precision At 15: 0.542 > Precision At 16: 0.538 > Precision At 17: 0.533 > Precision At 18: 0.527 > Precision At 19: 0.525 > Precision At 20: 0.518 -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org