Bugs item #2929885, was opened at 2010-01-11 17:01
Message generated for change (Settings changed) made by cornuz
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2929885&group_id=56967
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: PFtijah
Group: Pathfinder CVS Head
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: Roberto Cornacchia (cornuz)
Assigned to: Henning Rode (hrode)
>Summary: PFTIJAH: whitelist in "_param" bat incomplete
Initial Comment:
I create the pftijah index with a whitelist, as in:
<TijahOptions stemmer="porter-english" whitelist="invention-title abstract
description claim"/>
The whitelist works well. The tag dictionary contains only the tags I wanted +
all the tags included.
mil>bat("tj_DFLT_FT_INDEX_tagdict").print();
#---------------------------------#
# h t # name
# void str # type
#---------------------------------#
[ 0...@0, "_DOCUMENT_ROOT" ]
[ 1...@0, "invention-title" ]
[ 2...@0, "abstract" ]
[ 3...@0, "p" ]
[ 4...@0, "description" ]
[ 5...@0, "ul" ]
[ 6...@0, "li" ]
[ 7...@0, "sup" ]
[ 8...@0, "sl" ]
[ 9...@0, "heading" ]
[ 1...@0, "u" ]
[ 1...@0, "tables" ]
[ 1...@0, "table" ]
[ 1...@0, "tgroup" ]
[ 1...@0, "tbody" ]
[ 1...@0, "row" ]
[ 1...@0, "entry" ]
[ 1...@0, "sub" ]
[ 1...@0, "img" ]
[ 1...@0, "b" ]
[ 2...@0, "claim" ]
[ 2...@0, "claim-text" ]
However, the "tj_DFLT_FT_INDEX_param" bat contains only the FIRST tag specified
in my whitelist:
mil>bat("tj_DFLT_FT_INDEX_param").print();;
#-------------------------------------------------#
# h t # name
# str str # type
#-------------------------------------------------#
[ "curFragment", "0" ]
[ "preExpansion", "4" ]
[ "_version", "1.1" ]
[ "stemmer", "porter-english" ]
[ "tokenizer", "flex" ]
[ "name", "DFLT_FT_INDEX" ]
[ "fragmentSize", "1073741823" ]
[ "delay_finalize", "0" ]
[ "whitelist", "invention-title" ]
[ "lastStopWord", "430" ]
[ "_last_tijahPre", "7662" ]
[ "status", "finalized" ]
[ "collectionSize", "7055" ]
[ "_last_finalizedPre", "7661" ]
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2929885&group_id=56967
------------------------------------------------------------------------------
This SF.Net email is sponsored by the Verizon Developer Community
Take advantage of Verizon's best-in-class app development support
A streamlined, 14 day to market process makes app distribution fast and easy
Join now and get one step closer to millions of Verizon customers
http://p.sf.net/sfu/verizon-dev2dev
_______________________________________________
Monetdb-bugs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/monetdb-bugs