Bugs item #2934409, was opened at 2010-01-18 18:15
Message generated for change (Settings changed) made by jflokstra
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2934409&group_id=56967

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: PFtijah
Group: Pathfinder "stable"
>Status: Closed
>Resolution: Fixed
Priority: 5
Private: No
Submitted By: Roberto Cornacchia (cornuz)
Assigned to: Jan Flokstra (jflokstra)
Summary: PFTIJAH: indexing whitelist gives inconsistent results

Initial Comment:
When using a whitelist for pftijah indexing, as in <TijahOptions 
whitelist="tag1 tag2 tag3"/>,
it seems that the order in which the tags are written does matter.
So, <TijahOptions whitelist="tag3 tag1 tag2"/> would produce a different index.

I couldn't figure out exactly what conditions cause the problem.
However, I have the impression it wouldn't happen when those tag names don't 
appear nested into each other.

The test in attachment uses 2 simple xml files (actually html).
When using only one (either) of them, I don't see the bug. 
When using both, one of the expected tag names in the dictionary disappears
(that is "span", which is in the subtree of one of the tags in whitelist)


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=482468&aid=2934409&group_id=56967

------------------------------------------------------------------------------
The Planet: dedicated and managed hosting, cloud storage, colocation
Stay online with enterprise data centers and the best network in the business
Choose flexible plans and management services without long-term contracts
Personal 24x7 support from experience hosting pros just a phone call away.
http://p.sf.net/sfu/theplanet-com
_______________________________________________
Monetdb-bugs mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/monetdb-bugs

Reply via email to