Or simply adding a method to get hold of the HitDetailer in NutchBean...
then one can reconstruct the summary and call getText on a Fragment
(instead of relying on toString() method).
I know it can be done with changes to Nutch, the thing is I didn't want to
do it on my local checkout ;) Developers -- is such change possible
(sensible) on the main trunk?
Dawid
AB> Dawid Weiss wrote:
>>Ok, the workaround gives me a raw text of the page, I don't need it. I
>>need the textual representation. So the previous question still holds:
>>
>>Can you show me how to get hold of a textual representation of a hit
>>(or its summary)?
>>
>>
AB> It looks like some modifications to Nutch will be necessary... I
AB> remember a similar discussion on Lucene highlighting.
AB> The places which require modifications, are
AB> Summary.Highlight.toString(), Summary.Ellipsis.toString(), and
AB> Summary.toString() - I think that adding a method
AB> Summary.toString(String hiStart, String hiEnd), which specifies the
AB> highlight string, and reimplementing Summary.toString() as
AB> Summary.toString("<b>", "</b>") would do the trick....
-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers