Yeah,  this  sounds  reasonable  and  works  fine -- this is the way Leo G.
implemented  it in Egothor. I will wait until this patch appears in the CVS
then, thanks Doug.

Dawid

DC> Dawid Weiss wrote:
>> DC> A longer term fix might be to add an option to construct summaries
>> DC> directly as plain text.  This might be done as follows:
>> DC>    - replace Summary.toString() with both toText() and toHtml() methods;
>> DC>    - replace HitSummarizer.getSummary() with getHtmlSummary() and 
>> DC> getTextSummary().  This would require changes to NutchBean, 
>> DC> DistributedSearch, etc.
>> 
>> I think the latter approach provides a cleaner solution. I will
>> implement it and provide a patch to this list.

DC> I've thought a bit more about this.  Other folks have also wanted to 
DC> alter summaries in order to achieve different effects.  So whatever 
DC> solution we come up with should be easily extensible.  Adding new 
DC> methods each time probably isn't the best route.

DC> So I think a better approach is to make it easy to plug in different 
DC> summarizer on the backend.  Then you just alter your configuration to 
DC> name a different summarizer, and, voila, you get the style of summary 
DC> you desire.

DC> Until we have a more elaborate plugin mechanism, I think the best 
DC> approach is to add a config file option which names a class which 
DC> implements HitSummarizer, then somehow install an instance of that on 
DC> NutchBean line 80, where the summarizer is specified.  It's a little 
DC> more complicated than that, but I think you get the general idea.

DC> Does that sound like a reasonable approach?

DC> Doug



DC> -------------------------------------------------------
DC> The SF.Net email is sponsored by EclipseCon 2004
DC> Premiere Conference on Open Tools Development and Integration
DC> See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
DC> http://www.eclipsecon.org/osdn
DC> _______________________________________________
DC> Nutch-developers mailing list
DC> [EMAIL PROTECTED]
DC> https://lists.sourceforge.net/lists/listinfo/nutch-developers



-------------------------------------------------------
The SF.Net email is sponsored by EclipseCon 2004
Premiere Conference on Open Tools Development and Integration
See the breadth of Eclipse activity. February 3-5 in Anaheim, CA.
http://www.eclipsecon.org/osdn
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to