Yeah, this sounds reasonable and works fine -- this is the way Leo G. implemented it in Egothor. I will wait until this patch appears in the CVS then, thanks Doug.
Dawid DC> Dawid Weiss wrote: >> DC> A longer term fix might be to add an option to construct summaries >> DC> directly as plain text. This might be done as follows: >> DC> - replace Summary.toString() with both toText() and toHtml() methods; >> DC> - replace HitSummarizer.getSummary() with getHtmlSummary() and >> DC> getTextSummary(). This would require changes to NutchBean, >> DC> DistributedSearch, etc. >> >> I think the latter approach provides a cleaner solution. I will >> implement it and provide a patch to this list. DC> I've thought a bit more about this. Other folks have also wanted to DC> alter summaries in order to achieve different effects. So whatever DC> solution we come up with should be easily extensible. Adding new DC> methods each time probably isn't the best route. DC> So I think a better approach is to make it easy to plug in different DC> summarizer on the backend. Then you just alter your configuration to DC> name a different summarizer, and, voila, you get the style of summary DC> you desire. DC> Until we have a more elaborate plugin mechanism, I think the best DC> approach is to add a config file option which names a class which DC> implements HitSummarizer, then somehow install an instance of that on DC> NutchBean line 80, where the summarizer is specified. It's a little DC> more complicated than that, but I think you get the general idea. DC> Does that sound like a reasonable approach? DC> Doug DC> ------------------------------------------------------- DC> The SF.Net email is sponsored by EclipseCon 2004 DC> Premiere Conference on Open Tools Development and Integration DC> See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. DC> http://www.eclipsecon.org/osdn DC> _______________________________________________ DC> Nutch-developers mailing list DC> [EMAIL PROTECTED] DC> https://lists.sourceforge.net/lists/listinfo/nutch-developers ------------------------------------------------------- The SF.Net email is sponsored by EclipseCon 2004 Premiere Conference on Open Tools Development and Integration See the breadth of Eclipse activity. February 3-5 in Anaheim, CA. http://www.eclipsecon.org/osdn _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
