Bugs item #986894, was opened at 2004-07-07 23:55 Message generated for change (Tracker Item Submitted) made by Item Submitter You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=986894&group_id=59548
Category: searcher Group: None Status: Open Resolution: None Priority: 5 Submitted By: Andrzej Bialecki (abial) Assigned to: Doug Cutting (cutting) Summary: Patch to access ParseData of a search hit Initial Comment: The attached patch adds the ability to retrieve ParseData of a search hit. ParseData contains many valuable properties of a search hit. This is required (among others) to properly display the cached content because it's not possible to determine the character encoding from the output of the getContent() method (which returns byte[]). The symptoms are that for HTML pages using non-latin1 or non-UTF8 encodings the cached preview will almost certainly look broken. Using the attached patch it is possible to determine the character encoding from the ParseData (for HTTP: Content-Type metadata), and encode the content accordingly. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=986894&group_id=59548 ------------------------------------------------------- This SF.Net email sponsored by Black Hat Briefings & Training. Attend Black Hat Briefings & Training, Las Vegas July 24-29 - digital self defense, top technical experts, no vendor pitches, unmatched networking opportunities. Visit www.blackhat.com _______________________________________________ Nutch-developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-developers
