Bugs item #986894, was opened at 2004-07-07 14:55
Message generated for change (Comment added) made by cutting
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=986894&group_id=59548

Category: searcher
Group: None
>Status: Closed
>Resolution: Accepted
Priority: 5
Submitted By: Andrzej Bialecki (abial)
Assigned to: Doug Cutting (cutting)
Summary: Patch to access ParseData of a search hit

Initial Comment:
The attached patch adds the ability to retrieve
ParseData of a search hit. ParseData contains many
valuable properties of a search hit.

This is required (among others) to properly display the
cached content because it's not possible to determine
the character encoding from the output of the
getContent() method (which returns byte[]). The
symptoms are that for HTML pages using non-latin1 or
non-UTF8 encodings the cached preview will almost
certainly look broken. Using the attached patch it is
possible to determine the character encoding from the
ParseData (for HTTP: Content-Type metadata), and encode
the content accordingly.

----------------------------------------------------------------------

>Comment By: Doug Cutting (cutting)
Date: 2004-07-08 13:02

Message:
Logged In: YES 
user_id=21778

I applied this patch.  Thanks!

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=491356&aid=986894&group_id=59548


-------------------------------------------------------
This SF.Net email sponsored by Black Hat Briefings & Training.
Attend Black Hat Briefings & Training, Las Vegas July 24-29 - 
digital self defense, top technical experts, no vendor pitches, 
unmatched networking opportunities. Visit www.blackhat.com
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to