On 9/7/06, heack <[EMAIL PROTECTED]> wrote:
> I meet the same problem with you. I think if there exist a way to store a
> description to .mp3 .wmv or .avi .. files, and could be searched.

I believe the problem can't be solved by adding a new parse plugin to
parse "all other (binary) filetypes": this additional parser would
still get the complete (possibly very big) file from the remote host.
At which level are the http.content.limit and file.content.limit taken
into accont?
I'm thinking a new configuration setting (say,
(http|file).unsupported.extensions) set to "mp3|iso|psd" etc. could
guide the fetch algorithm so that it doesn't fetch the file contents
for these files, but simply fetches information *about* the files in
question. How does that sound?

t.n.a.

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to