On 9/7/06, heack <[EMAIL PROTECTED]> wrote: > I meet the same problem with you. I think if there exist a way to store a > description to .mp3 .wmv or .avi .. files, and could be searched.
I believe the problem can't be solved by adding a new parse plugin to parse "all other (binary) filetypes": this additional parser would still get the complete (possibly very big) file from the remote host. At which level are the http.content.limit and file.content.limit taken into accont? I'm thinking a new configuration setting (say, (http|file).unsupported.extensions) set to "mp3|iso|psd" etc. could guide the fetch algorithm so that it doesn't fetch the file contents for these files, but simply fetches information *about* the files in question. How does that sound? t.n.a. ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
