Re: Help with parse-mp3?

2008-01-18 Thread Hasan Diwan
parse-mp3.jar. I see the source for it in the nutch distribution, but not the jar file. I'm a Java newbie so I'm not sure exactly what I need to build the jar file from the source. Any help or pointers would be appreciated. Rick -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: problem with mp3 parser

2007-12-12 Thread Hasan Diwan
and I'd love to take a (brief) look and let you know if I see anything. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: problem with mp3 parser

2007-12-11 Thread Hasan Diwan
More new features than ever. Check out the new AIM(R) Mail ! - http://webmail.aim.com -- Sent from Gmail for mobile | mobile.google.com Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: Newbie questions about followed links

2007-03-08 Thread Hasan Diwan
the '?' and the links will be followed. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: How can I setup an mp3 search engine?

2006-10-28 Thread Hasan Diwan
, the plugin (as written) can not pluck information about the file from thin air. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: crawling a certain site

2006-08-02 Thread Hasan Diwan
, Hasan Diwan [EMAIL PROTECTED]

Re: Please Help.. recrawl script.. will send out to the list when finished for 0.8.0

2006-07-20 Thread Hasan Diwan
Mr Holt: On 7/20/06, Matthew Holt [EMAIL PROTECTED] wrote: there is a resource online that describes manually recrawling, that'd be great as well. Thanks. http://wiki.apache.org/nutch/NutchTutorial -- you're welcome. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: Eclipse IDE

2006-07-11 Thread Hasan Diwan
under plugin/* and the libraries should contain all the jar files. If you want to keep things simple, just use the build file from eclipse. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: PluginRuntimeException

2006-03-07 Thread Hasan Diwan
with httpclient. I'm using protocol-http myself. So noted, the change has been made. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: NullPointerException

2006-03-06 Thread Hasan Diwan
)/value /property Thanks for the help! -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: NullPointerException

2006-03-05 Thread Hasan Diwan
hosts in any domain +^http://([a-z0-9]*\.)*/ # skip everything else -. So, why isn't it fetching anything, if that is indeed the case? -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: NullPointerException

2006-03-05 Thread Hasan Diwan
/nutch-0.7.1/build/plugins/ontology Total hits: 0 -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: NullPointerException

2006-03-05 Thread Hasan Diwan
Mr Tang: On 05/03/06, Jack Tang [EMAIL PROTECTED] wrote: Weird! You are running nutch on local file system or distributed file system? Local file system And can you find the same query hasan via luke? Nope -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: NullPointerException

2006-03-05 Thread Hasan Diwan
On 05/03/06, Jack Tang [EMAIL PROTECTED] wrote: I am not sure what's wrong in nutch-0.7.1 indexing, but now it is possible to upgrade to nutch 0.8(svn version)? It is possible, but I was under the assumption that 0.8 required NDFS? -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: NullPointerException

2006-03-05 Thread Hasan Diwan
On 05/03/06, Jack Tang [EMAIL PROTECTED] wrote: You can still build it on local file system:) Build, yes, but what of deployment? Can I use it in the same way? At present, I don't have enough resources to run a distributed crawl. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: NullPointerException

2006-03-05 Thread Hasan Diwan
) at org.apache.nutch.crawl.Crawl.main(Crawl.java:104) I need to sleep now, so I'll check back tomorrow. Thanks for all the help! -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: nutch-extensionpoints 0.71

2006-02-27 Thread Hasan Diwan
be getting the line below? 060227 150626 Deleted 0 content duplicates. Thanks again for the kind assistance. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Re: Duplicate urls in urls file

2006-02-15 Thread Hasan Diwan
plugin does, but I have not had the chance to look at it as yet. -- Cheers, Hasan Diwan [EMAIL PROTECTED]

Duplicate urls in urls file

2006-02-13 Thread Hasan Diwan
I've written a perl script to build up a urls file to crawl from RSS feeds. Will nutch handle duplicate URLs in the crawl file or would that logic need to be in my perl script? -- Cheers, Hasan Diwan [EMAIL PROTECTED]

extension point... does not exist

2006-02-13 Thread Hasan Diwan
... org/apache/nutch/protocol/Protocol.java does exist, as does org/apache/nutch/protocol/Protocol.class, jar tvf nutch-0.7.1.jar holds the class file. I could do further investigation, but would like some pointers as to where I should be looking first. Thanks! -- Cheers, Hasan Diwan [EMAIL PROTECTED] 1

Re: PDF indexing support?

2005-11-16 Thread Hasan Diwan
On Nov 15, 2005, at 2:46 PM, Håvard W. Kongsgård wrote: Don't have a conf/nutch-site.xml Create it and put the overrides in there, per the nutch tutorial. Cheers, Hasan Diwan [EMAIL PROTECTED] PGP.sig Description: This is a digitally signed message part