I think there is probably a sub text here (I'm putting words in Otis' mouth,
for which my apologies).

' Yes, you could rewrite Nutch in C++ and have that use CLucene.'  But you'd
be mad to do so!

I'm a bit out of date with Nutch, but it's large.  And Java to C++ is not an
easy conversion because of the different memory management systems.

And why?  I guess you may see some performance improvement, but it would be
a LOT cheaper to throw hardware at the problem (and you may not see much if
any).

So if you have a few months to spare ....


Iain

-----Original Message-----
From: Otis Gospodnetic [mailto:[email protected]] 
Sent: 04 August 2009 04:49
To: [email protected]
Subject: Re: Nutch in C++

CLucene is just like Lucene (except a few versions behind), but written in
C++.

Yes, you could rewrite Nutch in C++ and have that use CLucene.

Otis
--
Sematext is hiring -- http://sematext.com/about/jobs.html?mls
Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR



----- Original Message ----
> From: "[email protected]" <[email protected]>
> To: [email protected]
> Sent: Monday, August 3, 2009 2:29:40 PM
> Subject: Re: Nutch in C++
> 
> 
> 
> 
> 
> Hi,
> 
> I know nutch uses Lucene. But for what is Clucene then? Only for indexing
files 
> in a hard drive?
> 
> 
> I have knowledge of C++ and some experience. I wanted to code crawler of
Nutch 
> in C++ to get more experience and make it open source, only if it l be
useful 
> for the open source community.
> My goal is to get more experience in C++ and make? contribution to open
source. 
> If you know other projects that may be more useful, please let me know.
> 
> thanks.
> Alex.
> 
> 
> -----Original Message-----
> From: Otis Gospodnetic 
> To: [email protected]
> Sent: Sun, Aug 2, 2009 8:15 pm
> Subject: Re: Nutch in C++
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> Nutch uses Lucene (Java), not CLucene (C++).
> 
> Why are you looking to rewrite Nutch in C++ anyway?  Sounds scary.
> 
> Otis
> --
> Sematext is hiring -- http://sematext.com/about/jobs.html?mls
> Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR
> 
> 
> 
> ----- Original Message ----
> > From: "[email protected]" 
> > To: [email protected]
> > Sent: Thursday, July 30, 2009 3:13:16 PM
> > Subject: Nutch in C++
> > 
> > Hi,
> > 
> > As I understood only indexing part of nutch is in C++ as clucene.? I
want to 
> > code? nutch in C++, only in case if it is worth doing that.? I wondered
if is 
> > worth coding the remaining parts of nutch in C++, let say the crawler.
Can 
> > someone give me directions on what to start.
> > 
> > Thanks
> > Alex.

Reply via email to