Hi,

I'd like to see your presentation, but that server is down.

Otis


--- Chris Mattmann <[EMAIL PROTECTED]> wrote:

> Hi there Jay,
> 
>  Here are some numbers that a colleague and I presented in my
> graduate
> computer science seminar class on search engines in the Spring 05'
> semester
> at USC. The numbers measure the efficiency and scalability of several
> of the
> plugin content extractors for Nutch (PDF, WORD, RSS, etc.). The tests
> were
> performed on a RedHat Linux 7.3 Box, with 1.3 GB RAM, and a 10 GB HD,
> and a
> Pentium III 500 Mhz processor. 
> 
>  The presentation is geared towards the parse-rss plugin that I
> wrote,
> although they should give you an idea of the other content extractors
> too.
> 
> Hope they help, here's the link to the presentation:
> 
> http://baron.pagemewhen.com:8080/~chris/RSS-Nutch-Eval.ppt
> 
> Cheers,
>   Chris
> 
> 
> ______________________________________________
> Chris A. Mattmann
> [EMAIL PROTECTED] 
> Staff Member
> Modeling and Data Management Systems Section (387)
> Data Management Systems and Technologies Group
> 
> _________________________________________________
> Jet Propulsion Laboratory               Pasadena, CA
> Office: 171-266B                        Mailstop:  171-246
> _______________________________________________________
> 
> Disclaimer:  The opinions presented within are my own and do not
> reflect
> those of either NASA, JPL, or the California Institute of Technology.
> 
> > -----Original Message-----
> > From: webmaster [mailto:[EMAIL PROTECTED]
> > Sent: Wednesday, July 20, 2005 8:02 PM
> > To: [email protected]
> > Subject: benchmarking
> > 
> > hey could some of you post your speeds (sorting,indexing, pages a
> > sec/documents a sec) and system specs I'm trying to compile a
> database of
> > which of nutches functions are better suited to run on what
> hardware. also
> > if
> > any of you have a sun box could you post its specs and some of the
> info
> > for
> > database sorting speeds and indexing speed, anything that uses full
> cpu.
> > whats everyones pages a sec top score??? e-mail me @
> > [EMAIL PROTECTED]
> > I'll post a webpage with the results
> > Thanks,
> > -Jay Pound
> 
> 
> 
> -------------------------------------------------------
> SF.Net email is sponsored by: Discover Easy Linux Migration
> Strategies
> from IBM. Find simple to follow Roadmaps, straightforward articles,
> informative Webcasts and more! Get everything you need to get up to
> speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
> _______________________________________________
> Nutch-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/nutch-general
> 

Reply via email to