Re: [Fwd: [PROPOSAL] index server project]

2006-10-19 Thread Stefan Groschupf
Hi Doug, we discussed the need of such a tool several times internally and developed some workarounds for nutch, so I would be definitely interested to contribute to such a project. Having a separated project that depends on hadoop would be the best case for our usecases. Best, Stefan

Re: [PROPOSAL] index server project

2006-10-20 Thread Stefan Groschupf
Hi, The major goal is scale, right? A distributed server provides more oomph than a single-node server can. Another important goal from my point of view would be index management, like index updates during production. Stefan

Re: [PROPOSAL] index server project

2006-11-06 Thread Stefan Groschupf
Hi, do people think we are already in a stage where we can setup some basic infrastructure like mailing list and wiki and move the discussion to the new mailing list. Maybe setup a incubator project? I would be happy to help with such basic tasks. Stefan Am 31.10.2006 um 22:03 schrieb Y

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-04-03 Thread Stefan Groschupf
Hi All, we are also very much interested in such a system and actually have to realize such a system for an project within the next 3 month. I would prefer to work on a open source solution instead of doing another one behind closed doors, though we would need to start coding pretty soon. W

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-04-03 Thread Stefan Groschupf
Should we start from scratch or with a code contribution? Someone still want to contribute its implementation? I just noticed - to late though - Ning already contributed the code to hadoop. So I guess my question should be rephrased what is the idea of moving this into a own project?

Re: Lucene Performance and usage alternatives

2008-08-05 Thread Stefan Groschupf
An alternative is always to distribute the index to a set of servers. If you need to scale I guess this is the only long term perspective. You can do your own home grown lucene distribution or look into existing one. I'm currently working on katta (http://katta.wiki.sourceforge.net/) - there

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-08-21 Thread Stefan Groschupf
ng working now, though they might have different goals and focus than dlucene or katta. Stefan Groschupf

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-08-22 Thread Stefan Groschupf
Hi, In terms of which project best fits my needs my gut feeling is that dlucene is pretty close. It supports incremental updates, and doesn't build in dependencies on systems like HDFS or Terracotta (I don't yet understand all the implications of those systems so would rather keep things simple

[ANN] katta-0.1.0 release - distribute lucene indexes in a grid

2008-09-17 Thread Stefan Groschupf
After 5 month work we are happy to announce the first developer preview release of katta. This release contains all functionality to serve a large, sharded lucene index on many servers. Katta is standing on the shoulders of the giants lucene, hadoop and zookeeper. Main features: + Plays wel

[ANNOUNCE] Katta 0.5 released

2009-04-09 Thread Stefan Groschupf
(...apologies for the cross posting...) Release 0.5 of Katta is now available. Katta - Lucene in the cloud. http://katta.sourceforge.net This release fixes bugs from 0.4, including one that sorted the results wrong under load. 0.5 also upgrades to Zookeeper to version 3.1., Lucene to version

Re: [ANNOUNCE] Katta 0.5 released

2009-04-09 Thread Stefan Groschupf
Oops, just saw on the wiki that "Gradle" (never heard of it before) is the build system (former build system, I gather from the release announcement) - I'm still interested in why the switch was made, though. - Steve On 4/9/2009 at 12:22 PM, Steven A Rowe wrote: On 4/9/200

ScaleCamp: get together the night before Hadoop Summit

2009-05-13 Thread Stefan Groschupf
Hi All, We are planing a community event the night before the Hadoop Summit. This "BarCamp" (http://en.wikipedia.org/wiki/BarCamp) event will be held at the same venue as the Summit (Santa Clara Marriott). Refreshments will be served to encourage socializing. To initialize conversations for t

Re: [APACHECON] Planning

2009-06-17 Thread Stefan Groschupf
Hi Grant, sorry I lost track here, is there a list of excepted presentations somewhere? Stefan ~~~ Hadoop training and consulting http://www.scaleunlimited.com http://www.101tec.com On Jun 17, 2009, at 8:42 AM, Grant Ingersoll wrote: Note, you may not have perm

Re: [ACUS09] IMPORTANT SPEAKER CONFIRMATION MESSAGE

2009-07-19 Thread Stefan Groschupf
Sorry I'm a day late, but I can confirm I can do a 20 min Katta Intro. On Jul 16, 2009, at 12:37 AM, Michael Busch wrote: I confirm I'm coming and that I'd like to give the talk below. Alternatively we could also split the talk up into two separate talks "Lucene Basics" and "New Features in L

Re: Organizing the Lucene meetup (Was: ApacheCon US)

2009-10-19 Thread Stefan Groschupf
There is an initial schedule online at: http://wiki.apache.org/lucene-java/LuceneAtApacheConUs2009 Isabel I still plan to do the Katta introduction. Is someone officially maintain the page or should I just go ahead and remove the question mark myself? Stefan