Hi Doug,
we discussed the need of such a tool several times internally and
developed some workarounds for nutch, so I would be definitely
interested to contribute to such a project.
Having a separated project that depends on hadoop would be the best
case for our usecases.
Best,
Stefan
Hi,
The major goal is scale, right? A distributed server provides more
oomph
than a single-node server can.
Another important goal from my point of view would be index
management, like index updates during production.
Stefan
Hi,
do people think we are already in a stage where we can setup some
basic infrastructure like mailing list and wiki and move the
discussion to the new mailing list. Maybe setup a incubator project?
I would be happy to help with such basic tasks.
Stefan
Am 31.10.2006 um 22:03 schrieb Y
Hi All,
we are also very much interested in such a system and actually have to
realize such a system for an project within the next 3 month.
I would prefer to work on a open source solution instead of doing
another one behind closed doors, though we would need to start coding
pretty soon. W
Should we start from scratch or with a code contribution?
Someone still want to contribute its implementation?
I just noticed - to late though - Ning already contributed the code to
hadoop. So I guess my question should be rephrased what is the idea of
moving this into a own project?
An alternative is always to distribute the index to a set of servers.
If you need to scale I guess this is the only long term perspective.
You can do your own home grown lucene distribution or look into
existing one.
I'm currently working on katta (http://katta.wiki.sourceforge.net/) -
there
ng working now, though they
might have different goals and focus than dlucene or katta.
Stefan Groschupf
Hi,
In terms of which project best fits my needs my gut feeling is that
dlucene is pretty close. It supports incremental updates, and doesn't
build in dependencies on systems like HDFS or Terracotta (I don't yet
understand all the implications of those systems so would rather keep
things simple
After 5 month work we are happy to announce the first developer
preview release of katta.
This release contains all functionality to serve a large, sharded
lucene index on many servers.
Katta is standing on the shoulders of the giants lucene, hadoop and
zookeeper.
Main features:
+ Plays wel
(...apologies for the cross posting...)
Release 0.5 of Katta is now available.
Katta - Lucene in the cloud.
http://katta.sourceforge.net
This release fixes bugs from 0.4, including one that sorted the
results wrong under load.
0.5 also upgrades to Zookeeper to version 3.1., Lucene to version
Oops, just saw on the wiki that "Gradle" (never heard of it before)
is the build system (former build system, I gather from the release
announcement) - I'm still interested in why the switch was made,
though. - Steve
On 4/9/2009 at 12:22 PM, Steven A Rowe wrote:
On 4/9/200
Hi All,
We are planing a community event the night before the Hadoop Summit.
This "BarCamp" (http://en.wikipedia.org/wiki/BarCamp) event will be
held at the same venue as the Summit (Santa Clara Marriott).
Refreshments will be served to encourage socializing.
To initialize conversations for t
Hi Grant,
sorry I lost track here, is there a list of excepted presentations
somewhere?
Stefan
~~~
Hadoop training and consulting
http://www.scaleunlimited.com
http://www.101tec.com
On Jun 17, 2009, at 8:42 AM, Grant Ingersoll wrote:
Note, you may not have perm
Sorry I'm a day late, but I can confirm I can do a 20 min Katta Intro.
On Jul 16, 2009, at 12:37 AM, Michael Busch wrote:
I confirm I'm coming and that I'd like to give the talk below.
Alternatively we could also split the talk up into two separate
talks "Lucene Basics" and "New Features in L
There is an initial schedule online at:
http://wiki.apache.org/lucene-java/LuceneAtApacheConUs2009
Isabel
I still plan to do the Katta introduction. Is someone officially
maintain the page or should I just go ahead and remove the question
mark myself?
Stefan
15 matches
Mail list logo