please help update Lucene status page

2005-08-02 Thread Doug Cutting
I started a Lucene status page for July at: http://wiki.apache.org/jakarta-lucene/Report-2005-07 Please help populate this page. It should contain news related to the Lucene top-level project (not just the Java sub-project) since Lucene became a top-level project at the beginning of this year

Re: IndexWriter and IndexReader open at the same time

2005-08-09 Thread Doug Cutting
Greg Love wrote: In the TheServerSide case study of the book, page 375, they say that they close the IndexWriter and even point out that they did before openning the IndexReader and deleting. So that kinda makes me wonder if i'm safe having an IndexReader with deletions and and IndexWriter with i

Re: Performance problem

2005-08-24 Thread Doug Cutting
WolfgangTäger wrote: While the search is carried out in a reasonably short time (about 500..800ms) we have a performance problem with actually retrieving the documents by code like: for (int i = nrhits-1; i >=0; i--){ Document hitDoc = hits.doc(i); String code=hitDoc.get("code"

Re: Lucene issue tracker moved to JIRA; LUCENE-352 not patched; cannot create lucene issue.

2005-09-13 Thread Doug Cutting
Paul Elschot wrote: I'd like to raise an issue for lucene, but the lucene is not listed as a project in the 'addressing' list for creating issues. (lucene4c is listed there.) I just fixed this. Doug

ferret: lucene in Ruby

2005-10-23 Thread Doug Cutting
I just ran accross a Ruby port of Lucene: http://ferret.davebalmain.com/trac/ Doug

Re: lucene1.4.3 src

2006-05-24 Thread Doug Cutting
The 1.4.3 src distributions had been misplaced. Sorry. I just restored them. They'll be on all of the Apache mirrors tomorrow. Doug sdamjad wrote: But these one's dont contain src files If i download them, then in src directory there is no directory for java thanks = Original Messa

Yonik Seeley joins Lucene PMC

2006-08-30 Thread Doug Cutting
The Lucene PMC has voted to add Yonik Seeley to its ranks. Thanks, Yonik, for all you've done to advance the Lucene community! Doug

Re: apachecon

2006-09-28 Thread Doug Cutting
Chris Hostetter wrote: Spamming "general" so people on other subprojects besides java-user see this... http://wiki.apache.org/apachecon/BirdsOfaFeatherUs06 I'll be there! Doug

Re: Infrastructure for large Lucene index

2006-10-06 Thread Doug Cutting
James wrote: We use Remote ParalellMultiSearcher, with some custom modifications (and we are in the process of making more) to allow most robust handling of many processes at once and integration of the responses from various sub-indexes. Can you share these modifications with others? If so, p

Re: Infrastructure for large Lucene index

2006-10-06 Thread Doug Cutting
James wrote: Let me check with the powers that be here, and then get the code into a more polished form. We hope to have it really enterprise-ready over the next couple months. Great! Once you have permission, please post it sooner rather than later, then others can help with polishing, or a

Re: Infrastructure for large Lucene index

2006-10-10 Thread Doug Cutting
Chris Hostetter wrote: : 3. If you're worried about high availability, then one fairly simple : approach is to have two parallel set of search clusters, with a load : balancer in front. For each cluster, monitor both the front-end : server (where the results get combined) and each of the back-end

[PROPOSAL] index server project

2006-10-18 Thread Doug Cutting
It seems that Nutch and Solr would benefit from a shared index serving infrastructure. Other Lucene-based projects might also benefit from this. So perhaps we should start a new project to build such a thing. This could start either in java/contrib, or as a separate sub-project, depending on

Re: [PROPOSAL] index server project

2006-10-30 Thread Doug Cutting
Yonik Seeley wrote: On 10/18/06, Doug Cutting <[EMAIL PROTECTED]> wrote: We assume that, within an index, a file with a given name is written only once. Is this necessary, and will we need the lockless patch (that avoids renaming or rewriting *any* files), or is Lucene's cu

Re: non-committers & branches in SVN

2006-11-03 Thread Doug Cutting
Erik Hatcher wrote: It's done frequently. In Lucene-land we have some committers that only have rights to certain contrib sub-directories, for example. We could easily do this for particular branches as well. We actually have a set of contributors who only have access to the entire contrib

New Technology Seeks To Let Startups Build Their Own Googles - Yahoo! News

2006-11-22 Thread Doug Cutting
Some good press for several Lucene projects! http://news.yahoo.com/s/cmp/20061122/tc_cmp/195600041 Doug

Re: Solr graduates and joins Lucene as sub-project

2007-01-17 Thread Doug Cutting
Yonik Seeley wrote: Solr has just graduated from the Incubator, and has been accepted as a Lucene sub-project! Congratulations and welcome! Doug

Re: Looking For Lucene Job Opportunities

2007-04-26 Thread Doug Cutting
Please don't use Lucene's mailing lists for job search. There are many good job search engines out there (some powered by Lucene). The sanctioned place for this is: http://wiki.apache.org/jakarta-lucene/Support Please feel free to add a link to that page. Cheers, Doug

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-02-07 Thread Doug Cutting
Ning, I am also interested in starting a new project in this area. The approach I have in mind is slightly different, but hopefully we can come to some agreement and collaborate. My current thinking is that the Solr search API is the appropriate model. Solr's facets are an important featur

Re: Lucene-based Distributed Index Leveraging Hadoop

2008-02-07 Thread Doug Cutting
[ No longer cross-posting to java-dev and solr-user. ] Andrzej Bialecki wrote: A particular client should be able to provide a consistent read/write view by bonding to particular replicas of a shard. Thus a user who makes a modification should be able to generally see that modification in res

Re: documentation for V1.0.2

2008-10-18 Thread Doug Cutting
Steven A Rowe wrote: Hmm, looking closer, I see that release 1.02 *was* referred to as 1.0.2 - see Lucene's home page as of July 2002: The last Sourceforge release was called "1.01b" and was created on 6/24/2001. http://www.mail-archive.com/[EMAIL PROTECTED]/msg00136.html FYI, Sourceforge

Re: Lucene Meetup in Amsterdam?

2009-02-26 Thread Doug Cutting
FWIW, I can make Tuesday but not Monday. Doug

Re: [DISCUSS] Archive Lucy

2009-03-09 Thread Doug Cutting
Grant Ingersoll wrote: Therefore, it is with some hesitation that I suggest we mothball Lucy. When committers are inactive for a year, we ask them if they'd like to be made emeritus. Usually they either don't respond or they say yes. If they say "no" we give them the benefit of the doubt for

Re: [DISCUSS] Archive Lucy

2009-03-09 Thread Doug Cutting
Grant Ingersoll wrote: I'd _suggest_ a few other things beyond just code commits, however: +1 for all these. Finally, six months or so does sound like the right time frame. So we can consider this fair warning: if there have only been a few token commits, and there isn't activity consisten

Re: Lucene Logo Vectorgraphic

2009-12-04 Thread Doug Cutting
Marvin Humphrey wrote: http://svn.apache.org/repos/asf/lucene/site/publish/images/logo.eps Or, more simply: http://lucene.apache.org/images/logo.eps Doug

Re: Factor out a standalone, shared analysis package for Nutch/Solr/Lucene?

2010-02-24 Thread Doug Cutting
Michael McCandless wrote: I think, in order to stop duplicating our analysis code across Nutch/Solr/Lucene, we should separate out the analyzers into a standalone package, and maybe as its own sub-project under the Lucene tlp? Is the goal to release these on a separate schedule from Lucene Java

Re: Factor out a standalone, shared analysis package for Nutch/Solr/Lucene?

2010-03-01 Thread Doug Cutting
Ted Dunning wrote: Hadoop is a strange beast. The Hadoop core itself has fractured into three projects that have independent mailing lists but which share release dates. But without any releases yet. Is that "shared nothing"? The rationale for the Hadoop split was that the single codebase wa

Re: Boosting on *unique* term matches without using MUST

2010-03-02 Thread Doug Cutting
This question probably belongs on java-user@, not gene...@. That said, coord() might be what you're looking for: http://lucene.apache.org/java/3_0_1/api/core/org/apache/lucene/search/Similarity.html#coord%28int,%20int%29 Doug tavi.nathanson wrote: Hey everyone, Let me start with an example q

Re: [VOTE] merge lucene/solr development

2010-03-04 Thread Doug Cutting
Uwe Schindler wrote: One idea: If we really make solr depend on the new lucene lib, solr should not have lucene jars in its lib folder, but instead the nightly build should fetch the jars from the lucene hudson build. If Solr trunk is meant to always be based on Lucene trunk, then shouldn't th

Re: Q re merging dev MLs

2010-03-04 Thread Doug Cutting
Otis Gospodnetic wrote: Didn't we see Hadoop do exactly the opposite? Doug described it as code-base being too big, but I'd say the original list was also too high traffic (was split into common-, hdfs-, and mapreduce- I believe). The intent is that eventually HDFS and MapReduce will evolve i

Re: [VOTE] merge lucene/solr development

2010-03-04 Thread Doug Cutting
Mattmann, Chris A (388J) wrote: Doesn't this move towards having a shared code base, Yes. The desire seems to be to have a shared code base, no? and more so the criteria for that of a TLP? Thoughts? A shared codebase with a single pool of committers are canonical TLP attributes, if that's

Re: Less drastic ways

2010-03-16 Thread Doug Cutting
Grant Ingersoll wrote: As you've seen by the Board's indication, they only view that there should be a single Lucene project. One committership, one project. Or, two committerships, two projects. So, if the existing committer structure were acceptable, then Solr would be split to a separate

Re: [PMC] [DISCUSS] Lucy

2010-06-15 Thread Doug Cutting
On 06/14/2010 05:16 PM, Chris Hostetter wrote: I'm having a hard time wrapping my head arround the idea of Lucy moving from the Lucene TLP to the Incubator TLP. It probably would have made sense for Lucy start in the Incubator years ago, but I'm not really sure what value that would add now give

Re: [PMC] [DISCUSS] Lucy

2010-06-15 Thread Doug Cutting
On 06/15/2010 10:48 AM, Mark Miller wrote: Sounds like Doug is saying we should get a rough consensus from the PMC on whether we want to put more effort into Lucy - not just abide its existence as seems to have been the case. Yes, but, to be clear, I don't think it's a huge effort, no more effo

Re: Number of Boolean Clauses (AND vs OR)

2011-04-11 Thread Doug Cutting
On 04/11/2011 01:25 PM, entdeveloper wrote: > Thanks Otis. And by your answer, does this mean that individual clauses in a > boolean query are executed sequentially? not in parallel? Clauses are executed in parallel. The execution of a conjunction is able to efficiently skip occurrences in ranges

Re: Number of Boolean Clauses (AND vs OR)

2011-04-12 Thread Doug Cutting
On 04/12/2011 08:53 AM, Yang wrote: > when you say "executed" in parallel, could you please elaborate more > on what "execute" refers to? I mean that all matches for a query clause are not generally enumerated before any results of its containing query are enumerated. There's generally a single t