Re: GSoC

2011-03-08 Thread Simon Willnauer
, David Nemeskey nemeskey.da...@sztaki.hu wrote: Please find the implementation plan attached. The word soon gets a new meaning when power outages are taken into account. :) As before, comments are welcome. David On Tuesday, February 22, 2011 15:22:57 Simon Willnauer wrote: I think that is good

Re: GSoC

2011-03-09 Thread Simon Willnauer
the Lucene ones. done on at least one ;) simon Also, feel free to label existing bugs. On Mar 9, 2011, at 2:11 AM, Simon Willnauer wrote: Hey David and all others who want to contribute to GSoC, the ASF has applied for GSoC 2011 as a mentoring organization. As a ASF project we don't need

Re: GSoC

2011-03-10 Thread Simon Willnauer
:53 Simon Willnauer wrote: On Wed, Mar 9, 2011 at 5:48 PM, Grant Ingersoll gsing...@apache.org wrote: I think we, Lucene committers, need to identify who is willing to mentor.    In my experience, it is less than 5 hours a week.  Most of the work is done as part of the community.  Sometimes

[GSoC] Apache Lucene @ Google Summer of Code 2011 [STUDENTS READ THIS]

2011-03-11 Thread Simon Willnauer
Hey folks, Google Summer of Code 2011 is very close and the Project Applications Period has started recently. Now it's time to get some excited students on board for this year's GSoC. I encourage students to submit an application to the Google Summer of Code web-application. Lucene Solr are

Re: I want to take part in Google Summer Code 2011

2011-03-11 Thread Simon Willnauer
Hey there, On Fri, Mar 11, 2011 at 10:02 PM, anurag.it.jo...@gmail.com wrote: I know Lucene , Solr and Nutch . I am also involved in such a project. Please guide me through any obstructions This is great! Did you read the GSoC WikiPage here:

Re: GPU acceleration

2011-03-12 Thread Simon Willnauer
On Sat, Mar 12, 2011 at 9:21 PM, Ken O'Brien k...@kenobrien.org wrote: Hi, Is anyone looking at GPU acceleration for Solr? If not, I'd like to contribute code which adds this functionality. As I'm not familiar with the codebase, does anyone know which areas of functionality could benefit

Re: Participating in GSoC'11 with Lucene

2011-03-12 Thread Simon Willnauer
Hey, On Sat, Mar 12, 2011 at 5:32 PM, Zhijie Shen zjshe...@gmail.com wrote: Hi developers, I'm a graduate student from National University of Singapore, majoring in Computer Science. The enthusiasm of open source and information retrieval drives me to participate in GSoC'11 with your

Re: Participating in GSoC'11 with Lucene

2011-03-13 Thread Simon Willnauer
On Sun, Mar 13, 2011 at 12:11 AM, Michael McCandless luc...@mikemccandless.com wrote: Simon these are great summaries -- can you post them on the issues too?   Thanks! done! simon On Sat, Mar 12, 2011 at 4:35 PM, Simon Willnauer simon.willna...@googlemail.com wrote: Hey, On Sat, Mar 12

Lucene Solr a one way street?

2011-03-13 Thread Simon Willnauer
Hey folks, I have recently tried to push some refactorings towards moving stuff from Solr to modules land to enable users of Lucene to benefit from the developments that have been made in Solr land during the past with very little success. Actually, it was a really disappointing experience

Lucene GSoC Mentors

2011-03-14 Thread Simon Willnauer
Hey folks, if you are planning to become a mentor in 2011s GSoC you should likely read this http://community.apache.org/guide-to-being-a-mentor.html as well as this: http://community.apache.org/mentee-ranking-process.html for now Mike and myself have signed up for mentoring, anybody else?

Re: Nested Document support in Lucene

2011-03-19 Thread Simon Willnauer
On Sat, Mar 19, 2011 at 9:39 AM, Kapil Charania lonestar.ka...@gmail.com wrote: Hi, I am a newbie to Lucene. I have already created indexes for my project. But now requirement is to go with Nested Document. I googled a lot but can not find much implementation of nested documents. My I know

HEADS UP - Rebuild any trunk indices

2011-03-22 Thread Simon Willnauer
Heads up -- LUCENE-2881, which I committed earlier today, changes the SegmentInfos file format and adds a new ${number}.fnx file storing global field numbers. If you have any indexes lying around built with revs of trunk before this commit, after you update you should completely reindex. Indexes

Re: [GSoC] Apache Lucene @ Google Summer of Code 2011 [STUDENTS READ THIS]

2011-03-23 Thread Simon Willnauer
it works. We will later rate the proposals from the GSoC website and decide which we choose. This is also when slots get assigned. simon Thanks, David On 2011 March 11, Friday 17:23:58 Simon Willnauer wrote: Hey folks, Google Summer of Code 2011 is very close and the Project Applications

BerlinBuzzwords 2011 Early Bird Ticket Period ends on April 7th.

2011-03-24 Thread Simon Willnauer
Hey folks, just a short notice for those who haven't noticed we have only a limited amount of Early-Bird tickets left and the Early-Bird period is ends on April 7th. If you want to get one of the 30 remaining tickets go and get one now here: http://berlinbuzzwords.de/content/tickets While we are

Re: boosting with standard search handler

2011-03-24 Thread Simon Willnauer
please do not cross post to dev list unless its a dev related question. simon On Thu, Mar 24, 2011 at 10:13 AM, Gastone Penzo gastone.pe...@gmail.com wrote: Hi, is possibile to boost fields like bf parameter of dismax in standard request handler? with or without funcions? thanx --

Re: Interested in GSOC

2011-03-25 Thread Simon Willnauer
Hey there, welcome to Lucene :), good to hear you are interested in Lucene and GSoC! On Fri, Mar 25, 2011 at 4:49 AM, Vinicius Paes de barros viniciuspaesdebar...@yahoo.com.br wrote: Hi there, I heard about GSOC from a friend of mine at college and I decide I want to participate this year.

Re: [VOTE] Lucene 3.1.0 RC3

2011-03-29 Thread Simon Willnauer
+1 I ran the tests checked signatures. Looks good to me. On Tue, Mar 29, 2011 at 9:27 PM, Dawid Weiss dawid.we...@cs.put.poznan.pl wrote: +1 Checked the clustering stuff again, works fine. Dawid On Tue, Mar 29, 2011 at 8:30 PM, Chris Hostetter hossman_luc...@fucit.org wrote: :

Re: [HUDSON] Lucene-Solr-tests-only-trunk - Build # 6565 - Failure

2011-03-31 Thread Simon Willnauer
This on is weird seems like there is a synchronized missing on FieldInfoBiMap#containsConsistent I try to reproduce first. simon On Thu, Mar 31, 2011 at 11:37 AM, Apache Hudson Server hud...@hudson.apache.org wrote: Build: https://hudson.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/6565/

Re: [HUDSON] Lucene-Solr-tests-only-trunk - Build # 6565 - Failure

2011-03-31 Thread Simon Willnauer
I just committed a fix for this simon On Thu, Mar 31, 2011 at 5:28 PM, Simon Willnauer simon.willna...@googlemail.com wrote: This on is weird seems like there is a synchronized missing on FieldInfoBiMap#containsConsistent I try to reproduce first. simon On Thu, Mar 31, 2011 at 11:37 AM

Re: My GSOC proposal

2011-04-06 Thread Simon Willnauer
searching does not hurt search performance?  (Ie we should be able to reproduce the results from http://blog.mikemccandless.com/2010/06/lucene-and-fadvisemadvise.html). Thanks for the summary mike! I have spoken to Micheal McCandless and Simon Willnauer about undertaking these tasks. Micheal

Re: Google Summer Code 2011 participation

2011-04-08 Thread Simon Willnauer
On Fri, Apr 8, 2011 at 12:11 PM, Michael McCandless luc...@mikemccandless.com wrote: Anyone can participate in Lucene/Solr!  You don't need to be GSoC student to do so... Browse the issues in Jira (maybe focusing on the ones marked for GSoC and not already taken), or open your own issues,

TestIndexWriterDelete#testUpdatesOnDiskFull can false fail

2011-04-13 Thread Simon Willnauer
In TestIndexWriterDelete#testUpdatesOnDiskFull especially between line 538 and 553 we could get a random exception from the MockDirectoryWrapper which makes the test fail since we are not catching / expecting those exceptions. I can make this fail on trunk even in 1000 runs but on realtime it

Re: TestIndexWriterDelete#testUpdatesOnDiskFull can false fail

2011-04-14 Thread Simon Willnauer
just committed to trunk simon On Wed, Apr 13, 2011 at 5:06 PM, Michael McCandless luc...@mikemccandless.com wrote: +1 Mike http://blog.mikemccandless.com On Wed, Apr 13, 2011 at 5:58 AM, Simon Willnauer simon.willna...@googlemail.com wrote: In TestIndexWriterDelete#testUpdatesOnDiskFull

Re: [HUDSON] Lucene-Solr-tests-only-realtime_search-branch - Build # 2 - Still Failing

2011-04-14 Thread Simon Willnauer
I just committed a fix for this On Thu, Apr 14, 2011 at 4:47 PM, Apache Hudson Server hud...@hudson.apache.org wrote: Build: https://hudson.apache.org/hudson/job/Lucene-Solr-tests-only-realtime_search-branch/2/ 1 tests failed. REGRESSION:  

Re: Setting the max number of merge threads across IndexWriters

2011-04-14 Thread Simon Willnauer
On Thu, Apr 14, 2011 at 5:20 PM, Jason Rutherglen jason.rutherg...@gmail.com wrote: Today the ConcurrentMergeScheduler allows setting the max thread count and is bound to a single IndexWriter. However in the [common] case of multiple IndexWriters running in the same process, this disallows

Re: Setting the max number of merge threads across IndexWriters

2011-04-14 Thread Simon Willnauer
that into each IW's CMS? On Thu, Apr 14, 2011 at 8:25 AM, Simon Willnauer simon.willna...@googlemail.com wrote: On Thu, Apr 14, 2011 at 5:20 PM, Jason Rutherglen jason.rutherg...@gmail.com wrote: Today the ConcurrentMergeScheduler allows setting the max thread count and is bound to a single

Re: Lucene Merge failing on Open Files

2011-04-14 Thread Simon Willnauer
On Wed, Apr 6, 2011 at 8:44 PM, Grant Ingersoll gsing...@apache.org wrote: Begin forwarded message: From: Michael McCandless luc...@mikemccandless.com Date: April 5, 2011 5:46:13 AM EDT To: simon.willna...@gmail.com Cc: Simon Willnauer simon.willna...@googlemail.com, java-u

Re: [HUDSON] Lucene-Solr-tests-only-trunk - Build # 7260 - Still Failing

2011-04-19 Thread Simon Willnauer
uwe go ahead and disable it... :( any idea when we get more HW for our tests? Its a shame that we don't have enough HW to run our continuous tests all the time as we like simon On Tue, Apr 19, 2011 at 9:01 AM, Uwe Schindler u...@thetaphi.de wrote: Disk full again, deleting Maybe we

Re: [HUDSON] Lucene-trunk - Build # 1537 - Failure

2011-04-22 Thread Simon Willnauer
hey folks, this seems to happen more often in the near past... can we raise the limit for jenkins so we don't get these failures again. Once we land DWPT this is likely to happen even more often since we are writing more / smaller segments with DWPT. Uwe do you have karma to fix that? simon On

Re: [HUDSON] Lucene-trunk - Build # 1537 - Failure

2011-04-22 Thread Simon Willnauer
://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Simon Willnauer [mailto:simon.willna...@googlemail.com] Sent: Friday, April 22, 2011 9:55 AM To: dev@lucene.apache.org Subject: Re: [HUDSON] Lucene-trunk - Build # 1537 - Failure hey folks, this seems to happen more often

Re: Lucene Jenkins slave out of disk

2011-04-23 Thread Simon Willnauer
On Fri, Apr 22, 2011 at 5:13 PM, Robert Muir rcm...@gmail.com wrote: On Fri, Apr 22, 2011 at 9:13 AM, Uwe Schindler u...@thetaphi.de wrote: Hi Robert, Thanks for pointing to that issue. Indeed the leftover test files in Lucene take approx. 3 GB per build. With our 9 builds that’s 30 GB -

Re: [HUDSON] Lucene-trunk - Build # 1537 - Failure

2011-04-23 Thread Simon Willnauer
On Fri, Apr 22, 2011 at 2:44 PM, Robert Muir rcm...@gmail.com wrote: On Fri, Apr 22, 2011 at 8:41 AM, Thomas Matthijs thomas.matth...@actonomy.com wrote: On Fri, Apr 22, 2011 at 14:15, Uwe Schindler u...@thetaphi.de wrote: Hi Simon, I had no success to change anything. As root I can at

Re: Lucene Jenkins slave out of disk

2011-04-23 Thread Simon Willnauer
On Sat, Apr 23, 2011 at 10:11 AM, Uwe Schindler u...@thetaphi.de wrote: Hi, On Sat, Apr 23, 2011 at 9:47 AM, Uwe Schindler u...@thetaphi.de wrote: Hi, Can you also check that all new tests in realtime use the new _TestUtils API for getting an index dir? That would be nice.

Re: Lucene Jenkins slave out of disk

2011-04-24 Thread Simon Willnauer
On Sun, Apr 24, 2011 at 9:53 AM, Uwe Schindler u...@thetaphi.de wrote: Hi, JUHU! - Thanks. Now the test folder(s) (build/test, build/backwards/test) after running the builds only contain the test results and some empty dirs. Simon, if you merge that one we should be fine now! not until

Re: Lucene Jenkins slave out of disk

2011-04-24 Thread Simon Willnauer
On Sun, Apr 24, 2011 at 12:21 PM, Uwe Schindler u...@thetaphi.de wrote: I am merging realtime up to trunk now... thanks uwe! simon Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Simon

bug in LuceneTestCase#TEST_MIN_ITER

2011-04-26 Thread Simon Willnauer
hey I wonder how this TEST_MIN_ITER feature works though... I expect that if I set -Dtests.iter.min=1 -Dtests.iter=10 and I fail in any of those iterations that the the runner stops immediately and prints a failure. Is that correct? if so I don't understand this code: if (testsFailed) {

Re: bug in LuceneTestCase#TEST_MIN_ITER

2011-04-27 Thread Simon Willnauer
Fixed the behavior in Revision: 1097097 simon On Tue, Apr 26, 2011 at 6:14 PM, Shai Erera ser...@gmail.com wrote: I think you're right Simon ! Obviously I didn't test it with that scenario in mind :). Shai On Tue, Apr 26, 2011 at 6:15 PM, Simon Willnauer simon.willna...@googlemail.com

Code Freeze on realtime_search branch

2011-04-29 Thread Simon Willnauer
(boolean, boolean) doesn't synchronized on IndexWriter anymore. A dedicated flushLock has been introduced to prevent multiple full- flushes happening concurrently. - DocumentsWriter doesn't write shared doc stores anymore. (Mike McCandless, Michael Busch, Simon Willnauer

Re: Link to nightly build test reports on main Lucene site needs updating

2011-04-30 Thread Simon Willnauer
thanks tom, I cced dev@l.a.o simon On Fri, Apr 29, 2011 at 11:14 PM, Burton-West, Tom tburt...@umich.edu wrote: Hello, I went to look at the Hudson nightly builds and tried to follow the link from the main Lucene page http://lucene.apache.org/java/docs/developer-resources.html#Nightly

Re: Link to nightly build test reports on main Lucene site needs updating

2011-05-01 Thread Simon Willnauer
as most users would prefer that one to fix latest bugs but don’t want to have a backwards-incompatible version. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Simon Willnauer [mailto:simon.willna

33 Days left to Berlin Buzzwords 2011

2011-05-04 Thread Simon Willnauer
hey folks, BerlinBuzzwords 2011 is close only 33 days left until the big Search, Store and Scale opensource crowd is gathering in Berlin on June 6th/7th. The conference again focuses on the topics search, data analysis and NoSQL. It is to take place on June 6/7th 2011 in Berlin. We are looking

Re: modularization discussion

2011-05-04 Thread Simon Willnauer
On Wed, May 4, 2011 at 3:49 PM, Mark Miller markrmil...@gmail.com wrote: On May 4, 2011, at 9:42 AM, Uwe Schindler wrote: Solr has no performance testing framework, see the issue from today (SOLR-2493). Come to Berlin Buzzwords! I think I will come :) simon (I know you already are :) )

Re: modularization discussion

2011-05-05 Thread Simon Willnauer
Hey folks On Tue, May 3, 2011 at 6:49 PM, Michael McCandless luc...@mikemccandless.com wrote: Isn't our end goal here a bunch of well factored search modules?  Ie, fast forward a year or two and I think we should have modules like these: I think we have two camps here (10k feet view): 1.

Re: [JENKINS] Lucene-Solr-tests-only-docvalues-branch - Build # 1064 - Failure

2011-05-05 Thread Simon Willnauer
I removed the @Override annotation on that file! simon On Thu, May 5, 2011 at 11:03 AM, Apache Jenkins Server hud...@hudson.apache.org wrote: Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-docvalues-branch/1064/ No tests ran. Build Log (for compile errors):

Re: modularization discussion

2011-05-05 Thread Simon Willnauer
On Thu, May 5, 2011 at 4:41 PM, Mark Miller markrmil...@gmail.com wrote: On May 5, 2011, at 10:25 AM, Grant Ingersoll wrote: 3.  Those who think most should be modularized, but realize it's a ton of work for an unproven gain (although most admit it is a highly likely gain) and should be

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 7757 - Failure

2011-05-05 Thread Simon Willnauer
the actual exception we are tripping here is java.lang.RuntimeException: java.lang.AssertionError [junit] at org.apache.lucene.index.TestFlushByRamOrCountsPolicy$IndexThread.run(TestFlushByRamOrCountsPolicy.java:328) [junit] Caused by: java.lang.AssertionError [junit] at

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 7757 - Failure

2011-05-06 Thread Simon Willnauer
I committed a fix for this in revision 1100103 simon On Thu, May 5, 2011 at 10:24 PM, Simon Willnauer simon.willna...@googlemail.com wrote: the actual exception we are tripping here is  java.lang.RuntimeException: java.lang.AssertionError    [junit

Re: [JENKINS] Lucene-Solr-tests-only-3.x - Build # 7777 - Still Failing

2011-05-06 Thread Simon Willnauer
merged missing file in and committed revision 1100131. simon On Fri, May 6, 2011 at 10:40 AM, Simon Willnauer simon.willna...@googlemail.com wrote: seems like there is a file missing - I am working on it simon On Fri, May 6, 2011 at 9:40 AM, Apache Jenkins Server hud...@hudson.apache.org

Re: modularization discussion

2011-05-07 Thread Simon Willnauer
think this is also what the board report should contain but I will reply to this separately. simon Mike http://blog.mikemccandless.com On Thu, May 5, 2011 at 10:25 AM, Grant Ingersoll gsing...@apache.org wrote: On May 5, 2011, at 4:15 AM, Simon Willnauer wrote: Hey folks On Tue, May 3

Re: modularization discussion

2011-05-07 Thread Simon Willnauer
On Sat, May 7, 2011 at 1:02 PM, Michael McCandless luc...@mikemccandless.com wrote: OK I opened:    https://issues.apache.org/jira/browse/LUCENE-3079 awesome! +1 Mike http://blog.mikemccandless.com On Sat, May 7, 2011 at 6:46 AM, Michael McCandless luc...@mikemccandless.com wrote: I

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 7924 - Failure

2011-05-10 Thread Simon Willnauer
On Tue, May 10, 2011 at 8:02 PM, Michael McCandless luc...@mikemccandless.com wrote: I committed fix... false failure tickled by the cool new sneaky throttling MockDirWrapper now does! YAY! :) simon Mike http://blog.mikemccandless.com On Tue, May 10, 2011 at 1:57 PM, Apache Jenkins

Re: a optimizable point about solr

2011-05-12 Thread Simon Willnauer
Hey good catch :) we already addressed this issue AFAIK here: https://issues.apache.org/jira/browse/SOLR-2493 so the question is if we need to do another release since it seems serious. simon 2011/5/12 shuigen kang kangsg...@gmail.com: Hi all:       I was recently used solr to set up a

Re: Lucene PMC Welcomes 3 New Members

2011-05-12 Thread Simon Willnauer
On Thu, May 12, 2011 at 12:24 PM, Michael McCandless luc...@mikemccandless.com wrote: Welcome aboard!! +1 Mike http://blog.mikemccandless.com On Wed, May 11, 2011 at 10:08 PM, Grant Ingersoll gsing...@apache.org wrote: The Lucene PMC would like to announce the addition of Steve Rowe, Shai

Re: [JENKINS] Lucene-trunk - Build # 1559 - Still Failing

2011-05-12 Thread Simon Willnauer
I found the issue - created LUCENE-3090 for it... I am on it. On Thu, May 12, 2011 at 3:08 PM, Michael McCandless luc...@mikemccandless.com wrote: I dug some into this heap dump... Somehow we have a DWPT using WAY too much RAM (301 MB).  A 2nd DWPT has 105 MB tied up.  I'm getting these

Re: 3.2.0 (or 3.1.1)

2011-05-16 Thread Simon Willnauer
+1 for pushing 3.2!! There have been discussions about porting DWPT to 3.x but I think its a little premature now and I am still not sure if we should do it at all. The refactoring is pretty intense throughout all IndexWriter and it integrates with Flex / Codecs. I am not saying its impossible,

Re: 3.2.0 (or 3.1.1)

2011-05-16 Thread Simon Willnauer
On Mon, May 16, 2011 at 1:30 PM, Robert Muir rcm...@gmail.com wrote: On Mon, May 16, 2011 at 7:10 AM, Simon Willnauer simon.willna...@googlemail.com wrote: the question is if we should backport stuff like LUCENE-2881 to 3.2 or if we should hold off until 3.3, should we do it at all? I think

Moving towards Lucene 4.0

2011-05-16 Thread Simon Willnauer
Hey folks, we just started the discussion about Lucene 3.2 and releasing more often. Yet, I think we should also start planning for Lucene 4.0 soon. We have tons of stuff in trunk that people want to have and we can't just keep on talking about it - we need to push this out to our users. From my

Re: svn commit: r1103709 - in /lucene/java/site: docs/whoweare.html docs/whoweare.pdf src/documentation/content/xdocs/whoweare.xml

2011-05-16 Thread Simon Willnauer
stanislav you are a full committer afaik?! simon On Mon, May 16, 2011 at 2:11 PM, stanis...@apache.org wrote: Author: stanislaw Date: Mon May 16 12:11:57 2011 New Revision: 1103709 URL: http://svn.apache.org/viewvc?rev=1103709view=rev Log: Adding myself (Stanislaw Osinski) to the contrib

Re: svn commit: r1103711 - in /lucene/dev/trunk/lucene/src: java/org/apache/lucene/util/automaton/MinimizationOperations.java test/org/apache/lucene/util/automaton/TestMinimize.java

2011-05-16 Thread Simon Willnauer
On Mon, May 16, 2011 at 2:15 PM, uschind...@apache.org wrote: Author: uschindler Date: Mon May 16 12:15:45 2011 New Revision: 1103711 URL: http://svn.apache.org/viewvc?rev=1103711view=rev Log: LUCENE-3101: Fix n^2 memory usage in minimizeSchindler() ähm minimizeHopcroft() LOL ^ ^

Re: Moving towards Lucene 4.0

2011-05-16 Thread Simon Willnauer
On Mon, May 16, 2011 at 2:57 PM, Robert Muir rcm...@gmail.com wrote: On Mon, May 16, 2011 at 8:48 AM, Uwe Schindler u...@thetaphi.de wrote: Sorry to be negative, - BulkPostings (my +1 since I want to enable positional scoring on all queries) My problem is the really crappy and unusable API

Re: Moving towards Lucene 4.0

2011-05-17 Thread Simon Willnauer
: On Mon, May 16, 2011 at 9:12 AM, Simon Willnauer simon.willna...@googlemail.com wrote: I have to admit that branch is very rough and the API is super hard to use. For now! Lets not be dragged away into discussion how this API should look like there will be time for that. +1, this is what

Solr ByteUtils

2011-05-18 Thread Simon Willnauer
Hey there, I just ran into org.apache.solr.util.ByteUtils which seems pretty much like a duplication of UnicodeUtils in Lucene. I think we should get rid of it and merge what needs to be merged into UnicodeUtils. This utils class is really just doing unicode stuff. simon

Re: Lucene/Solr JIRA

2011-05-18 Thread Simon Willnauer
On Tue, May 17, 2011 at 9:23 PM, Steven A Rowe sar...@syr.edu wrote: On 5/17/2011 at 3:02 PM, Chris Hostetter wrote: If we were starting from scratch, i'd agree with you that having a single Jira project makes more sense, but given where we are today, i think we should probably keep them

Re: Lucene/Solr JIRA

2011-05-19 Thread Simon Willnauer
On Wed, May 18, 2011 at 10:53 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : just a few words. I disagree here with you hoss IMO the suggestion to : merge JIRA would help to move us closer together and help close the : gap between Solr and Lucene. I think we need to start identifying us

Re: Moving towards Lucene 4.0

2011-05-20 Thread Simon Willnauer
On Thu, May 19, 2011 at 7:44 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : I think we should focus on everything that's *infrastructure* in 4.0, so : that we can develop additional features in subsequent 4.x releases. If we : end up releasing 4.0 just to discover many things will need

Welcome Chris Male Andi Vajda as full Solr / Lucene Committers

2011-05-23 Thread Simon Willnauer
Hi folks, I am happy to announce that the Lucene PMC has accepted Chris Male and Andi Vajda as Lucene/Solr committers. Congratulations Welcome on board, Chris Andi!! Simon - To unsubscribe, e-mail:

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 8380 - Failure

2011-05-25 Thread Simon Willnauer
that assert is broken... I am working on fixing this... simon On Thu, May 26, 2011 at 1:25 AM, Apache Jenkins Server hud...@hudson.apache.org wrote: Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/8380/ 45 tests failed. REGRESSION:  

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 8380 - Failure

2011-05-25 Thread Simon Willnauer
just committed a fix for this simon On Thu, May 26, 2011 at 2:32 AM, Simon Willnauer simon.willna...@googlemail.com wrote: that assert is broken... I am working on fixing this... simon On Thu, May 26, 2011 at 1:25 AM, Apache Jenkins Server hud...@hudson.apache.org wrote: Build: https

Re: [VOTE] Release Lucene/Solr 3.2.0

2011-05-27 Thread Simon Willnauer
I think robert is right here. We want to do more frequent releases and to go that path we need to stop waiting for a week for feature / improvement X. We can spin another release in 4 weeks I think we should actually. If we do that and increment the version number by 1 each time we reach 3.9 by

Re: [VOTE] Release Lucene/Solr 3.2.0

2011-05-29 Thread Simon Willnauer
Hi David, On Sat, May 28, 2011 at 11:14 PM, David Smiley (@MITRE.org) dsmi...@mitre.org wrote: You're right; it shouldn't be shoved in at the last second -- I didn't mean to imply that. It should be committed and then we'll give it a comfortable amount of time.  When that time is up, and if

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 8497 - Failure

2011-05-29 Thread Simon Willnauer
it seems like there is a race in IW#closeInternal() that allows a incoming thread to index a document into a DWPT that was empty (0 docs) when the supposed to be last flush kicked in. I will commit a fix for it in a minute. I added an assert which I tripped immediately while I wasn't able to

Re: [JENKINS] Lucene-Solr-tests-only-docvalues-branch - Build # 1202 - Still Failing

2011-05-29 Thread Simon Willnauer
I committed a fix for those javadoc links simon On Sun, May 29, 2011 at 6:56 PM, Apache Jenkins Server hud...@hudson.apache.org wrote: Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-docvalues-branch/1202/ All tests passed Build Log (for compile errors): [...truncated

Re: [VOTE] Release Lucene/Solr 3.2.0

2011-05-31 Thread Simon Willnauer
sorry guys I won't have time to look at this candiates due to buzzwords. Thanks for all the work. simon On Mon, May 30, 2011 at 7:01 PM, Grant Ingersoll gsing...@apache.org wrote: On May 28, 2011, at 6:10 PM, Robert Muir wrote: On Sat, May 28, 2011 at 5:09 PM, Grant Ingersoll

Re: Welcome Martijn van Groningen as Lucene/Solr committer

2011-06-01 Thread Simon Willnauer
Welcome! :) On Wed, Jun 1, 2011 at 9:35 PM, Dawid Weiss dawid.we...@cs.put.poznan.pl wrote: Welcome, Martijn! Dawid On Wed, Jun 1, 2011 at 9:29 PM, Robert Muir rcm...@gmail.com wrote: Welcome Martijn! On Wed, Jun 1, 2011 at 3:01 PM, Michael McCandless luc...@mikemccandless.com wrote:

Re: Welcome Erick Erickson as Lucene/Solr committer

2011-06-01 Thread Simon Willnauer
Welcome! simon On Wed, Jun 1, 2011 at 8:16 PM, Erik Hatcher erik.hatc...@gmail.com wrote: Yay!   Welcome, Erick. On Jun 1, 2011, at 16:07 , Robert Muir wrote: I'm pleased to announce that the Lucene PMC has voted for Erick Erickson as a committer. Erick, its tradition that you introduce

Re: Storing and loading the FST directly from disk

2011-06-03 Thread Simon Willnauer
I agree with Robert and Dawid that once you go past the page fault border you will loose performance. The problem here is that you don't realize it immediately. IMO the byte[] representation is so compact that it doesn't really matter if you use FS Cache memory or JVM memory so I'd rather go for

Re: tentative! release notes drafts

2011-06-03 Thread Simon Willnauer
On Fri, Jun 3, 2011 at 5:00 PM, johnmu...@aol.com wrote: So, in my case, upgrading from Solr 1.2 to 3.2, I must re-index. OK, I got that, thanks. Btw, where can I learn more about the new IndexUpgrader tool?  Is there a doc/wiki for it? you can find it in trunk:

Travel Assistance applications now open for ApacheCon NA 2011

2011-06-06 Thread Simon Willnauer
The Apache Software Foundation (ASF)'s Travel Assistance Committee (TAC) is now accepting applications for ApacheCon North America 2011, 7-11 November in Vancouver BC, Canada. The TAC is seeking individuals from the Apache community at-large --users, developers, educators, students, Committers,

Re: [jira] [Commented] (SOLR-2564) Integrating grouping module into Solr 4.0

2011-06-08 Thread Simon Willnauer
On Wed, Jun 8, 2011 at 2:27 PM, Robert Muir (JIRA) j...@apache.org wrote:    [ https://issues.apache.org/jira/browse/SOLR-2564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13045924#comment-13045924 ] Robert Muir commented on SOLR-2564:

Remove @version tags from JDocs

2011-06-09 Thread Simon Willnauer
hey folks, in solr and some lucene classes we have @version tags with svn $Id stuff in there which we got rid of in lucene a while ago. I went through all classes and removed them. I just want to check with everybody if its ok to commit that. Note: I only changed javadocs all other usage of $Id

Re: Distributed search capability

2011-06-09 Thread Simon Willnauer
hey jason, you are talking about the RMI contrib/remote? It was dropped a while ago since everybody rolls its own mechanism and some queries / filters didn't work with it. simon On Thu, Jun 9, 2011 at 7:29 PM, Jason Rutherglen jason.rutherg...@gmail.com wrote: Hi, I am wondering what happened

Re: Heads Up - Index File Format Change on Trunk

2011-06-10 Thread Simon Willnauer
, 2011 at 10:01 AM, Simon Willnauer simon.willna...@googlemail.com wrote: Hey folks, I just committed LUCENE-3108 (Landing DocValues on Trunk) which adds a byte to FieldInfo. If you are running on trunk you must / should re-index any trunk indexes once you update to the latest trunk. its likely

Re: commit-check target for ant?

2011-06-11 Thread Simon Willnauer
On Sat, Jun 11, 2011 at 8:04 AM, Dawid Weiss dawid.we...@cs.put.poznan.pl wrote: -0 ... wouldn't that mean that users running *actual* 1.5 JVM installs couldn't compile from source?  I think it would be a bad idea to say that our compile JVM requirements are differnet then our run JVM

Re: A TwoPhaseCommit interface

2011-06-11 Thread Simon Willnauer
On Sat, Jun 11, 2011 at 2:00 PM, Michael McCandless luc...@mikemccandless.com wrote: This sounds great! +1 Mike McCandless http://blog.mikemccandless.com On Sat, Jun 11, 2011 at 7:28 AM, Shai Erera ser...@gmail.com wrote: Hi I wrote a TwoPhaseCommit interface which declares the methods

Re: Welcome Jan Høydahl as Lucene/Solr committer

2011-06-13 Thread Simon Willnauer
Welcome! On Mon, Jun 13, 2011 at 5:04 PM, Robert Muir rcm...@gmail.com wrote: Welcome Jan! On Mon, Jun 13, 2011 at 10:43 AM, Mark Miller markrmil...@gmail.com wrote: I'm happy to announce that the Lucene/Solr PMC has voted in Jan Høydahl as our newest committer. Jan, if you don't mind,

Re: Indexing slower in trunk

2011-06-13 Thread Simon Willnauer
On Mon, Jun 13, 2011 at 8:13 PM, Erick Erickson erickerick...@gmail.com wrote: I half remember that this has come up before, but I couldn't find the thread. I was running some tests over the weekend that involved indexing 1.9M documents from the English Wiki dump. I'm consistently seeing that

Re: Lucene Facet path

2011-06-13 Thread Simon Willnauer
I believe people are already looking into that but I am not sure. sounds reasonable to me but I think its going to be lots of work simon On Mon, Jun 13, 2011 at 11:34 PM, Jason Rutherglen jason.rutherg...@gmail.com wrote: Are we going the direction of creating full facet features outside of

Re: Indexing slower in trunk

2011-06-14 Thread Simon Willnauer
(seconds) = 309 About to commit, total time so far: 309 Total Time Taken- 309 seconds Total documents added- 1917728 Docs/sec- 6206 starting optimize optimizing took 270 seconds On Mon, Jun 13, 2011 at 4:50 PM, Simon Willnauer simon.willna...@googlemail.com wrote: On Mon, Jun 13, 2011 at 8

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 8823 - Failure

2011-06-14 Thread Simon Willnauer
ah this is tripping an assert I added a couple of weeks ago. We already have an issue for this here: https://issues.apache.org/jira/browse/LUCENE-3190 the problem here seems related to a very lowish rambuffer together with flushing by docCount. I was not able to reproduce it yet. each time this

Re: XmlCharFilter

2011-06-15 Thread Simon Willnauer
On Wed, Jun 15, 2011 at 2:24 PM, Erick Erickson erickerick...@gmail.com wrote: Yonik's law of patches states: A half-baked patch in Jira, with no documentation, no tests and no backwards compatibility is better than no patch at all. +1 simon and what you've described sounds wy better

Re: Indexing slower in trunk

2011-06-16 Thread Simon Willnauer
Docs/sec- 4855 Thanks, all Erick On Tue, Jun 14, 2011 at 4:39 AM, Simon Willnauer simon.willna...@googlemail.com wrote: Erick, it seems you need to adjust your settings for 4.0 a little. When you index with DWPT it builds thread private segments which are independently flushed to disk

Re: Related project link to ManifoldCF from Solr site?

2011-06-16 Thread Simon Willnauer
a link in the related projects section seems possible, what do other think? simon On Thu, Jun 16, 2011 at 7:46 PM, karl.wri...@nokia.com wrote: Hi folks, How hard would it be to get a link to ManifoldCF from the Solr site’s related-link section?  I’m seeing a lot of people who know Solr

Re: Related project link to ManifoldCF from Solr site?

2011-06-17 Thread Simon Willnauer
On Fri, Jun 17, 2011 at 9:21 AM, Tommaso Teofili tommaso.teof...@gmail.com wrote: I think also Tika (tika.apache.org) and UIMA (uima.apache.org) could be added. agreeed simon Tommaso 2011/6/16 Mark Miller markrmil...@gmail.com On Jun 16, 2011, at 2:00 PM, Simon Willnauer wrote: a link

Jenkins Build for flexscoring branch

2011-06-19 Thread Simon Willnauer
Hey folks, FYI, I created a Jenkins job for the flexscoring branch which runs the test only every 4h. https://builds.apache.org/job/Lucene-Solr-tests-only-flexscoring-branch/ Simon - To unsubscribe, e-mail:

Re: Lucene 3.3 release soon?

2011-06-20 Thread Simon Willnauer
I would say within the next 3 month. Thoughts? On Mon, Jun 20, 2011 at 10:56 AM, Lukáš Vlček lukas.vl...@gmail.com wrote: Hi, How soon can we expect official Lucene 3.3 release? Best regards, Lukas - To unsubscribe, e-mail:

Re: Lucene 3.3 release soon?

2011-06-21 Thread Simon Willnauer
to run on hudson. On Mon, Jun 20, 2011 at 4:59 AM, Simon Willnauer simon.willna...@googlemail.com wrote: I would say within the next 3 month. Thoughts? On Mon, Jun 20, 2011 at 10:56 AM, Lukáš Vlček lukas.vl...@gmail.com wrote: Hi, How soon can we expect official Lucene 3.3 release? Best

Re: Lucene 3.3 release soon?

2011-06-21 Thread Simon Willnauer
means more release specific questions.  Expect to see questions / issues reported and you must ask what version are you using? before you can answer. May I suggest a scheduled release, once a quarter, near the end of a quarter? -JM -Original Message- From: Simon Willnauer

Re: [JENKINS] Lucene-Solr-tests-only-trunk - Build # 8984 - Still Failing

2011-06-22 Thread Simon Willnauer
I just committed a fix for this simon On Wed, Jun 22, 2011 at 2:51 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: Build: https://builds.apache.org/job/Lucene-Solr-tests-only-trunk/8984/ 1 tests failed. REGRESSION:  

Re: Need to create new version 3.4 in JIRA

2011-06-26 Thread Simon Willnauer
On Sun, Jun 26, 2011 at 1:00 PM, Jan Høydahl jan@cominvent.com wrote: Now that 3.3 is being shipped we need 3.4 version in JIRA. I seem not to have rights for this power granted :) you are an JIRA admin now on both solr lucene! simon -- Jan Høydahl, search solution architect

Updateing the website

2011-06-27 Thread Simon Willnauer
hey folks, I tried to update the website yesterday and run into some problems with permissions etc. I talked to the infra guys which helped me to fix it. Yet, the fact that we are relying on grants cron job bugs me a little. It seems that we are doing things not the apache way where you just go

  1   2   3   4   5   6   7   8   9   10   >