Re: (SOLR-1868) Cutover to flex APIs

2010-04-08 Thread Michael McCandless
Hi Koji,

Contributions are very welcome :)

Unfortunately that wiki page is somewhat stale -- it hasn't been
updated based on what's been done on flex.  But I think big change
for Solr  is to cutover to FieldsEnum, TermsEnum, DocsEnum,
DocsAndPositionsEnum instead of TermEnum, TermDocs, TermPositions.

At some point (lower priority I think) we should make it possible for
an app to pick the postings codec, probably per-field?  EG the pulsing
codec would be good for highly unique fields (eg the id field), but is
very experimental right now.

Mike

On Thu, Apr 8, 2010 at 12:17 PM, Koji Sekiguchi  wrote:
> Michael McCandless (JIRA) wrote:
>> Cutover to flex APIs
>> 
>>
>>                  Key: SOLR-1868
>>                  URL: https://issues.apache.org/jira/browse/SOLR-1868
>>              Project: Solr
>>           Issue Type: Bug
>>             Reporter: Michael McCandless
>>              Fix For: 3.1
>>
>>
>> We need to fix Solr to use flex APIs!
>>
>>
> Hello,
>
> I'm a latecomer on the issue of "flex", but I'd like to know it
> and if possible, contribute something. But I chicken out
> when I see LUCENE-1458 and its friend issues:
>
> https://issues.apache.org/jira/browse/LUCENE/fixforversion/12314439
>
> I think I should read FlexibleIndexing on wiki, but I'd appreciate
> it if someone recommend pointers for flex latecomers, if any.
>
> Thank you!
>
> Koji
>
> --
> http://www.rondhuit.com/en/
>
>


[jira] Created: (SOLR-1868) Cutover to flex APIs

2010-04-07 Thread Michael McCandless (JIRA)
Cutover to flex APIs


 Key: SOLR-1868
 URL: https://issues.apache.org/jira/browse/SOLR-1868
 Project: Solr
  Issue Type: Bug
Reporter: Michael McCandless
 Fix For: 3.1


We need to fix Solr to use flex APIs!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: Welcome Uwe Schindler to the Lucene PMC

2010-04-01 Thread Michael McCandless
Welcome Uwe!!

Mike

On Thu, Apr 1, 2010 at 7:05 AM, Grant Ingersoll  wrote:
> I'm pleased to announce that the Lucene PMC has voted to add Uwe Schindler to 
> the PMC.  Uwe has been doing a lot of work in Lucene and Solr, including 
> several of the last releases in Lucene.
>
> Please join me in extending congratulations to Uwe!
>
> -Grant Ingersoll
> PMC Chair
> -
> To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-dev-h...@lucene.apache.org
>
>


Re: lucene and solr trunk

2010-03-18 Thread Michael McCandless
All tests pass for me :)

Mike

On Thu, Mar 18, 2010 at 12:27 PM, Mark Miller  wrote:
> Alight, so we have implemented Hoss' suggestion here on the lucene/solr
> merged dev branch at lucene/solr/branches/newtrunk.
>
> Feel free to check it out and give some feedback.
>
> We also roughly have Solr running on Lucene trunk - eg compiling Solr will
> first compile lucene and run off those compiled class files. Running dist or
> example in Solr will grab Lucene's jars and put them in the war. This still
> needs further love, but it works.
>
> There is also a top level build.xml with two targets: clean, and test. Clean
> will clean both Lucene and Solr, and test will run tests for both Lucene and
> Solr.
>
> Thanks to everyone that contributed to getting all this working!
>
> --
> - Mark
>
> http://www.lucidimagination.com
>
>
>
> On 03/17/2010 12:40 PM, Mark Miller wrote:
>>
>> Okay, so this looks good to me (a few others seemed to like it - though
>> Lucene-Dev was somehow dropped earlier) - lets try this out on the branch?
>> (then we can get rid of that "horrible" branch name ;) )
>>
>> Anyone on the current branch object to having to do a quick svn switch?
>>
>> On 03/16/2010 06:46 PM, Chris Hostetter wrote:
>>>
>>> : Otis, yes, I think so, eventually.  But that's gonna take much more
>>> discussion.
>>> :
>>> : I don't think this initial cutover should try to "solve" how modules
>>> : will be organized, yet... we'll get there, eventually.
>>>
>>> But we should at least consider it, and not move in a direction that's
>>> distinct from the ultimate goal of better refactoring (especailly since
>>> that was one of the main goals of unifying development efforts)
>>>
>>> Here's my concrete suggestion that could be done today (for simplicity:
>>> $svn = https://svn.apache.org/repos/asf/lucene)...
>>>
>>>   svn mv $svn/java/trunk $svn/java/tmp-migration
>>>   svn mkdir $svn/java/trunk
>>>   svn mv $svn/solr/trunk $svn/java/trunk/solr
>>>   svn mv $svn/java/tmp-migration $svn/java/trunk/core
>>>
>>> At which point:
>>>
>>> 0. People who want to work only on "Lucene-Java" can start checking out
>>> $svn/java/trunk/core (i'm pretty sure existing checkouts will continue to
>>> work w/o any changes, the svn info should just update itself)
>>>
>>> 1. build files can be added to (the new) $svn/java/trunk to build ./core
>>> followed by ./solr
>>>
>>> 2. the build files in $svn/java/trunk/solr can be modified to look at
>>> ../core/ to find lucene jars
>>>
>>> 3. people who care about Solr (including all committers) should start
>>> checking out and building all of $svn/java/trunk
>>>
>>> 4. Long term, we could choose to branch all of $svn/java/trunk
>>> for releases ... AND/OR we could choose to branch specific modules
>>> (ie: solr) independently (with modifications to the build files on those
>>> branches to pull in their dependencies from alternate locations
>>>
>>> 5. Long term, we can start refactoring additional "modules" out of
>>> $svn/java/trunk/solr and $svn/java/trunk/core (like
>>> $svn/java/trunk/core/contrib) into their own directory in $svn/java/trunk
>>>
>>> 6. Long term, people who want to work on more then just "core" but don't
>>> care about certain "modules" (like solr) can do a simple non-recursive
>>> checkout of $svn/java/trunk and then do full checkouts of whatever
>>> modules
>>> they care about
>>>
>>>
>>> (Please note: I'm just trying to list things we *could* do if we go this
>>> route, i'm not advocating that we *should* do any of these things)
>>>
>>> I can't think of any objections people have raised to any of the previous
>>> suggestions which apply to this suggestion.  Is there anything people can
>>> think of that would be useful, but not possible, if we go this route?
>>>
>>>
>>> -Hoss
>>>
>>
>>
>
>
>
> -
> To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
> For additional commands, e-mail: java-dev-h...@lucene.apache.org
>
>


Re: rough outline of where Solr's going

2010-03-18 Thread Michael McCandless
Ahh, OK.

Meaning Solr will have to remove deprecated support, which means
Solr's next released version would be a major release?  Ie 2.0?

Mike

On Thu, Mar 18, 2010 at 11:26 AM, Robert Muir  wrote:
> On Thu, Mar 18, 2010 at 11:33 AM, Michael McCandless
>  wrote:
>> On version numbering... my inclination would be to let Solr and Lucene
>> use their own version numbers (don't sync them up).  I know it'd
>> simplify our lives to have the same version across the board, but
>> these numbers are really for our users, telling them when big changes
>> were made, back compat broken, etc.  I think that trumps dev
>> convenience.
>
> Be sure to consider the deprecations removal, its not possible for
> Solr to move to Lucene's trunk without this.
>
> Here are two examples of necessary deprecation removals in the branch
> so that Solr can use Lucene's trunk:
> https://issues.apache.org/jira/browse/SOLR-1820
> http://www.lucidimagination.com/search/document/f07da8e4d69f5bfe/removal_of_deprecated_htmlstrip_tokenizer_factories
>
> It seems to be the consensus that people want a major version change
> number when this is done.
>
> So this is an example where the version numbers of Solr really do
> relate to Lucene, if we want them to share the same trunk.
>
>
> --
> Robert Muir
> rcm...@gmail.com
>


Re: rough outline of where Solr's going

2010-03-18 Thread Michael McCandless
On Thu, Mar 18, 2010 at 8:20 AM, Marvin Humphrey  wrote:

> I think the concern is what happens if Solr migrates a bunch of stuff into
> Lucene, ceding control over crucial functionality, and then Solr wants to
> release but Lucene does not.  That would pose a problem for Solr, no?

But, I don't think we'd ever do this -- ie when we release Solr we'll
also release Lucene.

Think about it... if for some exotic reason Lucene is unreleasable,
then, presumably we would not up and release Solr until we fixed
whatever was broken with Lucene, since it'd also break Solr.

It is conceivable we could release only Lucene (yes, this was an
explicit part of the vote, take 2), but I expect in practice that'll be
the exception not the rule... it remains to be seen.

On version numbering... my inclination would be to let Solr and Lucene
use their own version numbers (don't sync them up).  I know it'd
simplify our lives to have the same version across the board, but
these numbers are really for our users, telling them when big changes
were made, back compat broken, etc.  I think that trumps dev
convenience.

Mike


Re: lucene and solr trunk

2010-03-17 Thread Michael McCandless
Git, Maven, Hg, etc., all sound great for the future, but let's focus
now on the baby step (how to re-org svn), today, so we can land the
Solr upgrade work now being done on a branch...

Hoss's side-by-side proposal sounds great... and his concrete steps
"that could be done today" look good (I'm hoping Uwe "the policeman
who wears many hats" volunteers ;).

Mike

On Wed, Mar 17, 2010 at 8:05 AM, Otis Gospodnetic
 wrote:
> +1 for this structure and this set of steps.
>  Otis
>
>
>
>
> - Original Message 
>> From: Chris Hostetter 
>> To: solr-dev@lucene.apache.org
>> Sent: Tue, March 16, 2010 6:46:19 PM
>> Subject: Re: lucene and solr trunk
>>
>> : Otis, yes, I think so, eventually.  But that's gonna take much more
>> discussion.
> :
> : I don't think this initial cutover should try to "solve"
>> how modules
> : will be organized, yet... we'll get there,
>> eventually.
>
> But we should at least consider it, and not move in a
>> direction that's
> distinct from the ultimate goal of better refactoring
>> (especailly since
> that was one of the main goals of unifying development
>> efforts)
>
> Here's my concrete suggestion that could be done today (for
>> simplicity:
> $svn =
>> target=_blank >https://svn.apache.org/repos/asf/lucene)...
>
>  svn
>> mv $svn/java/trunk $svn/java/tmp-migration
>  svn mkdir
>> $svn/java/trunk
>  svn mv $svn/solr/trunk $svn/java/trunk/solr
>
>> svn mv $svn/java/tmp-migration $svn/java/trunk/core
>
> At which
>> point:
>
> 0. People who want to work only on "Lucene-Java" can start
>> checking out
> $svn/java/trunk/core (i'm pretty sure existing checkouts will
>> continue to
> work w/o any changes, the svn info should just update
>> itself)
>
> 1. build files can be added to (the new) $svn/java/trunk to build
>> ./core
> followed by ./solr
>
> 2. the build files in $svn/java/trunk/solr
>> can be modified to look at
> ../core/ to find lucene jars
>
> 3. people who
>> care about Solr (including all committers) should start
> checking out and
>> building all of $svn/java/trunk
>
> 4. Long term, we could choose to branch
>> all of $svn/java/trunk
> for releases ... AND/OR we could choose to branch
>> specific modules
> (ie: solr) independently (with modifications to the build
>> files on those
> branches to pull in their dependencies from alternate
>> locations
>
> 5. Long term, we can start refactoring additional "modules" out
>> of
> $svn/java/trunk/solr and $svn/java/trunk/core (like
>>
> $svn/java/trunk/core/contrib) into their own directory in
>> $svn/java/trunk
>
> 6. Long term, people who want to work on more then just
>> "core" but don't
> care about certain "modules" (like solr) can do a simple
>> non-recursive
> checkout of $svn/java/trunk and then do full checkouts of
>> whatever modules
> they care about
>
>
> (Please note: I'm just trying to
>> list things we *could* do if we go this
> route, i'm not advocating that we
>> *should* do any of these things)
>
> I can't think of any objections people
>> have raised to any of the previous
> suggestions which apply to this
>> suggestion.  Is there anything people can
> think of that would be
>> useful, but not possible, if we go this route?
>
>
> -Hoss
>


Re: lucene and solr trunk

2010-03-16 Thread Michael McCandless
Otis, yes, I think so, eventually.  But that's gonna take much more discussion.

I don't think this initial cutover should try to "solve" how modules
will be organized, yet... we'll get there, eventually.

Mike

On Tue, Mar 16, 2010 at 4:57 PM, Otis Gospodnetic
 wrote:
> Hi,
>
> Check out the dir structure mentioned here: 
> http://markmail.org/message/gwpmaevw7tavqqge
>
> Isn't that what we want?
>
>  Otis
> 
> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
> Hadoop ecosystem search :: http://search-hadoop.com/
>
>
>
> - Original Message 
>> From: Mark Miller 
>> To: solr-dev@lucene.apache.org
>> Sent: Mon, March 15, 2010 11:43:48 PM
>> Subject: Re: lucene and solr trunk
>>
>> On 03/15/2010 11:28 PM, Yonik Seeley wrote:
>> So, we have a few options on
>> where to put Solr's new trunk:
>>
>>
>> Solr moves to Lucene's
>> trunk:
>>    /java/trunk, /java/trunk/sol
> +1. With the goal of
>> merged dev, merged tests, this looks the best to
> me. Simple to do patches
>> that span both, simple to setup
> Solr to use Lucene trunk rather than jars.
>> Short paths. Simple. I like it.
>
> --
> - Mark
>
>
>> href="http://www.lucidimagination.com"; target=_blank
>> >http://www.lucidimagination.com
>


Free live video streaming of ApacheCon US 2009

2009-11-04 Thread Michael McCandless
Team,

For those Lucene fanatics not in Oakland this week for ApacheCon US,
don't miss the FREE live video streaming, starting today:

  http://streaming.linux-magazin.de/en/program-apachecon-us-2009.htm

Note that there are many talks available, covering Apache Hadoop,
Apache HTTPD, Lucene, as well as the Apache Pioneer's Panel and
keynote presentations.

Lucene's track is this Friday (NOTE these times are UTC -- use
http://www.timeanddate.com to map to your time zone):

 17:00 Implementing an Information Retrieval Framework for an
   Organizational Repository, Sithu D Sudarsan

 18:00 Apache Mahout - Going from raw data to information
   Isabel Drost

 19:15 MIME Magic with Apache Tika
   Jukka Zitting

 20:15 Keynote: How Open Source Developers Can (Still!) Save The World
   Brian Behlendorf

 22:00 Building Intelligent Search Applications with the Lucene
   Ecosystem, Ted Dunning

 23:00 Realtime Search
   Jason Rutherglen

Happy viewing,

Mike


[jira] Commented: (SOLR-940) TrieRange support

2009-06-25 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724286#action_12724286
 ] 

Michael McCandless commented on SOLR-940:
-

OK try again?  Maybe 3rd time's the charm...

> TrieRange support
> -
>
> Key: SOLR-940
> URL: https://issues.apache.org/jira/browse/SOLR-940
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Shalin Shekhar Mangar
> Fix For: 1.4
>
> Attachments: SOLR-940-LUCENE-1602.patch, SOLR-940-LUCENE-1602.patch, 
> SOLR-940-LUCENE-1701.patch, SOLR-940-newTrieAPI.patch, 
> SOLR-940-newTrieAPI.patch, SOLR-940-rangequery.patch, 
> SOLR-940-rangequery.patch, SOLR-940-test.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch
>
>
> We need support in Solr for the new TrieRange Lucene functionality.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-940) TrieRange support

2009-06-25 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724255#action_12724255
 ] 

Michael McCandless commented on SOLR-940:
-

Sigh.  I'll go reopen LUCENE-1630!

> TrieRange support
> -
>
> Key: SOLR-940
> URL: https://issues.apache.org/jira/browse/SOLR-940
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Shalin Shekhar Mangar
> Fix For: 1.4
>
> Attachments: SOLR-940-LUCENE-1602.patch, SOLR-940-LUCENE-1602.patch, 
> SOLR-940-LUCENE-1701.patch, SOLR-940-newTrieAPI.patch, 
> SOLR-940-newTrieAPI.patch, SOLR-940-rangequery.patch, 
> SOLR-940-rangequery.patch, SOLR-940-test.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch
>
>
> We need support in Solr for the new TrieRange Lucene functionality.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-940) TrieRange support

2009-06-25 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12724080#action_12724080
 ] 

Michael McCandless commented on SOLR-940:
-

OK I just committed a fix for LUCENE-1630 that should fix that exception.

> TrieRange support
> -
>
> Key: SOLR-940
> URL: https://issues.apache.org/jira/browse/SOLR-940
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Shalin Shekhar Mangar
> Fix For: 1.4
>
> Attachments: SOLR-940-LUCENE-1602.patch, SOLR-940-LUCENE-1602.patch, 
> SOLR-940-LUCENE-1701.patch, SOLR-940-newTrieAPI.patch, 
> SOLR-940-newTrieAPI.patch, SOLR-940-rangequery.patch, 
> SOLR-940-rangequery.patch, SOLR-940-test.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch
>
>
> We need support in Solr for the new TrieRange Lucene functionality.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-940) TrieRange support

2009-06-25 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12723975#action_12723975
 ] 

Michael McCandless commented on SOLR-940:
-

Shalin I think that exception you got is a break in back-compat.  Sorry :(  I'm 
reopening LUCENE-1630 to fix it...

> TrieRange support
> -
>
> Key: SOLR-940
> URL: https://issues.apache.org/jira/browse/SOLR-940
> Project: Solr
>  Issue Type: New Feature
>Reporter: Yonik Seeley
>Assignee: Shalin Shekhar Mangar
> Fix For: 1.4
>
> Attachments: SOLR-940-LUCENE-1602.patch, SOLR-940-LUCENE-1602.patch, 
> SOLR-940-LUCENE-1701.patch, SOLR-940-newTrieAPI.patch, 
> SOLR-940-newTrieAPI.patch, SOLR-940-rangequery.patch, 
> SOLR-940-rangequery.patch, SOLR-940-test.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, 
> SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch, SOLR-940.patch
>
>
> We need support in Solr for the new TrieRange Lucene functionality.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-1221) Change Solr Highlighting to use the SpanScorer with MultiTerm expansion by default

2009-06-16 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12720285#action_12720285
 ] 

Michael McCandless commented on SOLR-1221:
--

+1 ;)  Solr's defaults are important, too.

> Change Solr Highlighting to use the SpanScorer with MultiTerm expansion by 
> default
> --
>
> Key: SOLR-1221
> URL: https://issues.apache.org/jira/browse/SOLR-1221
> Project: Solr
>  Issue Type: Improvement
>  Components: highlighter
>Reporter: Mark Miller
>Assignee: Mark Miller
> Fix For: 1.4
>
>
> To improve the out of the box experience of Solr 1.4, I really think we 
> should make this change. You will still be able to turn both off.
> Comments?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: Welcome new Solr committers Mark Miller and Noble Paul

2009-05-01 Thread Michael McCandless
Welcome aboard Mark and Noble!

Mike

On Thu, Apr 30, 2009 at 1:41 PM, Yonik Seeley  wrote:
> I'm pleased to announce that Mark Miller and Noble Paul have accepted
> invitations to become Solr committers!
> Welcome Mark & Noble, and thanks for all your great work on Solr!
>
> -Yonik
>
> ps: following recent custom, why don't you guys tell us a little about
> yourselves (in addition to adding yourself to
> http://lucene.apache.org/solr/who.html when you get the karma)
>


Re: lucene 2.9 migration issues

2009-04-24 Thread Michael McCandless
This is LUCENE-1573 (properly responding to Thread.interrupt()), which
is new in 2.9.

Do you know what would have interrupted the thread in your context?

Mike

On Fri, Apr 24, 2009 at 2:25 PM, Ryan McKinley  wrote:
> thanks!   I'm not sure this is related to 2.9, but I have not seen this
> before either
>
> 2009-04-24 14:19:53,024 ERROR org.apache.solr.core.SolrCore -
> java.lang.RuntimeException: java.lang.InterruptedException: sleep
> interrupted
>        at
> org.apache.lucene.index.SegmentInfos$FindSegmentsFile.run(SegmentInfos.java:620)
>        at org.apache.lucene.index.SegmentInfos.read(SegmentInfos.java:289)
>        at org.apache.lucene.index.IndexWriter.init(IndexWriter.java:1456)
>        at org.apache.lucene.index.IndexWriter.(IndexWriter.java:1295)
>        at
> org.apache.solr.update.SolrIndexWriter.(SolrIndexWriter.java:159)
>        at
> org.apache.solr.update.UpdateHandler.createMainIndexWriter(UpdateHandler.java:123)
>        at
> org.apache.solr.update.DirectUpdateHandler2.openWriter(DirectUpdateHandler2.java:170)
>        at
> org.apache.solr.update.DirectUpdateHandler2.addDoc(DirectUpdateHandler2.java:217)
>        at
> org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:60)
>        at
> org.apache.solr.handler.XMLLoader.processUpdate(XMLLoader.java:140)
>        at org.apache.solr.handler.XMLLoader.load(XMLLoader.java:69)
>        at
> org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54)
>        at
> org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131)
>        at org.apache.solr.core.SolrCore.execute(SolrCore.java:1330)
>        at
> org.apache.solr.client.solrj.embedded.EmbeddedSolrServer.request(EmbeddedSolrServer.java:139)
>        at
> org.apache.solr.client.solrj.request.UpdateRequest.process(UpdateRequest.java:259)
>        at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:63)
>
> Any thoughts what could cause that?  This is using the embedded solr
> server...
>
>
> On Apr 24, 2009, at 12:10 PM, Shalin Shekhar Mangar wrote:
>
>> On Fri, Apr 24, 2009 at 9:07 PM, Ryan McKinley  wrote:
>>
>>> Yes, that would be great!  the changes we need are in rev 768275:
>>> http://svn.apache.org/viewvc?view=rev&revision=768275
>>>
>>
>> Done. I upgraded to r768336.
>>
>> --
>> Regards,
>> Shalin Shekhar Mangar.
>
>


Re: faster example schema

2009-03-07 Thread Michael McCandless


+1

I think appropriate selection of defaults, biasing for performance in  
this

case, is extremely important yet often overlooked.

The out-of-the-box first-impression experience that people see, with  
Solr

or Lucene, often decides whether Solr or Lucene should be considered
for an application.

Mike

Yonik Seeley wrote:


I've occasionally run across people going with another search engine
because it was faster at indexing.
The example schema that people may be using as a base to do their
benchmarking (with perhaps minimal modifications) is slow.
There are many people out there that check what's fastest first, and
*then* check if it is satisfactory to meet their needs in other areas.

With very simple synthetic test documents (just a few fields each) and
the CSV loader, I've personally seen the indexing rate go from
~330/sec to ~3000/sec, when I removed the default field values, term
vectors, copyFields, etc.  The default example schema should still be
able to show how something can be done, but that doesn't mean it needs
to be enabled by default.

So what do people think about speeding up the default/example schema  
before 1.4?


-Yonik
http://www.lucidimagination.com




Re: trunk test failure

2009-01-29 Thread Michael McCandless


Except... the only reason (I think) that the Lucene build failed was  
because I had switched the back-compat test to check out an https://  
URL instead of http:// (the build got hung asking whether you trust  
that host because its cert was suspect) so I switched it back...  
and I'm not sure why that Solr test failed!  Hmmm...


Mike

Yonik Seeley wrote:

On Thu, Jan 29, 2009 at 12:07 PM, Mark Miller  
 wrote:

Mark Miller wrote:


Anyone else seeing the function query test failing on trunk?

- Mark


Okay, I tried building trunk lucene myself, and after putting that  
in, the

function query test passed. Perhaps an odd Lucene build went in?


The previous lucene nightly had failed (on an svn tag lookup), so I
built lucene myself (and all tests did pass).
I've tried multiple times, and I can't get that same test to fail
(WinXP).  Very strange.

Anyway, since the latest lucene nightly did pass, I'll update to that.
Hopefully that should fix things.

-Yonik




[jira] Commented: (SOLR-342) Add support for Lucene's new Indexing and merge features (excluding Document/Field/Token reuse)

2008-02-19 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12570327#action_12570327
 ] 

Michael McCandless commented on SOLR-342:
-

Super, I'm glad to hear that!

> Add support for Lucene's new Indexing and merge features (excluding 
> Document/Field/Token reuse)
> ---
>
> Key: SOLR-342
> URL: https://issues.apache.org/jira/browse/SOLR-342
> Project: Solr
>  Issue Type: Improvement
>  Components: update
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
>Priority: Minor
> Attachments: copyLucene.sh, SOLR-342.patch, SOLR-342.patch, 
> SOLR-342.patch, SOLR-342.tar.gz
>
>
> LUCENE-843 adds support for new indexing capabilities using the 
> setRAMBufferSizeMB() method that should significantly speed up indexing for 
> many applications.  To fix this, we will need trunk version of Lucene (or 
> wait for the next official release of Lucene)
> Side effect of this is that Lucene's new, faster StandardTokenizer will also 
> be incorporated.  
> Also need to think about how we want to incorporate the new merge scheduling 
> functionality (new default in Lucene is to do merges in a background thread)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-342) Add support for Lucene's new Indexing and merge features (excluding Document/Field/Token reuse)

2008-02-18 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12569845#action_12569845
 ] 

Michael McCandless commented on SOLR-342:
-

Will, one more thing to try is to on assertions for org.apache.lucene.*; this 
may catch an issue sooner.

> Add support for Lucene's new Indexing and merge features (excluding 
> Document/Field/Token reuse)
> ---
>
> Key: SOLR-342
> URL: https://issues.apache.org/jira/browse/SOLR-342
> Project: Solr
>  Issue Type: Improvement
>  Components: update
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
>Priority: Minor
> Attachments: copyLucene.sh, SOLR-342.patch, SOLR-342.patch, 
> SOLR-342.patch, SOLR-342.tar.gz
>
>
> LUCENE-843 adds support for new indexing capabilities using the 
> setRAMBufferSizeMB() method that should significantly speed up indexing for 
> many applications.  To fix this, we will need trunk version of Lucene (or 
> wait for the next official release of Lucene)
> Side effect of this is that Lucene's new, faster StandardTokenizer will also 
> be incorporated.  
> Also need to think about how we want to incorporate the new merge scheduling 
> functionality (new default in Lucene is to do merges in a background thread)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-342) Add support for Lucene's new Indexing and merge features (excluding Document/Field/Token reuse)

2008-02-16 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12569525#action_12569525
 ] 

Michael McCandless commented on SOLR-342:
-

Will, did you create a new index in your test?

Also make sure you are using this URL to checkout the 2.3 branch sources:

  https://svn.apache.org/repos/asf/lucene/java/branches/lucene_2_3

You should see 7 issues listed in the CHANGES.txt under "Bug fixes"?

> Add support for Lucene's new Indexing and merge features (excluding 
> Document/Field/Token reuse)
> ---
>
> Key: SOLR-342
> URL: https://issues.apache.org/jira/browse/SOLR-342
> Project: Solr
>  Issue Type: Improvement
>  Components: update
>Reporter: Grant Ingersoll
>Assignee: Grant Ingersoll
>Priority: Minor
> Attachments: copyLucene.sh, SOLR-342.patch, SOLR-342.patch, 
> SOLR-342.patch, SOLR-342.tar.gz
>
>
> LUCENE-843 adds support for new indexing capabilities using the 
> setRAMBufferSizeMB() method that should significantly speed up indexing for 
> many applications.  To fix this, we will need trunk version of Lucene (or 
> wait for the next official release of Lucene)
> Side effect of this is that Lucene's new, faster StandardTokenizer will also 
> be incorporated.  
> Also need to think about how we want to incorporate the new merge scheduling 
> functionality (new default in Lucene is to do merges in a background thread)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: switch to native locks by default?

2007-01-15 Thread Michael McCandless

Chris Hostetter wrote:

: Ah, I hadn't realized that they might not be supported everywhere... I

I'm just trusting the javadoc for NativeFSLockFactory ... i have no idea
if it's accurate or not.


Hi!  I had added the caveat about native locks based on my dicey
experience getting them working over NFS (NFS locking was not "turned
on" by default in my setup; and, frustratingly, it would take ~35
seconds for a timeout to tell me this).

I don't have any specific evidence that other OS/filesystems are
problematic but then again I haven't done much research to understand
overall portability of Java's native lock interface.  It would not
surprise me if other OS/filesystems had issues.

I was hoping by getting the NativeFSLockFactory out there that it
would get some healthy testing first and then we could use that
feedback to decide whether benefits outweigh the risks of making it
the default.  It's not clear how many people have actually tested it
at this point, though!


: The current locking can also guard against mistakes though (multiple
: instances of Solr trying to write to the same dir, someone opening a
: Luke index on it, etc).

right ... but it's only useful if all of the potential clients are using
the same locking mechanism ... right now it's only safe to do any of those
things if all the apps use SimpleFSLockFactory.  all the more reason to
make the factory and the lockDir configurable in Solr i guess.


Mike


Re: Solr, new index format (segments_N)

2006-11-28 Thread Michael McCandless

Yonik Seeley wrote:

On 11/28/06, Manuel Albela Miranda <[EMAIL PROTECTED]> wrote:

Thanks for your answer. Yes, i'm using Sun's JDK 1.5.0 too. Can you tell
me the step you followed to get solr running with the lockless commits?.
I compile the latest java lucene and then put the jar files generated in
the lib directory of the solr distribution. Finally running 'ant' it
compiles without any problem. But when trying the example with the new
index format, it fails.


These are the steps I did (with Sun's JDK 1.5.0):

  * Checked out the current trunk of Solr (from svn)

  * Checked out the current trunk of Lucene (from svn)

  * Built the Lucene JARs ("ant jar-core")

  * Copied jar to Solr dir:

  "cp build/lucene-core-2.1-dev.jar 
../solr/lib/lucene-core-nightly.jar"


  * Then ran "ant example" in solr dir (which creates solr.war under
example/webapps).

  * Start jetty (cd example; java -jar start.jar)

  * Submit all the xml files (cd exampledocs; sh post.sh *.xml)

  * Verified that index (example/solr/data/index/*) shows segments_N
file (not segments), to confirm lockless is in fact running.

  * Ran some searches, did some more of the examples (using curl to
delete & commit, etc.), and they all seem to be working fine.

Mike


Re: Solr, new index format (segments_N)

2006-11-27 Thread Michael McCandless

Manuel Albela Miranda wrote:

Hello everybody,
I wanted to know what can i do to make solr manage the new index format 
that comes with last svn version of Lucene, i mean, now there are not a 
segments file but segments_1, segments_2 and so on, and a segments.gen 
file. I have tried to compile solr with the last .jar files generated by 
the compilation of the lucene sources but i'm having the following errors:


Hello!  I was able to recompile the Solr example (solr.war) using the
current trunk version of Lucene (with lockless commits).  I was also
able to walk through the initial steps of the example (update, delete,
searching) running under Jetty and everything seems to work correctly.

I'm using Sun's JDK 1.5 on Linux.

From your exception, it looks like your JRE can't find the "matches"
native method for a java.lang.String, which is very strange.  This
method has been available since 1.4.  I'm not sure why this would
happen.  Maybe double-check to make sure you're running >= JRE 1.4?

Mike


Re: [jira] Updated: (SOLR-52) Lazy Field loading

2006-11-17 Thread Michael McCandless

Yonik Seeley wrote:

On 11/15/06, Mike Klaas <[EMAIL PROTECTED]> wrote:

Any objections to sync'ing solr with lucene trunk?  It might be nice
from an impact perspective to do so before lockless commits are
committed.


+1, my thoughts as well.


Hi!  Should I hold off on committing lockless until you've done
a sync for Solr to the Lucene trunk?  Is this done now?

Mike