[Lucene.Net] Problem while creating index for the xml file

2011-05-16 Thread Lalitha siva jyothi V
Dear Lucene team, I would like to create index files for the below xml file using Lucene.Net dll v2.9. I used the below code, but its not working. Please guide me to create index files for the below xml file. Thanks in advance NewsHistory News Story eid=34151

RE: [Lucene.Net] Problem while creating index for the xml file

2011-05-16 Thread Prescott Nasser
What's the issue your having? Seems like you're indexing the entire XML document as one field, which likely isn't the best way to go ~P Date: Tue, 17 May 2011 11:04:30 +0530 From: vlalithasivajyo...@gmail.com To: lucene-net-dev@lucene.apache.org

[JENKINS] Lucene-Solr-tests-only-trunk - Build # 8078 - Still Failing

2011-05-16 Thread Apache Jenkins Server
Build: https://builds.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/8078/ No tests ran. Build Log (for compile errors): [...truncated 47 lines...] - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For

[jira] [Updated] (LUCENE-3093) Build failed in the flexscoring branch because of Javadoc warnings

2011-05-16 Thread David Mark Nemeskey (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Mark Nemeskey updated LUCENE-3093: Affects Version/s: flexscoring branch Thanks Robert! I have added the flexscoring

[jira] [Commented] (SOLR-2448) Upgrade Carrot2 to version 3.5.0

2011-05-16 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033919#comment-13033919 ] Stanislaw Osinski commented on SOLR-2448: - Hi, if there are no objections, I'd like

[jira] [Updated] (LUCENE-3101) TestMinimize.testAgainstBrzozowski reproducible seed OOM

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3101: Attachment: LUCENE-3101_test.patch an explicit test case TestMinimize.testAgainstBrzozowski

[jira] [Assigned] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer reassigned LUCENE-3070: --- Assignee: Simon Willnauer Enable DocValues by default for every Codec

[jira] [Commented] (LUCENE-3098) Grouped total count

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033937#comment-13033937 ] Michael McCandless commented on LUCENE-3098: Patch looks great Martijn;

[jira] [Commented] (LUCENE-3097) Post grouping faceting

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033939#comment-13033939 ] Michael McCandless commented on LUCENE-3097: Thanks for the example Bill --

[jira] [Commented] (LUCENE-3097) Post grouping faceting

2011-05-16 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033940#comment-13033940 ] Martijn van Groningen commented on LUCENE-3097: --- bq. If I say,

[jira] [Commented] (LUCENE-3098) Grouped total count

2011-05-16 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033943#comment-13033943 ] Martijn van Groningen commented on LUCENE-3098: --- I will update both patches

[jira] [Commented] (LUCENE-3098) Grouped total count

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033946#comment-13033946 ] Michael McCandless commented on LUCENE-3098: One more idea: should we add a

[jira] [Commented] (LUCENE-3097) Post grouping faceting

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033947#comment-13033947 ] Michael McCandless commented on LUCENE-3097: Right, gender in this example

[jira] [Commented] (LUCENE-3101) TestMinimize.testAgainstBrzozowski reproducible seed OOM

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033950#comment-13033950 ] Robert Muir commented on LUCENE-3101: - the problem appears to be splitblock[] and

[jira] [Commented] (LUCENE-3097) Post grouping faceting

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033953#comment-13033953 ] Michael McCandless commented on LUCENE-3097: In fact, I think a very

[jira] [Updated] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-3070: Attachment: LUCENE-3070.patch This patch adds UOE to PreFlex codec and makes

[jira] [Updated] (LUCENE-3014) comparator API for segment versions

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3014: Attachment: LUCENE-3014.patch initial patch comparator API for segment versions

[jira] [Commented] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033970#comment-13033970 ] Robert Muir commented on LUCENE-3070: - Seems like it might be a good idea in

Re: 3.2.0 (or 3.1.1)

2011-05-16 Thread Simon Willnauer
+1 for pushing 3.2!! There have been discussions about porting DWPT to 3.x but I think its a little premature now and I am still not sure if we should do it at all. The refactoring is pretty intense throughout all IndexWriter and it integrates with Flex / Codecs. I am not saying its impossible,

[jira] [Commented] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033971#comment-13033971 ] Simon Willnauer commented on LUCENE-3070: - bq. Seems like it might be a good idea

[jira] [Updated] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-3070: Attachment: LUCENE-3070.patch new patch, I added random DocValues to updateDocument and

Re: 3.2.0 (or 3.1.1)

2011-05-16 Thread Simon Willnauer
On Mon, May 16, 2011 at 1:30 PM, Robert Muir rcm...@gmail.com wrote: On Mon, May 16, 2011 at 7:10 AM, Simon Willnauer simon.willna...@googlemail.com wrote: the question is if we should backport stuff like LUCENE-2881 to 3.2 or if we should hold off until 3.3, should we do it at all? I think

[jira] [Updated] (LUCENE-3101) TestMinimize.testAgainstBrzozowski reproducible seed OOM

2011-05-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3101: -- Attachment: LUCENE-3101.patch This patch reverts splitblock[], partition[] and reverse[][] to

[jira] [Commented] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13033977#comment-13033977 ] Robert Muir commented on LUCENE-3070: - looks good, i think this will help the test

[jira] [Updated] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-3070: Attachment: LUCENE-3070.patch fixed typo - I will commit in a second. Enable DocValues

Moving towards Lucene 4.0

2011-05-16 Thread Simon Willnauer
Hey folks, we just started the discussion about Lucene 3.2 and releasing more often. Yet, I think we should also start planning for Lucene 4.0 soon. We have tons of stuff in trunk that people want to have and we can't just keep on talking about it - we need to push this out to our users. From my

[jira] [Updated] (LUCENE-3101) TestMinimize.testAgainstBrzozowski reproducible seed OOM

2011-05-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3101: -- Attachment: LUCENE-3101.patch After some perf analysis, it showed, that replacing the

[jira] [Updated] (LUCENE-3102) Few issues with CachingCollector

2011-05-16 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-3102: --- Lucene Fields: [New, Patch Available] (was: [New]) Few issues with CachingCollector

[jira] [Updated] (LUCENE-3102) Few issues with CachingCollector

2011-05-16 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-3102: --- Attachment: LUCENE-3102.patch Patch includes the bug fixes + test. Still none of the items I listed

Re: Moving towards Lucene 4.0

2011-05-16 Thread Shai Erera
I think we should also start planning for Lucene 4.0 soon. +1 ! I think we should focus on everything that's *infrastructure* in 4.0, so that we can develop additional features in subsequent 4.x releases. If we end up releasing 4.0 just to discover many things will need to wait to 5.0, it'll

[jira] [Resolved] (LUCENE-3101) TestMinimize.testAgainstBrzozowski reproducible seed OOM

2011-05-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler resolved LUCENE-3101. --- Resolution: Fixed Fix Version/s: 4.0 Lucene Fields: [New, Patch Available] (was:

Re: svn commit: r1103709 - in /lucene/java/site: docs/whoweare.html docs/whoweare.pdf src/documentation/content/xdocs/whoweare.xml

2011-05-16 Thread Simon Willnauer
stanislav you are a full committer afaik?! simon On Mon, May 16, 2011 at 2:11 PM, stanis...@apache.org wrote: Author: stanislaw Date: Mon May 16 12:11:57 2011 New Revision: 1103709 URL: http://svn.apache.org/viewvc?rev=1103709view=rev Log: Adding myself (Stanislaw Osinski) to the contrib

Re: svn commit: r1103711 - in /lucene/dev/trunk/lucene/src: java/org/apache/lucene/util/automaton/MinimizationOperations.java test/org/apache/lucene/util/automaton/TestMinimize.java

2011-05-16 Thread Simon Willnauer
On Mon, May 16, 2011 at 2:15 PM, uschind...@apache.org wrote: Author: uschindler Date: Mon May 16 12:15:45 2011 New Revision: 1103711 URL: http://svn.apache.org/viewvc?rev=1103711view=rev Log: LUCENE-3101: Fix n^2 memory usage in minimizeSchindler() ähm minimizeHopcroft() LOL ^ ^

[jira] [Resolved] (LUCENE-3070) Enable DocValues by default for every Codec

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer resolved LUCENE-3070. - Resolution: Fixed Lucene Fields: [New, Patch Available] (was: [New]) Enable

Re: Moving towards Lucene 4.0

2011-05-16 Thread Michael McCandless
+1 Mike http://blog.mikemccandless.com On Mon, May 16, 2011 at 7:52 AM, Simon Willnauer simon.willna...@googlemail.com wrote: Hey folks, we just started the discussion about Lucene 3.2 and releasing more often. Yet, I think we should also start planning for Lucene 4.0 soon. We have tons of

[jira] [Reopened] (LUCENE-1149) add XA transaction support

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-1149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reopened LUCENE-1149: Sorry, you're right this issue isn't really a dup (I've reopened it). I was just

Re: Moving towards Lucene 4.0

2011-05-16 Thread Robert Muir
On Mon, May 16, 2011 at 7:52 AM, Simon Willnauer simon.willna...@googlemail.com wrote: Hey folks, we just started the discussion about Lucene 3.2 and releasing more often. Yet, I think we should also start planning for Lucene 4.0 soon. We have tons of stuff in trunk that people want to have

RE: Moving towards Lucene 4.0

2011-05-16 Thread Uwe Schindler
Sorry to be negative, - BulkPostings (my +1 since I want to enable positional scoring on all queries) My problem is the really crappy and unusable API of BulkPostings (wait for my talk at Lucene Rev...). For anybody else than Mike, Yonik and yourself that’s unusable. I tried to understand

Re: Moving towards Lucene 4.0

2011-05-16 Thread Robert Muir
On Mon, May 16, 2011 at 8:48 AM, Uwe Schindler u...@thetaphi.de wrote: Sorry to be negative, - BulkPostings (my +1 since I want to enable positional scoring on all queries) My problem is the really crappy and unusable API of BulkPostings (wait for my talk at Lucene Rev...). For anybody

[jira] [Reopened] (SOLR-2383) Velocity: Generalize range and date facet display

2011-05-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jan Høydahl reopened SOLR-2383: --- Reopening to add patch for branch 3.2 Velocity: Generalize range and date facet display

[jira] [Updated] (SOLR-2383) Velocity: Generalize range and date facet display

2011-05-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jan Høydahl updated SOLR-2383: -- Fix Version/s: 3.2 Velocity: Generalize range and date facet display

[jira] [Updated] (SOLR-2383) Velocity: Generalize range and date facet display

2011-05-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-2383?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jan Høydahl updated SOLR-2383: -- Attachment: SOLR-2383-branch_32.patch This Velocity enhancement should make it to 3.2. In this patch I

[jira] [Commented] (LUCENE-3101) TestMinimize.testAgainstBrzozowski reproducible seed OOM

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034004#comment-13034004 ] Robert Muir commented on LUCENE-3101: - Thanks for reporting this selckin, this is a

[jira] [Commented] (LUCENE-3101) TestMinimize.testAgainstBrzozowski reproducible seed OOM

2011-05-16 Thread Dawid Weiss (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034006#comment-13034006 ] Dawid Weiss commented on LUCENE-3101: - There is a lot of power in randomness, huh? :)

Re: Moving towards Lucene 4.0

2011-05-16 Thread Simon Willnauer
On Mon, May 16, 2011 at 2:57 PM, Robert Muir rcm...@gmail.com wrote: On Mon, May 16, 2011 at 8:48 AM, Uwe Schindler u...@thetaphi.de wrote: Sorry to be negative, - BulkPostings (my +1 since I want to enable positional scoring on all queries) My problem is the really crappy and unusable API

Re: Moving towards Lucene 4.0

2011-05-16 Thread Robert Muir
On Mon, May 16, 2011 at 9:12 AM, Simon Willnauer simon.willna...@googlemail.com wrote: I have to admit that branch is very rough and the API is super hard to use. For now! Lets not be dragged away into discussion how this API should look like there will be time for that. +1, this is what i

RE: svn commit: r1103709 - in /lucene/java/site: docs/whoweare.html docs/whoweare.pdf src/documentation/content/xdocs/whoweare.xml

2011-05-16 Thread Steven A Rowe
Hi Stanisław, You don’t need to be logged into people.apache.org to update the website. Have you seen these instructions? The “unversioned website” section is what you want, I think: http://wiki.apache.org/lucene-java/HowToUpdateTheWebsite Steve From: stac...@gmail.com

Re: svn commit: r1103709 - in /lucene/java/site: docs/whoweare.html docs/whoweare.pdf src/documentation/content/xdocs/whoweare.xml

2011-05-16 Thread Stanislaw Osinski
Hi Steve, That explains everything, thanks! I somehow failed to locate that wiki page and was looking at http://wiki.apache.org/solr/Website_Update_HOWTO instead. S. On Mon, May 16, 2011 at 15:25, Steven A Rowe sar...@syr.edu wrote: Hi Stanisław, You don’t need to be logged into

Re: svn commit: r1103709 - in /lucene/java/site: docs/whoweare.html docs/whoweare.pdf src/documentation/content/xdocs/whoweare.xml

2011-05-16 Thread Mark Miller
On May 16, 2011, at 8:55 AM, Stanislaw Osinski wrote: stanislav you are a full committer afaik?! I've been working mostly on the clustering plugin for now, so I'm not sure if it's right to move me to the core section right away :-) Incidentally, I tried to svn up on

[jira] [Commented] (SOLR-1942) Ability to select codec per field

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034018#comment-13034018 ] Robert Muir commented on SOLR-1942: --- any update on this? Would be nice to be able to hook

[jira] [Commented] (LUCENE-3098) Grouped total count

2011-05-16 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034025#comment-13034025 ] Martijn van Groningen commented on LUCENE-3098: --- Hmmm... So you get a list

Re: svn commit: r1103709 - in /lucene/java/site: docs/whoweare.html docs/whoweare.pdf src/documentation/content/xdocs/whoweare.xml

2011-05-16 Thread Stanislaw Osinski
Hi Mark, Thanks for clarifying the difference between contrib and full committers, I was probably too shy to subscribe myself to the latter group right away :-) For the time being, I'll most likely stick with maintaining the clustering bit and will consult you guys if I have something to

[jira] [Commented] (LUCENE-3098) Grouped total count

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034040#comment-13034040 ] Michael McCandless commented on LUCENE-3098: Right, we'd make it clear the

[jira] [Commented] (LUCENE-3098) Grouped total count

2011-05-16 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034050#comment-13034050 ] Martijn van Groningen commented on LUCENE-3098: --- That is true. It is just a

[jira] [Commented] (SOLR-1942) Ability to select codec per field

2011-05-16 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034051#comment-13034051 ] Grant Ingersoll commented on SOLR-1942: --- I thought I would have time last week, but

[jira] [Commented] (LUCENE-3090) DWFlushControl does not take active DWPT out of the loop on fullFlush

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034053#comment-13034053 ] Simon Willnauer commented on LUCENE-3090: - I did 150 runs for all Lucene Tests

[jira] [Commented] (SOLR-1942) Ability to select codec per field

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1942?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034059#comment-13034059 ] Robert Muir commented on SOLR-1942: --- ok thanks Grant. I'll take a look thru the patch

Re: Moving towards Lucene 4.0

2011-05-16 Thread Shai Erera
We anyway seem to mark every new API as @lucene.experimental these days, so we shouldn't have too much problem when 4.0 is out :). Experimental API is subject to change at any time. We can consider that as an option as well (maybe it adds another option to Robert's?). Though personally, I'm not

Re: Field should accept BytesRef?

2011-05-16 Thread Jason Rutherglen
But when you create an untokenized field (or even a binary field, which is stored-only at the moment), you could theoretically index the bytes directly Right, if I already have a BytesRef of what needs to be indexed, then passing the BR into Field/able should reduce garbage collection of

[jira] [Updated] (LUCENE-3102) Few issues with CachingCollector

2011-05-16 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-3102: --- Attachment: LUCENE-3102.patch bq. Only thing is: I would be careful about directly setting those

Re: Field should accept BytesRef?

2011-05-16 Thread Robert Muir
On Mon, May 16, 2011 at 11:29 AM, Jason Rutherglen jason.rutherg...@gmail.com wrote: But when you create an untokenized field (or even a binary field, which is stored-only at the moment), you could theoretically index the bytes directly Right, if I already have a BytesRef of what needs to be

[jira] [Resolved] (SOLR-2450) Carrot2 clustering should use both its own and Solr's stop words

2011-05-16 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski resolved SOLR-2450. - Resolution: Fixed Committed to trunk and branch_3x. Carrot2 clustering should use both

[jira] [Resolved] (SOLR-2449) Loading of Carrot2 resources from Solr config directory

2011-05-16 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski resolved SOLR-2449. - Resolution: Fixed Committed to trunk and branch_3x. Loading of Carrot2 resources from

[jira] [Resolved] (SOLR-2448) Upgrade Carrot2 to version 3.5.0

2011-05-16 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski resolved SOLR-2448. - Resolution: Fixed Committed to trunk and branch_3x. Upgrade Carrot2 to version 3.5.0

[jira] [Resolved] (SOLR-2505) Output cluster scores

2011-05-16 Thread Stanislaw Osinski (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2505?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stanislaw Osinski resolved SOLR-2505. - Resolution: Fixed Committed to trunk and branch_3x. Output cluster scores

[jira] [Updated] (LUCENE-3084) MergePolicy.OneMerge.segments should be ListSegmentInfo not SegmentInfos

2011-05-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3084: -- Attachment: LUCENE-3084-trunk-only.patch Here updated patch that removes some ListSI usage

[jira] [Commented] (LUCENE-3102) Few issues with CachingCollector

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034091#comment-13034091 ] Michael McCandless commented on LUCENE-3102: Patch looks great Shai -- +1 to

[jira] [Commented] (LUCENE-3084) MergePolicy.OneMerge.segments should be ListSegmentInfo not SegmentInfos

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034093#comment-13034093 ] Michael McCandless commented on LUCENE-3084: Uwe, this looks like a great

[jira] [Commented] (LUCENE-3090) DWFlushControl does not take active DWPT out of the loop on fullFlush

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034095#comment-13034095 ] Michael McCandless commented on LUCENE-3090: Patch looks good but hairy

[jira] [Created] (SOLR-2521) TestJoin.testRandom fails

2011-05-16 Thread Michael McCandless (JIRA)
TestJoin.testRandom fails - Key: SOLR-2521 URL: https://issues.apache.org/jira/browse/SOLR-2521 Project: Solr Issue Type: Bug Reporter: Michael McCandless Fix For: 4.0 Hit this random

[jira] [Assigned] (LUCENE-3100) IW.commit() writes but fails to fsync the N.fnx file

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer reassigned LUCENE-3100: --- Assignee: Simon Willnauer IW.commit() writes but fails to fsync the N.fnx file

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034101#comment-13034101 ] Michael McCandless commented on SOLR-2519: -- I think the attached patch is a good

[jira] [Assigned] (LUCENE-2027) Deprecate Directory.touchFile

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-2027: -- Assignee: Michael McCandless Deprecate Directory.touchFile

[jira] [Updated] (LUCENE-2027) Deprecate Directory.touchFile

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-2027: --- Attachment: LUCENE-2027.patch Patch, removing Dir.touchFile from trunk. For 3.x

[jira] [Commented] (LUCENE-3090) DWFlushControl does not take active DWPT out of the loop on fullFlush

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034103#comment-13034103 ] Simon Willnauer commented on LUCENE-3090: - Thanks mike for review and testing!!

[jira] [Updated] (LUCENE-3084) MergePolicy.OneMerge.segments should be ListSegmentInfo not SegmentInfos

2011-05-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Schindler updated LUCENE-3084: -- Attachment: LUCENE-3084-trunk-only.patch New patch that also has BalancedMergePolicy from

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034120#comment-13034120 ] Yonik Seeley commented on SOLR-2519: I think maybe there's a misconception that the

[jira] [Commented] (SOLR-2520) Solr creates invalid jsonp strings

2011-05-16 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034151#comment-13034151 ] Hoss Man commented on SOLR-2520: I'm confused here: As far as i can tell, the

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034154#comment-13034154 ] Michael McCandless commented on SOLR-2519: -- bq. I think maybe there's a

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034158#comment-13034158 ] Michael McCandless commented on SOLR-2519: -- It's also spooky that text fieldType

[jira] [Commented] (SOLR-2520) Solr creates invalid jsonp strings

2011-05-16 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034159#comment-13034159 ] Benson Margulies commented on SOLR-2520: Fun happens when you specify something in

[jira] [Updated] (LUCENE-3098) Grouped total count

2011-05-16 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martijn van Groningen updated LUCENE-3098: -- Attachment: LUCENE-3098.patch Attached patch with the discussed changes. 3x

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034172#comment-13034172 ] Hoss Man commented on SOLR-2519: I feel like we are convoluting two issues here: the

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034176#comment-13034176 ] Hoss Man commented on SOLR-2519: bq. Also: existing users would be unaffected by this?

[jira] [Updated] (LUCENE-3102) Few issues with CachingCollector

2011-05-16 Thread Shai Erera (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shai Erera updated LUCENE-3102: --- Component/s: (was: contrib/*) modules/grouping Few issues with

[jira] [Updated] (SOLR-2520) JSONResponseWriter w/json.wrf can produce invalid javascript depending on unicode chars in response data

2011-05-16 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man updated SOLR-2520: --- Summary: JSONResponseWriter w/json.wrf can produce invalid javascript depending on unicode chars in response

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034185#comment-13034185 ] Michael McCandless commented on SOLR-2519: -- bq. Bottom line: it's less confusing

[jira] [Commented] (SOLR-2520) JSONResponseWriter w/json.wrf can produce invalid javascript depending on unicode chars in response data

2011-05-16 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034187#comment-13034187 ] Benson Margulies commented on SOLR-2520: I'd vote for the later. I assume that

[jira] [Updated] (LUCENE-3103) create a simple test that indexes and searches byte[] terms

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated LUCENE-3103: Attachment: LUCENE-3103.patch attached is a first patch... maybe Uwe won't be able to resist

[jira] [Commented] (SOLR-2520) JSONResponseWriter w/json.wrf can produce invalid javascript depending on unicode chars in response data

2011-05-16 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2520?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034197#comment-13034197 ] Yonik Seeley commented on SOLR-2520: It looks like we already escape \u2028 (see

[jira] [Updated] (LUCENE-3098) Grouped total count

2011-05-16 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martijn van Groningen updated LUCENE-3098: -- Attachment: LUCENE-3098.patch Attached a new patch. * Renamed

[jira] [Commented] (SOLR-2519) Improve the defaults for the text field type in default schema.xml

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034203#comment-13034203 ] Robert Muir commented on SOLR-2519: --- As someone frustrated by this (but who would

[jira] [Commented] (LUCENE-3098) Grouped total count

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034214#comment-13034214 ] Michael McCandless commented on LUCENE-3098: Looks great Martijn! I'll

[jira] [Assigned] (LUCENE-3098) Grouped total count

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned LUCENE-3098: -- Assignee: Michael McCandless Grouped total count ---

[jira] [Commented] (LUCENE-3103) create a simple test that indexes and searches byte[] terms

2011-05-16 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034217#comment-13034217 ] Robert Muir commented on LUCENE-3103: - one thing i did previously (seemed overkill

[jira] [Commented] (LUCENE-3103) create a simple test that indexes and searches byte[] terms

2011-05-16 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034220#comment-13034220 ] Uwe Schindler commented on LUCENE-3103: --- Reflection should work correct. No need to

[jira] [Commented] (LUCENE-3103) create a simple test that indexes and searches byte[] terms

2011-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034224#comment-13034224 ] Michael McCandless commented on LUCENE-3103: +1 -- this is a great test to

[jira] [Updated] (LUCENE-3098) Grouped total count

2011-05-16 Thread Martijn van Groningen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martijn van Groningen updated LUCENE-3098: -- Attachment: LUCENE-3098-3x.patch Great! Attached the 3x backport. Grouped

[jira] [Updated] (LUCENE-3100) IW.commit() writes but fails to fsync the N.fnx file

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-3100: Attachment: LUCENE-3100.patch here is a patch sync'ing the file on successful write

[jira] [Commented] (LUCENE-3092) NRTCachingDirectory, to buffer small segments in a RAMDir

2011-05-16 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13034242#comment-13034242 ] Simon Willnauer commented on LUCENE-3092: - mike I attached a patch to LUCENE-3100

  1   2   >