[jira] Updated: (SOLR-176) Add detailed timing data to query response output

2007-05-17 Thread Will Johnson (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Will Johnson updated SOLR-176:
--

Attachment: RequesthandlerBase.patch

added some average stats to RequestHandlerBase.  all of the same info can be 
obtained by parsing the log files but having it show up on the admin screens 
and jmx is simple and nice to have.  stats added: avgTimePerRequest and 
avgRequestsPerSecond.

 Add detailed timing data to query response output
 -

 Key: SOLR-176
 URL: https://issues.apache.org/jira/browse/SOLR-176
 Project: Solr
  Issue Type: New Feature
  Components: search
Affects Versions: 1.2
Reporter: Mike Klaas
 Assigned To: Mike Klaas
Priority: Minor
 Fix For: 1.2

 Attachments: dtiming.patch, dtiming.patch, RequesthandlerBase.patch


 see 
 http://www.nabble.com/%27accumulate%27-copyField-for-faceting-tf3329986.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-239) Read IndexSchema from InputStream instead of Config file

2007-05-17 Thread Otis Gospodnetic (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496606
 ] 

Otis Gospodnetic commented on SOLR-239:
---

Minor comment: if you use the same name for the patch file, JIRA will 
automatically gray out the older one, thus making it easier to spot which 
patches are out of date, and which one is the fresh one to look at.

Thanks for the patch, this sounds useful.


 Read IndexSchema from InputStream instead of Config file
 

 Key: SOLR-239
 URL: https://issues.apache.org/jira/browse/SOLR-239
 Project: Solr
  Issue Type: Improvement
Affects Versions: 1.2
 Environment: all
Reporter: Will Johnson
Priority: Minor
 Fix For: 1.2

 Attachments: IndexSchemaStream.patch, IndexSchemaStream2.patch


 Soon to follow patch adds a constructor to IndexSchema to allow them to be 
 created directly from InputStreams.  The overall logic for the Core's use of 
 the IndexSchema creation/use does not change however this allows java clients 
 like those in SOLR-20 to be able to parse an IndexSchema.  Once a schema is 
 parsed, the client can inspect an index's capabilities which is useful for 
 building generic search UI's.  ie provide a drop down list of fields to 
 search/sort by.  

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-208) RSS feed XSL example

2007-05-17 Thread Otis Gospodnetic (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496607
 ] 

Otis Gospodnetic commented on SOLR-208:
---

+1 for including this.  I imagine a lot of people will want this to provide 
subscribe to search results for a saved search type functionality.


 RSS feed XSL example
 

 Key: SOLR-208
 URL: https://issues.apache.org/jira/browse/SOLR-208
 Project: Solr
  Issue Type: New Feature
  Components: clients - java
Affects Versions: 1.2
Reporter: Brian Whitman
 Assigned To: Hoss Man
Priority: Trivial
 Attachments: rss.xsl


 A quick .xsl file for transforming solr queries into RSS feeds. To get the 
 date and time in properly you'll need an XSL 2.0 processor, as in 
 http://wiki.apache.org/solr/XsltResponseWriter .  Tested to work with the 
 example solr distribution in the nightly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-208) RSS feed XSL example

2007-05-17 Thread Brian Whitman (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496612
 ] 

Brian Whitman commented on SOLR-208:


I'm not involved or familiar with the RSS wars but I will say that this is a 
quick example of getting RSS out of Solr in the easiest possible way. It worked 
on every single browser and newsreader I tried it on. There's no reason we 
can't also include an atom_example.xsl as well. I don't understand why you 
would use GData for this at all, but i am probably out of my league there.

+1 for adding except remove the XSL2.0 references, not worth the  confusion, 
date handling is not necessary for the exampledocs test case.


 RSS feed XSL example
 

 Key: SOLR-208
 URL: https://issues.apache.org/jira/browse/SOLR-208
 Project: Solr
  Issue Type: New Feature
  Components: clients - java
Affects Versions: 1.2
Reporter: Brian Whitman
 Assigned To: Hoss Man
Priority: Trivial
 Attachments: rss.xsl


 A quick .xsl file for transforming solr queries into RSS feeds. To get the 
 date and time in properly you'll need an XSL 2.0 processor, as in 
 http://wiki.apache.org/solr/XsltResponseWriter .  Tested to work with the 
 example solr distribution in the nightly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-236) Field collapsing

2007-05-17 Thread Otis Gospodnetic (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496617
 ] 

Otis Gospodnetic commented on SOLR-236:
---

Question:
Do you need collapse=true when you can detect whether collapse.field has been 
specified or not?


 Field collapsing
 

 Key: SOLR-236
 URL: https://issues.apache.org/jira/browse/SOLR-236
 Project: Solr
  Issue Type: New Feature
  Components: search
Affects Versions: 1.2
Reporter: Emmanuel Keller
 Attachments: collapse_field.patch, collapse_field.patch, 
 field_collapsing.patch, field_collapsing.patch


 This patch include a new feature called Field collapsing.
 Used in order to collapse a group of results with similar value for a given 
 field to a single entry in the result set. Site collapsing is a special case 
 of this, where all results for a given web site is collapsed into one or two 
 entries in the result set, typically with an associated more documents from 
 this site link. See also Duplicate detection.
 http://www.fastsearch.com/glossary.aspx?m=48amid=299
 The implementation add 4 new query parameters (SolrParams):
 collapse set to true to enable collapsing.
 collapse.field to choose the field used to group results
 collapse.type normal (default value) or adjacent
 collapse.max to select how many continuous results are allowed before 
 collapsing
 TODO (in progress):
 - More documentation (on source code)
 - Test cases

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-208) RSS feed XSL example

2007-05-17 Thread Walter Underwood (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496624
 ] 

Walter Underwood commented on SOLR-208:
---

I wasn't in the RSS wars, either, but I was on the Atom working group. That was 
a bunch of volunteers making a clean, testable spec for RSS functionality 
(http://www.ietf.org/rfc/rfc4287). RSS 2.0 has some bad ambiguities, especially 
around ampersand and entities in titles. The default has changed over the years 
and clients do different, incompatible things.

GData is just a way to do search result stuff that we would need anyway. It is 
standard set of URL parameters for query, start-index, and categories, and a 
few Atom extensions for total results, items per page, and next/previous.

http://code.google.com/apis/gdata/reference.html


 RSS feed XSL example
 

 Key: SOLR-208
 URL: https://issues.apache.org/jira/browse/SOLR-208
 Project: Solr
  Issue Type: New Feature
  Components: clients - java
Affects Versions: 1.2
Reporter: Brian Whitman
 Assigned To: Hoss Man
Priority: Trivial
 Attachments: rss.xsl


 A quick .xsl file for transforming solr queries into RSS feeds. To get the 
 date and time in properly you'll need an XSL 2.0 processor, as in 
 http://wiki.apache.org/solr/XsltResponseWriter .  Tested to work with the 
 example solr distribution in the nightly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Code style

2007-05-17 Thread Otis Gospodnetic
Hi,

Touchy topic: code style

Should Solr be following the Lucene code formatting/style?  Lucene follows 
Sun's recommendation except for the 2-space indent, I believe.  I'm asking 
because Solr is full of variable and method names that look like abbrevs ;) - 
e.g. getDocListC - C?, and on top of that the code is rather 
dense,without,many,spaces, so it's hard to read, at least for me.  Over the 
years I must have learned to deal with a bunch of other stylistic differences, 
but the code density is still hard, I find.  Sparse braincells?

How do the rest of you feel?  I volunteer to tidy up the code, if others agree 
with following Lucene's formating.  I believe Nutch and Hadoop already follow 
it.

Thanks,
Otis
 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Simpy -- http://www.simpy.com/  -  Tag  -  Search  -  Share




[jira] Commented: (SOLR-208) RSS feed XSL example

2007-05-17 Thread Otis Gospodnetic (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496649
 ] 

Otis Gospodnetic commented on SOLR-208:
---

Plus there is contrib/gdata-server under Lucene waiting to be used, so there 
already is gData-something in Luceneland.


 RSS feed XSL example
 

 Key: SOLR-208
 URL: https://issues.apache.org/jira/browse/SOLR-208
 Project: Solr
  Issue Type: New Feature
  Components: clients - java
Affects Versions: 1.2
Reporter: Brian Whitman
 Assigned To: Hoss Man
Priority: Trivial
 Attachments: rss.xsl


 A quick .xsl file for transforming solr queries into RSS feeds. To get the 
 date and time in properly you'll need an XSL 2.0 processor, as in 
 http://wiki.apache.org/solr/XsltResponseWriter .  Tested to work with the 
 example solr distribution in the nightly.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



Re: Code style

2007-05-17 Thread Yonik Seeley

On 5/17/07, Otis Gospodnetic [EMAIL PROTECTED] wrote:

Touchy topic: code style


Uh, oh... something everyone will have an opinion about ;-)


Should Solr be following the Lucene code formatting/style?  Lucene follows 
Sun's recommendation except for the 2-space indent, I believe.


Well, that's the general guidelines... there is a ton of lucene code
that doesn't follow that though.  One violation repeated everywhere
is

if (foo)
  bar();

Those don't bother me personally, I'm just pointing it out.


I'm asking because Solr is full of variable and method names that look like 
abbrevs ;)
- e.g. getDocListC - C?


Heh... I never realized abbreviations were off-limits.
In this particular case, I needed to refactor getDocList into a
caching version and a non caching version (C) and (NC).


, and on top of that the code is rather dense,without,many,spaces, so

it's hard to read, at least for me.

Spaces between lines, or spaces in a single line?

I tend to compress code where the logic is easy to understand...
sometimes spreading simple things out make it harder to see everything
in context.


How do the rest of you feel?  I volunteer to tidy up the code, if others agree 
with following Lucene's formating.  I believe Nutch and Hadoop already follow 
it.


Solr already has a policy that is the same as Lucene.
I'm fine with cleanups... just try to avoid breaking patches in JIRA.

-Yonik


Re: Code style

2007-05-17 Thread Chris Hostetter

:  How do the rest of you feel?  I volunteer to tidy up the code, if
:  others agree with following Lucene's formating.  I believe Nutch and
:  Hadoop already follow it.
:
: Solr already has a policy that is the same as Lucene.
: I'm fine with cleanups... just try to avoid breaking patches in JIRA.

Once upon a time i thought the idea of cleaning up the formatting of code
in commits that *only* changed formatting was a good idea ... but even if
hte message is clear that it's just a formatting commit, it still creates
noise, makes some patches harder to apply, and moves line numbers arround
reducing the number of situations where you can make educated guessees
about what went wrongwhen looking at a stack trace from someone without
knowing exactly which version they were using.

I'd certianly prefer if all new code and new changes met teh style
guidelines we have setup -- tidying up the lines that are being changed
anyway as part of the commit, but frankly i'd just as soon leave code that
works but isn't super pretty well enough alone.


-Hoss



Re: Code style

2007-05-17 Thread Otis Gospodnetic
Hi,



- Original Message 

From: Yonik Seeley [EMAIL PROTECTED]

To: solr-dev@lucene.apache.org

Sent: Thursday, May 17, 2007 2:05:41 PM

Subject: Re: Code style



On 5/17/07, Otis Gospodnetic [EMAIL PROTECTED] wrote:




 Should Solr be following the Lucene code formatting/style?  Lucene follows 
 Sun's recommendation except for the 2-space indent, I believe.



Well, that's the general guidelines... there is a ton of lucene code

that doesn't follow that though.  One violation repeated everywhere

is



if (foo)

   bar();



Those don't bother me personally, I'm just pointing it out.



OG: yes, that's one of those things my brain must have learned to read easily 
either way.  What's missing here?  Curly braces?



 I'm asking because Solr is full of variable and method names that look like 
 abbrevs ;)

 - e.g. getDocListC - C?



Heh... I never realized abbreviations were off-limits.

In this particular case, I needed to refactor getDocList into a

caching version and a non caching version (C) and (NC).



OG: Si, I realized that as I read the core more, but it wasn't obvious to me 
immediately.  How quickly things become obvious is important, I think.



, and on top of that the code is rather dense,without,many,spaces, so

it's hard to read, at least for me.



Spaces between lines, or spaces in a single line?



OG: Heh, good question.  Again, spaces or not between lines are easy for me, 
it's the lack of spaces in a single line that make things hard to read-kind of 
likethings should be made as simple as possible, but not any 
simpler.-A.Einstein would be hard to read.  Mental token parsing - 
WhiteSpaceBrainTokenizer, I guess.


I tend to compress code where the logic is easy to understand...

sometimes spreading simple things out make it harder to see everything

in context.



OG: I agree.  This made me learn to appreciate vertically tight code sometimes.

 How do the rest of you feel?  I volunteer to tidy up the code, if others 
 agree with following Lucene's formating.  I believe Nutch and Hadoop already 
 follow it.



Solr already has a policy that is the same as Lucene.

I'm fine with cleanups... just try to avoid breaking patches in JIRA.



OG: Right.  Right to what Hoss said in his reply, too.
OG: Luckily, man patch shows  -l  or  --ignore-whitespace, which might help.

Otis





Re: Code style

2007-05-17 Thread Otis Gospodnetic
Hi,

- Original Message 
From: Chris Hostetter [EMAIL PROTECTED]
I'd certianly prefer if all new code and new changes met teh style
guidelines we have setup -- tidying up the lines that are being changed
anyway as part of the commit, but frankly i'd just as soon leave code that
works but isn't super pretty well enough alone.

OG: That would be ideal.  However, sometimes you want to work on the existing 
code and the hard-to-parse style makes it harder for you to follow and 
understand the code.  I'm facing that now with SolrIndexSearcher, which is what 
promoted this thread.

Otis






[jira] Commented: (SOLR-230) make post.jar support better args for using tutorial

2007-05-17 Thread Carsten (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496669
 ] 

Carsten commented on SOLR-230:
--

Being from the unix fraction: 
Why is there a need for -Ddata=stdin ?
Just make it read from stdin if no arguments are given. 
Good old unix tradition.

And by the way: Why would you use system properties to pass parameters instead 
of simply using a parameter like

java -jar post.jar -url http://localhost:8983/solr/update *.xml 
java -jar post.jar -data deletequeryname:DDR/query/delete 

Just my EUR 0.02.

 make post.jar support better args for using tutorial
 

 Key: SOLR-230
 URL: https://issues.apache.org/jira/browse/SOLR-230
 Project: Solr
  Issue Type: New Feature
  Components: update
Reporter: Hoss Man
 Assigned To: Hoss Man
 Attachments: SOLR-230.patch


 SOLR-86 create post.jar which eliminated the need for post.sh ... but as 
 noticed in 
 SOLR-164 there are still some cases in the tutorial that require direct use 
 of curl (deleting) and there are some nice things about post.sh that post.jar 
 doesn't support (defaulting the URL)
 this issue is to tackle some of the ideas Bertrand and I posted as a comment 
 in SOLR-86 after it was resolved
 Bertrand Delacretaz [19/Feb/07 12:35 AM] ...
 Considering the tutorial examples 
 (http://lucene.apache.org/solr/tutorial.html), it'd be useful to allow this 
 to POST its standard input, or the contents of a command-line parameter: ...
 Hoss Man [19/Feb/07 11:50 AM]
 yeah ... i think we should hardcode http://localhost:8983/solr/update with a 
 possible override by system prop, then add either a command line switch other 
 another system prop indicating to use the command line as filenames or as raw 
 data, and another op for stdin.
 java -jar -Ddata=files post.jar *.xml
 java -jar post.jar *.xml ... data=files being the default
 echo deletequeryname:DDR/query/delete | java -jar -Ddata=stdin 
 post.jar
 cat *.xml | java -jar -Ddata=stdin post.jar
 java -jar -Ddata=args post.jar deletequeryname:DDR/query/delete
 java -jar -Durl=http://localhost:8983/solr/update post.jar *.xml 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Commented: (SOLR-230) make post.jar support better args for using tutorial

2007-05-17 Thread Hoss Man (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12496674
 ] 

Hoss Man commented on SOLR-230:
---

To answer both questions: i did it that way just to try and keep the code 
simple and explicit.  i figured using system props would help make the 
execution examples in the tutorial self documenting, while keeping the simple 
uses cases very simple, and eliminating the need for any complex getopt style 
argv parsing.

 make post.jar support better args for using tutorial
 

 Key: SOLR-230
 URL: https://issues.apache.org/jira/browse/SOLR-230
 Project: Solr
  Issue Type: New Feature
  Components: update
Reporter: Hoss Man
 Assigned To: Hoss Man
 Attachments: SOLR-230.patch


 SOLR-86 create post.jar which eliminated the need for post.sh ... but as 
 noticed in 
 SOLR-164 there are still some cases in the tutorial that require direct use 
 of curl (deleting) and there are some nice things about post.sh that post.jar 
 doesn't support (defaulting the URL)
 this issue is to tackle some of the ideas Bertrand and I posted as a comment 
 in SOLR-86 after it was resolved
 Bertrand Delacretaz [19/Feb/07 12:35 AM] ...
 Considering the tutorial examples 
 (http://lucene.apache.org/solr/tutorial.html), it'd be useful to allow this 
 to POST its standard input, or the contents of a command-line parameter: ...
 Hoss Man [19/Feb/07 11:50 AM]
 yeah ... i think we should hardcode http://localhost:8983/solr/update with a 
 possible override by system prop, then add either a command line switch other 
 another system prop indicating to use the command line as filenames or as raw 
 data, and another op for stdin.
 java -jar -Ddata=files post.jar *.xml
 java -jar post.jar *.xml ... data=files being the default
 echo deletequeryname:DDR/query/delete | java -jar -Ddata=stdin 
 post.jar
 cat *.xml | java -jar -Ddata=stdin post.jar
 java -jar -Ddata=args post.jar deletequeryname:DDR/query/delete
 java -jar -Durl=http://localhost:8983/solr/update post.jar *.xml 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.