Re: Error with bin/optimize and multiple solr webapps

2007-03-07 Thread Bertrand Delacretaz

On 3/7/07, Jeff Rodenburg [EMAIL PROTECTED] wrote:

Oops, my bad I didn't see either 186 or 187 before entering 188.  :-)


I have closed SOLR-186 and SOLR-187 as duplicates, please add relevant
info to SOLR-188 if needed.

-Bertrand


Re: Hierarchical Facets

2007-03-07 Thread Andrew Nagy

Hmm ... I had a brain storm.

Could I do something like this:
collectionDir1/Subdir1/SubSubDir1/collection

Then query collection:Dir1/Subdir1 and get the facets on collection at 
that point to see all of the subsubdirectories?


Is their any better method?

Andrew

Andrew Nagy wrote:
I am running into a stumbling block and can only find a way to solve 
the problem with some sort of hierarchical faceting system.  I am in 
the process of moving my records from eXist (http://exist.sf.net) to 
Solr, but the problem is with the lack of a directory structure that 
exist has.  I figure I could easily solve this problem by setting a 
field called collection and then faceting on that field to get my 
list of directories however this will not allow me to do 
subdirectories.


Any ideas on how to implement such a thing in solr?

Thanks!
Andrew



Re: [2] SQL Update

2007-03-07 Thread Chris Hostetter

: It would be useful to have a solr setting for stripping the dynamic field
: suffix/prefix on index field name to get back the original field name. Does
: it make sense?

sorry, i'm really not following this talk of stripping the dynamic field
name to get back the original field name ... what does that mean?

it seems like what you are describing is having a schema consisting
entirely of dynamic fields where teh suffix determines hte datatype, but
you don't wnat to have to use that suffix when interacting with Solr ...
but if i've got two dynamicFields *_i and *_s and you refer to user
in your query, how is Solr suppose to know wether you want user_i or
user_s

If you want people do be able to refer to the field user and you want it
to be an integer field, why not definie it as an explicit field instead of
a dynamic field?



-Hoss



Re: improve performance after commit

2007-03-07 Thread Chris Hostetter
: back in just now. Here's an example trying to warm using a sort on
: field name subject. I tried query of
: allMessageContent:trying;subject+asc as well as
: allMessageContent:trying;subject (without +asc) - either way

when expressing params in XML (either as init params for a request
handler, or in a QuerySenderListener the params don't need to be URL
escaped ... they just need to be XML escaped, try something like...


 listener event=newSearcher class=solr.QuerySenderListener
   arr name=queries
 lst
 str name=qallMessageContent:test; subject asc/str
 str name=start0/str
 str name=rows10/str
 /lst
   /arr
 /listener

-Hoss



Re: Hierarchical Facets

2007-03-07 Thread Chris Hostetter

Assuming I'm understanding your question correctly, something like that
would probably work just fine -- just do prefix searches on your
directories when you want to find all files in that directory or any
decendent directory, and use facet counds to see the list of decendent
directories and the number of files under each that match.



: Date: Wed, 07 Mar 2007 10:11:54 -0500
: From: Andrew Nagy [EMAIL PROTECTED]
: Reply-To: solr-user@lucene.apache.org
: To: Andrew Nagy [EMAIL PROTECTED]
: Cc: solr-user@lucene.apache.org
: Subject: Re: Hierarchical Facets
:
: Hmm ... I had a brain storm.
:
: Could I do something like this:
: collectionDir1/Subdir1/SubSubDir1/collection
:
: Then query collection:Dir1/Subdir1 and get the facets on collection at
: that point to see all of the subsubdirectories?
:
: Is their any better method?
:
: Andrew
:
: Andrew Nagy wrote:
:  I am running into a stumbling block and can only find a way to solve
:  the problem with some sort of hierarchical faceting system.  I am in
:  the process of moving my records from eXist (http://exist.sf.net) to
:  Solr, but the problem is with the lack of a directory structure that
:  exist has.  I figure I could easily solve this problem by setting a
:  field called collection and then faceting on that field to get my
:  list of directories however this will not allow me to do
:  subdirectories.
: 
:  Any ideas on how to implement such a thing in solr?
: 
:  Thanks!
:  Andrew
: 
:



-Hoss



Re: [2] Highlighting problems with HTML tagged fields

2007-03-07 Thread Chris Hostetter

It is tracked in http://issues.apache.org/jira/browse/SOLR-42

...there are currently no patches.


: Date: Tue, 6 Mar 2007 15:04:25 -0800 (PST)
: From: nick19701 [EMAIL PROTECTED]
: Reply-To: solr-user@lucene.apache.org
: To: solr-user@lucene.apache.org
: Subject: Re: [2] Highlighting problems with HTML tagged fields
:
:
:
: Yonik Seeley wrote:
: 
:  HTMLStripWhitespaceTokenizerFactory works in two phases...
:  HTMLStripReader removes the HTML and passes the result to
:  WhitespaceTokenizer... at that point, Tokens are generated, but the
:  offsets will correspond to the text after HTML removal, not before.
: 
:  I did it this way so that HTMLStripReader  could go before any
:  tokenizer (like StandardTokenizer).
: 
:  Can you open a JIRA bug for this?  The fix would be a special version
:  of HTMLStripReader integrated with a WhitespaceTokenizer to keep
:  offsets correct.
: 
:  -Yonik
: 
: 
: Is there a fix for this problem?
:
: my solr is dated on 12/17/2006. HTMLStripWhitespaceTokenizerFactory +
: highlighting still
: doesn't work. All the wrong items are highlighted.
: --
: View this message in context: 
http://www.nabble.com/Highlighting-problems-with-HTML-tagged-fields-tf2017260.html#a9343253
: Sent from the Solr - User mailing list archive at Nabble.com.
:



-Hoss



Hierarchical Facets

2007-03-07 Thread Andrew Nagy
I am running into a stumbling block and can only find a way to solve the 
problem with some sort of hierarchical faceting system.  I am in the 
process of moving my records from eXist (http://exist.sf.net) to Solr, 
but the problem is with the lack of a directory structure that exist 
has.  I figure I could easily solve this problem by setting a field 
called collection and then faceting on that field to get my list of 
directories however this will not allow me to do subdirectories.


Any ideas on how to implement such a thing in solr?

Thanks!
Andrew


Re: dynamicField Scaling

2007-03-07 Thread mark angelillo

On Mar 7, 2007, at 2:17 PM, Mike Klaas wrote:


On 3/7/07, mark angelillo [EMAIL PROTECTED] wrote:

Hello,

I've got a Solr index running and I want to use a dynamicField to
store n different sorting fields. The field that is used to actually
sort the results will be determined by the application that is
querying the index.

I'm wondering if anyone has done something similar to this, or if
anyone has an idea of how Solr will perform as the number n of
sorting fields grows larger. Is there a way to make sure this doesn't
start to slow the index down? Is there any information out there
about the number of dynamicFields that can be declared in this way
before the entire index suffers? Is there such a limit?


It's not realy about the number of dynamic fields.  The key variable
is the number of sort fields.  To sort efficiently, solr needs to
maintain a cache of field values.  This consumes memory per-field on
the order of

D x S + U

where D is the document count, S is the the size of the data type (eg.
4bytes for ints, 8 bytes for doubles, 4/8 bytes for anything else
[pointers]), and U is the cumulative size of the unique field values
(if sorting on a non-primitive type, like Strings).

If you have sufficient memory to store this data for each field you
are sorting on, you shouldn't have any problems.

best,
-Mike



Okay, makes sense.

Thanks,
Mark



xml generation for SOLR

2007-03-07 Thread netaji . k
hai,

Yes the Xml formats is understood but there is an issue to generate these
xmls
from a data source. These XML feild tags doesnot contain the same start
tags and end tags.

like field name=catsoftware/field

and standerd xml writers have xml generated as the same start and end tags.

in SOLR xml
start tag = field name=cat
end tag =  /field

can you adivise anything on this please.

regards,
aditya



Installation woes

2007-03-07 Thread Andrew Nagy
Hello, I am trying to install another copy of solr on a server.  I have 
done this many times before, but am running into errors now and I am not 
sure what is causing them.


I unzipped a copy of 1.1.0 and placed the .war file into tomcat.  Then I 
created the solr directory with my bin, data, conf directories.
When I restart tomcat (from the solr directory) and go to the Solr admin 
website everything looks fine.  But, when I pull up the statistics page, 
I only have information in the Query Handlers box, the rest are empty.  
When I click on ping, I see a 500 error from tomcat.


What am I forgetting to do?

Thanks
Andrew


Re: Installation woes

2007-03-07 Thread Yonik Seeley

On 3/7/07, Andrew Nagy [EMAIL PROTECTED] wrote:

Hello, I am trying to install another copy of solr on a server.  I have
done this many times before, but am running into errors now and I am not
sure what is causing them.

I unzipped a copy of 1.1.0 and placed the .war file into tomcat.  Then I
created the solr directory with my bin, data, conf directories.
When I restart tomcat (from the solr directory) and go to the Solr admin
website everything looks fine.  But, when I pull up the statistics page,
I only have information in the Query Handlers box, the rest are empty.
When I click on ping, I see a 500 error from tomcat.

What am I forgetting to do?


Did you point solr to the solr_home (containing your bin, data, and
conf directories)?
Or do you rely on the default of $CWD/solr?

If that's not it, check all the tomcat log files for the first
exception you see.

-Yonik


RE: Installation woes

2007-03-07 Thread Binkley, Peter
I had a problem like that when I blew away an index by deleting the
index directory instead of its parent the data directory; it seemed that
if Solr saw the data directory, it assumed the index was there. Removing
the data directory and letting Solr create it seemed to fix the problem.
(Or maybe something else got fixed at the same time, hard to say).

Peter

-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, March 07, 2007 1:12 PM
To: solr-user@lucene.apache.org
Subject: Installation woes

Hello, I am trying to install another copy of solr on a server.  I have
done this many times before, but am running into errors now and I am not
sure what is causing them.

I unzipped a copy of 1.1.0 and placed the .war file into tomcat.  Then I
created the solr directory with my bin, data, conf directories.
When I restart tomcat (from the solr directory) and go to the Solr admin
website everything looks fine.  But, when I pull up the statistics page,
I only have information in the Query Handlers box, the rest are empty.  
When I click on ping, I see a 500 error from tomcat.

What am I forgetting to do?

Thanks
Andrew


Re: Installation woes

2007-03-07 Thread Andrew Nagy
Argh!  Thanks Yonik for pointing out the log files, duh!  I had a 
malformed line in my schema.xml.  Nice feature to add down the line, 
although I know there is a lot of work going into the admin interface so 
who knows if it is already thought of.  Schema Debugger?  Maybe one day 
I will dig into the code guts and try to contribute.


Andrew


Yonik Seeley wrote:

On 3/7/07, Andrew Nagy [EMAIL PROTECTED] wrote:

Hello, I am trying to install another copy of solr on a server.  I have
done this many times before, but am running into errors now and I am not
sure what is causing them.

I unzipped a copy of 1.1.0 and placed the .war file into tomcat.  Then I
created the solr directory with my bin, data, conf directories.
When I restart tomcat (from the solr directory) and go to the Solr admin
website everything looks fine.  But, when I pull up the statistics page,
I only have information in the Query Handlers box, the rest are empty.
When I click on ping, I see a 500 error from tomcat.

What am I forgetting to do?


Did you point solr to the solr_home (containing your bin, data, and
conf directories)?
Or do you rely on the default of $CWD/solr?

If that's not it, check all the tomcat log files for the first
exception you see.

-Yonik


Re: xml generation for SOLR

2007-03-07 Thread Chris Hostetter
: like field name=catsoftware/field
:
: and standerd xml writers have xml generated as the same start and end tags.
:
: in SOLR xml
: start tag = field name=cat
: end tag =  /field

that tag is the same, the tag is field ... the field tag has a
mandatory attribute which is name

While it's certianly true that some people discourage use of XML
attributes when designing XML schema,s it doesn't change the fact that
having attributes is a very legal part of XML.

a quick google search for xml attribute turns up quite a few pages that
you may find useful, here's just a few for starters...

http://java.sun.com/webservices/jaxp/dist/1.1/docs/tutorial/overview/1_xml.html
http://www.xmlnews.org/docs/xml-basics.html#attributes
http://www.expertrating.com/courseware/XMLCourse/XML-Elements-Attributes-4.asp


-Hoss



Solr on Tomcat 6.0.10?

2007-03-07 Thread Walter Underwood
Is anyone running Solr on Tomcat 6.0.10? Any issues?
I searched the archives and didn't see anything.

wunder
-- 
Walter Underwood
Search Guru, Netflix




Re: Installation woes

2007-03-07 Thread Ryan McKinley

On 3/7/07, Andrew Nagy [EMAIL PROTECTED] wrote:

Argh!  Thanks Yonik for pointing out the log files, duh!  I had a
malformed line in my schema.xml.  Nice feature to add down the line,
although I know there is a lot of work going into the admin interface so
who knows if it is already thought of.  Schema Debugger?  Maybe one day
I will dig into the code guts and try to contribute.



I recently posted:
 https://issues.apache.org/jira/browse/SOLR-179

It stops everything and displays the errors on every url if there are
configuration errors.


Solr and Multiple Index Partitions

2007-03-07 Thread Venkatesh Seetharam

Hello there,

Howdy. I was wondering if there is a way to configure one Solr instance to
search multiple Index partitions? I read the wiki and found the entry in
SolrConfig.xml:

dataDir/var/data/solr/dataDir

Can I have mutiple directories? Comma separated?

Any help is greatly appreciated.

--
Thanks,
Venkatesh

Perfection (in design) is achieved not when there is nothing more to add,
but rather when there is nothing more to take away.
- Antoine de Saint-Exupéry


Re: Solr and Multiple Index Partitions

2007-03-07 Thread Ryan McKinley

Solr looks at one index - If you want to look at multiple indexes, you
need multiple solr instances running.  Check the wiki for how to set
that up:

 http://wiki.apache.org/solr/SolrJetty

(the resin and tomcat pages have something similar)



On 3/7/07, Venkatesh Seetharam [EMAIL PROTECTED] wrote:

Hello there,

Howdy. I was wondering if there is a way to configure one Solr instance to
search multiple Index partitions? I read the wiki and found the entry in
SolrConfig.xml:

dataDir/var/data/solr/dataDir

Can I have mutiple directories? Comma separated?

Any help is greatly appreciated.

--
Thanks,
Venkatesh

Perfection (in design) is achieved not when there is nothing more to add,
but rather when there is nothing more to take away.
- Antoine de Saint-Exupéry



Re: Solr and Multiple Index Partitions

2007-03-07 Thread Venkatesh Seetharam

Yes, I'm implementing federated search. I do have N partitions of indexes
built and I'd like to have mutiple Solr instances in a cluster each serving
atleast 2 partitions. I was wondering if I could somehow find a way to make
Solr work with atleast 2 partitions. It looks like I need to have smaller
number of partitions.

Thanks,
Venkatesh

On 3/7/07, Bradley Burke [EMAIL PROTECTED] wrote:


Venkatesh,

Are you perhapes talking about Federated Searching (
http://wiki.apache.org/solr/FederatedSearch).  I am new to Solr but this
was
a feature I was looking far as well.  I do not think its built into Solr
at
the momment.

Regards,
Brad


On 3/7/07, Venkatesh Seetharam [EMAIL PROTECTED] wrote:

 Thanks Ryan for your inputs. If I'm not using Solr webapp but wrapping
 Solr
 in plain Java, is there any way that I could get Solr to work with
 multiple
 index partitions?

 Venkatesh

 On 3/7/07, Ryan McKinley [EMAIL PROTECTED] wrote:
 
  Solr looks at one index - If you want to look at multiple indexes, you
  need multiple solr instances running.  Check the wiki for how to set
  that up:
 
http://wiki.apache.org/solr/SolrJetty
 
  (the resin and tomcat pages have something similar)
 
 
 
  On 3/7/07, Venkatesh Seetharam [EMAIL PROTECTED] wrote:
   Hello there,
  
   Howdy. I was wondering if there is a way to configure one Solr
 instance
  to
   search multiple Index partitions? I read the wiki and found the
entry
 in
   SolrConfig.xml:
  
   dataDir/var/data/solr/dataDir
  
   Can I have mutiple directories? Comma separated?
  
   Any help is greatly appreciated.
  
   --
   Thanks,
   Venkatesh
  
   Perfection (in design) is achieved not when there is nothing more
to
  add,
   but rather when there is nothing more to take away.
   - Antoine de Saint-Exupéry
  
 




Re: Solr and Multiple Index Partitions

2007-03-07 Thread Venkatesh Seetharam

Thanks Ryan for your insight. I do not wish to change Solr.


Although it seems weird, just adding a type field makes it possible

I'm not working with a DB but since we are indexing a huge vault of XML
documents, I use Hadoop + Lucene for indexing which finally generates N
number of partitions, I was thinking to use Solr as the searcher and will
have a broker which merges results from individual Solr searchers.

I do not know if I can ask another question here or start a new thread.

I use a custom Analyzer which extends Lucene's StandardAnalyzer. When I
configured Solr to use this one, It throws an exception
RuntimeException(Can't set positionIncrementGap on custom analyzer  +
analyzer.getClass()).

Do I need to extend a specific Analyzer for it to work with Solr?

Venkatesh

On 3/7/07, Ryan McKinley [EMAIL PROTECTED] wrote:


As it is now... I don't think so.  SolrCore is a static singleton
class -- without some serious reworking, i think there is only one
instance per jvm.

I think getting rid of the static singleton should go on the long term
TODO list, but that doesn't help you now.

Although it seems weird, just adding a type field makes it possible
to put things that seem like they need their own database into the
same index - you select the 'type' you want by adding +type:mytype
to your query


On 3/7/07, Venkatesh Seetharam [EMAIL PROTECTED] wrote:
 Thanks Ryan for your inputs. If I'm not using Solr webapp but wrapping
Solr
 in plain Java, is there any way that I could get Solr to work with
multiple
 index partitions?

 Venkatesh

 On 3/7/07, Ryan McKinley [EMAIL PROTECTED] wrote:
 
  Solr looks at one index - If you want to look at multiple indexes, you
  need multiple solr instances running.  Check the wiki for how to set
  that up:
 
http://wiki.apache.org/solr/SolrJetty
 
  (the resin and tomcat pages have something similar)
 
 
 
  On 3/7/07, Venkatesh Seetharam [EMAIL PROTECTED] wrote:
   Hello there,
  
   Howdy. I was wondering if there is a way to configure one Solr
instance
  to
   search multiple Index partitions? I read the wiki and found the
entry in
   SolrConfig.xml:
  
   dataDir/var/data/solr/dataDir
  
   Can I have mutiple directories? Comma separated?
  
   Any help is greatly appreciated.
  
   --
   Thanks,
   Venkatesh
  
   Perfection (in design) is achieved not when there is nothing more
to
  add,
   but rather when there is nothing more to take away.
   - Antoine de Saint-Exupéry
  
 




Re: [2] Highlighting problems with HTML tagged fields

2007-03-07 Thread nick19701


Chris Hostetter wrote:
 
 
 It is tracked in http://issues.apache.org/jira/browse/SOLR-42
 
 ...there are currently no patches.
 
 

The suggested fix from Mirko seems very simple. Hopefull a patch will be
applied 
very soon. In the meantime, I'll use my backup solution: 
http://fucoder.com/code/se-hilite/ http://fucoder.com/code/se-hilite/ 


-- 
View this message in context: 
http://www.nabble.com/Highlighting-problems-with-HTML-tagged-fields-tf2017260.html#a9363720
Sent from the Solr - User mailing list archive at Nabble.com.



Re: [2] SQL Update

2007-03-07 Thread Chris Hostetter

: I wanted to add data from relational database tables.
: To avoid defining each and every table column name in the schema, I thought
: I'll append a suffix to the field name depending on it's type.

which is fine and dandy for when you index the data, mapping your string
database column user, and your int database column id to the solr
dynamic fields user_string and id_int ...and i suppose you could make
a customized ResponseWriter that when writing out documents striped off
any suffixes it could tell came from dynamicFields so the response docs
contained str name=user and int name=id ... but when parsing the
query string your clients send, and they ask for user:42 how would the
request handler know that it shoudl rewrite that to user_string:42 and not
user_int:42 ?



-Hoss



Re: [2] Highlighting problems with HTML tagged fields

2007-03-07 Thread nick19701


Chris Hostetter wrote:
 
 
 patches for issues can't be applied until someone who cares about them
 write them and contribute them for committers to consider/apply :)
 
 

it seems I'm one of the very few people who care about this feature :)

Unfortunately my daily languages are c++ and c#. I only know a little bit
Java. Otherwise I'll contribute.

-- 
View this message in context: 
http://www.nabble.com/Highlighting-problems-with-HTML-tagged-fields-tf2017260.html#a9365098
Sent from the Solr - User mailing list archive at Nabble.com.



synonym filter fix

2007-03-07 Thread nick19701

About this synonym filter fix:
http://issues.apache.org/jira/browse/SOLR-167
http://issues.apache.org/jira/browse/SOLR-167 

I tried today's solr build. It seems the verbose analysis of the index
analyzer still has the same symptom.
should I try another build? Or maybe my tomcat is not using the latest
build?
-- 
View this message in context: 
http://www.nabble.com/synonym-filter-fix-tf3365934.html#a9365249
Sent from the Solr - User mailing list archive at Nabble.com.



Re: synonym filter fix

2007-03-07 Thread Mike Klaas

On 3/7/07, nick19701 [EMAIL PROTECTED] wrote:


About this synonym filter fix:
http://issues.apache.org/jira/browse/SOLR-167
http://issues.apache.org/jira/browse/SOLR-167

I tried today's solr build. It seems the verbose analysis of the index
analyzer still has the same symptom.
should I try another build? Or maybe my tomcat is not using the latest
build?


Using the latest solr trunk, the error I reproduced before seems fixed
(see below).  Note that if you are using term vectors with
highlighting, you'll have to reindex.

org.apache.solr.analysis.SynonymFilterFactory {expand=true,
ignoreCase=true, synonyms=synonyms.txt}
term position   1   2   3   4   5   6   7   8   
9   10  11  12  13
term text   bestbuy -   AcerAspire  AS5610-2273 -   
$599.   Windows vista,  1   GB  RAM
bestbuy gib
bb  gigabyte
gigabytes
term type   wordwordwordwordwordwordwordword
wordwordwordwordword
wordword
wordword
word
source start,end
0,8 0,8 9,1011,15   16,22   23,34   35,36   37,42   43,50   
51,57   58,59   60,62   63,66
0,8 60,62
0,8 60,62
60,62


Re[2]: Solr and Multiple Index Partitions

2007-03-07 Thread Jack L
Selecting by type will do the job. But I suppose it sacrifice
performance because having multiple document types in the same
index will render a larger index. Is it bad?

-- 
Best regards,
Jack

Wednesday, March 7, 2007, 2:15:14 PM, you wrote:

 As it is now... I don't think so.  SolrCore is a static singleton
 class -- without some serious reworking, i think there is only one
 instance per jvm.

 I think getting rid of the static singleton should go on the long term
 TODO list, but that doesn't help you now.

 Although it seems weird, just adding a type field makes it possible
 to put things that seem like they need their own database into the
 same index - you select the 'type' you want by adding +type:mytype
 to your query


__
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


Re: making an in-order query

2007-03-07 Thread Brian Whitman

id:A id:B id:C id:D

*usually* works, but I have seen D appear first in the results for
certain queries.

Is there a query I can do or a better way to accomplish this?


It's a bit of a hack, but you could use boosts to order the docs:

id:A^4 id:B^3 id:C^2 id:D^1


Gorgeous! Does the job admirably. Thanks Yonik