RE: Faceted search not working?

2010-05-25 Thread Birger Lie
Hi, try http://localhost:8080/solr/select/?q=YOUR-QUERYfacet=truefacet.field=title I don't think the bolean fields is mapped to on and off :) -birger -Original Message- From: Ilya Sterin [mailto:ster...@gmail.com] Sent: 24. mai 2010 23:11 To: solr-user@lucene.apache.org Subject:

Tagging and excluding Filters

2010-05-25 Thread Lukas Kahwe Smith
Hi, I am using the following solution: http://wiki.apache.org/solr/SimpleFacetParameters#Tagging_and_excluding_Filters However when I implemented this on I found that I cannot combine different filter types: http://search.un-informed.org/search?q==t[23]=malariatm=anys=Search The above request

How well does Solr scale over large number of facet values?

2010-05-25 Thread Andy
I want to facet over a field group. Since group is created by users, potentially there can be a huge number of values for group. - Would Solr be able to handle a use case like this? Or is Solr not really appropriate for facet fields with a large number of values? - I understand that I can set

Re: Apache or Nginx In front of SOLR?

2010-05-25 Thread Paul Dhaliwal
It depends on what kind of load you are talking about and what your expertise is. NGINX does perform better than apache for most people, however less people know about NGINX than apache. If you have more than 100K searchers a day doing a few searches each, you will benefits from NGINX. If your

RE: Highlighting is not happening

2010-05-25 Thread Doddamani, Prakash
Hey, I thought the Highlights would happen in the field of the documents returned from SOLR J But it gives new list of Highlighting at below, sorry for the confusion I was wondering is there a way that the fields returned itself contains bold characters Eg : if searched for query doc

Re: Faceted search not working?

2010-05-25 Thread Sascha Szott
Hi Birger, Birger Lie wrote: I don't think the bolean fields is mapped to on and off :) You can use true and on interchangeably. -Sascha -birger -Original Message- From: Ilya Sterin [mailto:ster...@gmail.com] Sent: 24. mai 2010 23:11 To: solr-user@lucene.apache.org Subject:

Re: sort by field length

2010-05-25 Thread Sascha Szott
Hi Erick, Erick Erickson wrote: Are you sure you want to recompute the length when sorting? It's the classic time/space tradeoff, but I'd suggest that when your index is big enough to make taking up some more space a problem, it's far too big to spend the cycles calculating each term length for

Re: Highlighting is not happening

2010-05-25 Thread Sascha Szott
Hi, to accomplish that, use the highlighting parameters hl.simple.pre and hl.simple.post. By the way, there are a plenty of other parameters that affect highlighting. Take a look at: http://wiki.apache.org/solr/HighlightingParameters -Sascha Doddamani, Prakash wrote: Hey, I thought the

Re: Apache or Nginx In front of SOLR?

2010-05-25 Thread Kranti™ K K Parisa
Thanks Paul, I shall continue doing some more RD with your inputs. Best Regards, Kranti K K Parisa On Tue, May 25, 2010 at 12:54 PM, Paul Dhaliwal subp...@gmail.com wrote: It depends on what kind of load you are talking about and what your expertise is. NGINX does perform better than

Re: How well does Solr scale over large number of facet values?

2010-05-25 Thread Marc Sturlese
With the uninverted algorithm it will be very fast whatever is the number of unique terms. But be careful with the memory because it uses quite a lot. Using the oldest facet algorithm, if you have a lot of different terms it will be slow. -- View this message in context:

Re: Problem with extended dismax, minus prefix (to mean NOT) and interaction with mm?

2010-05-25 Thread Erik Hatcher
This looks like a case where the extended dismax parser is creating a Lucene QueryParser parsed query rather than a disjunction maximum query. A case of too much magic maybe? Looks like this one should be parsed quite differently. Try dismax and see what you get, it'll be quite

Re: How well does Solr scale over large number of facet values?

2010-05-25 Thread Marc Sturlese
Since Solr 1.4 I think the uninverted method is on by default. Anyway, you can choose wich to use with the method param: facet.method=fc/enum (where fc is the uninverted one) http://wiki.apache.org/solr/SimpleFacetParameters -- View this message in context:

Using solrJ to get all fields in a particular schema/index

2010-05-25 Thread Rakhi Khatwani
Hi, Is there any way to get all the fields (irrespective of whether it contains a value or null) in solrDocument. or Is there any way to get all the fields in schema.xml of the url link ( http://localhost:8983/solr/core0/)?? Regards, Raakhi

Re: Using solrJ to get all fields in a particular schema/index

2010-05-25 Thread findbestopensource
To reterive all documents, You need to use the query/filter *FieldName:*:** Regards Aditya www.findbestopensource.com On Tue, May 25, 2010 at 4:14 PM, Rakhi Khatwani rkhatw...@gmail.com wrote: Hi, Is there any way to get all the fields (irrespective of whether it contains a value

Re: How real-time are Solr/Lucene queries?

2010-05-25 Thread Grant Ingersoll
How many docs are in the batch you are pulling down? How many docs/second do you expect on the index size? How big are the docs? What do you expect in terms of queries per second? How fast do new documents need to be available on the local server? How much analysis do you have to do?

Re: Using solrJ to get all fields in a particular schema/index

2010-05-25 Thread findbestopensource
To reterive all documents, You need to use the query/filter *FieldName:*:** Regards Aditya www.findbestopensource.com On Tue, May 25, 2010 at 4:14 PM, Rakhi Khatwani rkhatw...@gmail.com wrote: Hi, Is there any way to get all the fields (irrespective of whether it contains a value or

Re: Using solrJ to get all fields in a particular schema/index

2010-05-25 Thread findbestopensource
Resending it as there is a typo error. To reterive all documents, You need to use the query/filter FieldName:*:* . Regards Aditya www.findbestopensource.com On Tue, May 25, 2010 at 4:29 PM, findbestopensource findbestopensou...@gmail.com wrote: To reterive all documents, You need to use

Re: Machine utilization while indexing

2010-05-25 Thread Thijs
Hi all, I did some further investigation and (after turning of some filters in yourkit) found that is was actually the machine sending the files to solr that was slowing things down. At first I couldn't find this as it turned out that yourkit hides org.apache.* classes. When I removed this

Re: Using solrJ to get all fields in a particular schema/index

2010-05-25 Thread Rakhi Khatwani
Hi Aditya, i can retrieve all documents. but cannot retrieve all the fields in a document(if it does not hv any value). For example i get a list of documents, some of the documents have some value for title field, and others mite not contain a value for title field. in anycase i need

Re: Using solrJ to get all fields in a particular schema/index

2010-05-25 Thread findbestopensource
If a field doesn't have a value, You will get NULL on retrieving it. How could you expect a value for a field which is not provided? You have two options, choose either one.. 1. If the fieldvalue is returned NULL then display a proper error / user defined message. Handle the error. 2. Add a

Re: Tagging and excluding Filters

2010-05-25 Thread Lukas Kahwe Smith
On 25.05.2010, at 08:55, Lukas Kahwe Smith wrote: Now when I deselect one of the checkboxes I add an fq parameters:

Re: sort by field length

2010-05-25 Thread Erick Erickson
Ah, I may have misunderstood, I somehow got it in my mind you were talking about the length of each term (as in string length). But if you're looking at the field length as the count of terms, that's another question, sorry for the confusion... I have to ask, though, why you want to sort this

Re: Faceted search not working?

2010-05-25 Thread Jean-Sebastien Vachon
Is the FacetComponent loaded at all? requestHandler name=standard class=solr.SearchHandler default=true arr name=components strquery/str strfacet/str /arr /requestHandler On 2010-05-25, at 3:32 AM, Sascha Szott wrote: Hi Birger, Birger Lie wrote: I don't think the bolean

question about indexing...

2010-05-25 Thread Jörg Agatz
I have a work!, i musst indexing a lot of E-Mails, so i will create a Script to generate me a xml of the Mails. Now is the question, what happens when i creade a field body and in this field comes a lot of or like this: Confidentiality Caution: This message and all its included content and

Re: IndexSearcher and Caches

2010-05-25 Thread Rahul R
Chris, I am using SolrIndexSearcher to get a handle to the total number of records in the index. I am doing it like this : int num = Integer.parseInt((String)solrSearcher.getStatistics().get(numDocs).toString()); Please let me know if there is a better way to do this. Mark, I can tell you what I

Re: Faceted search not working?

2010-05-25 Thread Sascha Szott
Hi, please note, that the FacetComponent is one of the six search components that are automatically associated with solr.SearchHandler (this holds also for the QueryComponent). Another note: By using name=components all default components will be replaced by the components you explicitly

Re: question about indexing...

2010-05-25 Thread Erik Hatcher
Well, you'll just have to create valid XML, either encoding some characters or using CDATA sections. Erik On May 25, 2010, at 10:06 AM, Jörg Agatz wrote: I have a work!, i musst indexing a lot of E-Mails, so i will create a Script to generate me a xml of the Mails. Now is the

Re: question about indexing...

2010-05-25 Thread Jörg Agatz
ok, done.. But now i dosent find any word in the CDATA field. i make : field name=P_CONTENT_ITEMS_COMMENT![CDATA[ Hallo leute. mein name ist dein name und wir wollen eigentlich nur unsere Ruhe haben. bich du er sie es/b Ha ha Ha ha ha ha ha ha ha ha ]]/field it is a string field Multivalued..

Re: question about indexing...

2010-05-25 Thread Erik Hatcher
You have to provide more details than that. We need to know the field definition for that named field, the corresponding field type definition, and the exact request you're making to Solr that you think should find this document. And most importantly, did you commit/ :) Erik On

Re: question about indexing...

2010-05-25 Thread Jörg Agatz
i create a new Index, but nothing Change. field name=COMMENT type=string indexed=true stored=true multiValued=true/ field name=COMMENT ![CDATA[ Hallo leute. mein name ist dein name und wir wollen eigentlich nur unsere Ruhe haben. bich du er sie es/b Ha ha Ha ha ha ha ha ha ha ha ]]/field I

Re: caching on unique queries

2010-05-25 Thread Chris Hostetter
: Pretty much every one of my queries is going to be unique. However, the : query is fairly complex and also contains both unique and non-unique : data. In the query, some fields will be unique (e.g description), but : other fields will be fairly common (e.g. category). If we could use :

Help me understand query syntax of subqueries

2010-05-25 Thread Tigi Scramble
Any idea why this query returns 0 records: sexual assault AND (-obama) while this one returns 1400 ? sexual assault AND -(obama) Some debug info: sexual assault AND (-obama), translates to: +text:sexual assault +(-text:obama), returns 0 records sexual assault AND -(obama), translates

Re: How real-time are Solr/Lucene queries?

2010-05-25 Thread Jason Rutherglen
The main issue is if you're using facets, which are currently inefficient for the realtime use case because they're created on the entire set of segment/readers. Field caches in Lucene are per segment and so don't have this problem. On Tue, May 25, 2010 at 4:09 AM, Grant Ingersoll

Re: Problem with extended dismax, minus prefix (to mean NOT) and interaction with mm?

2010-05-25 Thread Chris Hostetter
: I'm running edismax (on both a 1.4 with patch and a branch_3x version) and : I'm seeing something I don't expect. ... : str name=rawquerystringdog cat -trilogy/str : str name=querystringdog cat -trilogy/str : str name=parsedqueryallfields:dog allfields:cat : -allfields:trilogi/str :

Does SOLR provide a java class to perform url-encoding

2010-05-25 Thread JohnRodey
I would like to leverage on whatever SOLR provides to properly url-encode a search string. For example a user enters: mr. bill oh no The URL submitted by the admin page is:

Re: Does SOLR provide a java class to perform url-encoding

2010-05-25 Thread Sean Timm
Java provides one. You probably want to use utf-8 as the encoding scheme. http://java.sun.com/javase/6/docs/api/java/net/URLEncoder.html Note you also will want to strip or escape character that are meaningful in the Solr/Lucene query syntax.

Re: How real-time are Solr/Lucene queries?

2010-05-25 Thread Thomas J. Buhr
My documents are all quite small if not down right tiny, there is not much analysis to do. I plan to mainly use Solr for indexing application configuration data which there is a lot of and I have all pre-formated. Since it is a music application there are many score templates, scale and rhythm

Re: Using solrJ to get all fields in a particular schema/index

2010-05-25 Thread Chris Hostetter
:Is there any way to get all the fields (irrespective of whether : it contains a value or null) in solrDocument. no. a document only has Field instances for the fields which it has values for. it's also not a feature that would even be theoretically posisbly to add, becuase of

Re: Faceted search not working?

2010-05-25 Thread Ilya Sterin
Sascha thanks for the response, here is the output... ?xml version=1.0 encoding=UTF-8? response lst name=responseHeader int name=status0/int int name=QTime0/int lst name=params str name=wtxml/str str name=qtitle:*/str str name=fltitle/str /lst /lst result

Re: Faceted search not working? (RESOLVED)

2010-05-25 Thread Ilya Sterin
Ah, the issue was explicitly specifying components... arr name=components strquery/str /arr I don't remember changing this during default install, commenting this out enabled faceted search component. Thanks all for the help. Ilya On Tue, May 25, 2010 at 10:38 AM, Sascha Szott sz...@zib.de

Re: Does SOLR provide a java class to perform url-encoding

2010-05-25 Thread JohnRodey
Thanks Sean, that was exactly what I need. One question though... How to correctly retain the Solr specific characters. I tried adding escape chars but URLEncoder doesn't seem to care about that: Example: String s1 = \mr. bill\ oh n?; String s2 = \\\mr. bill\\\ oh n\\?; String encoded1 =

Re: SOLR-343 date facet mincount patch

2010-05-25 Thread Umesh_
Hoss, I was able to successfully apply the path Solr-343 and even after applying the patch, date facet minCount does not work. Appropriate part of response are as given below: [responseHeader] = object(SolrObject)#107 (3) { [status] = int(0) [QTime] = int(4) [params] =

Re: SOLR-343 date facet mincount patch

2010-05-25 Thread Umesh_
Chris, Please ignore the repeated response header due to typo in the previous message. ~Umesh -- View this message in context: http://lucene.472066.n3.nabble.com/Re-SOLR-343-date-facet-mincount-patch-tp789556p842863.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Indexing stalls reads

2010-05-25 Thread Lance Norskog
This sounds like you have the same solrconfig for the slave and the master? You should turn off autoCommit on the slave. Only the master should autoCommit. You should set up the ReplicationHandler. This moves index updates from the indexer to the query server.

Solr read-only core

2010-05-25 Thread Yao
Is there a way to open a Solr index/core in read-only mode? -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-read-only-core-tp843049p843049.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Solr read-only core

2010-05-25 Thread Markus Jelsma
Hi,   I'd guess there are two ways in doing this but i've never seen any solrconfig.xml file having any directives that explicitly do not allow for updates.   You'd either have a proxy in front that simply won't allow any other HTTP method than GET and HEAD, or you could remove the update

Re: IndexSearcher and Caches

2010-05-25 Thread Lance Norskog
The stats.jsp page walks the internal JMX beans. It prints out the numbers of documents among other things. I would look at how that works instead of writing your own thing for the internal APIs. They may have changed from Solr 1.3 to 1.4 and will change further for 1.5 (4.0 is the new name?).

Re: question about indexing...

2010-05-25 Thread Lance Norskog
Change type=string to type=text. This causes the field to be analyzed and then searching on words finds the document. On Tue, May 25, 2010 at 8:34 AM, Jörg Agatz joerg.ag...@googlemail.com wrote: i create a new Index, but nothing Change.  field name=COMMENT type=string indexed=true

Enhancing Solr relevance functions through predefined constants

2010-05-25 Thread Prasanna R
Hi all, I have a suggestion for improving relevance functions in Solr by way of providing access to a set of pre-defined constants in Solr queries. Specifically, the number of documents indexed, the number of unique terms in a field, the total number of terms in a field, etc. are some of the

Re: Debugging - DIH Delta Queries-

2010-05-25 Thread Chris Hostetter
: Subject: Debugging - DIH Delta Queries- : References: : 1659766275.5213.1274376509278.javamail.r...@vicenza.dmz.lexum.pri : In-Reply-To: : 1659766275.5213.1274376509278.javamail.r...@vicenza.dmz.lexum.pri http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing

Re: Solr Cell and encrypted pdf files

2010-05-25 Thread Chris Hostetter
: I can't seem to get solr cell to index password protected pdf files. : I can't figure out how to pass the password to tika and looking at : ExtractingDocumentLoader, : it doesn't seem to pass any pdf password related metadata to the tika parser. I suspect you are correct, i don't think anyone

Re: question about indexing...

2010-05-25 Thread Erick Erickson
Don't forget to re-index after you make the change Lance suggested... Erick On Tue, May 25, 2010 at 4:51 PM, Lance Norskog goks...@gmail.com wrote: Change type=string to type=text. This causes the field to be analyzed and then searching on words finds the document. On Tue, May 25, 2010 at

Re: Solr Delta Queries

2010-05-25 Thread Chris Hostetter
: field name=indexed_timestamp type=date indexed=true stored=true default=NOW multiValued=false/ : For some reason when doing delta indexing via DIH, this field is not being updated. : : Are timestamp fields updated during DELTA updates? timestamp fields aren't treated any differnetly then

Re: solr caches from external caching system like memcached

2010-05-25 Thread Chris Hostetter
: Is it possible to use solr caches such as query cache , filter cache : and document cache from external caching system like memcached as it : has several advantages such as centralized caching system and reducing the : pause time of JVM 's garbage collection as we can assign less

Re: Solr highlighter and custom queries?

2010-05-25 Thread Chris Hostetter
: Actually, its not as much a Solr problem as a Lucene one, as it turns : out, the WeightedSpanTermExtractor is in Lucene and not Solr. : : Why they decided to only highlight queries that are in Lucene I don't : know, but what I did to solve this problem was simply to make my queries :

Re: Full Import failed

2010-05-25 Thread Chris Hostetter
: yes i am running 1.5, Any idea how we can run Solr 1.4 using Java 1.5 Solr 1.4 works just fine with Java 1.5 -- even when Using the DataImportHandler. there are some features of DIH like the ScriptTransformer that requires java 1.6, but that's not your issue... : Last I encountered that

Re: Full Import failed

2010-05-25 Thread Mohamed Parvez
I am just using the sor.war file that came with the Solr 1.4 download on weblogic. did not add any jar or remove any jar On Tue, May 25, 2010 at 9:54 PM, Chris Hostetter hossman_luc...@fucit.orgwrote: : yes i am running 1.5, Any idea how we can run Solr 1.4 using Java 1.5 Solr 1.4 works