Antwort: RE: Group.query
I think what you need is facetting, or is this another thing? http://searchhub.org/dev/2009/09/02/faceted-search-with-solr/ Peter Kirk p...@alpha-solutions.dk schrieb am 26.09.2012 12:18:32: Von: Peter Kirk p...@alpha-solutions.dk An: solr-user@lucene.apache.org solr-user@lucene.apache.org Datum: 26.09.2012 12:19 Betreff: RE: Group.query Thanks. Yes I can do this - but doesn't it mean I need to execute a query per group? What I really want to do (and I'm sorry I'm not so good at explaining) is to execute one query for products, and receive results grouped by the groups - but where a particular product may be found in several groups. For example, I'd like to execute a query for all products which match bucket. There are several products which are buckets, each of which can belong to several groups. Would it be possible to generate a query which would return the groups, each with a list of the buckets? Example result, with 3 groups, and several products (which may occur in several groups). Children_sand_toys Castle bucket Plain bucket Boys_toys Castle bucket Truck bucket Girls_toys Castle bucket Large Pony bucket Thanks, Peter -Original Message- From: Ingar Hov [mailto:ingar@gmail.com] Sent: 26. september 2012 11:57 To: solr-user@lucene.apache.org Subject: Re: Group.query I hope I understood the question, if so this may be a solution: Why don't you make the field group for product multiple? Example: field name=group type=string indexed=true stored=true multiValued=true/ If the product is a member of group1 and group2, just add both for the product document so that each product has an array of group. Then you can easily get all products for group1 by doing query: group:group1 Regards, Ingar On Wed, Sep 26, 2012 at 10:48 AM, Peter Kirk p...@alpha-solutions.dk wrote: Thanks. Yes, the only solution I could think of was to execute several queries. I would like it to be a single query if at all possible. If anyone has ideas I could look into that would be great. Thanks, Peter -Original Message- From: Aditya [mailto:findbestopensou...@gmail.com] Sent: 26. september 2012 10:41 To: solr-user@lucene.apache.org Subject: Re: Group.query Hi You are doing AND search, so you are getting results prod1 and prod2. I guess, you should query only for group1 and another query for group2. Regards Aditya www.findbestopensource.com On Wed, Sep 26, 2012 at 12:26 PM, Peter Kirk p...@alpha-solutions.dk wrote: Hi I have products which belong to one or more groups. Products are documents in Solr, while the groups are fields (eg. group_1_bool:true). For example: Prod1 = group1, group2 Prod2 = group1, group2 Prod3 = group1 Prod4 = group2 I would like to execute a query which results in the groups with their products. That is, the result should be something like: Group1 = Prod1, Prod2, Prod3 Group2 = Prod1, Prod2, Prod4 How can I do this? I've been looking at group.query, but I don't think this is what I want. For example, q=*:*group.query=group_1_bool:true+AND+group_2_bool:true Results in 1 group called group_1_bool:true AND group_2_bool:true, which contains prod1 and prod2. Thanks, Peter
Antwort: Re: Solr Cell Questions
Thank you Erick for your respone, I've already tried what you've suggested and got some out of memory exceptions. Because of this i like the solution with solr Cell where i can send the file directly to solr via stream and don't collect them in my memory. And another question that came to my mind, how many documents per minute, second, what ever can i put into solr. Say XML format and from 100kb to 100MB. Is there a number or is it to dependent from hardware and settings? Best Johannes Erick Erickson erickerick...@gmail.com schrieb am 25.09.2012 00:22:26: Von: Erick Erickson erickerick...@gmail.com An: solr-user@lucene.apache.org Datum: 25.09.2012 00:23 Betreff: Re: Solr Cell Questions If you're concerned about throughput, consider moving all the SolrCell (Tika) processing off the server. SolrCell is way cool for showing what can be done, but its downside is you're moving all the processing of the structured documents to the same machine doing the indexing. Pretty soon, especially with significant size files, you're spending all your CPU cycles parsing the files... Happens there's a blog about this: http://searchhub.org/dev/2012/02/14/indexing-with-solrj/ By moving the indexing to N clients, you can increase throughput until you make Solr work hard to do the indexing Best Erick On Mon, Sep 24, 2012 at 10:04 AM, johannes.schwendin...@blum.com wrote: Hi, Im currently experimenting with Solr Cell to index files to Solr. During this some questions came up. 1. Is it possible (and wise) to connect to Solr Cell with multiple Threads at the same time to index several documents at the same time? This question came up because my prrogramm takes about 6hours to index round 35000 docs. (no production environment, only example solr and a little desktop machine but I think its very slow, and I know solr isn't the bottleneck (yet)) 2. If 1 is possible, how many Threads should do this and how many memory Solr needs? I've tried it but i run into an out of memory exception. Thanks in advantage Best Regards Johannes
Antwort: Re: Re: Solr Cell Questions
The difference with solr cell is, that i'am sending every single document to solr cell and don't collect them until i have a couple of them in my memory. Using mainly the code form here: http://wiki.apache.org/solr/ExtractingRequestHandler#SolrJ Erick Erickson erickerick...@gmail.com schrieb am 25.09.2012 15:47:34: Von: Erick Erickson erickerick...@gmail.com An: solr-user@lucene.apache.org Datum: 25.09.2012 15:48 Betreff: Re: Re: Solr Cell Questions bq: how many documents per minute, second, what ever can i put into solr Too many variables to say. I've seen several thousand truly simple docs/sec. But since you're doing the Tika processing that's probably going to be your limiting factor. And it'll be many fewer... I don't understand your OOM issue when running Tika on the client. Or, rather, why you think using SolrCell makes this different. SolrCell also uses Tika. So my suspicion it that your client-side process simply isn't allocating much memory to the JVM, did you try bumping the memory on your client? Best Erick On Tue, Sep 25, 2012 at 5:23 AM, johannes.schwendin...@blum.com wrote: Thank you Erick for your respone, I've already tried what you've suggested and got some out of memory exceptions. Because of this i like the solution with solr Cell where i can send the file directly to solr via stream and don't collect them in my memory. And another question that came to my mind, how many documents per minute, second, what ever can i put into solr. Say XML format and from 100kb to 100MB. Is there a number or is it to dependent from hardware and settings? Best Johannes Erick Erickson erickerick...@gmail.com schrieb am 25.09.2012 00:22:26: Von: Erick Erickson erickerick...@gmail.com An: solr-user@lucene.apache.org Datum: 25.09.2012 00:23 Betreff: Re: Solr Cell Questions If you're concerned about throughput, consider moving all the SolrCell (Tika) processing off the server. SolrCell is way cool for showing what can be done, but its downside is you're moving all the processing of the structured documents to the same machine doing the indexing. Pretty soon, especially with significant size files, you're spending all your CPU cycles parsing the files... Happens there's a blog about this: http://searchhub.org/dev/2012/02/14/indexing-with-solrj/ By moving the indexing to N clients, you can increase throughput until you make Solr work hard to do the indexing Best Erick On Mon, Sep 24, 2012 at 10:04 AM, johannes.schwendin...@blum.com wrote: Hi, Im currently experimenting with Solr Cell to index files to Solr. During this some questions came up. 1. Is it possible (and wise) to connect to Solr Cell with multiple Threads at the same time to index several documents at the same time? This question came up because my prrogramm takes about 6hours to index round 35000 docs. (no production environment, only example solr and a little desktop machine but I think its very slow, and I know solr isn't the bottleneck (yet)) 2. If 1 is possible, how many Threads should do this and how many memory Solr needs? I've tried it but i run into an out of memory exception. Thanks in advantage Best Regards Johannes
Solr Cell Questions
Hi, Im currently experimenting with Solr Cell to index files to Solr. During this some questions came up. 1. Is it possible (and wise) to connect to Solr Cell with multiple Threads at the same time to index several documents at the same time? This question came up because my prrogramm takes about 6hours to index round 35000 docs. (no production environment, only example solr and a little desktop machine but I think its very slow, and I know solr isn't the bottleneck (yet)) 2. If 1 is possible, how many Threads should do this and how many memory Solr needs? I've tried it but i run into an out of memory exception. Thanks in advantage Best Regards Johannes
Antwort: Re: Antwort: Re: Query during a query
The problem is, that I don't know how to do this. :P My sequence: the user enters his search words. This is sent to solr. There I need to make another query first to get metadata from the index. with this metadata I have to connect to an external source to get some information about the user. With this information and the first search words I query then the solr index to get the search result. I hope its clear now wheres my problem and what I want to do Regards, Johannes Von: Jack Krupansky j...@basetechnology.com An: solr-user@lucene.apache.org Datum: 31.08.2012 15:03 Betreff: Re: Antwort: Re: Query during a query So, just do another query before doing the main query. What's the problem? Be more specific. Walk us through the sequence of processing that you need. -- Jack Krupansky -Original Message- From: johannes.schwendin...@blum.com Sent: Friday, August 31, 2012 1:52 AM To: solr-user@lucene.apache.org Subject: Antwort: Re: Query during a query Thanks for the answer, but I want to know how I can do a seperate query before the main query. And I only want this data in my programm. The user won't see it. I need the values from one field to get some information from an external source while the main query is executed. pravesh suyalprav...@yahoo.com schrieb am 31.08.2012 07:42:48: Von: pravesh suyalprav...@yahoo.com An: solr-user@lucene.apache.org Datum: 31.08.2012 07:43 Betreff: Re: Query during a query Did you checked SOLR Field Collapsing/Grouping. http://wiki.apache.org/solr/FieldCollapsing http://wiki.apache.org/solr/FieldCollapsing If this is what you are looking for. Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/ Query-during-a-query-tp4004624p4004631.html Sent from the Solr - User mailing list archive at Nabble.com.
Query during a query
Hi list, I want to get distinct data from a single solr field when ever a search query is started by an user. How can I do this? Regards, Johannes
Antwort: Re: Query during a query
Thanks for the answer, but I want to know how I can do a seperate query before the main query. And I only want this data in my programm. The user won't see it. I need the values from one field to get some information from an external source while the main query is executed. pravesh suyalprav...@yahoo.com schrieb am 31.08.2012 07:42:48: Von: pravesh suyalprav...@yahoo.com An: solr-user@lucene.apache.org Datum: 31.08.2012 07:43 Betreff: Re: Query during a query Did you checked SOLR Field Collapsing/Grouping. http://wiki.apache.org/solr/FieldCollapsing http://wiki.apache.org/solr/FieldCollapsing If this is what you are looking for. Thanx Pravesh -- View this message in context: http://lucene.472066.n3.nabble.com/ Query-during-a-query-tp4004624p4004631.html Sent from the Solr - User mailing list archive at Nabble.com.
Antwort: Re: Antwort: Re: refiltering search results
Von: Ahmet Arslan iori...@yahoo.com An: solr-user@lucene.apache.org Datum: 29.08.2012 10:50 Betreff: Re: Antwort: Re: refiltering search results Thanks for the answer. My next question is how can i filter the result or how to replace the old ResponseBuilder Result with a new one? --- On Wed, 8/29/12, johannes.schwendin...@blum.com johannes.schwendin...@blum.com wrote: From: johannes.schwendin...@blum.com johannes.schwendin...@blum.com Subject: Antwort: Re: refiltering search results To: solr-user@lucene.apache.org Date: Wednesday, August 29, 2012, 8:22 AM The main idea is to filter results as much as possible with solr an then check this result again. To do this I have to read some information from some fields of the documents in the result. At the moment I am trying to do this in the process method of a Search Component. But I even dont know how to get access to the search results or the index Fields of the documents. I have thought of ResponseBuilder.getResults() but after I have the DocListandSet Object I get stuck. You can read information from some fields using DocListandSet with org.apache.solr.util.SolrPluginUtils#docListToSolrDocumentList method.
LateBinding
Hello, Has anyone ever implementet the security feature called late-binding? I am trying this but I am very new to solr and I would be very glad if I would get some hints to this. Regards, Johannes
refiltering search results
Hello, Im trying to develop a search component to filter the search results agein with current data so that the user only sess results he is permitted to see. Can someone give me a hint where to start and how to do this? Is a Search Component the right place to do this? Regards Johannes
Antwort: Re: refiltering search results
The main idea is to filter results as much as possible with solr an then check this result again. To do this I have to read some information from some fields of the documents in the result. At the moment I am trying to do this in the process method of a Search Component. But I even dont know how to get access to the search results or the index Fields of the documents. I have thought of ResponseBuilder.getResults() but after I have the DocListandSet Object I get stuck. I know the time of the search will increase but security has priority Regards, Johannes Von: Alexandre Rafalovitch arafa...@gmail.com An: solr-user@lucene.apache.org Datum: 28.08.2012 16:48 Betreff: Re: refiltering search results I think there was a JOIN example (for version 4) somewhere with the permission restrictions. Or, if you have very broad categories, you can use different search handlers with restriction queries baked in. These might be enough. Otherwise, you have to send the list of IDs back and forth and it could be expensive. Regards, Alex. Personal blog: http://blog.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Tue, Aug 28, 2012 at 9:28 AM, johannes.schwendin...@blum.com wrote: Hello, Im trying to develop a search component to filter the search results agein with current data so that the user only sess results he is permitted to see. Can someone give me a hint where to start and how to do this? Is a Search Component the right place to do this? Regards Johannes