Re: Fastest way to import big amount of documents in SolrCloud

2014-05-02 Thread Alexander Kanarsky
If you build your index in Hadoop, read this (it is about the Cloudera Search but in my understanding also should work with Solr Hadoop contrib since 4.7)

Re: timeAllowed in not honoring

2014-05-02 Thread Toke Eskildsen
On Thu, 2014-05-01 at 23:38 +0200, Shawn Heisey wrote: I was surprised to read that fc uses less memory. I think that is an error in the documentation. Except for special cases, such as asking for all facet values on a high cardinality field, I would estimate that enum uses less memory than fc.

Re: timeAllowed in not honoring

2014-05-02 Thread Toke Eskildsen
On Thu, 2014-05-01 at 23:03 +0200, Aman Tandon wrote: So can you explain how enum is faster than default. The fundamental difference is than enum iterates terms and counts how many of the documents associated to the terms are in the hits, while fc iterates all hits and updates a counter for the

Re: Block Join Score Highlighting

2014-05-02 Thread StrW_dev
Mikhail Khludnev wrote Hello, Score support is addressed at https://issues.apache.org/jira/browse/SOLR-5882. Highlighting is another story. be aware of http://heliosearch.org/expand-block-join/ it might somehow useful for your problem. Thx for the reply! My score question is answered

Re: XSLT Caching Warning

2014-05-02 Thread Christopher Gross
I have a few transforms that I need to do, but I turned set the cache lifetime very high. I'm just trying to rectify error messages that pop up. If it's something that I can ignore, then that's OK, I just wanted to be sure. Thanks! -- Chris On Thu, May 1, 2014 at 10:32 PM, Alexandre

Export big extract from Solr to [My]SQL

2014-05-02 Thread Per Steffensen
Hi I want to make extracts from my Solr to MySQL. Any tools around that can help med perform such a task? I find a lot about data-import from SQL when googling, but nothing about export/extract. It is not all of the data in Solr I need to extract. It is only documents that full fill a normal

Re: Export big extract from Solr to [My]SQL

2014-05-02 Thread Siegfried Goeschl
Hi Per, basically I see three options * use a lot of memory to scope with huge result sets * user result set paging * SOLR 4.7 supports cursors (https://issues.apache.org/jira/browse/SOLR-5463) Cheers, Siegfried Goeschl On 02.05.14 13:32, Per Steffensen wrote: Hi I want to make extracts

Displaying ExternalFileField values in CSVResponse - Solr 4.6

2014-05-02 Thread Sanjeev Pragada
Hi,nbsp; nbsp;We are using Solr4.6 to index and search our ecommerce product details. We are using ExternalFileField option to incorporate some ranking signals.nbsp;The problem I am facing currently is that the values of ExternalFileField are not displayed in the CSVResponse of the solr.

PostingHighlighter complains about no offsets

2014-05-02 Thread Michael Sokolov
I've been wanting to try out the PostingsHighlighter, so I added storeOffsetsWithPositions to my field definition, enabled the highlighter in solrconfig.xml, reindexed and tried it out. When I issue a query I'm getting this error: |field 'text' was indexed without offsets, cannot highlight

Re: What are the best practices on Multiple Language support in Solr Cloud ?

2014-05-02 Thread Nicole Lacoste
Hi Shamik, I don't have an answer for you, just a couple of comments. Why not use dynamic field definitions in the schema? As you say most of your fields are not analysed you just add a language tag _en, _fr, _de, ...) to the field when you index or query. Then you can add languages as you need

Roll up query with original facets

2014-05-02 Thread Darin Amos
Hello All, I am having a query issue I cannot seem to find the correct answer for. I am searching against a list of items and returning facets for that list of items. I would like to group the result set on a field such as a “parentItemId”. parentItemId maps to other documents within the same

Re: PostingHighlighter complains about no offsets

2014-05-02 Thread Michael Sokolov
I checked using the analysis admin page, and I believe there are offsets being generated (I assume start/end=offsets). So IDK I am going to try reindexing again. Maybe I neglected to reload the config before I indexed last time. -Mike On 05/02/2014 09:34 AM, Michael Sokolov wrote: I've

Re: Displaying ExternalFileField values in CSVResponse - Solr 4.6

2014-05-02 Thread Ahmet Arslan
Hi Sanjeev, Here is the relevant jira : https://issues.apache.org/jira/browse/SOLR-5423 which has fix versions 4.7.1, 4.8, 5.0. So I recommend to use/download latest 4.8.0 version. Ahmet On Friday, May 2, 2014 2:46 PM, Sanjeev Pragada sanje...@rediff.co.in wrote: Hi,nbsp; nbsp;We are

Re: Block Join Score Highlighting

2014-05-02 Thread Mikhail Khludnev
On Fri, May 2, 2014 at 2:34 PM, StrW_dev r.j.bamb...@structweb.nl wrote: Mikhail Khludnev wrote Hello, Score support is addressed at https://issues.apache.org/jira/browse/SOLR-5882. Highlighting is another story. be aware of http://heliosearch.org/expand-block-join/ it might somehow

Re: Export big extract from Solr to [My]SQL

2014-05-02 Thread simon
The cursor-based deep paging in 4.7+ works very well and the performance on large extracts (for us, maybe up to 100K documents) is excellent, though it will obviously depend on the number and size of fields that you need to pull. I wrote a Perl module to do the extractions from Solr without

Re: Searching for tokens does not return any results

2014-05-02 Thread Erick Erickson
bq: but this index was created using a Java program using Lucene interface Elaborating a bit on Koji's comment... The fact that you used Lucene to index the doc means that the analysis page is almost, but not quite entirely, useless on the indexing side. It's looking at your field definition in

Re: RE : Shards don't return documents in same order

2014-05-02 Thread Erick Erickson
Francois: Yes, there are several means to examine the raw terms in the index. The admin/schema-browser page TermsComponent: https://cwiki.apache.org/confluence/display/solr/The+Terms+Component Luke the schema-browser is all set up for you, it's easiest. The TermsComponent should be

Re: Fastest way to import big amount of documents in SolrCloud

2014-05-02 Thread Erick Erickson
re: optimize after every import This is not recommended in 4.x unless and until you have evidence that it really does help, reviews are very mixed, and it's been renamed force merge in 4.x just so people don't think Of course I want to do this, who wouldn't?. bq: Doing a commit instead of

Re: Roll up query with original facets

2014-05-02 Thread Erick Erickson
I think this might be what you're looking for.. http://wiki.apache.org/solr/SimpleFacetParameters#Multi-Select_Faceting_and_LocalParams Best, Erick On Fri, May 2, 2014 at 7:19 AM, Darin Amos dari...@gmail.com wrote: Hello All, I am having a query issue I cannot seem to find the correct answer

ANNOUNCE: Apache Solr Reference Guide for 4.8

2014-05-02 Thread Chris Hostetter
The Lucene PMC is pleased to announce that there is a new version of the Solr Reference Guide available for Solr 4.8. The 396 page PDF serves as the definitive user's manual for Solr 4.8. It can be downloaded from the Apache mirror network:

Can't use 2 highlighting components in the same solrconfig

2014-05-02 Thread Cario, Elaine
Hoping someone can help me... I'm trying to use both the PostingsHighlighter and the FastVectorHighlighter in the same solrconfig (selection driven by different request handlers), but once I define 2 search components in the config, it always picks the Postings Highlighter (even if I never

RE: Searching for tokens does not return any results

2014-05-02 Thread Yetkin Ozkucur
Erick, Koji, Ahmet: Thank you all for your answers! I think I found the problem and I am on the right track to fix it. 1- As you suggested the problem was in the Java code populating the index. The analyzer in the Java code had to be consistent with the one defined in SOLR. I was able to

Re: Searching for tokens does not return any results

2014-05-02 Thread Erick Erickson
Glad to hear it! You shouldn't really have to customize the analyzer to get it to behave as it would if you just used Solr to ingest documents, just chain things together. That's what Solr does after all. Of course you may have special needs that are better served by more customization.

Spellchecking - looking for general advice

2014-05-02 Thread Maciej Dziardziel
Hi I was looking at spellcheck (Direct and FileBased) and testing that they can do. Direct works fine most of the time, but I'd like to find solution for few corner cases: 1) having recruted and recruiter in index, recruter should suggest the latter. Obviously the distance to the former is

Re: ANNOUNCE: Apache Solr Reference Guide for 4.8

2014-05-02 Thread Alexandre Rafalovitch
Somebody should create an offline search interface for it. :-) Regards, Alex On 02/05/2014 11:53 pm, Chris Hostetter hoss...@apache.org wrote: The Lucene PMC is pleased to announce that there is a new version of the Solr Reference Guide available for Solr 4.8. The 396 page PDF serves as