Re: Best way to dump out entire solr content?

2015-03-13 Thread vsriram30
Great! Thanks for providing more info Toke Eskildsen Thanks, Sriram -- View this message in context: http://lucene.472066.n3.nabble.com/Best-way-to-dump-out-entire-solr-content-tp4192734p4192892.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Facet pivot sorting while combining Stats Component With Pivots in Solr 5

2015-03-13 Thread Dominique Bejean
Thank you for the response This is something Heliosearch can do. Ionic Seeley, created a JIRA ticket to back port this feature to Solr 5. https://issues.apache.org/jira/browse/SOLR-7214 But in order to be available in Solr 5 this ticket should cover both http://heliosearch.org/json-facet-api/

get Multi-Valued field data from DocValues

2015-03-13 Thread Kevin Osborn
If I am finding the values of a long field for a single numeric field, I just do: DocValues.getNumeric(contex.reader(), myField).get(docNumber). This returns the value of the field and everything is good. However, my field is a multi-valued long field. So, I need to do:

Distributed IDF performance

2015-03-13 Thread Jack Krupansky
Does anybody have any actual performance data or even a rough formula for calculating the overhead for using the new Solr 5.0 Distributed IDF ( SOLR-1632 https://issues.apache.org/jira/browse/SOLR-1632)? And any guidance as far as which StatsInfo plugin is best to use? Are many people now using

Re: Solr 5 admin screen description

2015-03-13 Thread Shawn Heisey
On 3/13/2015 9:48 AM, davidphilip cherian wrote: I would like to understand the solr's dashboard screen, particularly the screen *system* properties displayed such as Physical Memory, Swap Space, File Desc. count and etc. 1. What is the importance of file descriptor count? 2. Dashboard shows

Re: get Multi-Valued field data from DocValues

2015-03-13 Thread Chris Hostetter
: If I am finding the values of a long field for a single numeric field, I : just do: : : DocValues.getNumeric(contex.reader(), myField).get(docNumber). This : returns the value of the field and everything is good. : : However, my field is a multi-valued long field. So, I need to do: : :

Re: get Multi-Valued field data from DocValues

2015-03-13 Thread Kevin Osborn
getSortedNumeric throws the following exception: unexpected docvalues type SORTED_SET for field 'space_list' (expected one of [SORTED_NUMERIC, NUMERIC]). Use UninvertingReader or index with docvalues. If I am reading the doumentation correctly, getSortedNumeric sorts the values, but it is still

Re: Distributed IDF performance

2015-03-13 Thread Anshum Gupta
There's no rough formula or performance data that I know of at this point. About he guidance, if you want to use Global stats, my obvious choice would be to use the LRUStatsCache. Before committing, I did run some tests on my macbook but as I said back then, they shouldn't be totally taken at face

Re: Distributed IDF performance

2015-03-13 Thread Jack Krupansky
Oops... I said StatsInfo and that should have been StatsCache (statsCache .../). -- Jack Krupansky On Fri, Mar 13, 2015 at 6:04 PM, Anshum Gupta ans...@anshumgupta.net wrote: There's no rough formula or performance data that I know of at this point. About he guidance, if you want to use

Re: Update solr schema.xml in real time for Solr 4.10.1

2015-03-13 Thread Zheng Lin Edwin Yeo
Hi Erick, The real time update of the schema means we can just do an update using REST-API curl instead of manually editing the schema.xml and restart the Solr server. In Solr 5.0, if Solr is loading the schema from the resource named in 'managedSchemaResourceName', instead of schema.xml, I can

Re: Update solr schema.xml in real time for Solr 4.10.1

2015-03-13 Thread Zheng Lin Edwin Yeo
Hi Nitin, To update the schema using Curl in Solr 5.0, we can use the following command if we are using ManagedIndexSchemaFactory: curl -X POST -H 'Content-type:application/json' --data-binary '{ add-field : { name:sell-by, type:tdate, stored:true} }'

Re: solr 4.7.2 mergeFactor/ Merge policy issue

2015-03-13 Thread Summer Shire
Hi All, Did anyone get a chance to look at my config and the InfoStream File ? I am very curious to see what you think thanks, Summer On Mar 6, 2015, at 5:20 PM, Summer Shire shiresum...@gmail.com wrote: Hi All, Here’s more update on where I am at with this. I enabled infoStream

Re: Parsing error on space

2015-03-13 Thread Jack Krupansky
First, as a general rule you must escape spaces in URL query parameters, either as a plus sign or %20. That said, I suspect that the code that extracts nested queries is stopping at the space. Try escaping the space with a backslash. Then you could try replacing the space in the nested query

Re: Distributed IDF performance

2015-03-13 Thread Anshum Gupta
np! I forgot to mention that I didn't notice any considerable performance hit in my tests. The QTimes were barely off by 5%. On Fri, Mar 13, 2015 at 3:13 PM, Jack Krupansky jack.krupan...@gmail.com wrote: Oops... I said StatsInfo and that should have been StatsCache (statsCache .../). --

Re: get Multi-Valued field data from DocValues

2015-03-13 Thread Kevin Osborn
I figured it out. Here is what you want to do (excuse the Scala syntax). docValues = DocValues.getSortedSet(contex.reader(), myField) docValues.setDocument(docNumber) val values = Stream.continually(docValues.nextOrd).takeWhile(_ != SortedSetDocValues.NO_MORE_ORDS).map(b =

Parsing error on space

2015-03-13 Thread Rajesh
Hi, I want to retrieve the parent document which contain Test Street in street field or if any of it's child contain Test Street in childStreet field. So, I've used the following syntax. q=street:Test Street OR {!parent which=type:parent}childStreet:Test Street If the query after the OR

Word frequency

2015-03-13 Thread phiroc
Hello, is it possible to create dynamic facets with SOLR 5.0.0? For instance, I would like to display the most-frequently occurring words in the left-hand side of my Velocity SOLR GUI (facet_fields.vm). Facet_fields.vm currently looks like this:

Re: data import

2015-03-13 Thread Antonio Jesús Sánchez Padial
Maybe you should add some info about: - your architecture, number of servers, etc - your schema.xml - and the data (ammount, type, ...) you are indexing Best. El 13/03/2015 a las 9:37, abhishek tiwari escribió: solr indexing taking too much time . What should i do to reduce time . working on

Re: response.results

2015-03-13 Thread Mikhail Khludnev
Hello response is assigned at https://github.com/apache/lucene-solr/blob/trunk/solr/contrib/velocity/src/java/org/apache/solr/response/VelocityResponseWriter.java#L224 On Fri, Mar 13, 2015 at 11:22 AM, phi...@free.fr wrote: Hello, could someone please explain how the current Velocity

Re: data import

2015-03-13 Thread Mikhail Khludnev
take a profile by Visual VM or so. On Fri, Mar 13, 2015 at 11:37 AM, abhishek tiwari test.mi...@gmail.com wrote: solr indexing taking too much time . What should i do to reduce time . working on solr 4.0. -- Sincerely yours Mikhail Khludnev Principal Engineer, Grid Dynamics

Re: Best way to dump out entire solr content?

2015-03-13 Thread Toke Eskildsen
On Fri, 2015-03-13 at 00:32 +0100, vsriram30 wrote: But as you say probably the internal skips using the cursor might be more efficient than the skip done with increasing the start, I will use the cursors. Kindly correct me if my understanding is not right. Let's say you want page 5.000 and

data import

2015-03-13 Thread abhishek tiwari
solr indexing taking too much time . What should i do to reduce time . working on solr 4.0.

RE: error message This IndexSchema is not mutable with a classicSchemaIndexFactory

2015-03-13 Thread Pedro Figueiredo
Hello Shawn, Yes! That was exactly what happened... After your recommendation the insert using solrj was successful. Many thanks! Best regards, Pedro Figueiredo Senior Engineer pjlfigueir...@criticalsoftware.com M. 934058150 Rua Engº Frederico Ulrich, nº 2650 4470-605 Moreira da Maia,

Re: how to store _text field

2015-03-13 Thread Mirko Torrisi
Hi Alexandre, I need to visualize the content of _txt. For some reasons, actual it is not showed in the results (the response). I guess that it doesn't happen because it isn't stored (for some default setting that I'd like to change). Thanks for your help, Mirko On 13/03/15 00:27,

Re: [Poll]: User need for Solr security

2015-03-13 Thread Dmitry Kan
Eric, right, filesystem level encryption is the way. Making encryption part of the lucene data structures would be a tall order. On Thu, Mar 12, 2015 at 5:22 PM, Erick Erickson erickerick...@gmail.com wrote: About 1. Gotta be careful here about what would be promised. You really _can't_

response.results

2015-03-13 Thread phiroc
Hello, could someone please explain how the current Velocity template examples provided with the 5.0.0 distribution retrieve documents from SOLR? result_list.vm contains the following line #foreach($doc in $response.results) but I can't figure out where $response.results is generated. Many

Re: [Poll]: User need for Solr security

2015-03-13 Thread Dmitry Kan
Jan, Index encryption is not really about trust to root users for us. It is about letting client company to be able to secure their index with their key. To prevent information loss through hacking to a server. What I agree with is that this does go beyond just search ;) Thanks for the JIRA,

Re: Word frequency

2015-03-13 Thread phiroc
Yes. Except that I don't want to facet the entire text field (as it can contain thousands of words). I would like to: - loop throught the documents in my core - extract the most-frequently-appearing words in each document's text field - generate a .vm which displays those words ranked number

Re: empty index, with simple texts

2015-03-13 Thread Alexandre Rafalovitch
Browse is configured for displaying specific fields from example schema. I bet you are not mapping anything to them. Try just normal search first in the Web Admin UI, see what fields you get back. Then look at solrconfig.xml at the definition for /browse handler (it's big). Then, choose the

Re: Word frequency

2015-03-13 Thread Alexandre Rafalovitch
On 13 March 2015 at 10:25, phi...@free.fr wrote: I would like to: - loop throught the documents in my core - extract the most-frequently-appearing words in each document's text field - generate a .vm which displays those words ranked number of occurrences, or, ideally, automatically

Re: Word frequency

2015-03-13 Thread phiroc
If you are asking whether users have access to /browse, then the answer is yes. Currently, they can type keywords in the q input field to do searches. I plan to turn q into a hidden field and add a 'keywords' input field whose contents will be transferred to q when users press Search, using

Re: backport Heliosearch features to Solr

2015-03-13 Thread Shawn Heisey
On 3/12/2015 5:11 PM, Markus Jelsma wrote: Hello - i would assume off-heap would out perform any heap based data structure. G1 is only useful if you deal with very large heaps, and it eats CPU at the same time. As much as G1 is better than CMS in same cases, you would still have less wasted

Re: [Poll]: User need for Solr security

2015-03-13 Thread O. Klein
I used Tomcat to secure admin pages. Haven't looked into Jetty if/how to do this, but some basic security like you mentioned: SSL, authenticated login to Admin UI, authorization for Admin APIs would be nice to have. -- View this message in context:

Re: Word frequency

2015-03-13 Thread Alexandre Rafalovitch
The usual recommendation is to use Solr as a database, internally with a separate user-facing app in a different container. Solr is not really easy to secure, so best is to use O/S level protection, e.g. listening on localhost only or only on a secure IP address. This separate client also gives

Solr 5 admin screen description

2015-03-13 Thread davidphilip cherian
Hi Group, I would like to understand the solr's dashboard screen, particularly the screen *system* properties displayed such as Physical Memory, Swap Space, File Desc. count and etc. 1. What is the importance of file descriptor count? 2. Dashboard shows that it has taken 16 gb of space out of

Re: Word frequency

2015-03-13 Thread phiroc
Point taken, Shawn. Thanks for your input. - Mail original - De: Shawn Heisey apa...@elyograg.org À: solr-user@lucene.apache.org Envoyé: Vendredi 13 Mars 2015 16:12:46 Objet: Re: Word frequency On 3/13/2015 8:54 AM, phi...@free.fr wrote: If you are asking whether users have access to

Facet pivot sorting while combining Stats Component With Pivots in Solr 5

2015-03-13 Thread Dominique Bejean
Hi, Here is a query with a sample result set. http://localhost:8983/solr/myindex/select?q=*%3A*wt=jsonindent=truestats=truestats.field={!tag=piv1}sizefacet=truefacet.limit=10facet.pivot={!stats=piv1}objectrows=0 facet_counts:{ facet_queries:{}, facet_fields:{}, facet_dates:{},

Re: Word frequency

2015-03-13 Thread Shawn Heisey
On 3/13/2015 8:54 AM, phi...@free.fr wrote: If you are asking whether users have access to /browse, then the answer is yes. Currently, they can type keywords in the q input field to do searches. I plan to turn q into a hidden field and add a 'keywords' input field whose contents will

TooManyBasicQueries?

2015-03-13 Thread Ian Rose
I sometimes see the following in my logs: ERROR org.apache.solr.core.SolrCore – org.apache.lucene.queryparser.surround.query.TooManyBasicQueries: Exceeded maximum of 1000 basic queries. What does this mean? Does this mean that we have issued a query with too many terms? Or that the number of

Re: Word frequency

2015-03-13 Thread Erik Hatcher
Do you mean like faceting on one of your full text fields? Something like /browse?facet.field=_text or one of your other fields? — Erik Hatcher, Senior Solutions Architect http://www.lucidworks.com http://www.lucidworks.com/ On Mar 13, 2015, at 4:16 AM, phi...@free.fr wrote: Hello,

Re: TooManyBasicQueries?

2015-03-13 Thread Erik Hatcher
It results from a surround query with too many terms. Says the javadoc: * Exception thrown when {@link BasicQueryFactory} would exceed the limit * of query clauses. I’m curious, are you issuing a large {!surround} query or is it expanding to hit that limit? — Erik Hatcher, Senior Solutions

empty index, with simple texts

2015-03-13 Thread Nicolas Turenne
Hello, i am under Windows 7 (sp1) , i run Solr 5 after launching the service and running indexing command with tika like : java -Dauto -Durl=http://localhost:8983/solr/tika/update; -jar E://Apache//solr-5.0.0//example//exampledocs//post.jar

Re: Facet pivot sorting while combining Stats Component With Pivots in Solr 5

2015-03-13 Thread Chris Hostetter
: Is it possible to sort the facet pivot items by using a stat value like sum : or mean ? Not at the moment no. It's something i'd like to see implemented eventually, but it would require a *huge* amount of re-implementation of how the faceting code currently works. -Hoss