Great! Thanks for providing more info Toke Eskildsen
Thanks,
Sriram
--
View this message in context:
http://lucene.472066.n3.nabble.com/Best-way-to-dump-out-entire-solr-content-tp4192734p4192892.html
Sent from the Solr - User mailing list archive at Nabble.com.
Thank you for the response
This is something Heliosearch can do. Ionic Seeley, created a JIRA ticket
to back port this feature to Solr 5.
https://issues.apache.org/jira/browse/SOLR-7214
But in order to be available in Solr 5 this ticket should cover both
http://heliosearch.org/json-facet-api/
If I am finding the values of a long field for a single numeric field, I
just do:
DocValues.getNumeric(contex.reader(), myField).get(docNumber). This
returns the value of the field and everything is good.
However, my field is a multi-valued long field. So, I need to do:
Does anybody have any actual performance data or even a rough formula for
calculating the overhead for using the new Solr 5.0 Distributed IDF (
SOLR-1632 https://issues.apache.org/jira/browse/SOLR-1632)?
And any guidance as far as which StatsInfo plugin is best to use?
Are many people now using
On 3/13/2015 9:48 AM, davidphilip cherian wrote:
I would like to understand the solr's dashboard screen, particularly the
screen *system* properties displayed such as Physical Memory, Swap Space,
File Desc. count and etc. 1. What is the importance of file descriptor
count? 2. Dashboard shows
: If I am finding the values of a long field for a single numeric field, I
: just do:
:
: DocValues.getNumeric(contex.reader(), myField).get(docNumber). This
: returns the value of the field and everything is good.
:
: However, my field is a multi-valued long field. So, I need to do:
:
:
getSortedNumeric throws the following exception:
unexpected docvalues type SORTED_SET for field 'space_list' (expected one
of [SORTED_NUMERIC, NUMERIC]). Use UninvertingReader or index with
docvalues.
If I am reading the doumentation correctly, getSortedNumeric sorts the
values, but it is still
There's no rough formula or performance data that I know of at this point.
About he guidance, if you want to use Global stats, my obvious choice would
be to use the LRUStatsCache.
Before committing, I did run some tests on my macbook but as I said back
then, they shouldn't be totally taken at face
Oops... I said StatsInfo and that should have been StatsCache
(statsCache .../).
-- Jack Krupansky
On Fri, Mar 13, 2015 at 6:04 PM, Anshum Gupta ans...@anshumgupta.net
wrote:
There's no rough formula or performance data that I know of at this point.
About he guidance, if you want to use
Hi Erick,
The real time update of the schema means we can just do an update using
REST-API curl instead of manually editing the schema.xml and restart the
Solr server. In Solr 5.0, if Solr is loading the schema from the resource
named in 'managedSchemaResourceName', instead of schema.xml, I can
Hi Nitin,
To update the schema using Curl in Solr 5.0, we can use the following
command if we are using ManagedIndexSchemaFactory:
curl -X POST -H 'Content-type:application/json' --data-binary '{
add-field : {
name:sell-by,
type:tdate,
stored:true}
}'
Hi All,
Did anyone get a chance to look at my config and the InfoStream File ?
I am very curious to see what you think
thanks,
Summer
On Mar 6, 2015, at 5:20 PM, Summer Shire shiresum...@gmail.com wrote:
Hi All,
Here’s more update on where I am at with this.
I enabled infoStream
First, as a general rule you must escape spaces in URL query parameters,
either as a plus sign or %20.
That said, I suspect that the code that extracts nested queries is stopping
at the space.
Try escaping the space with a backslash.
Then you could try replacing the space in the nested query
np!
I forgot to mention that I didn't notice any considerable performance hit
in my tests. The QTimes were barely off by 5%.
On Fri, Mar 13, 2015 at 3:13 PM, Jack Krupansky jack.krupan...@gmail.com
wrote:
Oops... I said StatsInfo and that should have been StatsCache
(statsCache .../).
--
I figured it out. Here is what you want to do (excuse the Scala syntax).
docValues = DocValues.getSortedSet(contex.reader(), myField)
docValues.setDocument(docNumber)
val values = Stream.continually(docValues.nextOrd).takeWhile(_ !=
SortedSetDocValues.NO_MORE_ORDS).map(b =
Hi,
I want to retrieve the parent document which contain Test Street in street
field or if any of it's child contain Test Street in childStreet field.
So, I've used the following syntax.
q=street:Test Street OR {!parent which=type:parent}childStreet:Test
Street
If the query after the OR
Hello,
is it possible to create dynamic facets with SOLR 5.0.0?
For instance, I would like to display the most-frequently occurring words in
the left-hand side of my Velocity SOLR GUI (facet_fields.vm).
Facet_fields.vm currently looks like this:
Maybe you should add some info about:
- your architecture, number of servers, etc
- your schema.xml
- and the data (ammount, type, ...) you are indexing
Best.
El 13/03/2015 a las 9:37, abhishek tiwari escribió:
solr indexing taking too much time .
What should i do to reduce time . working on
Hello
response is assigned at
https://github.com/apache/lucene-solr/blob/trunk/solr/contrib/velocity/src/java/org/apache/solr/response/VelocityResponseWriter.java#L224
On Fri, Mar 13, 2015 at 11:22 AM, phi...@free.fr wrote:
Hello,
could someone please explain how the current Velocity
take a profile by Visual VM or so.
On Fri, Mar 13, 2015 at 11:37 AM, abhishek tiwari test.mi...@gmail.com
wrote:
solr indexing taking too much time .
What should i do to reduce time . working on solr 4.0.
--
Sincerely yours
Mikhail Khludnev
Principal Engineer,
Grid Dynamics
On Fri, 2015-03-13 at 00:32 +0100, vsriram30 wrote:
But as you say probably the internal skips using the cursor might be more
efficient than the skip done with increasing the start, I will use the
cursors. Kindly correct me if my understanding is not right.
Let's say you want page 5.000 and
solr indexing taking too much time .
What should i do to reduce time . working on solr 4.0.
Hello Shawn,
Yes! That was exactly what happened...
After your recommendation the insert using solrj was successful.
Many thanks!
Best regards,
Pedro Figueiredo
Senior Engineer
pjlfigueir...@criticalsoftware.com
M. 934058150
Rua Engº Frederico Ulrich, nº 2650 4470-605 Moreira da Maia,
Hi Alexandre,
I need to visualize the content of _txt. For some reasons, actual it is
not showed in the results (the response).
I guess that it doesn't happen because it isn't stored (for some default
setting that I'd like to change).
Thanks for your help,
Mirko
On 13/03/15 00:27,
Eric,
right, filesystem level encryption is the way. Making encryption part of
the lucene data structures would be a tall order.
On Thu, Mar 12, 2015 at 5:22 PM, Erick Erickson erickerick...@gmail.com
wrote:
About 1. Gotta be careful here about what would be promised. You
really _can't_
Hello,
could someone please explain how the current Velocity template examples
provided with the 5.0.0 distribution retrieve documents from SOLR?
result_list.vm contains the following line
#foreach($doc in $response.results)
but I can't figure out where $response.results is generated.
Many
Jan,
Index encryption is not really about trust to root users for us. It is
about letting client company to be able to secure their index with their
key. To prevent information loss through hacking to a server. What I agree
with is that this does go beyond just search ;)
Thanks for the JIRA,
Yes.
Except that I don't want to facet the entire text field (as it can contain
thousands of words).
I would like to:
- loop throught the documents in my core
- extract the most-frequently-appearing words in each document's text field
- generate a .vm which displays those words ranked number
Browse is configured for displaying specific fields from example schema. I
bet you are not mapping anything to them.
Try just normal search first in the Web Admin UI, see what fields you get
back. Then look at solrconfig.xml at the definition for /browse handler
(it's big). Then, choose the
On 13 March 2015 at 10:25, phi...@free.fr wrote:
I would like to:
- loop throught the documents in my core
- extract the most-frequently-appearing words in each document's text field
- generate a .vm which displays those words ranked number of occurrences,
or, ideally, automatically
If you are asking whether users have access to /browse, then the answer is yes.
Currently, they can type keywords in the q input field to do searches.
I plan to turn q into a hidden field and add a 'keywords' input field whose
contents will be transferred to q when users press Search, using
On 3/12/2015 5:11 PM, Markus Jelsma wrote:
Hello - i would assume off-heap would out perform any heap based data
structure. G1 is only useful if you deal with very large heaps, and it eats
CPU at the same time. As much as G1 is better than CMS in same cases, you
would still have less wasted
I used Tomcat to secure admin pages. Haven't looked into Jetty if/how to do
this, but some basic security like you mentioned: SSL, authenticated login
to Admin UI, authorization for Admin APIs would be nice to have.
--
View this message in context:
The usual recommendation is to use Solr as a database, internally with
a separate user-facing app in a different container. Solr is not
really easy to secure, so best is to use O/S level protection, e.g.
listening on localhost only or only on a secure IP address.
This separate client also gives
Hi Group,
I would like to understand the solr's dashboard screen, particularly the
screen *system* properties displayed such as Physical Memory, Swap Space,
File Desc. count and etc. 1. What is the importance of file descriptor
count? 2. Dashboard shows that it has taken 16 gb of space out of
Point taken, Shawn. Thanks for your input.
- Mail original -
De: Shawn Heisey apa...@elyograg.org
À: solr-user@lucene.apache.org
Envoyé: Vendredi 13 Mars 2015 16:12:46
Objet: Re: Word frequency
On 3/13/2015 8:54 AM, phi...@free.fr wrote:
If you are asking whether users have access to
Hi,
Here is a query with a sample result set.
http://localhost:8983/solr/myindex/select?q=*%3A*wt=jsonindent=truestats=truestats.field={!tag=piv1}sizefacet=truefacet.limit=10facet.pivot={!stats=piv1}objectrows=0
facet_counts:{
facet_queries:{},
facet_fields:{},
facet_dates:{},
On 3/13/2015 8:54 AM, phi...@free.fr wrote:
If you are asking whether users have access to /browse, then the answer is
yes.
Currently, they can type keywords in the q input field to do searches.
I plan to turn q into a hidden field and add a 'keywords' input field whose
contents will
I sometimes see the following in my logs:
ERROR org.apache.solr.core.SolrCore –
org.apache.lucene.queryparser.surround.query.TooManyBasicQueries: Exceeded
maximum of 1000 basic queries.
What does this mean? Does this mean that we have issued a query with too
many terms? Or that the number of
Do you mean like faceting on one of your full text fields? Something like
/browse?facet.field=_text or one of your other fields?
—
Erik Hatcher, Senior Solutions Architect
http://www.lucidworks.com http://www.lucidworks.com/
On Mar 13, 2015, at 4:16 AM, phi...@free.fr wrote:
Hello,
It results from a surround query with too many terms. Says the javadoc:
* Exception thrown when {@link BasicQueryFactory} would exceed the limit
* of query clauses.
I’m curious, are you issuing a large {!surround} query or is it expanding to
hit that limit?
—
Erik Hatcher, Senior Solutions
Hello,
i am under Windows 7 (sp1) , i run Solr 5
after launching the service and running indexing command with tika like :
java -Dauto -Durl=http://localhost:8983/solr/tika/update; -jar
E://Apache//solr-5.0.0//example//exampledocs//post.jar
: Is it possible to sort the facet pivot items by using a stat value like sum
: or mean ?
Not at the moment no.
It's something i'd like to see implemented eventually, but it would
require a *huge* amount of re-implementation of how the faceting code
currently works.
-Hoss
43 matches
Mail list logo