Bloom filters and optimized vs. unoptimized indices

2013-04-29 Thread Otis Gospodnetic
Hi, I was looking at http://lucene.apache.org/core/4_2_1/codecs/org/apache/lucene/codecs/bloom/BloomFilteringPostingsFormat.html and this piece of text: " A PostingsFormat useful for low doc-frequency fields such as primary keys. Bloom filters are maintained in a ".blm" file which offers "fast-fai

Using properties from solrcore.properties in data-config.xml (Solr 4.2.1)

2013-04-29 Thread Arun Rangarajan
We are trying to upgrade from Solr 3.6.2 to Solr 4.2.1 and are having problems with using properties in solrcore.properties inside data-config.xml. With Solr 3.6.2, we were able to directly use properties in solrcore.properties inside data-config.xml like ${jdbc.driver}, ${jdbc.username}, etc., bu

What Happens to Consistency if I kill a Leader and Startup it again?

2013-04-29 Thread Furkan KAMACI
I think about such situation: Let's assume that I am indexing at my SolrCloud. My leader has a version of higher than replica as well (I have one leader and one replica for each shard). If I kill leader, replica will be leader as well. When I startup old leader again it will be a replica for my sh

Re: 4.2.1 Tutorial

2013-04-29 Thread Furkan KAMACI
Check your logs when you startup Solr if you get that error: "There exists no core with the name "collection1"". Do you get any error as like core:collection1 could not create or something like that? 2013/4/29 Jon Strayer > I can't be the only person to run into this, but I can't find any mentio

LinkedIn'de bağlantı kurma daveti

2013-04-29 Thread somer81
LinkedIn vibhoreng04 Lucene], Sizi LinkedIn'deki profesyonel ağıma eklemek istiyorum. - ömer sevinç ömer sevinç Ondokuzmayıs Üniversitesi Uzaktan Eğitim Merkezi şirketinde Öğr. Gör. Bilgisayar Müh. pozisyonunda Samsun, Türkiye ömer sevinç adlı kişiyi tanıdığınızı onaylayın: htt

4.2.1 Tutorial

2013-04-29 Thread Jon Strayer
I can't be the only person to run into this, but I can't find any mention of it anywhere. I have Solr 4.2.1 installed under OSX 10.8.3. I'm working my way through the tutorial. When I click on this link: http://localhost:8983/solr/#/collection1/queryI get the error message "There exists no core

Re: Customizing Solr GUI

2013-04-29 Thread Alexandre Rafalovitch
Does the website have a middleware? As in, is it static website or something served dynamically from PHP, Ruby, Java, etc? If the later, then you do the same thing blacklight does: you run your Solr server and your middleware talks to it over HTTP connection. Then, you have to figure out how to get

RE: Bizarre Solr issue

2013-04-29 Thread Jack.Drysdale.ctr
Hello, Shawn, and thanks for your reply. I will look into this, ASAP. I know that on one of the dev environments the persistent flag is set to "true"; I'll check the others and the production. I will also see if someone can get me a copy of the logs from the production environment to see if any

Re: Customizing Solr GUI

2013-04-29 Thread kneerosh
Thanks a lot for the responses. Now Im sure I need blacklight. Suppose I had a website designed using any other standard method- how would I have embedded a solr search in it? Velocity & Blacklight are , as I understand useful when you are building a system from scratch and can design a new searc

Re: Bizarre Solr issue

2013-04-29 Thread Shawn Heisey
On 4/29/2013 8:15 AM, jack.drysdale@ustranscom.mil wrote: Production environment is *nix running CF 9.0.0, with both Verity and Solr collections. Trying to list collections is breaking - one collection in particular is breaking the CFCOLLECTION action="list": Error message states that the so

RE: java.lang.NullPointerException. I am trying to use CachedSqlEntityProcessor

2013-04-29 Thread Dyer, James
This sounds like https://issues.apache.org/jira/browse/SOLR-3791, which was resolved in 3.6.2 / 4.0. James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: srinalluri [mailto:nallurisr...@yahoo.com] Sent: Monday, April 29, 2013 11:41 AM To: solr-user@lucene.apache.org

Exact and Partial Matches

2013-04-29 Thread Sandeep Mestry
Dear Experts, I have a requirement for the exact matches and applying alphabetical sorting thereafter. To illustrate, the results should be sorted in exact matches and all later alphabetical. So, if there are 5 documents as below Doc1 title: trees Doc 2 title: plum trees Doc 3 title: Money Tr

java.lang.NullPointerException. I am trying to use CachedSqlEntityProcessor

2013-04-29 Thread srinalluri
I am in Solr 3.6.1. The following entity gives java.lang.NullPointerException. How to debug this? Here I am usingCachedSqlEntityProcessor. Here is the e

Re: why does "*" affect case sensitivity of query results

2013-04-29 Thread geeky2
here is the jira link: https://issues.apache.org/jira/browse/SOLR-219 -- View this message in context: http://lucene.472066.n3.nabble.com/why-does-affect-case-sensitivity-of-query-results-tp4059801p4059814.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: why does "*" affect case sensitivity of query results

2013-04-29 Thread geeky2
was looking in Smiley's book on page 129 and 130. from the book, >> No text analysis is performed on the search word containing the wildcard, not even lowercasing. So if you want to find a word starting with Sma, then sma* is required instead of Sma*, assuming the index side of the field's type i

Re: why does "*" affect case sensitivity of query results

2013-04-29 Thread Alexandre Rafalovitch
http://wiki.apache.org/solr/MultitermQueryAnalysis Sorry, not for your version of Solr. Regards, Alex. Personal blog: http://blog.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, i

Re: facet.offset issue (previosly: [solr 3.4] anomaly during distributed facet query with 102 shards)

2013-04-29 Thread Dmitry Kan
Hi guys, do you have any ideas? Does it even make sense to paginate in facet searches, if we require deep paging? Dmitry On Fri, Apr 26, 2013 at 11:09 PM, Dmitry Kan wrote: > Hi list, > > We have encountered a weird bug related to the facet.offset parameter. In > short: the more general quer

solr query- get results without scanning files

2013-04-29 Thread dafnashkedy
I would like to execute a solr query and get only the uniquKey I've defined. The documents are very big so defining fl='my_key' is not fast enough - all the matching documents are still scanned and the query can take hours (even though the search itself was fast - numFound takes few seconds to retu

RE: Bizarre Solr issue

2013-04-29 Thread Jack.Drysdale.ctr
I don’t know if this will make any difference, or not, but production is two load-balanced servers (as far as I know, both identical). If I run the script specifically on the first server, it errors as I have described. If I run the script specifically on the second server, it lists the collect

why does "*" affect case sensitivity of query results

2013-04-29 Thread geeky2
hello, environment: solr 3.5 problem statement: when query has "*" appended, it turns case sensitive. assumption: query should NOT be case sensitive actual value in database at time of index: 4387828BULK here is a snapshot of what works and does not work. what works: itemModelNoExactMatch

Re: createNodeSet

2013-04-29 Thread Mark Miller
What version of Solr? That should work in Jetty in 4.2 and not before and in Tomcat in 4.3 and not before. - Mark On Apr 29, 2013, at 10:19 AM, Arkadi Colson wrote: > When I first do a linkconfig the route:implicit seems to be gone! So > recreating the collection will solve this. The problem

RE: Bizarre Solr issue

2013-04-29 Thread Jack.Drysdale.ctr
Hello, Alex, and thank you for your reply. I just looked it up: ColdFusion Server 9 ships with Solr version 1.4.1. Both dev and production environments use the same version. The script that I wrote takes environment into consideration - with three Windows dev environments and one Linux product

Current Has A Red Emblem, Slave Has A Higher Version And Doesn't Do Anything To Catch Up Master

2013-04-29 Thread Furkan KAMACI
If you can help me it would be nice. I have tested crawling at my amazon instances and I have a weird situation: My slave version is higher than master (actually I have killed my master and started up it again at some time) Replication (Slave) Version Gen Size Master: 1367243029412 49 1.29 GB Sla

Re: solr 3.6 hang for few seconds, need help

2013-04-29 Thread Erick Erickson
Garbage collection would be my first guess too. Here's an excellent article on GC: http://searchhub.org/2011/03/27/garbage-collection-bootcamp-1-0/ Best Erick On Mon, Apr 29, 2013 at 5:56 AM, Christian von Wendt-Jensen wrote: > I'm experiencing the same issue in my setup. > > If you do not see

Re: Bizarre Solr issue

2013-04-29 Thread Alexandre Rafalovitch
Version of Solr would help here. Solr 4+ will log where it find the collections if enabled (not sure about earlier version). The most likely problem is related to path. Perhaps you are hardcoding '\' separator somewhere on Windows and that messes up the path on Unix. Or you have different Solr ver

Re: createNodeSet

2013-04-29 Thread Arkadi Colson
When I first do a linkconfig the route:implicit seems to be gone! So recreating the collection will solve this. The problem that I cannot request a collection that does not exists on that host is still there. Arkadi On 04/29/2013 03:31 PM, Arkadi Colson wrote: The strange thing is that I creat

Bizarre Solr issue

2013-04-29 Thread Jack.Drysdale.ctr
Hello, everyone. I have a really bizarre Solr issue that I hope someone can help me resolve. Production environment is *nix running CF 9.0.0, with both Verity and Solr collections. Trying to list collections is breaking - one collection in particular is breaking the CFCOLLECTION action="list": E

Re: relevance when merging results

2013-04-29 Thread Erick Erickson
You cannot rely on scores to be comparable between two queries, or between two cores with very different kinds of data. Scores are only a way to sort results within the _same_ query and the _same_ type of core. By "type" I mean, say, shards where the schemas are identical and the statical character

Re: Customizing Solr GUI

2013-04-29 Thread Erick Erickson
Give me access to your raw Solr URLs, and I can submit the following: .../update?commit=ture&term.body=*:*<.query> which will remove all documents from your index. You really have to take control of the requests you allow to get to Solr... Best Erick On Fri, Apr 26, 2013 at 9:59 AM, Alexandre Raf

Re: Atomic Update and stored copy-fields

2013-04-29 Thread Erick Erickson
I'd ask it a different way, why in the world would you store the destinations of copyFields? It just bloats your index to no good purpose since all the sources are stored. As you can tell, I don;t have a good answer for your question, but for an explicit warning like that, I'd heed it and/or exami

Re: SolrJ Custom RowMapper

2013-04-29 Thread Erick Erickson
Luis: On the chance that this is an "XY" problem, why do you want a custom RowMapper? Perhaps there's an easy way to do what you want, but without a problem statement it's hard to tell. Best Erick On Thu, Apr 25, 2013 at 5:21 PM, Sudhakar Maddineni wrote: > Hey Luis, > Check this example in the

Re: Solr 4.2.1 SSLInitializationException

2013-04-29 Thread Sarita Nair
:I'm confused ... it seems that you (or GlassFish) has created a  :Catch-22... Glassfish specifies keystore as a system property, but does not require specifying the password for the keystore as a system property.  GF uses a keychain mechanism, which requires the password to be passed from the

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Dmitry Kan
Thanks. Only question is how to transition to this model. Our facet (string) fields contain timestamp prefixes, that are reverse ordered starting from the freshest value. In theory, we could try computing the filter queries for those. But before doing so, we would need the matched ids from solr,

Re: createNodeSet

2013-04-29 Thread Arkadi Colson
The strange thing is that I created some time ago 2 other collections and there the router:implicit has not been set. Is it possible to create a collection withour the router:implicit? http://solr01:8983/solr/admin/collections?action=CREATE&name=lvs&numShards=5&replicationFactor=2&collection.co

Re: createNodeSet

2013-04-29 Thread Michael Della Bitta
That means that documents will be indexed and stored on the node they're sent to. It shouldn't keep Solr Cloud from loadbalancing reads. Fixing that won't address the problem you're asking about, but it may clear up other unintended behaviors. What version of Solr are you using, and what servlet c

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Michael Della Bitta
I guess so, you'd have to use a filter query to page through the set of documents you were faceting against and sum them all at the end. It's not quite the same operation as paging through results, because facets are aggregate statistics, but if you're willing to go through the trouble, I bet it wo

Re: createNodeSet

2013-04-29 Thread Arkadi Colson
I found this in the zookeeper directory /collections/collectionX/ { "configName":"smsc", "router":"implicit"} Is "router:implicit" the cause of this? Is it possible to fix? Thx! On 04/29/2013 01:24 PM, Arkadi Colson wrote: Is it correct that if I create a collection B with parameter crea

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Dmitry Kan
Michael, Interesting! Do (Can) you apply this to facet searches as well? Dmitry On Mon, Apr 29, 2013 at 4:02 PM, Michael Della Bitta < michael.della.bi...@appinions.com> wrote: > We've found that you can do a lot for yourself by using a filter query > to page through your data if it has a natu

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Michael Della Bitta
We've found that you can do a lot for yourself by using a filter query to page through your data if it has a natural range to do so instead of start and rows. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www

Re: solr 3.6 hang for few seconds, need help

2013-04-29 Thread Christian von Wendt-Jensen
I'm experiencing the same issue in my setup. If you do not see any logging for several seconds, then it _could_ be due to garbage collection. If you experience heavy traffic and have very large caches, then the JVM might be forced to do a full garbage collection from time to time, halting all p

solr 3.6 hang for few seconds, need help

2013-04-29 Thread mizayah
Hi, Im running solr 3.6 on tomcat, under some traffic about 20r/s I got 6 different cores on it. I was testing one by quering every 1 second with simple request and time param. INFO: [core1] webapp=/solr3.4-tomcat path=/select params= ... 1:55:05 ... Apr 29, 2013 1:55:06 PM org.apache.solr.cor

createNodeSet

2013-04-29 Thread Arkadi Colson
Is it correct that if I create a collection B with parameter createNodeSet = hostB and I query on hostA something for collectionA it could not be found? BR, Arkadi

Re: Issue regarding Indexing PDFs into Solr.

2013-04-29 Thread Furkan KAMACI
It seems that your solrconfig.xml can not find libraries. Here is an example path from solrconfig.xml: 2013/4/29 Krishna Venkateswaran > Hi > > I have installed Solr over Apache Tomcat. > I have used Apache Tomcat v6.x for Solr to work. > > When trying to upload a file using SolrJ to index it

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Dmitry Kan
Abhishek, There is a wiki regarding this: http://wiki.apache.org/solr/CommonQueryParameters search "pageDoc and pageScore". On Mon, Apr 29, 2013 at 1:17 PM, Abhishek Sanoujam wrote: > We have a single shard, and all the data is in a single box only. > Definitely looks like "deep-paging" is ha

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Abhishek Sanoujam
We have a single shard, and all the data is in a single box only. Definitely looks like "deep-paging" is having problems. Just to understand, is the searcher looping over the result set everytime and skipping the first "start" count? This will definitely take a toll when we reach higher "start

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Dmitry Kan
Jan, Would the same distrib=false help for distributed faceting? We are running into a similar issue with facet paging. Dmitry On Mon, Apr 29, 2013 at 11:58 AM, Jan Høydahl wrote: > Hi, > > How many shards do you have? This is a known issue with deep paging with > multi shard, see https://is

Issue regarding Indexing PDFs into Solr.

2013-04-29 Thread Krishna Venkateswaran
Hi I have installed Solr over Apache Tomcat. I have used Apache Tomcat v6.x for Solr to work. When trying to upload a file using SolrJ to index it into Solr, I am getting an exception as follows: Server at http://localhost:8080/solr-example returned non ok status:500, message:Internal Server Err

Re: Re: Re: Re: Shard update error when using DIH

2013-04-29 Thread heaven
Whoops, yes, that works. Will check if that helped to fix the original error now. Monday 29 April 2013, you wrote: On 29 April 2013 14:55, heaven <[hidden email][1]> wrote: > Got these errors after switching the field type to long: > * *crm-test:* > org.apache.solr.common.SolrException:o

Re: Re: Re: Shard update error when using DIH

2013-04-29 Thread Gora Mohanty
On 29 April 2013 14:55, heaven wrote: > Got these errors after switching the field type to long: > * *crm-test:* > org.apache.solr.common.SolrException:org.apache.solr.common.SolrException: > Unknown fieldtype 'long' specified on field _version_ You have probably edited your schema. The def

Re: Re: Re: Shard update error when using DIH

2013-04-29 Thread heaven
Got these errors after switching the field type to long: * *crm-test:* org.apache.solr.common.SolrException:org.apache.solr.common.SolrException: Unknown fieldtype 'long' specified on field _version_ * *crm-prod:* org.apache.solr.common.SolrException:org.apache.solr.common.SolrExcep

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Jan Høydahl
Hi, How many shards do you have? This is a known issue with deep paging with multi shard, see https://issues.apache.org/jira/browse/SOLR-1726 You may be more successful in going to each shard, one at a time (with &distrib=false) to avoid this issue. -- Jan Høydahl, search solution architect Co

Re: Re: Shard update error when using DIH

2013-04-29 Thread Raymond Wiker
You have --- I think this needs to be "long".

Re: Re: Shard update error when using DIH

2013-04-29 Thread heaven
Yes, here is the full schema: http://pastebin.com/pFPbD749[1] On Mon, Apr 29, 2013 at 10:01 AM, heaven <[hidden email][2]> wrote: *If you reply to this email, your message will be added to the discussion below:* http://lucene.472066.n3.nabble.com/Shard-update-error-wh

Re: Shard update error when using DIH

2013-04-29 Thread Raymond Wiker
On Mon, Apr 29, 2013 at 10:01 AM, heaven wrote: > Hi, seems like I have exactly the same error: > > Apr 28, 2013 11:41:57 PM org.apache.solr.common.SolrException log > SEVERE: null:java.lang.UnsupportedOperationException > at > > org.apache.lucene.queries.function.FunctionValues.longVal(F

Re: Shard update error when using DIH

2013-04-29 Thread heaven
Hi, seems like I have exactly the same error: Apr 28, 2013 11:41:57 PM org.apache.solr.common.SolrException log SEVERE: null:java.lang.UnsupportedOperationException at org.apache.lucene.queries.function.FunctionValues.longVal(FunctionValues.java:46) at org.apache.solr.update.Versio

Re: Update on shards

2013-04-29 Thread Arkadi Colson
Anyone else having this problem that an update needs to go to a host where a shard exists. java version "1.7.0_17" Java(TM) SE Runtime Environment (build 1.7.0_17-b02) Java HotSpot(TM) 64-Bit Server VM (build 23.7-b01, mixed mode) Solr 4.2.1 apache-tomcat-7.0.33 Thx! Met vriendelijke groeten

Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Abhishek Sanoujam
We have a solr core with about 115 million documents. We are trying to migrate data and running a simple query with *:* query and with start and rows param. The performance is becoming too slow in solr, its taking almost 2 mins to get 4000 rows and migration is being just too slow. Logs snippet