Re: data/index naming format

2013-09-05 Thread Jason Hellman
The circumstance I've most typically seen the index. show up is when an update is sent to a slave server. The replication then appears to preserve the updated slave index in a separate folder while still respecting the correct data from the master. On Sep 5, 2013, at 8:03 PM, Shawn Heisey w

Re: data/index naming format

2013-09-05 Thread Shawn Heisey
On 9/5/2013 6:48 PM, Aditya Sakhuja wrote: > I am running solr 4.1 for now, and am confused about the structure and > naming of the contents of the data dir. I do not see the index.properties > being generated on a fresh solr node start either. > > Can someone clarify when should one expect to see

Re: subindex

2013-09-05 Thread Erick Erickson
Nope. You can do this if you've stored _all_ the fields (with the exception of _version_ and the destinations of copyField directives). But there's no way I know of to do what you want if you haven't. If you have, you'd be essentially spinning through all your docs and re-indexing just the fields

data/index naming format

2013-09-05 Thread Aditya Sakhuja
Hello, I am running solr 4.1 for now, and am confused about the structure and naming of the contents of the data dir. I do not see the index.properties being generated on a fresh solr node start either. Can someone clarify when should one expect to see data/index vs. data/index., and the index.p

solrcloud shards backup/restoration

2013-09-05 Thread Aditya Sakhuja
Hello, I was looking for a good backup / recovery solution for the solrcloud indexes. I am more looking for restoring the indexes from the index snapshot, which can be taken using the replicationHandler's backup command. I am looking for something that works with solrcloud 4.3 eventually, but sti

Re: unknown _stream_source_info while indexing rich doc in solr

2013-09-05 Thread Chris Hostetter
: yes sir i did restart the tomcat. When you look at the Schema Browser for your default solr core (i'm guessing it's collection1?), does it list ignored_* as a dynamic field? does this URL below show you that "ignored_*" is using type "ignored" ? ... http://localhost:8983/solr/#/collection1

Re: SolrCloud 4.x hangs under high update volume

2013-09-05 Thread Tim Vaillancourt
Update: It is a bit too soon to tell, but about 6 hours into testing there are no crashes with this patch. :) We are pushing 500 batches of 10 updates per second to a 3 node, 3 shard cluster I mentioned above. 5000 updates per second total. More tomorrow after a 24 hr soak! Tim On Wednesday, 4

Re: Solr 4.3 Startup with Multiple Cores Hangs on "Registering Core"

2013-09-05 Thread Chris Hostetter
: I currently have Solr 4.3 set up with about 400 cores set to load upon : start up. When starting Solr with an empty index for each core, Solr is : able to load all of the cores and start up normally as expected. : However, after running a dataimport on all cores and restarting Solr, it : h

Solr substring search

2013-09-05 Thread Scott Schneider
Hello, I'm trying to find out how Solr runs a query for "*foo*". Google tells me that you need to use NGramFilterFactory for that kind of substring search, but I find that even with very simple fieldTypes, it just works. (Perhaps because I'm testing on very small data sets, Solr is willing to

Solr Cell Question

2013-09-05 Thread Jamie Johnson
Is it possible to configure solr cell to only extract and store the body of a document when indexing? I'm currently doing the following which I thought would work ModifiableSolrParams params = new ModifiableSolrParams(); params.set("defaultField", "content"); params.set("xpath", "/xhtml:html/

Re: More on topic of Meta-search/Federated Search with Solr

2013-09-05 Thread Paul Libbrecht
Hello list, A student of a friend of mine made his masters on that topic, especially about federated ranking. I have copied his text here: http://direct.hoplahup.net/tmp/FederatedRanking-Koblischke-2009.pdf Feel free to contact me to contact Robert Koblischke for questions. Pa

Re: charfilter doesn't do anything

2013-09-05 Thread Shawn Heisey
On 9/5/2013 10:03 AM, Andreas Owen wrote: > i would like to filter / replace a word during indexing but it doesn't do > anything and i dont get a error. > > in schema.xml i have the following: > > multiValued="true"/> > > > > > pattern="Zahlungsverkehr" replacement="A

Re: charfilter doesn't do anything

2013-09-05 Thread Jack Krupansky
And show us an input string and a query that fail. -- Jack Krupansky -Original Message- From: Shawn Heisey Sent: Thursday, September 05, 2013 2:41 PM To: solr-user@lucene.apache.org Subject: Re: charfilter doesn't do anything On 9/5/2013 10:03 AM, Andreas Owen wrote: i would like to f

Re: Numeric fields and payload

2013-09-05 Thread Erick Erickson
Peter: I don't quite get this. Formatting to display is trivial as it's usually done for just a few docs anyway. You could also just store the original unaltered value and add an additional "normalized" field. Best Erick On Wed, Sep 4, 2013 at 2:02 PM, PETER LENAHAN wrote: > Chris Hostetter

Odd behavior after adding an additional core.

2013-09-05 Thread mike st. john
using solr 4.4 , i used collection admin to create a collection 4shards replication - factor of 1 i did this so i could index my data, then bring in replicas later by adding cores via coreadmin i added a new core via coreadmin, what i noticed shortly after adding the core, the leader of the

Solr documents update on index

2013-09-05 Thread Luis Portela Afonso
Hi, I'm having a problem when solr indexes. It is updating documents already indexed. Is this a normal behavior? If a document with the same key already exists is it supposed to be updated? I has thinking that is supposed to just update if the information on the rss has changed. Appreciate your h

bucket count for facets

2013-09-05 Thread Steven Bower
Is there a way to get the count of buckets (ie unique values) for a field facet? the rudimentary approach of course is to get back all buckets, but in some cases this is a huge amount of data. thanks, steve

Loading a SpellCheck dynamically

2013-09-05 Thread Mr Havercamp
I currently have multiple spellchecks configured in my solrconfig.xml to handle a variety of different spell suggestions in different languages. In the snippet below, I have a catch-all spellcheck as well as an English only one for more accurate matching (I.e. my schema.xml is set up to captur

Solr 4.3 Startup with Multiple Cores Hangs on "Registering Core"

2013-09-05 Thread Austin Rasmussen
Hello, I currently have Solr 4.3 set up with about 400 cores set to load upon start up. When starting Solr with an empty index for each core, Solr is able to load all of the cores and start up normally as expected. However, after running a dataimport on all cores and restarting Solr, it hangs

charfilter doesn't do anything

2013-09-05 Thread Andreas Owen
i would like to filter / replace a word during indexing but it doesn't do anything and i dont get a error. in schema.xml i have the following: my 2. question is where can i say that the expression is multilined like in javascript i can

Re: JSON update request handler & commitWithin

2013-09-05 Thread Ryan, Brent
Ya, looks like this is a bug in Datastax Enterprise 3.1.2. I'm using their enterprise cluster search product which is built on SOLR 4. :( On 9/5/13 11:24 AM, "Jack Krupansky" wrote: >I just tried commitWithin with the standard Solr example in Solr 4.4 and >it works fine. > >Can you reproduce

Re: JSON update request handler & commitWithin

2013-09-05 Thread Jason Hellman
They have modified the mechanisms for committing documents…Solr in DSE is not stock Solr...so you are likely encountering a boundary where stock Solr behavior is not fully supported. I would definitely reach out to them to find out if they support the request. On Sep 5, 2013, at 8:27 AM, "Ryan,

Re: JSON update request handler & commitWithin

2013-09-05 Thread Jack Krupansky
I just tried commitWithin with the standard Solr example in Solr 4.4 and it works fine. Can you reproduce your problem using the standard Solr example in Solr 4.4? -- Jack Krupansky From: Ryan, Brent Sent: Thursday, September 05, 2013 10:39 AM To: solr-user@lucene.apache.org Subject: JSON upd

JSON update request handler & commitWithin

2013-09-05 Thread Ryan, Brent
I'm prototyping a search product for us and I was trying to use the "commitWithin" parameter for posting updated JSON documents like so: curl -v 'http://localhost:8983/solr/proposal.solr/update/json?commitWithin=1' --data-binary @rfp.json -H 'Content-type:application/json' However, the com

Re: Tweaking boosts for more search results variety

2013-09-05 Thread Jack Krupansky
The grouping (field collapsing) feature somewhat addresses this - group by a "site" field and then if more than one or a few top pages are from the same site they get grouped or collapsed so that you can see more sites in a few results. See: http://wiki.apache.org/solr/FieldCollapsing https://

Re: Solr Cloud hangs when replicating updates

2013-09-05 Thread Erick Erickson
If you run into this again, try a jstack trace. You should see evidence of being stuck in SolrCmdDistributor on a variable called "semaphore"... On current 4x this is around line 420. If you're using SolrJ, then SOLR-4816 is another thing to try. But Mark's patch would be best of all to test, If

Re: Need help on Joining and sorting syntax and limitations between multiple documents in solr-4.4.0

2013-09-05 Thread Erick Erickson
The very first thing I'd do is see if you can _not_ use joins. Especially if you're coming from a RDBMS background. Joins in Solr are somewhat specialized and are NOT equivalent to db joins. First of all there's no way to get fields from the "from" part of the join returned in the results. Secondl

Tweaking Edismax on the Phrase Fields

2013-09-05 Thread Bruno René Santos
Hi, I have a doubt about the raw query that is parsed from a edismax query. Form example the query: _query_:"{!edismax mm=100% bf='log(div(9900,producttier))' pf='name_synonyms~100^3 name~100^6 heading~100^20' pf2='name_synonyms~100^3 name~100^6 heading~100^20' qf='name_synonyms^3 name^6 heading^

Re: Little XsltResponseWriter documentation bug (Attn: Wiki Admin)

2013-09-05 Thread Stefan Matheis
Dimitri I've added you to the https://wiki.apache.org/solr/ContributorsGroup - feel free to improve the wiki :) - Stefan On Wednesday, September 4, 2013 at 11:46 PM, Dmitri Popov wrote: > Upayavira, > > I could edit that page myself, but need to be confirmed human according to > http://wiki

Re: Solr 4.3: Recovering from "Too many values for UnInvertedField faceting on field"

2013-09-05 Thread Dmitry Kan
We had a similar case for multivalued fields with a lot of unique values per field in some cases. Using facet.method=enum instead of facet.method=fc fixed the problem. Can run slower though. Dmitry On Tue, Sep 3, 2013 at 5:04 PM, Dennis Schafroth wrote: > We are harvesting and indexing bibliogr