Re: Searching special characters

2016-02-12 Thread Modassar Ather
You can search them by escaping with backslash. Best, Modassar

Re: Searching special characters

2016-02-12 Thread Anil
Thanks for quick response. Should these be treated differently during index ? I have tried *\"Audit* which is returning results of *Audit *also which is incorrect. what do you say ? On 12 February 2016 at 15:07, Modassar Ather wrote: > You can search them by escaping

Searching special characters

2016-02-12 Thread Anil
HI, How can we search special characters like *, " (double quote) where these are actually solr uses for exact and wild card searches. Please advice. Regards, Anil

Re: slave is getting full synced every polling

2016-02-12 Thread Novin Novin
Typo? That's 60 seconds, but that's not especially interesting either way. Yes, I was thinking about this too and I have changed it to 59 actually. Do the actual segment's look identical after the polling? Well no. How I am handling master slave. How we do this use sym link for master and

Re: Searching special characters

2016-02-12 Thread Modassar Ather
These special characters can be removed if at begging or end or can be taken care by the relevant filters depending on the schema defined. E.g "Audit"/*Audit should be searched by query Audit so I see no reason of indexing "/* of the content. You can use PatternReplaceFilter for replacing these

Re: slave is getting full synced every polling

2016-02-12 Thread Novin Novin
sorry core name is wmsapp_analysis which is big core On Fri, 12 Feb 2016 at 12:01 Novin Novin wrote: > Well It started again. > > Below is are the errors from solr logging on admin ui. > Log error message in master > 2/12/2016, 11:39:24 AM

Weird behaviour related to facetting

2016-02-12 Thread Sebastian Geerken
Hi! I've experienced a strange behaviour with several versions of SOLR (currently testing with 5.4.1, but this effects can also be reproduced with 5.3.1). Some facet values are not returned when querying "*:*", but only when I search for something special, say text "foo". I've stripped down both

Solr-kerbarose URL not accessible

2016-02-12 Thread vidya
Hi When I am trying to access my solrCloud web UI page, deployed in cloudera cluster, I have encountered with the error "DEFECTED TOKENS DETECTED" . Find the attachment of the error that is added here. It is because of kerbarose installed on cluster. Is there any other way that I can access

Re: slave is getting full synced every polling

2016-02-12 Thread Novin Novin
Well It started again. Below is are the errors from solr logging on admin ui. Log error message in master 2/12/2016, 11:39:24 AM null:java.lang.IllegalStateException: file: MMapDirectory@/var/solr/data/wmsapp_analysis/data/index.20160211204900750

query knowledge graph

2016-02-12 Thread Midas A
Please suggest how to create query knowledge graph for e-commerce application . please describe in detail . our mote is to improve relevancy . we are from LAMP back ground .

Re: slave is getting full synced every polling

2016-02-12 Thread Novin Novin
Details here are important. Do you understand what Erick was asking when he was talking about segments? The segments are the files in the index directory, which is usually data/index inside the core's instance directory. Thanks Shawn, If I am thinking right these segments also appears on core

Re: Solr-kerbarose URL not accessible

2016-02-12 Thread Anil
through use jaas-configuration through java API connectivity. System.setProperty("java.security.auth.login.config", "location of jaas configration file"); HttpClientUtil.setConfigurer(new Krb5HttpClientConfigurer()); add above two lines when your creating solr cloud you can find jaas conf

Re: Weird behaviour related to facetting

2016-02-12 Thread Alessandro Benedetti
I know sometime it happens, unfortunately you simply ignored the facet.limit parameter ... By default you show only the first 100 facets. Showing more is going to show also the one you were thinking were missing ( but actually were simply not shown) . Cheers On 12 February 2016 at 10:59,

Re: How is Tika used with Solr

2016-02-12 Thread xavi jmlucjav
Of course, but that code is very tricky, so if the extraction library takes care of all that, it's a huge gain. The Aperture library I used worked very well in that regard, and even though it did not use processes as Timothy says, it never got stuck if I remember correctly. On Fri, Feb 12, 2016

Re: Solr architecture

2016-02-12 Thread Mark Robinson
Thanks All for your suggestions! Rgds, Mark. On Thu, Feb 11, 2016 at 9:45 AM, Upayavira wrote: > Your biggest issue here is likely to be http connections. Making an HTTP > connection to Solr is way more expensive than the ask of adding a single > document to the index. If you

Re: Weird behaviour related to facetting

2016-02-12 Thread Sebastian Geerken
Alessandro, thank you for the hint. Setting facet.limit to a higher value fixes the problem. Regards Sebastian On Fr, Feb 12, 2016, Alessandro Benedetti wrote: > I know sometime it happens, unfortunately you simply ignored the > facet.limit parameter ... > By default you show only the first 100

Re: slave is getting full synced every polling

2016-02-12 Thread Alessandro Benedetti
Have you customised the merge factor ? Is it aggressive ? In case a lot of merge happens, you can potentially incur in a big trasnfer of files each replication . You need to check the segments in the slave every minutes. When the replication is triggered what are the difference from the Master

Re: optimize requests that fetch 1000 rows

2016-02-12 Thread Shawn Heisey
On 2/12/2016 2:57 AM, Matteo Grolla wrote: > tell me if I'm wrong but qtime accounts for search time excluding the > fetch of stored fields (I have a 90ms qtime and a ~30s time to obtain the > results on the client on a LAN infrastructure for 300kB response). debug > explains how much of

Re: Solr-kerbarose URL not accessible

2016-02-12 Thread Shawn Heisey
On 2/12/2016 4:28 AM, vidya wrote: > When I am trying to access my solrCloud web UI page, deployed in cloudera > cluster, I have encountered with the error "DEFECTED TOKENS DETECTED" . Find > the attachment of the error that is added here. It is because of kerbarose > installed on cluster. > >

Re: optimize requests that fetch 1000 rows

2016-02-12 Thread Jack Krupansky
Thanks for that critical clarification. Try... 1. A different response writer to see if that impacts the clock time. 2. Selectively remove fields from the fl field list to see if some particular field has some issue. 3. If you simply return only the ID for the document, how fast/slow is that?

Re: slave is getting full synced every polling

2016-02-12 Thread Shawn Heisey
On 2/12/2016 1:58 AM, Novin Novin wrote: > Typo? That's 60 seconds, but that's not especially interesting either way. > > Yes, I was thinking about this too and I have changed it to 59 actually. If you want the polling to occur once an hour, pollInterval will need to be set to 01:00:00 ... not

Re: optimize requests that fetch 1000 rows

2016-02-12 Thread Matteo Grolla
Hi Jack, tell me if I'm wrong but qtime accounts for search time excluding the fetch of stored fields (I have a 90ms qtime and a ~30s time to obtain the results on the client on a LAN infrastructure for 300kB response). debug explains how much of qtime is used by each search component. For me

Re: un-Boosting some Docs at index time

2016-02-12 Thread Steven White
Thanks Erick!! Yes, SolrInputDocument.setDocumentBoost() is what I'm looking for. I was under the impression boosting is on fields only. Steve On Fri, Feb 12, 2016 at 11:36 AM, Erick Erickson wrote: > You can use index-time boosting on a per-field basis, here's a

Re: edismax query parser - pf field question

2016-02-12 Thread Senthil
It does not work with comma as well. In fact, no DisjunctionMaxQuery is added for any of pf fields if I add comma. (+((DisjunctionMaxQuery((P_NAME:refriger^1.5 | CategoryName:refrigerator)~1.0) DisjunctionMaxQuery((P_NAME:water^1.5 | CategoryName:water)~1.0) DisjunctionMaxQuery((P_NAME:filter^1.5

Re: optimize requests that fetch 1000 rows

2016-02-12 Thread Erick Erickson
I agree with everyone else that this seems very unusual, but here are some additional possible options: If (and only if) you're returning "simple" (i.e. numerics and strings) you could consider the Streaming Aggregation stuff. It's built to return rows without going to disk. The restriction is

Re: slave is getting full synced every polling

2016-02-12 Thread Erick Erickson
bq: What I have done when the problem started, I changed slave to master and master to slave. OK, other things aside, if you're really saying that every time you switch the slave and master around and restart, you get a full sync then I'd reply "don't do that". Why are you switching slave and

un-Boosting some Docs at index time

2016-02-12 Thread Steven White
Hi everyone, I'm trying to figure out if this is possible, if so how do I do it. I'm indexing records from my database. The Solr doc has 2 basic fields: the ID and the Data field. I lump the data of each field from the record into Solr's Data field. At search time, I search on this single

Re: Need to move on SOlr cloud (help required)

2016-02-12 Thread Erick Erickson
bq: in case of solrcloud architecture we need not to have load balancer First, my comment about a load balancer was for the master/slave architecture where the load balancer points to the slaves. Second, for SolrCloud you don't necessarily need a load balancer as if you're using a SolrJ client

Re: Searching special characters

2016-02-12 Thread Erick Erickson
Also look at the admin/analysis page to see the effects of various filters in your analysis chain. It's very likely that the * is not even _in_ the index. Here is a partial list of elements that _may_ be in your analysis chain: https://cwiki.apache.org/confluence/display/solr/Filter+Descriptions

Re: un-Boosting some Docs at index time

2016-02-12 Thread Erick Erickson
You can use index-time boosting on a per-field basis, here's a place to start: https://lucidworks.com/blog/2011/12/14/options-to-tune-documents-relevance-in-solr/ Does that work? Best, Erick On Fri, Feb 12, 2016 at 8:30 AM, Steven White wrote: > Hi everyone, > > I'm

Re: slave is getting full synced every polling

2016-02-12 Thread Novin Novin
you're trying to accomplish X and asking about Y where Y is the index replication. What's X? What is the purpose of switching the master and slave and how often do you do it and why? I think I didn't explain it quit properly. So I have situation in which data is getting index every 20 seconds or

Re: boolean query with score and with out score

2016-02-12 Thread Erik Hatcher
What are the parsed queries from debug=true? Maybe it's an Or/OR thing? > On Feb 12, 2016, at 23:47, sara hajili wrote: > > hi i have a Boolean query > like this > query = caption:apple Or caption:bannana^1.0003 OR > caption:pineapple^1.0023 > and get a result like >

boolean query with score and with out score

2016-02-12 Thread sara hajili
hi i have a Boolean query like this query = caption:apple Or caption:bannana^1.0003 OR caption:pineapple^1.0023 and get a result like doc1 doc2 doc3 but this result does not satisfy me at all. because i had a doc that contain some of this term but i did not get these docs. but when i change my

Re: slave is getting full synced every polling

2016-02-12 Thread Shawn Heisey
On 2/12/2016 11:47 AM, Novin Novin wrote: > I think I didn't explain it quit properly. So I have situation in which > data is getting index every 20 seconds or less and I can't loose data while > indexing. I use searching a lot in website, if I have to restart my solr > machine because of kernel

Re: query knowledge graph

2016-02-12 Thread Jack Krupansky
"knowledge graph" is kind of vague - what did you have in mind? An example would help. -- Jack Krupansky On Fri, Feb 12, 2016 at 7:27 AM, Midas A wrote: > Please suggest how to create query knowledge graph for e-commerce > application . > > > please describe in detail .

Re: query knowledge graph

2016-02-12 Thread Alexandre Rafalovitch
The last Lucene/Solr Revolution had a number of presentations on relevancy. I would recommend watching them as a first step. They are on YouTube under Lucidworks channel. There is also an early release book from Mannings called Relevant Search which you will find very useful. Regards, Alex.

Re: slave is getting full synced every polling

2016-02-12 Thread Erick Erickson
If you have to stay on master/slave, then the full replication when you do this switch is probably just a price you'll have to pay. The indexes are different so to be on the safe side Solr will replicate the whole thing. Is it really that much of a problem? As Shawn says, though, much of this