Error when use block-join filters in json api

2015-10-08 Thread Iana Bondarska
Hello, I'm trying to use block join feature with json api. I got following error when add query with "which parent" or "child of " prefixes to query facet. My query is : {!parent which="state:Idaho"} AND category:Books If I remove block-join prefixes -- query runs without errors. Are such

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
On 08/10/2015 09:00, Paul Libbrecht wrote: This is a very nice start Charlie, Thanks! I just hope it's not too elderly to serve as a basis. I'd warn a bit however, on the value of such previews: automated previews of web-page can be quite far from what users might be remembering a page

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread John Smith
The ids are all different: they're unique numbers followed by a couple of keywords. I've made a test with a small collection of 10 documents to make sure I can manage them manually: all ids are confirmed as different. I also dumped the exact command, here's one example: 101084385_Sebago_ sebago

Re: Exclude documents having same data in two fields

2015-10-08 Thread NutchDev
One option could be creating another boolean field field1_equals_field2 and set it to true for documents matching it while indexing. Use this field as a filter criteria while querying solr. -- View this message in context:

Re: Fuzzy search for names and phrases

2015-10-08 Thread NutchDev
WordDelimiterFilterFactory can handle cases like, wi-fi ==> wifi SD500 ==> sd 500 PowerShot ==> Power Shot you can get more information at wiki page here, https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory -- View this message in context:

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread Upayavira
Look for the DedupUpdateProcessor in an update chain. that is there, but commented out IIRC in the techproducts sample configs. Perhaps you uncommented it to use your own update processors, but didn't remove that component? On Thu, Oct 8, 2015, at 07:38 AM, John Smith wrote: > Oh, I forgot

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread John Smith
After some further investigation, for those interested: the SignatureUpdateProcessorFactory fields were somehow mis-configured (I guess copied over from another collection). The initial import had been made using a data import handler: I suppose the update chain isn't called in this process and no

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread John Smith
Yes indeed, the update chain had been activated... I commented it out again and the problem vanished. Good job, thanks Erick and Upayavira! John On 08/10/15 08:58, Upayavira wrote: > Look for the DedupUpdateProcessor in an update chain. > > that is there, but commented out IIRC in the

Re: Instant Page Previews

2015-10-08 Thread Paul Libbrecht
This is a very nice start Charlie, I'd warn a bit however, on the value of such previews: automated previews of web-page can be quite far from what users might be remembering a page should look like. In particular all tool pages typically show quite "empty" or "initial" state in such automatic

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread John Smith
Oh, I forgot Erick's mention of the logs: there's nothing unusual in INFO level, the update request just gets mentioned. No exception. I reran it with the DEBUG level, but most of the log was related to jetty. Here's a line I noticed though: org.apache.solr.servlet.HttpSolrCall; Closing out

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread Upayavira
Yay! On Thu, Oct 8, 2015, at 08:38 AM, John Smith wrote: > Yes indeed, the update chain had been activated... I commented it out > again and the problem vanished. > > Good job, thanks Erick and Upayavira! > John > > > On 08/10/15 08:58, Upayavira wrote: > > Look for the DedupUpdateProcessor in

Re: Lose Solr config on zookeeper when it is restarted

2015-10-08 Thread Upayavira
Are all instances of Solr the same version? Mixing versions could cause what Erick describes. On Thu, Oct 8, 2015, at 03:19 AM, Erick Erickson wrote: > Sounds like you're somehow mixing old and new versions of the ZK state > when you restart. I have no idea how that would be happening, but... >

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
On 07/10/2015 18:49, Lewin Joy (TMS) wrote: Hi, Is there anyway we can implement instant page previews in solr? Just saw that Google Search Appliance has this out of the box. Just like what google.com had previously. We need to display the content of the result record when hovering over the

Re: Instant Page Previews

2015-10-08 Thread Charlie Hull
On 07/10/2015 18:49, Lewin Joy (TMS) wrote: Hi, Is there anyway we can implement instant page previews in solr? Just saw that Google Search Appliance has this out of the box. Just like what google.com had previously. We need to display the content of the result record when hovering over the

Re: Error when use block-join filters in json api

2015-10-08 Thread Mikhail Khludnev
Hello, Yana! It's not clear what happens. I appreciate if you put exact queries (up to obfuscated values) and exceptions or actual results (and expectations); sample data is also useful. What I can note so far, user filters can't be used as parent mask in _which_ and _of_. See

Re: Exclude documents having same data in two fields

2015-10-08 Thread Aman Tandon
But I want to find do it at run time without index extra field With Regards Aman Tandon On Thu, Oct 8, 2015 at 11:55 AM, NutchDev wrote: > One option could be creating another boolean field field1_equals_field2 and > set it to true for documents matching it while

Re: How to show some documents ahead of others

2015-10-08 Thread NutchDev
Hi Christian, You can take a look at Solr's QueryElevationComponent . It will allow you to configure the top results for a given query regardless of the normal lucene scoring. Also you can specify exclude document list to exclude certain

Re: faceting is unusable slow since upgrade to 5.3.0

2015-10-08 Thread Uwe Reh
Sorry for the delay. I had an ugly flu. SOLR-7730 seems to work fine. Using docValues with Solr 5.4.0-2015-09-29_08-29-55 1705813 makes my faceted queries fast again. (90ms vs. 2ms) :-) Thanks Uwe Am 27.09.2015 um 20:32 schrieb Mikhail Khludnev: On Sun, Sep 27, 2015 at 2:00 PM, Uwe

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread Upayavira
You can either specify the update chain via an update.chain request parameter, or you can configure a new request parameter with its own URL and separate update.chain value. I have no idea how you would then reference that in the DIH - I've never really used it. Upayavira On Thu, Oct 8, 2015,

How to show some documents ahead of others

2015-10-08 Thread liviuchristian
Hi everybody, I'm building a recipe search engine based on solr. Paid postings must be listed on the front page, ahead of non-paid postings. When a user performs a query based on some keywords, solr returns documents in the decreasing order of their score. However, I don't know how to make

Re: Error when use block-join filters in json api

2015-10-08 Thread Iana Bondarska
Hello Mikhail, here are json.facet parameters that I tried: c_gender, c_window belong to child documents, rest - to parent. 1. returns no results, can we combine filters from different levels in queries { high_popularity : { type : query, q : "{!child of=city:Auburn}city:Auburn AND

Re: Exclude documents having same data in two fields

2015-10-08 Thread NutchDev
Hi Aman, Have a look at this , it has query time approach also using Solr function query, http://stackoverflow.com/questions/15927893/how-to-check-equality-of-two-solr-fields http://stackoverflow.com/questions/16258605/query-for-document-that-two-fields-are-equal -- View this message in

Re: faceting is unusable slow since upgrade to 5.3.0

2015-10-08 Thread Mikhail Khludnev
Uwe, it's good to know! I mean that you've recovered. Take care! On Thu, Oct 8, 2015 at 1:24 PM, Uwe Reh wrote: > Sorry for the delay. I had an ugly flu. > > SOLR-7730 seems to work fine. Using docValues with Solr > 5.4.0-2015-09-29_08-29-55 1705813 makes my faceted

Re: Error when use block-join filters in json api

2015-10-08 Thread Iana Bondarska
sorry,missed example input data: child document: { "c_gender": "female", "c_window": "seaview", "_root_": 1673891436 } parent document: { "_id": 1673891436, "county_code": "26021", "city": "Auburn", "year": 2012, "county": "Berrien", "Sales": 112808, "state": "Washington", "product_group":

Re: Error when use block-join filters in json api

2015-10-08 Thread Mikhail Khludnev
> > { high_popularity : { > type : query, > q : "{!child of=city:Auburn}city:Auburn AND c_window:seaview", > facet :{top_genres:{type: terms,field: c_gender}} > } > } I'm not sure about facets, but query isn't correct in should be something like q : "+c_window:seaview +{!child

Re: how to deployed another web project into jetty server(solr inbuilt)

2015-10-08 Thread Mugeesh Husain
Thank you Daniel Collins. Client is not providing tomcat or any other server that why i was looking for it. any i ask again for server installation. Thanks, Mugeesh Husain -- View this message in context:

Re: How to show some documents ahead of others

2015-10-08 Thread Upayavira
Or just have a field in your index - paid: true/false Then sort=paid desc, score desc (you may need to sort paid asc, not sure which way a boolean would sort) Question is whether you want to show ALL paid posts, or just a set of them. For the latter you could use result grouping on the paid

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread John Smith
Well, every day we update a lot of documents (usually several millions) so the DIH is a good fit. Calling the update chain would make sense there: after all a data import is just a batch update. Otherwise, the same operations would have to be made upfront, possibly in another environment and/or

Re: How to show some documents ahead of others

2015-10-08 Thread Andrea Roggerone
Hi guys, I don't think that sorting is a good solution in this case as it doesn't allow any meaningful customization.I believe that the advised QueryElevationComponent is one of the viable alternative. Another one would be to boost at query time a particular field, like for instance paid. That

Re: How to show some documents ahead of others

2015-10-08 Thread Alessandro Benedetti
Is it possible to understand better this : "as it doesn't allow any meaningful customization " ? Cheers On 8 October 2015 at 15:27, Andrea Roggerone wrote: > Hi guys, > I don't think that sorting is a good solution in this case as it doesn't > allow any

Re: Best Indexing Approaches - To max the throughput

2015-10-08 Thread Mugeesh Husain
Good way Using SolrJ with Thread pool executor framework, increase number of Thread as per your requirement -- View this message in context: http://lucene.472066.n3.nabble.com/Best-Indexing-Approaches-To-max-the-throughput-tp4232740p4233513.html Sent from the Solr - User mailing list archive

Re: How to show some documents ahead of others

2015-10-08 Thread Walter Underwood
Sorting all paid above all unpaid will give bad results when there are many matches. It will show 1000 paid items, include all the barely relevant ones, before it shows the first highly relevant unpaid recipe. What if that was the only correct result? Two approaches that work: 1. Boost paid

Re: Error when use block-join filters in json api

2015-10-08 Thread Iana Bondarska
thanks for help, I managed to get some results with json.facet: { high_popularity : { type : query, q : "+{!child of=state:*}state:Michigan+{!child of=city:*}city:'Benton'", facet :{top_genres:{type: terms,field: c_gender}} } } but in this case operator to link conditions is defined

Re: Unexpected delayed document deletion with atomic updates

2015-10-08 Thread Alessandro Benedetti
Not related to the deletion problem, only as a curiosity for your use case : 1 Have i misunderstood your use case, or you should use : inc Increments a numeric value by a specific amount. Must be specified as a single numeric value. Basically overtime you click, you always set the value for

Re: Fuzzy search for names and phrases

2015-10-08 Thread Alessandro Benedetti
Am i the only one that sees this messages out of context in the Mailing list ? Is this the expected behaviour ? Cheers On 8 October 2015 at 07:37, NutchDev wrote: > WordDelimiterFilterFactory can handle cases like, > > wi-fi ==> wifi > SD500 ==> sd 500 > PowerShot ==>

Please add me to ContributorsGroup of the Solr Wiki

2015-10-08 Thread Nikola Smolenski
Hello, Could you please add me to the ContributorsGroup of the Solr Wiki? I have made Serbian analyzer for Solr [ https://issues.apache.org/jira/browse/LUCENE-6053] and would now like to write about some Serbian search considerations. My wiki username is NikolaSmolenski. -- Nikola Smolenski

Re: Please add me to ContributorsGroup of the Solr Wiki

2015-10-08 Thread Nikola Smolenski
Sorry, I have somehow not seen your initial response. I can log in and edit. On Thu, Oct 8, 2015 at 5:32 PM, Erick Erickson wrote: > I think I did this a few days ago, your name has been in the auth file > since 5-Oct. > > So... > 1> you haven't checked > or > 2> I

Re: Exclude documents having same data in two fields

2015-10-08 Thread Alessandro Benedetti
Hi agree with Nutch, using the Function Range Query Parser, should do your trick : https://lucene.apache.org/solr/5_3_0/solr-core/org/apache/solr/search/FunctionRangeQParserPlugin.html Cheers On 8 October 2015 at 13:31, NutchDev wrote: > Hi Aman, > > Have a look at

Re: How to show some documents ahead of others

2015-10-08 Thread Andrea Roggerone
Sure. Let's say that as Upayavira was saying you have in your index: "paid: true/false Then sort=paid desc, score desc" In that case, paid=true and higher score would come up first. After that you decide that you want to add a set of offers: Offer 1: cost 1000 euros Offer 2: cost 100 euros Offer

Re: Please add me to ContributorsGroup of the Solr Wiki

2015-10-08 Thread Erick Erickson
I think I did this a few days ago, your name has been in the auth file since 5-Oct. So... 1> you haven't checked or 2> I messed it up somehow. or 3> You really need access to the _Lucene_ contributor's group rather than the Solr contributor's group, they're separate auth lists. Let me

Re: How to show some documents ahead of others

2015-10-08 Thread Alessandro Benedetti
Thanks Andrea, I agree with you. It seems much likely the classic " Relevancy biased by date" . But instead of having new docs we have paying docs. Probably a boost function can be helpful as already said. Cheers On 8 October 2015 at 17:03, Upayavira wrote: > Hence the

Re: How to show some documents ahead of others

2015-10-08 Thread Upayavira
Hence the suggestion to group by the paid field - would give you two lists of the number you ask for. What I'm trying to say is that the QueryElevationComponent might do it, but it is also relatively clunky, so a pure search solution might do it. However, the thing we lack right now is a full

Re: Best Indexing Approaches - To max the throughput

2015-10-08 Thread Alessandro Benedetti
This depends of the number of active producers, but ideally it's ok. Different threads will access the ThreadSafe ConcurrentUpdateSolrClient and send the document in batches. Or you were meaning something different ? On 8 October 2015 at 16:00, Mugeesh Husain wrote: > Good

Re: Error when use block-join filters in json api

2015-10-08 Thread Mikhail Khludnev
Iana, Such complex structured queries are really hard to forge in Solr (-here Elastic's DSL gets over much-). I suggest to check http://blog.griddynamics.com/2013/09/solr-block-join-support.html and http://blog.griddynamics.com/2013/12/grandchildren-and-siblings-with-block.html to get known corner

Re: tlog replay

2015-10-08 Thread Rallavagu
As a follow up. Eventually the tlog file is disappeared (could not track the time it took to clear out completely). However, following messages were noticed in follower's log. 5120638 [recoveryExecutor-14-thread-2] WARN org.apache.solr.update.UpdateLog – Starting log replay tlog On

Re: Scramble data

2015-10-08 Thread Susheel Kumar
Like Erick said, would something like using replace function on individual sensitive fields in fl param would work? replacing to something REDACTED etc. On Thu, Oct 8, 2015 at 2:58 PM, Tarala, Magesh wrote: > I already have the data ingested and it takes several days to do

Re: Scramble data

2015-10-08 Thread Uwe Reh
Hi, my suggestions are probably to simple, because they are not a real protection of privacy. But maybe one fits to your needs. Most simple: Declare your 'hidden' fields just as "indexed=true stored=false", the data will be used for searching, but the fields are not listed in the query

Re: Scramble data

2015-10-08 Thread Roman Chyla
Or you could also apply XSL to returned records: https://wiki.apache.org/solr/XsltResponseWriter On Thu, Oct 8, 2015 at 5:06 PM, Uwe Reh wrote: > Hi, > > my suggestions are probably to simple, because they are not a real > protection of privacy. But maybe one fits

Re: No live SolrServers available to handle this request

2015-10-08 Thread Mark Miller
Your Lucene and Solr versions must match. On Thu, Oct 8, 2015 at 4:02 PM Steve wrote: > I've loaded the Films data into a 4 node cluster. Indexing went well, but > when I issue a query, I get this: > > "error": { > "msg":

Re: Scramble data

2015-10-08 Thread Doug Turnbull
Can you just generate a fake data set for testing? There are numerous libraries that create fake names, phone numbers, etc that you can use to create mock data. Faker is one we have used in sensitive situations https://github.com/joke2k/faker I think this is going to be a better long-term

Re: tlog replay

2015-10-08 Thread Rallavagu
Erick, Actually, configured autocommit to 15 seconds and openSearcher is set to false. Neither 2 nor 3 happened. However, softCommit is set to 10 min. ${solr.autoCommit.maxTime:15000} false Working on upgrading to 5.3 which will take a bit of time and trying to get this

Re: Lose Solr config on zookeeper when it is restarted

2015-10-08 Thread CrazyDiamond
i have one instance of solr. the thing is when i create collection the running solr is used but when i upload config i use zkcli -- View this message in context: http://lucene.472066.n3.nabble.com/Lose-Solr-config-on-zookeeper-when-it-is-restarted-tp421p4233626.html Sent from the Solr -

Re: tlog replay

2015-10-08 Thread Erick Erickson
right, so the scenario is 1> somehow you didn't do a hard commit (openSearcher=true or false doesn't matter) for a really long time while indexing. 2> Solr abnormally terminated. 3> When Solr started back up it replayed the entire log. How <1> happened is the mystery though. With a hard commit

which one is faster synonym_edismax & edismax faster?

2015-10-08 Thread Aman Tandon
Hi, Currently we are using the *synonym_edismax query parser* plugin to handle the multi-word synonym. I want to know which is more faster *edismax* or *synonym_edismax*. As we are having the very less amount of multi-words in our dictionary so we are thinking to use standard edismax query

Re: how to deployed another web project into jetty server(solr inbuilt)

2015-10-08 Thread Upayavira
On Thu, Oct 8, 2015, at 03:21 PM, Mugeesh Husain wrote: > Thank you Daniel Collins. > > Client is not providing tomcat or any other server that why i was looking > for it. > any i ask again for server installation. There is good reason for what Daniel told you. Sure, you can work out how to

Re: Best Indexing Approaches - To max the throughput

2015-10-08 Thread Susheel Kumar
The ConcurrentUpdateSolrClient is not cloud aware or takes zkHostString as input. So only option is to use CloudSolrClient with SolrJ & Thread pool executor framework. On Thu, Oct 8, 2015 at 12:50 PM, Alessandro Benedetti < benedetti.ale...@gmail.com> wrote: > This depends of the number of

RE: Scramble data

2015-10-08 Thread Tarala, Magesh
I already have the data ingested and it takes several days to do that. I was trying to avoid re-ingesting the data. Thanks, Magesh -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Wednesday, October 07, 2015 9:26 PM To: solr-user@lucene.apache.org

No live SolrServers available to handle this request

2015-10-08 Thread Steve
I've loaded the Films data into a 4 node cluster. Indexing went well, but when I issue a query, I get this: "error": { "msg": "org.apache.solr.client.solrj.SolrServerException: No live SolrServers available to handle this request: [