Re: extract multi-features for one solr feature extractor in solr learning to rank

2017-04-21 Thread alessandro.benedetti
Hi Jianxiong, this is definitely interesting. Briefly reviewing the paper you linked the use case seems clear : You want similar "family" of features, to be calculated on each field. Let's take as example the TF feature, you may want to define in the features.json only one feature including all

Re: Graph traversel

2017-04-21 Thread Ganesh M
I also tried with the sample data mentioned in this link. https://cwiki.apache.org/confluence/display/solr/Other+Parsers#OtherParsers-GraphQueryParser even for that, after loading the data and for the query http://localhost:8983/solr/graph/query?q={!graph%20from=in_edge%20to=out_edge}id:A=id I

Re: Update schema.xml without restarting Solr?

2017-04-21 Thread Lingeshm
Hello Team , I can’t change the schema name of an existing index. I want to change the schema name from “schemaV1" to “schemaV2 for one of the existing index curl -XPUT http://localhost:8098/search/index/my_idx-H "Content-Type: application/json" -d '{"schema":"schemaV2"}' the funny part is

RE: DistributedUpdateProcessorFactory was explicitly disabled from this updateRequestProcessorChain

2017-04-21 Thread alessandro.benedetti
Let's make a quick differentiation between PRE and POST processors in a Solr Cloud atchitecture : "In a single node, stand-alone Solr, each update is run through all the update processors in a chain exactly once. But the behavior of update request processors in SolrCloud deserves special

Re: prefix facet performance

2017-04-21 Thread alessandro.benedetti
Hi Maria, If you have 100-500.000 unique values for the field you are interested in, and the cardinality of your search results is actually quite small in comparison, I am not that sure term enum will help you that much ... To simplify, with the term enum approach, you iterate over each unique

Re: Advice on how to work with pure JSON data.

2017-04-21 Thread Mikhail Khludnev
Hello, See below. On Fri, Apr 21, 2017 at 8:21 AM, wrote: > One thing I forgot to mention in my original post is that I wish to do > this using the SolrJ client. > I have my own rest server that presents a common API to our users, but the > back-end can be >

Backup not working

2017-04-21 Thread vrindavda
Hello, I am trying to backup the Solr index data using collection API. I have \collection2_shard1_replica1\data\index\segments_6 in my data folder, but when I try to backup files, It expects \collection2_shard1_replica1\data\index\segments_5 which is not there in data folder, hence giving

Graph traversel

2017-04-21 Thread Ganesh M
Hi I am trying graph traversal based on the documentation available over here http://solr.pl/en/2016/04/18/solr-6-0-and-graph-traversal-support/ But the it's not working as expected. For this query http://localhost:8983/solr/graph/query?q=*:*={!graph%20from=parent_id%20to=id}id:1 ( which is

Re: Backup not working

2017-04-21 Thread vrindavda
I realized that Segments_1 is getting created in Shard2 and Segments_2 in Shard1. Backup API is looking for Segments_1 in Shard1. Please correct if I have configured something wrongly. I have created collection using collection API and am using data_driven_schema_configs configs. -- View this

Re: Running Solr6 on Tomcat7

2017-04-21 Thread ankur.168
As Shawn said, it is not recommended, still if you want to do this you can follow these steps(picked from following post http://lucene.472066.n3.nabble.com/Running-Solr-6-3-on-Tomcat-Help-Please-td4320874.html) The following instructions work with Solr 6.2 + Tomcat 8.5: 1. Copy

Re: Update schema.xml without restarting Solr?

2017-04-21 Thread Mikhail Khludnev
> the funny part is curl is not returning anything such either error or success can you add -v or so to curl, to see http status code at least? On Fri, Apr 21, 2017 at 11:20 AM, Lingeshm wrote: > Hello Team , > > I can’t change the schema name of an existing index. >

Re: Update schema.xml without restarting Solr?

2017-04-21 Thread Shawn Heisey
On 4/21/2017 2:20 AM, Lingeshm wrote: > I can’t change the schema name of an existing index. > > I want to change the schema name from “schemaV1" to “schemaV2 for one of the > existing index > > curl -XPUT http://localhost:8098/search/index/my_idx-H "Content-Type: > application/json" -d

Re: Running Solr6 on Tomcat7

2017-04-21 Thread rgummadi
Thanks. I will try your steps. -- View this message in context: http://lucene.472066.n3.nabble.com/Running-Solr6-on-Tomcat7-tp4330500p4331258.html Sent from the Solr - User mailing list archive at Nabble.com.

Overseer session expires on multiple collection creation

2017-04-21 Thread apoorvqwerty
Hi, I am trying to create multiple collections with 2 shards and 2 replications each. After 5-6 successful overseer status response for 5 creations shows 40k requests for collection_operations=>am_i_leader which is a bit odd. and I get Am I not supposed to create 8-10 collections one after the

Re: Update schema.xml without restarting Solr?

2017-04-21 Thread Alexandre Rafalovitch
I would say that all this points at existence of middle-ware in front of Solr. Therefore, the next action would be to identify the middle-ware and ask this question on _their_ mailing list. Regards, Alex. http://www.solr-start.com/ - Resources for Solr users, new and experienced On 21

Re: prefix facet performance

2017-04-21 Thread Maria Muslea
Actually using facet.method=enum made a HUGE difference even in my case where I have many unique values. I am happy with the query response time now. Is there a way in SOLR to count the unique values for a field? If not, I could run the reindexing and count the unique values while I add them to

Re: HttpSolrServer commit is taking more time

2017-04-21 Thread Venkateswarlu Bommineni
Thanks for the reply. I can see same configuration as given in mail in Solr configuration file But I can see same performance issues while querying also through solrJ. Thanks, Venkat. On 21 Apr 2017 9:30 am, "Shawn Heisey" wrote: > On 4/20/2017 9:23 PM, Venkateswarlu

Re: Enable https for Solr

2017-04-21 Thread Steve Rowe
Hi Edwin, See . -- Steve www.lucidworks.com > On Apr 21, 2017, at 12:03 AM, Zheng Lin Edwin Yeo > wrote: > > Hi, > > I would like to find out, how can we allow Solr to accept secure > connections via https?

How to use Wordnet in solr?

2017-04-21 Thread Pablo Anzorena
Hey, I'm planning to use Wordnet and I want to know how. There's a class called *WordnetSynonymParser *, does anybody use it? It says it is experimental... I'm using solr 5.2.1 Briefly speaking about my needs: I have different collections in different languages (fr, pr, sp, en). When the user

Re: How to use Wordnet in solr?

2017-04-21 Thread Alexandre Rafalovitch
I am not sure WordnetSynonymParser is accessible from Solr. At least I never heard anybody mention it. I am also aware of https://github.com/nicholasding/solr-lemmatizer but that's lematizer, not a synonym builder. But perhaps there are some lessons/code in there that could be useful. Regards,

Modify solr score

2017-04-21 Thread tstusr
Hi. We are making an application that searches for certain specific topics, as many captured words on a document the higher the score. We have 2 scenarios of testing. The first one with documents that users tag as relevant and other ones that contains documents out of our domain. In first

Re: How to use Wordnet in solr?

2017-04-21 Thread Steve Rowe
From (also applies to SynonymFilter): - format: (optional; default: solr) Controls how the synonyms will be parsed. The short names solr (for SolrSynonymParser) and wordnet (for

Re: Modify solr score

2017-04-21 Thread Walter Underwood
It isn’t going to work. The score is not an absolute relevance measurement. It only says that the first document is more relevant than the second, and so on. Scores are not comparable between different queries. The score cannot be used to say that the first hit for query A is a better match

Re: How to use Wordnet in solr?

2017-04-21 Thread alessandro.benedetti
Hi Pablo, with wordnet format , Solr will just parse synonyms from a different file format [1] . The rest will work exactly the same. You will use a managed resource to load the file and then potentially update it. If you were thinking to use directly the online resource, you may need to customize

DateRangeField and Faceting

2017-04-21 Thread Stephen Weiss
Hi everyone, Just trying to do a sense check on this. I'm trying to do a facet based off a DateRangeField and I'm hitting this error: Error from server at http://172.20.141.150:8983/solr/instock_au_shard1_replica0: Unable to range facet on

Re: prefix facet performance

2017-04-21 Thread alessandro.benedetti
That is quite interesting ! You can use the stats module ( in association with the Json facets if you need it) to calculate an accurate approximation of the unique values [1] [2] . Good to know it improved your scenario, I may need to update my knowledge of term enum internals! Can you describe

Re: Modify solr score

2017-04-21 Thread alessandro.benedetti
It has been discussed countless times, never rely on score values. Rely on the ranking of your results. It seems you model a as a least of keywords and then you just run a query for each topic. Essentially for you, a is a query. The ranking of your results will already be affected by how many

Re: Modify solr score

2017-04-21 Thread tstusr
Since we report the score, we think there will be some relation between them. As far as we know scoring (and then ranking) are calculated based on tf-idf. What we want to do is to make a qualitative ranking, it means, according to one topic we will tag documents as "very related", "fairly

Re: Modify solr score

2017-04-21 Thread Rick Leir
Ulf: Maybe there is a way you could filter out the unrelated documents. Qf? Rick On April 21, 2017 2:18:59 PM EDT, tstusr wrote: >Well, I know they can change. > >I think, the main problem here it that (in this point) documents >completely >unrelated to a topic are being

Re: prefix facet performance

2017-04-21 Thread Maria Muslea
The field is: and using unique() I found that it has 700K+ unique values. The query before (that takes ~10s): wt=json=true=*:*=0=true=concept=A/ the query after (that is almost instant): wt=json=true=*:*=0=true=concept=A/=enum' Maria On Fri, Apr 21, 2017 at 8:59 AM, alessandro.benedetti

Re: Modify solr score

2017-04-21 Thread tstusr
Well, maybe I explain it wrong. We have entry points, each of them are related to a topic. It mens that when we select the first topic all information has to be related in some way to this vocabulary. So, it can work since we select documents not related to each vocabulary of every entry point.

Re: Modify solr score

2017-04-21 Thread Walter Underwood
Using a minimum score cut off does not work. The score is not an absolute estimate of relevance. The idf component of the score is a whole-corpus metric. When you add or delete documents, the scores for the exact same query can change. wunder Walter Underwood wun...@wunderwood.org

Re: Enable https for Solr

2017-04-21 Thread Zheng Lin Edwin Yeo
Thank you Steve. I have managed to set up the SSL, and the query via https is working now. However, I am getting this error when I tried to do indexing using SolrJ. I have already changed the URL to pass using https. What could be the reason that causes this?

Re: Enable https for Solr

2017-04-21 Thread Steve Rowe
Edwin, Did you set the required keystore/truststore/password system properties? See the example at -- Steve www.lucidworks.com > On Apr 21, 2017, at 1:44 PM, Zheng Lin Edwin Yeo

Re: Modify solr score

2017-04-21 Thread tstusr
Well, I know they can change. I think, the main problem here it that (in this point) documents completely unrelated to a topic are being ranked as high as documents related. So, in order to penalize them we are trying to use the ratio or term frequency/word length. Nevertheless we aren't able to

Re: How to use Wordnet in solr?

2017-04-21 Thread Pablo Anzorena
Thanks to everybody. I will try first Alessandro and Steve recommendation. If i don't misunderstood, you are telling me that I have to customize the prolog files to "solr txt synonyms syntax"? If that is correct, what is the point of format:wordnet ? 2017-04-21 12:52 GMT-03:00

php apache solr client - Solr HTTP Error 58: 'Problem with the local SSL certificate'

2017-04-21 Thread bay chae
Hi, Apologies if this is an inappropriate place to contact please redirect if this is the case. I have successfully setup solr(6.5.0) with ssl in my dev environment and can get a proper response using the following curl request: curl -E ./etc/solr-ssl.keystore.p12:secret --cacert

Re: DateRangeField and Faceting

2017-04-21 Thread Stephen Weiss
One small detail - I just realized I've been doing JSON faceting and the wiki refers to old-school faceting. Old-school faceting indeed does work but the problem is the facet is ultimately one of a whole tree of stats I'm collecting, so JSON facet is far more convenient for my use case (I

Re: prefix facet performance

2017-04-21 Thread Yonik Seeley
On Fri, Apr 21, 2017 at 4:25 PM, Maria Muslea wrote: > The field is: > > > > and using unique() I found that it has 700K+ unique values. > > The query before (that takes ~10s): > > wt=json=true=*:*=0=true=concept=A/ > > the query after (that is almost instant): > >

Re: prefix facet performance

2017-04-21 Thread Maria Muslea
I see. Once I specify a prefix the number of terms is MUCH smaller. Thank you again for all your help. Maria On Fri, Apr 21, 2017 at 1:46 PM, Yonik Seeley wrote: > On Fri, Apr 21, 2017 at 4:25 PM, Maria Muslea > wrote: > > The field is: > > > > > >