Re: Solr Auto-Complete

2015-12-04 Thread Alexandre Rafalovitch
You can see an example of similar use at: http://www.solr-start.com/javadoc/solr-lucene/index.html (search box). The corresponding schema is here: https://github.com/arafalov/Solr-Javadoc/blob/master/JavadocIndex/JavadocCollection/conf/schema.xml#L24 . It does have some extra special-case stuff

Re: Highlighting large documents

2015-12-04 Thread Zheng Lin Edwin Yeo
Hi Andrea, I'm using the original highlighter. Below is my configuration for the highlighter in solrconfig.xml explicit 10 json true text id, title, content_type, last_modified, url, score on id, title, content, author

Re: Highlighting large documents

2015-12-04 Thread Andrea Gazzarini
Hi Zheng, just curiousity, because shortly I will have to deal with a similar scenario (Solr 5.3.1 + large documents + highlighting). Which highlighter are you using? Andrea 2015-12-04 16:51 GMT+01:00 Zheng Lin Edwin Yeo : > Hi, > > I'm using Solr 5.3.0 > > I found that in

Some errors migrating to solr cloud

2015-12-04 Thread tedsolr
I had a fairly simple plan for migrating my single solr instance with multiple cores, to a solrcloud implementation where core => collection. My testing locally (windows) worked fine, but the first linux (development) environment I tried to migrate had some failures. This is v5.2.1. The setup:

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-04 Thread Rick Leir
On Fri, Dec 4, 2015 at 12:59 AM, wrote: > > >Just wondering if folks have any suggestions on using Schema.xml vs. > >Managed Schema going forward. > > We are using loosely typed languages (Perl and Javascript), and a loosely typed DB (CouchDB). This is

Re: Solr Auto-Complete

2015-12-04 Thread Salman Ansari
Thanks Alan, Alessandaro and Andrea for your great explanations. I will follow the path of adding edge ngrams to the field type for my use case. Regards, Salman On Thu, Dec 3, 2015 at 12:23 PM, Alessandro Benedetti wrote: > "Sounds good but I heard "/suggest" component

Highlighting large documents

2015-12-04 Thread Zheng Lin Edwin Yeo
Hi, I'm using Solr 5.3.0 I found that in large documents, sometimes I face situation that when I do a highlight query, the resultset that is returned does not contain the highlighted query. There are actually matches in the documents, but just that they located further back in the documents. I

Re: Solrcloud - adding a node as a replica?

2015-12-04 Thread Mugeesh Husain
Hi, kamaci I have 3 server, solr1, solr2 and solr3 I want to create 3 core in server solr1,and solr2 Not solr 3 in any core/collection. I want to create 3 replica wrt to above core, replica would be in server solr3. When i create one core using bin/solr create -c abc -shard 1 -replicaFactor 1,

Solrcloud: 1 server, 1 configset, multiple collections, multiple schemas

2015-12-04 Thread bengates
Hello, I'm having usage issues with *Solrcloud*. What I want to do: - Manage a solr server *only with the API* (create / reload / delete collections, create / replace / delete fields, etc). - A new collection should* start with pre-defined default fields, fieldTypes and copyFields* (let's say,

"OnException" extension to SearchComponents / finalize on search components

2015-12-04 Thread deansg
I was recently writing a SearchComponent that performs a certain action in the prepare method when in distributed context, and then must perform another action after the query finished running to clean up (one of the finishStage calls). I realized that if the SearchHandler on the server throws an

Re: migrate(or copy) data from one core1(node2) to anothere core2(node1)

2015-12-04 Thread Mugeesh Husain
Hello Erick, I did shutdown all node in solrcloud and copy data directory from non solcoud, after that i start all of node, but new data is not refleted. One doubt >> Am i need to shutdown zookeeper node also ? or also clean data from zk directory -- View this message in

Reloading the collection timed out

2015-12-04 Thread Troy Edwards
After running Solr on a linux box for about 15 days; today when I tried to reload collections I got the following error reload the collection time out:180s org.apache.solr.common.SolrException: reload the collection time out:180s at

RE: Use multiple istance simultaneously

2015-12-04 Thread Gian Maria Ricci - aka Alkampfer
Many thanks for your response. I worked with Solr until early version 4.0, then switched to ElasticSearch for a variety of reasons. I've used replication in the past with SolR, but with Elasticsearch basically I had no problem because it works similar to SolrCloud by default and with almost zero

Re: Solrcloud: 1 server, 1 configset, multiple collections, multiple schemas

2015-12-04 Thread Jeff Wartes
If you want two different collections to have two different schemas, those collections need to reference two different configsets. So you need another copy of your config available using a different name, and to reference that other name when you create the second collection. On 12/4/15, 6:26

Re: Highlighting large documents

2015-12-04 Thread Andrea Gazzarini
No no, sorry, the project is not yet started so I didn't experience your issue, but I'll be a careful listener of this thread Best, Andrea 2015-12-04 17:04 GMT+01:00 Zheng Lin Edwin Yeo : > Hi Andrea, > > I'm using the original highlighter. > > Below is my configuration

Re: schema fileds and Typefield in solr-5.3.1

2015-12-04 Thread kostali hassan
thank you Erick, i follow you advice and take a look to config apache tika, I have modifie my request handler /update/extract: last_modified ignored_ true links ignored_ D:\solr\solr-5.3.1\server\solr\tika-data-config.xml and config tika :

Re: schema fileds and Typefield in solr-5.3.1

2015-12-04 Thread Erik Hatcher
Kostali - See if the "Introspect rich document parsing and extraction” section of http://lucidworks.com/blog/2015/08/04/solr-5-new-binpost-utility/ helps*. You’ll be able to see the output of /update/extract (aka Tika) and adjust your mappings and configurations accordingly. * And apologies

Re: schema fileds and Typefield in solr-5.3.1

2015-12-04 Thread kostali hassan
thank you , that's why I choose to add the exact value using solarium PHP Client, but the time out stop indexing after 30seconde: $dir = new Folder($dossier); $files = $dir->find('.*\.*'); foreach ($files as $file) { $file = new File($dir->pwd() . DS . $file); $query =

Re: migrate(or copy) data from one core1(node2) to anothere core2(node1)

2015-12-04 Thread Erick Erickson
Really, this is probably not a great path to go down. If you are saying you want a leader and follower situation, you should just copy to the leader, bring it up then use the ADDREPLICA command to add the replica. That said, I'd seriously consider just defining a new cluster in your SolrCloud

RE: Data Import Handler - Multivalued fields - splitBy

2015-12-04 Thread Dyer, James
Brian, Be sure to have... transformer="RegexTransformer" ...in your tag. It’s the RegexTransformer class that looks for "splitBy". See https://wiki.apache.org/solr/DataImportHandler#RegexTransformer for more information. James Dyer Ingram Content Group -Original Message- From:

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-04 Thread Erick Erickson
Actually, I rather agree with your colleagues, but then I'm something of a curmudgeon. More accurately, unless you _strictly_ control the input documents, you never know what you have in your index. I'd rather have docs fail indexing than be indexed with, say, typos in the field names FWIW,

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-04 Thread Alexandre Rafalovitch
Not that hard to setup a cron and diff job and email when the diff is not-empty. A sort-of "is that what you expected" report. But, for myself, I also prefer schema and then managed. I do not like schemaless mode, even for development. Instead, I prefer to do "dynamicField *". P.s. I am thinking

RE: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-04 Thread Davis, Daniel (NIH/NLM) [C]
So, I actually went to an Elastic Search one day conference. One person spoke about having to re-index everything because they had their field mappings wrong. I've also worked on Linked Data, RDF, where the fact that everything is a triple is supposed to make SQL schemas unneeded. The

Data Import Handler - Multivalued fields - splitBy

2015-12-04 Thread Brian Narsi
I have the following: I believe I had the following working (splitting on pipe delimited) But it does not work now. In-fact now I have even tried But I cannot get the values to split into an array. Any thoughts/suggestions what may be wrong? Thanks,

Indexing Wikipedia

2015-12-04 Thread Kate Kas
Hi, i tried to index .xml files from wikipedia articles ( https://dumps.wikimedia.org/enwiki/20150702/) using the method, which is proposed by solr tutorial ( https://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia). I think that some fields are not indexed, because when i use

Re: Using properties placeholder ${someProperty} for xml node attribute in solrconfig

2015-12-04 Thread Pushkar Raste
Thanks Erick, I verified that we can use properties placeholders for attributes on a xml node. One last question. I was reading through CommitTracker and looks like setting maxTime for 'autoCommit' or ' autoSoftCommit' will disable commits. Is my understanding right? On 3 December 2015 at 15:40,

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-04 Thread Upayavira
This is exactly right. Schemaless can be a great discovery tool, but not something it is useful to use in production, I'd say. On Fri, Dec 4, 2015, at 08:21 PM, Davis, Daniel (NIH/NLM) [C] wrote: > So, I actually went to an Elastic Search one day conference. One person > spoke about having to

Authorization API versus zkcli.sh

2015-12-04 Thread Oakley, Craig (NIH/NLM/NCBI) [C]
Looking through cwiki.apache.org/confluence/display/solr/Authentication+and+Authorization+Plugins one notices that security.json is initially created by zkcli.sh, and then modified by means of the Authentication API and the Authorization API. By and large, this sounds like a good way to

Re: Stop adding content in Solr through /update URL

2015-12-04 Thread Chris Hostetter
: You could add 'enable' flag in the solrconfig.xml and then : enable/disable it differently on different servers: Off the top of my head, i'm not certain if enable="false" on a "/update" handler will actaully do what the user wants -- it might prevent a handler from existing at that path; but

Max indexing threads & RamBuffered size

2015-12-04 Thread KNitin
Hi, The max indexing threads in the solrconfig.xml is set to 8 by default. Does this mean only 8 concurrent indexing threads will be allowed per collection level? or per core level? Buffered size : This seems to be set at 64Mb. If we have beefier machine that can take more load, can we set this

Re: Stop adding content in Solr through /update URL

2015-12-04 Thread Alexandre Rafalovitch
On 4 December 2015 at 19:23, Chris Hostetter wrote: > NotFoundRequestHandler Totally not in either Wiki or Reference Guide. :-( Must be part of the secret committer's lore. Thank you for sharing it with us, pure plebs :-) Newsletter and resources for Solr

Re: Data Import Handler - Multivalued fields - splitBy

2015-12-04 Thread Brian Narsi
That was it! Thank you! On Fri, Dec 4, 2015 at 3:13 PM, Dyer, James wrote: > Brian, > > Be sure to have... > > transformer="RegexTransformer" > > ...in your tag. It’s the RegexTransformer class that looks > for "splitBy". > > See

Re: Stop adding content in Solr through /update URL

2015-12-04 Thread Jack Krupansky
Never made it into CHANGES.txt either. Not part of any patch either. Appears to have been secretly committed as a part of SOLR-6787 (Blob API) via Revision *1650448 * in Solr 5.1. -- Jack Krupansky On Fri, Dec 4, 2015 at 10:54 PM, Alexandre

Re: Indexing Wikipedia

2015-12-04 Thread Paul Libbrecht
SImply... some fields are not stored so they are only searched through (being indexed) but not given back? (title and text in the tutorial you refer to). Are these the missing fields? Paul > Kate Kas > 5 décembre 2015 00:23 > Hi, > > i tried to index .xml files from