issue in launching SolrCloud windows/cygwin

2014-10-19 Thread Anurag Sharma
Here is the issue am facing issue in using the 'solr' script on Windows with cygwin terminal: $ bin/solr -e cloud bin/solr: line 16: $'\r': command not found bin/solr: line 17: $'\r': command not found bin/solr: line 46: $'\r': command not found which: no lsof in

Re: issue in launching SolrCloud windows/cygwin

2014-10-19 Thread Jürgen Wagner (DVT)
Hello Anurag, the CRLF problem with Cygwin can be cured by running the scripts all through this filter: tr -d '\r' $script $script.new ; mv $script.new $script with $script holding the path of the script to be massaged. Generally, however, I would advise to use the standard scripts only for

Re: issue in launching SolrCloud windows/cygwin

2014-10-19 Thread Anurag Sharma
Hello Jurgen, Thanks a lot for yoru prompt response. It solved the CRLF problem but the script is not supported on cygwin due to severe limitations and lack of adherence to BASH standards, such as lack of lsof, curl, and ps options. I found there is a native solr.cmd script for windows which

Re: issue in launching SolrCloud windows/cygwin

2014-10-19 Thread Nazik Huq
Run Solr straight from the Windows cmd if CygWin isn't a requirement. For example, running java -jar start.jar from the example directory will start single instance Solr. To run SolrCloud follow the instructions in Simple Two-Shard Cluster on the Same Machine from this link http://bit.ly/1rlmYvF

CopyField from text to multi value

2014-10-19 Thread Tomer Levi
Hi, I would like to copy a textual field content into a multivalue filed. For example, Let's say my field text contains: I am a solr user I would like to have a multi-value copyFields with the following content: [I, am, a, solr, user] Thanks, Tomer Levi Software Engineer Big Data Group Product

Re: issue in launching SolrCloud windows/cygwin

2014-10-19 Thread Anurag Sharma
Hi Nazik, Thanks for the response. The link mentioned by you is very useful. I used the windows cmd and started the cloud using solr.cmd script. The script is very rich in taking multiple options. Anurag On Sun, Oct 19, 2014 at 5:01 PM, Nazik Huq nazik...@gmail.com wrote: Run Solr straight

Re: CopyField from text to multi value

2014-10-19 Thread Erick Erickson
Not quite sure what you're asking here. If you do a copyField, the raw input is, well, copied to the destination field and _then_ the analysis chain is applied. Which seems to be what you want, the destination field would be a text-based field, perhaps text_general or some such from the distro.

RE: CopyField from text to multi value

2014-10-19 Thread Tomer Levi
Hi Erick, Thanks for the explanation, I understand that the analysis chain is applied after the raw input was copied. I need to store the output of the analysis chain as a new multi-value field, and I think that ShingleFilterFactory might do that, isn’t it? Tomer -Original Message-

Re: CopyField from text to multi value

2014-10-19 Thread Jack Krupansky
As always, you need to first examine how you intend to query the fields before you dive into data modeling. In this case, is there any particular reason that you need the individual terms as separate values, as opposed to simply using a tokenized text field? -- Jack Krupansky From: Tomer Levi

Re: CopyField from text to multi value

2014-10-19 Thread Erick Erickson
This really feels like an XY problem, which I think Jack is alluding to. bq: I understand that the analysis chain is applied after the raw input was copied. I need to store the output of the analysis chain as a new multi-value field This statement is really confusing. You can't have the output

RE: CopyField from text to multi value

2014-10-19 Thread Tomer Levi
Thanks again for the help. The use case is this. In my UI I would like to indicate which words leaded to every document in the response. It actually seems like a simple highlight case but instead of getting the highlight result as this is a brlong/br string brwith/br text, Our UI team

Query parsing - difference between Analysis and parsedquery_toString output

2014-10-19 Thread tinush
Hi, I use Solr 4.9 and imported about 20K documents from CSV data. In schema there is following definition for text_general field which I want to process by tokenization, stop word removal, stemming. fieldType name=text_general class=solr.TextField positionIncrementGap=100 analyzer

Re: CopyField from text to multi value

2014-10-19 Thread Walter Underwood
I think that info is available with termvectors. That should give a list of the query terms that matched each document, if I understand it correctly. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ On Oct 19, 2014, at 7:37 AM, Tomer Levi tomer.l...@nice.com wrote:

Re: Query parsing - difference between Analysis and parsedquery_toString output

2014-10-19 Thread Erick Erickson
This trips _everybody_ up. Analysis doesn't happen until things get through the query parser. So, let's assume your query is q=manufacture_t:The Hershey Company^100 OR title_t:The Hershey Company^1000 The problem is that the query _parser_ doesn't understand that your intent is that the hershey

Re: Recovering from Out of Mem

2014-10-19 Thread Salman Akram
I assume you will have to write a script to restart the service as well? On Fri, Oct 17, 2014 at 7:17 PM, Tim Potter tim.pot...@lucidworks.com wrote: You'd still want to kill it ... so you'll need to register a cmd script with the JVM using -XX:OnOutOfMemoryError=kill.cmd and then you could

Re: Recovering from Out of Mem

2014-10-19 Thread Ramzi Alqrainy
You can create a script to ping on Solr every 10 sec. if no response, then restart it (Kill process id and run Solr again). This is the fastest and easiest way to do that on windows. -- View this message in context:

Re: How to properly use Levenstein distance with ~ in Java

2014-10-19 Thread Ramzi Alqrainy
You can use Levenstein Distance algorithm inside solr without writing code by specifing the source of terms in solrconfig.xml searchComponent name=spellcheck class=solr.SpellCheckComponent lst name=spellchecker str name=classnamesolr.IndexBasedSpellChecker/str str