Database vs Solr : ID based filtering

2015-03-26 Thread Aman Tandon
Hi, Does an ID based filtering on solr will perform poor than DB? - http://localhost:8983/solr/select?q=*&fq=id:153 *OR* - select * from TABLE where id=153 With Regards Aman Tandon

ZFS File System for SOLR 3.6 and SOLR 4

2015-03-26 Thread abhi Abhishek
Hello, i am trying to use ZFS as filesystem for my Linux Environment. are there any performance implications of using any filesystem other than ext-3/ext-4 with SOLR? Thanks in Advance Best Regards, Abhishek

Re: SOLR Index in shared/Network folder

2015-03-26 Thread Shawn Heisey
On 3/27/2015 12:06 AM, abhi Abhishek wrote: > Greetings, > I am trying to use a network shared location as my index directory. > are there any known problems in using a Network File System for running a > SOLR Instance? It is not recommended. You will probably need to change the lockType, .

SOLR Index in shared/Network folder

2015-03-26 Thread abhi Abhishek
Greetings, I am trying to use a network shared location as my index directory. are there any known problems in using a Network File System for running a SOLR Instance? Thanks in Advance. Best Regards, Abhishek

Different methods of sending documents to Solr

2015-03-26 Thread zhangxin0804
Hi All, I am trying to post data into Solr using curl command. Does anybody could tell me the difference between the following two methods? Method1: curl "http://localhost:8983/solr/update/extract?literal.id=doc1&commit=true"; -F "myfile=@tutorial.html" The -F flag instructs curl to POST dat

solr server datetime

2015-03-26 Thread fjq
Is it possible to retrieve the server datetime? -- View this message in context: http://lucene.472066.n3.nabble.com/solr-server-datetime-tp4195728.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: i'm a newb: questions about schema.xml

2015-03-26 Thread Zheng Lin Edwin Yeo
Yes, this is the correct page which will tell you more about this "managed-schema" thing in Solr 5.0.0. I got stuck in this for quite a while previously too. Regards, Edwin On 27 March 2015 at 08:20, Mark Bramer wrote: > Pretty sure I found what I am looking for: > https://cwiki.apache.org/conf

Re: Solr Monitoring - Stored Stats?

2015-03-26 Thread Otis Gospodnetic
Matt, SPM will give you all that out of the box with alerts, anomaly detection etc. See http://sematext.com/spm Otis > On Mar 25, 2015, at 11:26, Matt Kuiper wrote: > > Hello, > > I am familiar with the JMX points that Solr exposes to allow for monitoring > of statistics like QPS, numdoc

RE: i'm a newb: questions about schema.xml

2015-03-26 Thread Mark Bramer
Pretty sure I found what I am looking for: https://cwiki.apache.org/confluence/display/solr/Managed+Schema+Definition+in+SolrConfig I noticed the "managed-schema" file and a couple Google searches with that finally landed me at that link. Interesting that the file is hidden from the Files lis

Re: i'm a newb: questions about schema.xml

2015-03-26 Thread Erick Erickson
This is key: managed-schema You've managed to get things started with the "managed schema". Therefore, you need to use the REST API to add/subtract/multiply/divide. This is different than "schemaless", although it _is_ related. And they're both different than having a schema.xml to edit. Or start

Re: Build index from Oracle, adding fields

2015-03-26 Thread Julian Perry
On 27/03/2015 12:42, Shawn Heisey wrote: If that's not practical, then the only real option you have is to drop back to one entity, and build a single SELECT statement (using JOIN and some form of CONCAT) that will gather all the information from all the tables at the same time, and combine multi

RE: i'm a newb: questions about schema.xml

2015-03-26 Thread Mark Bramer
Hi Shawn, Definitely helpful to know about the instance and files stuff in Admin. I'm not running cloud, so I looked in the /conf directory but there's no schema.xml: Here's what's in my core's Files: currency.xml elevate.xml lang params.json protwords.txt solrconfig.xml stopword

Re: Build index from Oracle, adding fields

2015-03-26 Thread Shawn Heisey
On 3/26/2015 5:19 PM, Julian Perry wrote: > I have an index with, say, 10 fields. > > I load that index directly from Oracle - data-config.xml using > JDBC. I can load 10 million rows very quickly. This direct > way of loading from Oracle straight into SOLR is fantastic - > really efficient and s

Re: i'm a newb: questions about schema.xml

2015-03-26 Thread Shawn Heisey
On 3/26/2015 4:57 PM, Mark Bramer wrote: > I'm a Solr newb. I've been poking around for several days on my own test > instance, and also online at the info available. But one thing just isn't > jiving and I can't put my finger on why. I've searched many many times but I > don't see what I'm l

Build index from Oracle, adding fields

2015-03-26 Thread Julian Perry
Hi I have looked and cannot see any clear answers to this on the Interwebs. I have an index with, say, 10 fields. I load that index directly from Oracle - data-config.xml using JDBC. I can load 10 million rows very quickly. This direct way of loading from Oracle straight into SOLR is fantast

Re: SolrCloud -- Blocking access to administration commands while keeping the solr internal communication

2015-03-26 Thread Shawn Heisey
On 3/26/2015 3:38 PM, Oded Sofer wrote: > There are many blogs discussing this issue but it is hard to find if someone > had managed to resolve that. > We have many nodes in the SolrCloud, implementing the iptable restriction > will fill the iptable with many rules that will affect performance.

i'm a newb: questions about schema.xml

2015-03-26 Thread Mark Bramer
Hello, I'm a Solr newb. I've been poking around for several days on my own test instance, and also online at the info available. But one thing just isn't jiving and I can't put my finger on why. I've searched many many times but I don't see what I'm looking for, so I'm thinking perhaps I hav

SolrCloud -- Blocking access to administration commands while keeping the solr internal communication

2015-03-26 Thread Oded Sofer
Hello there, There are many blogs discussing this issue but it is hard to find if someone had managed to resolve that. We have many nodes in the SolrCloud, implementing the iptable restriction will fill the iptable with many rules that will affect performance. We are using 4.3.10, on Tomcat 5

Re: Custom TokenFilter

2015-03-26 Thread Test Test
Hi Erick,  For me, this classCastException is caused by the wrong use of TokenFilter.In fieldType declaration (schema.xml), i've put :And instead using TokenizerFactory in my class, i utilize TokenFilterFactory like this :public class SentenceTokenizerFactory  extends TokenFilterFactory  So when

Performance json vs javabin

2015-03-26 Thread Tech MOnkey
Has anyone done performance tests between json and javabin? Scale tipped towards javabin when compared to XML(https://issues.apache.org/jira/browse/SOLR-486). I am curious to know if it is same with json when load is 600 per minute, for example. Thanks,

delta import on changes in entity within a document

2015-03-26 Thread PeterKerk
I have the following data-config: Now, when the object in the [locations] table is updated, my delta import (/dataimport?comm

Re: How to create a core by API?

2015-03-26 Thread Yonik Seeley
On Thu, Mar 26, 2015 at 1:45 PM, Mark E. Haase wrote: > I'm not saying you're wrong. The configSet parameter doesn't work at all in > my set up, so you might be right... I'm just wondering where that's > documented. Trying on current trunk, I got it to work: /opt/code/lusolr_trunk/solr$ curl -XP

Re: Uneven index distribution using composite router

2015-03-26 Thread shamik
Thanks for your reply Eric. In my case, I've 14 languages, out of which 50% of the documents belong to English. German and CHS will probably constitute another 25%. I'm not using copyfield, rather, each language has it's dedicated field such as title_enu, text_enu, title_ger,text_ger, etc. Since I

Re: How to create a core by API?

2015-03-26 Thread Erick Erickson
Got to the comments section and add any corrections you'd like, that'll get bubbled up. Best, Erick On Thu, Mar 26, 2015 at 10:45 AM, Mark E. Haase wrote: > On Thu, Mar 26, 2015 at 1:31 PM, Erick Erickson > wrote: > >> Hmmm, looks like I stand corrected. I haven't kept complete track >> there,

Re: How to create a core by API?

2015-03-26 Thread Mark E. Haase
On Thu, Mar 26, 2015 at 1:31 PM, Erick Erickson wrote: > Hmmm, looks like I stand corrected. I haven't kept complete track > there, looks like this one didn't stick in my head. I'm not saying you're wrong. The configSet parameter doesn't work at all in my set up, so you might be right... I'm ju

Re: Solr Monitoring - Stored Stats?

2015-03-26 Thread Upayavira
Have a look at the admin UI, plugins/stats. I’ve just spent the time to re-implement it in AngularJS, so I know the functionality is there - twice :-) You can “watch for changes” - it pulls in a reference XML, and posts that back to the server, which only reports back changes. Dunno if that give

Re: Uneven index distribution using composite router

2015-03-26 Thread Erick Erickson
right, when you take over routing, making sure the distribution is even is now your responsibility. Your assumption is that the amount of _text_ in each doc is roughly the same between your three languages, have you verified this? And are you doing anything like copyFields that are kicking in on o

Re: How to create a core by API?

2015-03-26 Thread Erick Erickson
Hmmm, looks like I stand corrected. I haven't kept complete track there, looks like this one didn't stick in my head. As far as the docs are concerned, all patches welcome! Best, Erick On Thu, Mar 26, 2015 at 10:26 AM, Mark E. Haase wrote: > Erick, are you sure that configSets don't apply to si

Uneven index distribution using composite router

2015-03-26 Thread Shamik Bandopadhyay
Hi, I'm using a three level composite router in a solr cloud environment, primarily for multi-tenant and field collapsing. The format is as follows. *language!topic!url*. An example would be : ENU!12345!www.testurl.com/enu/doc1 GER!12345!www.testurl.com/ger/doc2 CHS!67890!www.testurl.com/chs

Re: How to create a core by API?

2015-03-26 Thread Mark E. Haase
Erick, are you sure that configSets don't apply to single-node Solr instances? https://cwiki.apache.org/confluence/display/solr/Config+Sets I don't see anything about Solr cloud there. Also, "configSet" is a documented argument to the Core Admin API: https://cwiki.apache.org/confluence/display/s

Re: How to create a core by API?

2015-03-26 Thread Mark E. Haase
Okay, thanks for the feedback. I'll admit that I do find the cloud vs non-cloud deployment options a constant source of confusion, not the least of which is due to the name. If I run a single Solr instance on EC2, that's not "cloud", but if I run a few instances with ZK on my local LAN, that is "cl

Re: How to create a core by API?

2015-03-26 Thread Erick Erickson
Ok, you're being confused by cloud, non cloud and all that kinda stuff Configsets are SolrCloud only, so forget them since you specified it's not SolrCloud. bq: surely the HTTP API doesn't require the caller to create a directory and copy files first, does it In fact, yes. The thing to remem

Re: How to create a core by API?

2015-03-26 Thread Shawn Heisey
On 3/26/2015 10:24 AM, Mark E. Haase wrote: > I can't get the Core Admin API to work. I have a brand new installation of > Solr 5.0.0 (in non-cloud mode). I installed using the installation script > (a nice addition!) with default options, so I have Solr in /opt/solr and > its data in /var/solr. >

Re: Applying Tokenizers and Filters to CopyFields

2015-03-26 Thread Erick Erickson
Glad it worked out... Looking back, I can't believe I didn't mention adding &debug=query to the URL. That would have shown you exactly what the parsed query looked like and you'd have seen right off that it wasn't searching against the field you thought it was. It's one of the first things I do wh

How to create a core by API?

2015-03-26 Thread Mark E. Haase
I can't get the Core Admin API to work. I have a brand new installation of Solr 5.0.0 (in non-cloud mode). I installed using the installation script (a nice addition!) with default options, so I have Solr in /opt/solr and its data in /var/solr. Here's what I'm trying: curl 'http://localhost:8

Re: Replacing a group of documents (Delete/Insert) without a query on the index ever showing an empty list (Docs)

2015-03-26 Thread Shawn Heisey
On 3/26/2015 9:53 AM, Russell Taylor wrote: > I have an index which is made up of groups of documents, each group is > defined by a field called keyField (keyField:A). > I need to delete all the keyField:A documents and replace them with a brand > new set without the index ever returning > zero d

Replacing a group of documents (Delete/Insert) without a query on the index ever showing an empty list (Docs)

2015-03-26 Thread Russell Taylor
Hi, I have an index which is made up of groups of documents, each group is defined by a field called keyField (keyField:A). I need to delete all the keyField:A documents and replace them with a brand new set without the index ever returning zero documents on a query. At the moment I deleteByQuer

Re: Applying Tokenizers and Filters to CopyFields

2015-03-26 Thread Michael Della Bitta
Glad you are sorted out! Michael Della Bitta Senior Software Engineer o: +1 646 532 3062 appinions inc. “The Science of Influence Marketing” 18 East 41st Street New York, NY 10017 t: @appinions | g+: plus.google.com/appinions

RE: Solr Monitoring - Stored Stats?

2015-03-26 Thread Matt Kuiper
Erick, Shawn, Thanks for your responses. I figured this was the case, just wanted to check to be sure. I have used Zabbix to configure JMX points to monitor over time, but it was a bit of work to get configured. We are looking to create a simple dashboard of a few stats over time. Looks lik

Re: Applying Tokenizers and Filters to CopyFields

2015-03-26 Thread Martin Wunderlich
Thanks so much, Erick and Michael, for all the additional explanation. The crucial information in the end turned out to be the one about the Default Search Field („df“). In solrconfig.xml this parameter was to point to the original text, which is why the expanded queries didn’t work. When I set

Re: Running test cases with ant

2015-03-26 Thread Shawn Heisey
On 3/26/2015 6:40 AM, Mrinali Agarwal wrote: > I am trying to run my test cases in solr using ant . > I am using below command > > ant test –Dtestcase=Test -Dtests.leaveTemporary=true > > Now , here i have my own custom schema & solrConfig . On running the above > command on solr directiory

Running test cases with ant

2015-03-26 Thread Mrinali Agarwal
Hello , I am trying to run my test cases in solr using ant . I am using below command ant test –Dtestcase=Test -Dtests.leaveTemporary=true Now , here i have my own custom schema & solrConfig . On running the above command on solr directiory , it builds the project again which overrides my

Installing the auto-phrase-tokenfilter

2015-03-26 Thread luismart
hello, I am after installing the auto-phrase-tokenfilter from https://github.com/LucidWorks/auto-phrase-tokenfilter. Can anyone point me to some documentation on how to do this? Thanks Luis Martinez -- View this message in context: http://lucene.472066.n3.nabble.com/Installing-the-auto-ph

RE: German Compound Splitter words.fst causing problems.

2015-03-26 Thread Christopher Morley
Thanks for the tip Markus. We are using this filter to decompound German words. Update: I am on the path to victory. The words.fst file is actually built by the plugin, however there is a basic input&output file format mismatch (at the byte level) that doesn't occur with 4.0. As soon as you

Re: How partial are partial updates

2015-03-26 Thread Mikhail Khludnev
On Thu, Mar 26, 2015 at 12:23 PM, kennyk wrote: > Does solr have to reindex the whole document and not just the modified > fields? > yep. you are right. > If so, can you give me an idea of the amount (factor) of speed > gained by partial re-indexing? > it's exactly the same what you have in in

How partial are partial updates

2015-03-26 Thread kennyk
Hi all, I have a question. Here I read that /Solr supports several modifiers that atomically update values of a document. This allows updating only specific fields,/ and that /All original source fields must be stor