Re: Using SimpleNaiveBayesClassifier in solr

2015-10-14 Thread Alessandro Benedetti
ahahah absolutely not, you don't sound dumb. You need only a basic knowledge of how Lucene manage IndexReaders and IndexSearchers. On 14 October 2015 at 09:08, Yewint Ko wrote: > Thank Ales and Tommaso for your replies > > So, is it like the classifier query the whole index db and load onto mem

Re: Using SimpleNaiveBayesClassifier in solr

2015-10-14 Thread Yewint Ko
Thank Ales and Tommaso for your replies So, is it like the classifier query the whole index db and load onto memory first before running tokenizer against InputDocument? It sounds like if I don't close the classifier and my index is big, i might need bigger machine. Anyway to reverse the order? D

Re: Using SimpleNaiveBayesClassifier in solr

2015-10-12 Thread Alessandro Benedetti
Hi Yewint, > > The sample test code inside seems like that classifier read the whole index > db to train the model everytime when classification happened for > inputDocument. or am I misunderstanding something here? I would suggest you to take a look to a couple of articles I wrote last summer ab

Re: Using SimpleNaiveBayesClassifier in solr

2015-10-12 Thread Tommaso Teofili
Hi Yewint, the SNB classifier is not an online one, so you should retrain it every time you want to update it. What you pass to the Classifier is a Reader therefore you should grant that this keeps being accessible (not close it) for classification to work. Regarding performance SNB becomes slower