Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-11 Thread epnRui
Hi Iorixxx! I have not optimized the index but the day after this post I saw I didn't have this problem anymore. I will follow your advice next time! Now I'm avoiding so much manipulation at indexation time and I'm doing more work in the java code in the client side. If I had time I would

Re: Implementing a customised tokenizer

2014-03-11 Thread epnRui
Hi Ahmet, I think the expungesDelete is done automatically through SolrJ. So I don't think it was that. THe problem solved by itself apparently. I wonder if it has to do with an automatic optimization of Solr indexes? Otherwise it was something similar to XY problem :P Thanks for the help! --

Re: Implementing a customised tokenizer

2014-03-07 Thread epnRui
Hi iorixxx! Thanks for replying. I managed to get around well enough not to need a tokenizer customized implementation. That would be a pain in ... Anyway, now I have another problem, which is related to the following: - I had previously used replace chars and replace patterns, charfilters and

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-07 Thread epnRui
Hi guys! I solved my problem on the client side but at least I solved it... Anyway, now I have another problem, which is related to the following: - I had previously used replace chars and replace patterns, charfilters and filters, at index time to replace EP by European Parliament. At that

Implementing a customised tokenizer

2014-03-05 Thread epnRui
I have managed to understand how to properly implement and change the words on a CharFilter and a Filter, but I fail to understand how the Tokenizer works... I also fail to find any tutorials on the thing.. Could you provide some example implementation of incrementToken and how to manipulate the

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-05 Thread epnRui
Hi guys, So, I keep facing this problem which I can't solve. I thought it was due to HTML anchors containing the name of the hashtag, and thus repeating it, but it's not. So the use case is: 1 - I need to consider hashtags as tokens. 2 - The hashtag has to show up in the facets. Right now if I

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-03-03 Thread epnRui
Hi guys, I'm on my way to solve it properly. This is how my field looks like now: fieldType name=text_en class=solr.TextField positionIncrementGap=100 analyzer type=index charFilter class=solr.PatternReplaceCharFilterFactory pattern=(#)|(%23)

Re: Facets, termvectors, relevancy and Multi word tokenizing

2014-02-28 Thread epnRui
Hi Ahmet!! I went ahead and did something I thought it was not a clean solution and then when I read your post and I found we thought of the same solution, including the European_Parliament with the _ :) So I guess there would be no way to do this more cleanly, maybe only implementing my own

Facets, termvectors, relevancy and Multi word tokenizing

2014-02-27 Thread epnRui
Hi everyone! I'm having a problem and I have searched and Haven't found a solution yet and am rather confused at the moment. I have an application that stores human readable texts in my Solr index. It finds the most relevant terms in that human readable text, I think using termvectors and

Re: setting up master and slave in same machine with diff ip's and same port

2013-01-31 Thread epnRui
Hi, I solved the issue by setting up two different virtual network adapters in ubuntu server. case closed ;) thanks for the help!! -- View this message in context:

Stopping solr

2013-01-31 Thread epnRui
Hi people, First of all this forum is a god sent!!! Second: I have a master / slave configuration, using replication. Currently in production I have only one server, there's no backup server (really...). The webapplication is a public webapplication, everyone can see it. - How often, in

setting up master and slave in same machine with diff ip's and same port

2013-01-23 Thread epnRui
Hi everyone its my first post here so I hope im doing it in the right place. Im a software developer and Im setting up a DEV environment in Ubuntu with the same configuration as in PROD. (apparently this IT department doesnt know the difference between a developer and a sys admin) In PROD we