RE: run on reboot on windows

2010-05-03 Thread Frederico Azeiteiro
Hi Ahmed, I need to achieve that also. Do you manage to install it as service and start solr with Jetty? After installing and start jetty as service how do you start solr? Thanks, Frederico -Original Message- From: S Ahmed [mailto:sahmed1...@gmail.com] Sent: segunda-feira, 3 de Maio de

Where to check optimize status

2010-06-29 Thread Frederico Azeiteiro
, Frederico Azeiteiro

RE: Where to check optimize status

2010-06-29 Thread Frederico Azeiteiro
Lucene Merge Thread. If its there, then optimize is still running. Also, index-filesize and filenames in your index-dir are changing a lot... On Tuesday 29 June 2010 12:54:54 Frederico Azeiteiro wrote: Hi, I'm using solr1.4.0 default installation. Is there a place where I can find

steps to improve search

2010-07-02 Thread Frederico Azeiteiro
Hi, I'm using the default text field type on my schema. Is there a quick way to do more accurate searches like searching for paying for it only return docs with the full expression paying for it, and not return articles with word pay as it does now? Thanks, Frederico

RE: steps to improve search

2010-07-02 Thread Frederico Azeiteiro
Message- From: Leonardo Menezes [mailto:leonardo.menez...@googlemail.com] Sent: sexta-feira, 2 de Julho de 2010 12:07 To: solr-user@lucene.apache.org Subject: Re: steps to improve search Try field:text to search On Fri, Jul 2, 2010 at 12:57 PM, Frederico Azeiteiro frederico.azeite

RE: steps to improve search

2010-07-02 Thread Frederico Azeiteiro
. Searching with the surrounding the text you are searching for, has exactly the effect you are looking for. try it... On Fri, Jul 2, 2010 at 1:23 PM, Frederico Azeiteiro frederico.azeite...@cision.com wrote: I'm sorry, maybe I didn’t explain correctly. The issue is using the default text FIELD

RE: steps to improve search

2010-07-02 Thread Frederico Azeiteiro
For the example given, I need the full expression paying for it, so yes all the words. -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com] Sent: sexta-feira, 2 de Julho de 2010 12:30 To: solr-user@lucene.apache.org Subject: RE: steps to improve search I need to know how to

RE: steps to improve search

2010-07-02 Thread Frederico Azeiteiro
RemoveDuplicatesTokenFilterFactory StopFilterFactory you get those fake matches. try going into the admin, on the analysis section. in there you can simulate the index/search of a document, and see how its actually searched/indexed. it will give you some clues... On Fri, Jul 2, 2010 at 1:50 PM, Frederico Azeiteiro

Query: URl too long

2010-07-12 Thread Frederico Azeiteiro
Hi, I need to perform a search using a list of values (about 2000). I'm using SolrNET QueryInList function that creates the searchstring like: fieldName: value1 OR fieldName: value2 OR fieldName: value3... (2000 values) This method created a string with about 100 000 chars and the

RE: Query: URl too long

2010-07-12 Thread Frederico Azeiteiro
...@vyre.com] Sent: segunda-feira, 12 de Julho de 2010 11:56 To: solr-user@lucene.apache.org Subject: Re: Query: URl too long Hi there, We had a similar issue. It's an easy fix, simply change the request type from GET to POST. Jon On 12 Jul 2010, at 11:18, Frederico Azeiteiro wrote: Hi, I need

RE: Query: URl too long

2010-07-12 Thread Frederico Azeiteiro
a similar issue. It's an easy fix, simply change the request type from GET to POST. Jon On 12 Jul 2010, at 11:18, Frederico Azeiteiro wrote: Hi, I need to perform a search using a list of values (about 2000). I'm using SolrNET QueryInList function that creates the searchstring

RE: Query: URl too long

2010-07-12 Thread Frederico Azeiteiro
Not an option because the query has other fields to query also. They are generated throw a list choices (that could go to 5000's string with 7 char each..). I don't know is this could be considered off-topic (please advise...) but: i'm doing some test with lucene (Lucene.Net 2.9.2) but the

RE: Query: URl too long

2010-07-12 Thread Frederico Azeiteiro
Ok, I posted on SOLRNet forum asking how can I reduce the URL string using POST method. But I'm giving a try to SOLRJ. Think should be the right way to do it maybe. -Original Message- From: Mauricio Scheffer [mailto:mauricioschef...@gmail.com] Sent: segunda-feira, 12 de Julho de 2010

java.lang.NullPointerException

2010-07-13 Thread Frederico Azeiteiro
Hi again, I change the search options to decrease my query size and now I get passed the URI too long from the other thread. I already added : Set name=headerBufferSize819200/Set Set name=requestBufferSize819200/Set On Jetty config but now I'm stucked again on: 13/Jul/2010 9:41:38

wildcard and proximity searches

2010-07-29 Thread Frederico Azeiteiro
Hi, What approach shoud I use to perform wildcard and proximity searches? Like: solr mail*~10 For getting docs where solr is within 10 words of mailing for instance? Thanks, Frederico

RE: wildcard and proximity searches

2010-07-30 Thread Frederico Azeiteiro
Hi Ahmet, Thank you. I'll be happy to test it if I manage to install it ok.. I'm a newbie at solr but I'm going to try the instructions in the thread to load it. Another doubts I have about wildcard searches: a) I think wildcard search is by default case sensitive? Is there a way to make case

RE: wildcard and proximity searches

2010-07-30 Thread Frederico Azeiteiro
Hi Ahmet, a) I think wildcard search is by default case sensitive? Is there a way to make case insensitive? Wildcard searches are not analyzed. To case insensitive search you can lowercase query terms at client side. (with using lowercasefilter at index time) e.g. Mail* = mail* I discovered

RE: wildcard and proximity searches

2010-08-04 Thread Frederico Azeiteiro
the heavy queries Frederico -Original Message- From: Jonathan Rochkind [mailto:rochk...@jhu.edu] Sent: quarta-feira, 4 de Agosto de 2010 01:41 To: solr-user@lucene.apache.org Subject: Re: wildcard and proximity searches Frederico Azeiteiro wrote: But it is unusual to use both leading

timestamp field

2010-08-11 Thread Frederico Azeiteiro
Hi, I have on my schema field name=timestamp type=date indexed=true stored=true default=NOW / This field is returned as date name=timestamp2010-08-11T10:11:03.354Z/date For an article added at 2010-08-11T11:11:03.354Z! And the server has the time of 2010-08-11T11:11:03.354Z...

RE: timestamp field

2010-08-11 Thread Frederico Azeiteiro
- www.solrtraining.com On 11. aug. 2010, at 13.02, Frederico Azeiteiro wrote: Hi, I have on my schema field name=timestamp type=date indexed=true stored=true default=NOW / This field is returned as date name=timestamp2010-08-11T10:11:03.354Z/date For an article added

Problem comitting on 40GB index

2010-01-12 Thread Frederico Azeiteiro
Hi all, I started working with solr about 1 month ago, and everything was running well both indexing as searching documents. I have a 40GB index with about 10 000 000 documents available. I index 3k docs for each 10m and commit after each insert. Since yesterday, I can't commit no articles to

RE: Problem comitting on 40GB index

2010-01-12 Thread Frederico Azeiteiro
a previous reader opened. So I suspect that sometime recently you've opened another IR. HTH Erick On Tue, Jan 12, 2010 at 8:03 AM, Frederico Azeiteiro frederico.azeite...@cision.com wrote: Hi all, I started working with solr about 1 month ago, and everything was running well both indexing

RE: Problem comitting on 40GB index

2010-01-12 Thread Frederico Azeiteiro
at 9:22 AM, Frederico Azeiteiro frederico.azeite...@cision.com wrote: Hi Erik, I'm a newbie to solr... By IR, you mean searcher? Is there a place where I can check the open searchers? And rebooting the machine shouldn't closed that searchers? Thanks, -Original Message- From

RE: Problem comitting on 40GB index

2010-01-13 Thread Frederico Azeiteiro
Sorry, my bad... I replied to a current mailing list message only changing the subject... Didn't know about this Hijacking problem. Will not happen again. Just for close this issue, if I understand correctly, for an index of 40G, I will need, for running an optimize: - 40G if all activity on

RE: Problem comitting on 40GB index

2010-01-13 Thread Frederico Azeiteiro
[mailto:erickerick...@gmail.com] Sent: 13 January 2010 13:33 To: solr-user@lucene.apache.org Subject: Re: Problem comitting on 40GB index That's my understanding.. But fortunately disk space is cheap G On Wed, Jan 13, 2010 at 5:01 AM, Frederico Azeiteiro frederico.azeite...@cision.com wrote

query all filled field?

2010-02-03 Thread Frederico Azeiteiro
Hi all, Is it possible to query some field in order to get only not empty documents? All documents where field x is filled? Thanks, Frederico

RE: query all filled field?

2010-02-03 Thread Frederico Azeiteiro
Ok, if anyone needs it: I tried fieldX:[* TO *] I think this is correct. In my case I found out that I was not indexing this field correctly because they are all empty. :) -Original Message- From: Frederico Azeiteiro [mailto:frederico.azeite...@cision.com] Sent: quarta-feira, 3 de

RE: query all filled field?

2010-02-03 Thread Frederico Azeiteiro
Hum, strange.. I reindexed some docs with the field corrected. Now I'm sure the field is filled because: fieldX:(*a*) returns docs. But fieldX:[* TO *] is returning the same as *.* (all results) I tried with -fieldX:[* TO *] and I get no results at all. I wonder if someone has tried this

RE: query all filled field?

2010-02-04 Thread Frederico Azeiteiro
: quinta-feira, 4 de Fevereiro de 2010 05:38 To: solr-user@lucene.apache.org Subject: Re: query all filled field? Queries that start with minus or NOT don't work. You have to do this: *:* AND -fieldX:[* TO *] On Wed, Feb 3, 2010 at 5:04 AM, Frederico Azeiteiro frederico.azeite...@cision.com wrote

RE: query all filled field?

2010-02-04 Thread Frederico Azeiteiro
with field not empty? Thanks again, Frederico -Original Message- From: Frederico Azeiteiro [mailto:frederico.azeite...@cision.com] Sent: quinta-feira, 4 de Fevereiro de 2010 10:55 To: solr-user@lucene.apache.org Subject: RE: query all filled field? Thanks, but still no luck

RE: query all filled field?

2010-02-04 Thread Frederico Azeiteiro
Theoretically yes,it's correct, but i have about 1/10 of the docs with this field not empty and the rest is empty. Most of the articles have the field empty as I can see when query *:*. So the queries don't make sense... -Original Message- From: Ankit Bhatnagar

RE: query all filled field?

2010-02-04 Thread Frederico Azeiteiro
XML update. I'm serializing the doc in .NET, and then using solsharp to insert/update the doc to SOLR. The result is: doc str name=fieldX/ /doc Dows this means I'm adding a whitespace on XML Update? Frederico -Original Message- From: Ahmet Arslan [mailto:iori...@yahoo.com]

RE: query all filled field?

2010-02-04 Thread Frederico Azeiteiro
I've analyzed my index application and checked the XML before executing the http request and the field it's empty: field name=fieldX / It should be empty on SOLR. Probably something in the way between my application (.NET) and the SOLR (Jetty on Ubuntu) adds the whitespace. Anyway, I'll try

Reindex after changing defaultSearchField?

2010-02-17 Thread Frederico Azeiteiro
Hi, If i change the defaultSearchField in the core schema, do I need to recreate the index? Thanks, Frederico

Solrsharp

2010-02-26 Thread Frederico Azeiteiro
Hi, I don't know if this list includes this kind of help, but I'm using Solrsharp with C# to operate SOLR. Please advise if this is off-topic please. I'm having a little trouble to make a search with exclude terms using the query parameters. Does anyone uses Solrsharp around here? Do

RE: Solrsharp

2010-02-27 Thread Frederico Azeiteiro
solr.net: http://code.google.com/p/solrnet/. Disclaimer: I haven't evaluated solr.net myself but it looks to be more robust than solr# and is more actively maintained than solr#. S -Original Message- From: Frederico Azeiteiro frederico.azeite...@cision.com To: solr-user@lucene.apache.org

fieldType text

2010-03-02 Thread Frederico Azeiteiro
Hi, I'm using the default text field type that comes with the example. When searching for simple words as 'HP' or 'TCS' solr is returning results that contains 'HP1' or 'TCS' Is there a solution for to avoid this? Thanks, Frederico

search and count ocurrences

2010-03-02 Thread Frederico Azeiteiro
Hi, I need to implement a search where i should count the number of times the string appears on the search field, ie: only return articles that mention the word 'HP' at least 2x. I'm currently doing this after the SOLR search with my own methods. Is there a way that SOLR does this type of

RE: search and count ocurrences

2010-03-10 Thread Frederico Azeiteiro
Thanks Chris. Could something like that be implemented in c# ? :) Does anyone has any link where I can start digging? This is not an urgent matter, just something to investigate and implement in a near future. Frederico -Original Message- From: Chris Hostetter

Cant commit on 125 GB index

2010-03-11 Thread Frederico Azeiteiro
Hi, I'm having timeouts commiting on a 125 GB index with about 2200 docs. I'm inserting new docs every 5m and commiting after that. I would like to try the autocommit option and see if I can get better results. I need the docs indexed available for searching in about 10 minutes after

RE: Cant commit on 125 GB index

2010-03-13 Thread Frederico Azeiteiro
, Frederico Azeiteiro frederico.azeite...@cision.com wrote: Hi, I'm having timeouts commiting on a 125 GB index with about 2200 docs. I'm inserting new docs every 5m and commiting after that. I would like to try the autocommit option and see if I can get better results. I need

search within sentence or paragraph

2010-03-30 Thread Frederico Azeiteiro
Hi all, Is it possible search for a combination of words within the same sentence or paragraph? Ex: American and McDonalds Returns : McDonalds is a American company Don't returns: ...went to McDonalds. After that se saw the American flag... Is this possible? Frederico

Solr 3.5.0 - different behaviour on rows?

2012-03-15 Thread Frederico Azeiteiro
Hi all, Just testing SOLR 3.5.0. and notice a different behavior on this new version: select?rows=10q=sig%3a(54ba3e8fd3d5d8371f0e01c403085a0c)? this query returns no results on my indexes, but works for SOLR 1.4.0 and returns Java heap space java.lang.OutOfMemoryError: Java heap

Error loading class solr.CJKBigramFilterFactory

2012-11-14 Thread Frederico Azeiteiro
Hi, I've been testing some CJK tokenizers and I manage to get acceptable results using: fieldType name=text_cjk class=solr.TextField positionIncrementGap=100 analyzer tokenizer class=solr.StandardTokenizerFactory/ !--

RE: Error loading class solr.CJKBigramFilterFactory

2012-11-14 Thread Frederico Azeiteiro
solr.CJKBigramFilterFactory On Wed, Nov 14, 2012 at 8:12 AM, Frederico Azeiteiro frederico.azeite...@cision.com wrote: Fo make some further testing I installed SOLR 3.5.0 using default Jetty server. When tried to start SOLR using the same schema I get: SEVERE

RE: Error loading class solr.CJKBigramFilterFactory

2012-11-15 Thread Frederico Azeiteiro
@lucene.apache.org Assunto: Re: Error loading class solr.CJKBigramFilterFactory I'm sure. I added it to 3.6 ;) You must have something funky with your tomcat configuration, like an exploded war with different versions of jars or some other form of jar hell. On Wed, Nov 14, 2012 at 9:32 AM, Frederico Azeiteiro

RE: Search differences between solr 1.4.0 and 3.6.1

2012-11-28 Thread Frederico Azeiteiro
? The generateNumberParts=0 keeps the '12' from making it through the filter in 1.4 and 3.6 so I suspect you're not quite doing something the same way in both. Perhaps looking at index tokenization in one and query in the other? Best Erick On Mon, Nov 26, 2012 at 9:06 AM, Frederico Azeiteiro

RE: Search differences between solr 1.4.0 and 3.6.1

2012-11-28 Thread Frederico Azeiteiro
On Wed, Nov 28, 2012 at 4:34 AM, Frederico Azeiteiro frederico.azeite...@cision.com wrote: I just reload both indexes just to make sure that all definitions are loaded. On Analysis tool I can see differences, even that the fields are defined on the same way: Query Analyser for 3.6.1

RE: Search differences between solr 1.4.0 and 3.6.1

2012-11-28 Thread Frederico Azeiteiro
Ok, I'll test that and let you know. Is there some test I can easily do to confirm that is was really a side-effect of the bug? Frederico Azeiteiro Developer   -Mensagem original- De: Jack Krupansky [mailto:j...@basetechnology.com] Enviada

RE: Search differences between solr 1.4.0 and 3.6.1

2012-11-28 Thread Frederico Azeiteiro
- De: Frederico Azeiteiro [mailto:frederico.azeite...@cision.com] Enviada: quarta-feira, 28 de Novembro de 2012 17:19 Para: solr-user@lucene.apache.org Assunto: RE: Search differences between solr 1.4.0 and 3.6.1 Ok, I'll test that and let you know. Is there some test I can easily do to confirm

Recover index

2011-10-28 Thread Frederico Azeiteiro
Hello all, When moving a SOLR index to another instance I lost the files: segments.gen segments_xk I have the .cfs file complete. What are my options to recover the data. Any ideia that I can test? Thank you. Frederico Azeiteiro

RE: wildcard and proximity searches

2010-10-05 Thread Frederico Azeiteiro
these functionalities? thanks Mark On Wed, Aug 4, 2010 at 2:24 PM, Frederico Azeiteiro frederico.azeite...@cision.com wrote: Thanks for you ideia. At this point I'm logging each query time. My ideia is to divide my queries into normal queries and heavy queries. I have some heavy queries with 1 minute

Strange query behaviour using splitOnCaseChange=1

2011-01-11 Thread Frederico Azeiteiro
searching for InterContinental the article is NOT returned Can anyone explains why the last search didn't return the article? Thank you, Frederico Azeiteiro

Using MLT feature

2011-04-04 Thread Frederico Azeiteiro
? Is it a good approach? Are there a better way to perform this comparison? Thank you for your help. Best regards, Frederico Azeiteiro

RE: Using MLT feature

2011-04-04 Thread Frederico Azeiteiro
don't want to index exact.Similar though, I think has to be based off a document in the index. On Apr 4, 2011, at 5:16, Frederico Azeiteiro frederico.azeite...@cision.com wrote: Hi, I would like to hear your opinion about the MLT feature and if it's a good solution to what I need

RE: Using MLT feature

2011-04-04 Thread Frederico Azeiteiro
and not added to my index? Is it so simple as that? Does it works even if the medianame should be an exact match (not similar match as the headline and bodytext are)? Thank you for your help, Frederico Azeiteiro Developer   -Original Message

RE: Using MLT feature

2011-04-04 Thread Frederico Azeiteiro
the article. 2. Before adding the doc, create a signature (using the same algorithm that SOLR uses) on my indexing app and then verify if signature exists before adding. I'm way thinking the right way here? :) Thank you, Frederico   -Original Message- From: Frederico Azeiteiro

RE: Using MLT feature

2011-04-05 Thread Frederico Azeiteiro
using the TextProfileSignature with success? Thank you, Frederico -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: segunda-feira, 4 de Abril de 2011 16:47 To: solr-user@lucene.apache.org Cc: Frederico Azeiteiro Subject: Re: Using MLT feature Hi again, I

RE: Using MLT feature

2011-04-05 Thread Frederico Azeiteiro
=minTokenLen5/str On the processor tag. Best regards, Frederico  -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: terça-feira, 5 de Abril de 2011 12:01 To: solr-user@lucene.apache.org Cc: Frederico Azeiteiro Subject: Re: Using MLT feature On Tuesday 05 April 2011

RE: Using MLT feature

2011-04-06 Thread Frederico Azeiteiro
...@openindex.io] Sent: terça-feira, 5 de Abril de 2011 15:20 To: solr-user@lucene.apache.org Cc: Frederico Azeiteiro Subject: Re: Using MLT feature If you check the code for TextProfileSignature [1] your'll notice the init method reading params. You can set those params as you did. Reading Javadoc

RE: Using MLT feature

2011-04-08 Thread Frederico Azeiteiro
. Ludovic. 2011/4/7 Frederico Azeiteiro [via Lucene] ml-node+2790579-1141723501-383...@n3.nabble.com Well at this point I'm more dedicated to the Deduplicate issue. Using a Min_token_len of 4 I'm getting nice comparison results. MLT returns a lot of similar docs that I don't consider similar

RE: Using MLT feature

2011-04-08 Thread Frederico Azeiteiro
10:11 To: solr-user@lucene.apache.org Subject: Re: Using MLT feature Couldn't you extend the TextProfileSignature and modify the TokenComparator class to use lexical order when token have the same frequency ? Ludovic. 2011/4/8 Frederico Azeiteiro [via Lucene] ml-node+2794604-1683988626-383...@n3