Re: Best strategy migrate indexes

2022-11-08 Thread Pablo Vázquez Blázquez
t; escribió: > > > > > The process itself sounds like it should work (it's basically a > > > reindex so it should be safer than trying to migrate directly.) > > > > > > I would check that the Luke version matches the Lucene version - if > > >

Re: Best strategy migrate indexes

2022-11-07 Thread Pablo Vázquez Blázquez
tly.) > > I would check that the Luke version matches the Lucene version - if > the two match, it shouldn't be possible to get issues like this. > That is, the precise versions of Lucene each is using. > > TX > > > On Mon, 7 Nov 2022 at 22:09, Pablo Vázquez Blázquez

Re: Best strategy migrate indexes

2022-11-07 Thread Pablo Vázquez Blázquez
://github.com/hakanai/luceneupgrader>", I got: java -jar luceneupgrader-0.5.2-SNAPSHOT.jar info tests_small_index-7.x-migrator Lucene index version: 7 What am I doing wrong or misleading? Thanks! El mié, 2 nov 2022 a las 21:13, Pablo Vázquez Blázquez () escribió: > Hi, > > Luckil

Re: Best strategy migrate indexes

2022-11-02 Thread Pablo Vázquez Blázquez
> might want to perform. e.g., you might want to create points. Or > remove an index for a field. Or add an index for a field. > > TX > > > On Tue, 1 Nov 2022 at 02:57, Pablo Vázquez Blázquez > wrote: > > > > Hi all, > > > > Thank you all for your respo

Re: Best strategy migrate indexes

2022-10-31 Thread Pablo Vázquez Blázquez
d > > data since when Lucene changes version, > > even minor version, I always reindex. > > > > Best regards > > > > From: Gus Heck > > Sent: Saturday, October 29, 2022 2:17 PM > > To: [email protected] >

Best strategy migrate indexes

2022-10-28 Thread Pablo Vázquez Blázquez
e any tool to migrate the indices? Is it mandatory to reindex? In this case, how can I deal with this when I do not have the sources of documents that generated my current indices (I mean, I just have the indices themselves)? Thanks, -- Pablo Vázquez ([email protected])

Re: Lucene 5.3.0 GeoPointDistanceQuery

2015-09-21 Thread Pablo Mincz
Thanks for the response. Someone has an example on how to use a FunctionQuery in a CustomScoreQuery with the haversin function? Thanks! Regards, Pablo. On Mon, Aug 24, 2015 at 7:40 PM, Uwe Schindler wrote: > Hi, > > you can use the lucene-expressions module and use the distance funct

Lucene 5.3.0 GeoPointDistanceQuery

2015-08-24 Thread Pablo Mincz
Hi, I'm using GeoPointDistanceQuery to filter the results near by a certain coordinate, but now I want to sort them by distance to that coordinate. I did not find anything in the github repository on how to do this. Someone know how to do this? Thanks for the help! Regards,

Re: Update DocValues and Query Time Join performance on DocValues

2013-03-25 Thread Pablo Guerrero
Thanks Gopal, I had the impression that it was easier to update DocValues than to update any document in general, but we will have to wait for this. Cheers, Pablo On Mon, Mar 25, 2013 at 3:37 AM, Gopal Patwa wrote: > Unfortunately updateable docvalues is not supported yet, but it fut

Update DocValues and Query Time Join performance on DocValues

2013-03-23 Thread Pablo Guerrero
Joins should be really fast using DocValues, as you have random access, but I would like to know what's the real cost of traversing a relationship, is it constant?, logarithmic?, linear? Thanks in advance, Pablo

Re: Lucene reliability as primary store

2013-03-23 Thread Pablo Guerrero
Thanks Simon, I'll see if I can implement some kind of transaction log to avoid committing every change. Cheers, Pablo On Fri, Mar 22, 2013 at 8:13 PM, Simon Willnauer wrote: > On Fri, Mar 22, 2013 at 2:00 PM, Pablo Guerrero wrote: > > Hi all, > > > > I'm e

Lucene reliability as primary store

2013-03-22 Thread Pablo Guerrero
? Thank you in advance, Pablo

Re: HighFreqTerms patch

2011-02-10 Thread Pablo Mendes
6:16 PM, Michael McCandless > wrote: > > Hmm, which version of Lucene are you using? Newer versions let you > > specify a field... > > > > Mike > > > > On Wed, Feb 9, 2011 at 12:06 PM, Pablo Mendes > wrote: > >> Guys, > >> this is tiny and pro

HighFreqTerms patch

2011-02-09 Thread Pablo Mendes
Guys, this is tiny and probably not relevant. But I'll bet a beer that at least a dozen people had to dirtymod this class while they could have run it from command line. A 15 min time save that took 15 min to create. I guess it's a tie. Best, Pablo --- HighFreq

Re: Scaling Lucene to 1bln docs

2010-08-10 Thread Pablo Mendes
Shelly, Do you mind sharing with the list the final settings you used for your best results? Cheers, Pablo On Tue, Aug 10, 2010 at 3:49 PM, [email protected] wrote: > Hey Shelly, > If you want to get more info on lucene, I'd recommend you get a copy of > lucene in action

Modifying idf()?

2010-07-30 Thread Pablo Mendes
ere? Or in TermScorer? Anybody has some light to shed on this issue? Thanks in advance, Pablo [1] http://lucene.apache.org/java/3_0_2/api/all/org/apache/lucene/search/DefaultSimilarity.html#idf%28int,%20int%29

IndexWriter.mergeDocument(Term term, Document doc)

2010-06-29 Thread Pablo Mendes
elf. Is the implementation of IndexWriter.updateDocument itself a good starting point? Does anybody have suggestions on how to go about it? Or is it there in some other shape or form? Or is it absolutely discouraged? Thanks, Pablo [1] http://lucene.apache.org/java/3_0_2/api/core/org/apache/lucene/document/class-use/Document.html

Wich way would you recommend for successive-words similarity and scoring ?

2010-05-03 Thread Pablo
orer and or a Similarity, or can an analyser and a specific stemmer be sufficient? Thanks, [I first wrote to dev, wasn't the right place.] Pablo - To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]

Re: Lucene problem:No document handler defined for the name "test"

2009-07-16 Thread Pablo Mosquera Saenz
; simon > > On Thu, Jul 16, 2009 at 12:20 PM, Pablo Mosquera > Saenz wrote: > > Hi, I have downloaded the springmodule for lucene, version 0.9 and tried > to > > test the sample > > > > I have used the lucene core library 2.4.1 > > > > The first problem I

Lucene problem:No document handler defined for the name "test"

2009-07-16 Thread Pablo Mosquera Saenz
Hi, I have downloaded the springmodule for lucene, version 0.9 and tried to test the sample I have used the lucene core library 2.4.1 The first problem I found is that with the initial configuration With SingleSearcherFactory, in the startup I have an error because ther

Re: LUCENE-831 (complete cache overhaul) -> mem use

2008-11-14 Thread Pablo Saavedra
I have the same problem with cache and too many sorted fields, and had to implement a big workaround to be able to plug my own cache implementation in lucene 2.3.2. What I'd really like to see in the new cache implementation is easier pluggability and extension of the lucene classes, which is curre

Re: OutOfMemory Problems Lucene 2.4 / Tomcat

2008-11-04 Thread Pablo Saavedra
I hope that helps, if you find anything interesting do post it somewhere. I'm afraid I'm a little bit far away from New Orleans at the moment. Regards. 2008/11/4 Todd Benge <[EMAIL PROTECTED]> > Thanks Pablo. > > I'll be flying to New Orleans tomorrow for

Re: Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
It was, after all an XML issue, the servlets creating the content that was being indexed were not sending UTF but the XML declaration stated the code WAS UTF, so it really was not a Lucene issue after all. Thanks for all the help. On Thu, Aug 21, 2008 at 6:18 PM, Juan Pablo Morales <[EM

Re: Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
t;Document theDoc = hits.doc(0); >String nombre = theDoc.get("name"); >System.out.println("Nombre: " + nombre); > } > } > > > When I run this in IntelliJ, I get: > > Query: name:niñ > Nombre: niños > > Process finished with exit code 0 > > > Are you by chance indexing XML? Indirectly, yes > > -- Juan Pablo Morales Ingenian Software ltda

Re: Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
On Thu, Aug 21, 2008 at 12:47 PM, Steven A Rowe <[EMAIL PROTECTED]> wrote: > Hola Juan, Hi Steve > > > On 08/21/2008 at 1:16 PM, Juan Pablo Morales wrote: > > I have an index in Spanish and I use Snowball to stem and > > analyze and it works perfectly. However,

Storing special characters in Lucene

2008-08-21 Thread Juan Pablo Morales
" Looking at the index with Luke it shows me "ni�os" but when I want to see the full text (by right clicking) it shows me ni�os. I know Lucene is supposed to store fields in UTF8, but then, how can I make sure I sotre something and get it back just as it was, including special chara

"No tvx file" error

2008-05-29 Thread Pablo B.
not index those files, so later, in my own code, when I call termFreqVector.getTerms() for the failing file, it returns a null pointer exception, which does not make sense because the file has text in it! Any one can help? Thank you in advanced, Pablo. -- View this message in cont

question about IndexWriter.maxFieldLength

2005-05-17 Thread Pablo Gomes Ludermir
But which terms are those? The first 10.000 to be indexed? Or is there any feature selection approach? Like, the most frequent 10.000 terms are indexed and the rest are discarded? Anyone knows that? If this is not the case, Is it possible to implement? Regards Pablo -- Pablo Gomes Luder

what is QueryFilter.bits ???

2005-05-05 Thread Pablo Gomes Ludermir
idea? Regards, Pablo -- Pablo Gomes Ludermir [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: categorized search

2005-05-05 Thread Pablo Gomes Ludermir
"search engines" and "open source software", and it appears in the documents 1, 2 and 3 in the category "search engines" and in documents 4 and 7 in the categoy "open source". I could not get it to work yet (maybe because of my lack of experience with Lucene). Some

indexing synonyms / reducing the index size

2005-05-04 Thread Pablo Gomes Ludermir
y own weighting scheme, and it is a quite costly algorithm, so the less terms I have the better it performs. Regards, Pablo -- Pablo Gomes Ludermir [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For addit

Fwd: getting document metadata

2005-05-03 Thread Pablo Gomes Ludermir
Forgot to send to the list. -- Forwarded message -- From: Pablo Gomes Ludermir <[EMAIL PROTECTED]> Date: May 3, 2005 9:07 PM Subject: Re: getting document metadata To: Luke Shannon <[EMAIL PROTECTED]> I actually would like to have a single field on the Document o

getting document metadata

2005-05-03 Thread Pablo Gomes Ludermir
the contents of the index, and I would like to have only one field: the body contents). Is there another way to get the document's path if we don't index it? Or just with magic? :) Regards, -- Pablo Gomes Ludermir [EMAIL

Re: skip document header while indexing

2005-04-29 Thread Pablo Gomes Ludermir
Could you give me some pointers (example or website) to how I could do that? On 4/29/05, Erik Hatcher <[EMAIL PROTECTED]> wrote: > > On Apr 29, 2005, at 7:50 AM, Pablo Gomes Ludermir wrote: > > > Hello all, > > > > Is it possible to skip the first "xx&quo

skip document header while indexing

2005-04-29 Thread Pablo Gomes Ludermir
m is = new FileInputStream(file); Reader reader = new BufferedReader(new InputStreamReader(is)); doc.add(Field.Text(PATH_FIELD, artifactModel)); doc.add(Field.Text(CONTENTS_FIELD, reader, true)); Regards, Pablo -- Pablo Gomes Ludermir [EMAIL PROTECTED]

categorized search

2005-04-24 Thread Pablo Gomes Ludermir
Hi all, I have indexed a field that describes the "category" of the document. Thus, I want to know how many categories have a specific term. Could someone help me to get this with good performance? Regards, Pablo -- Pablo Gomes Ludermir [EMAIL

getting the number of occurrences within a document

2005-04-14 Thread Pablo Gomes Ludermir
Hello all, I would like to get the following information from the index: 1. Given a term, how many times the term occurs in each document. Something like a triple: < Term, Doc1, Freq> , , , ... Is possible to do that? Regards, Pablo -- Pablo Gomes Ludermir [EMAIL PRO