Re: Lucene Index Cloud Replication

2019-07-11 Thread Anton Zenkov
; > > I'd like to share what I've got for 1 and 3, based on S3 and DynamoDB, > but > > I'd like to do it with interfaces that lend themselves to other > > implementations for blob and metadata storage. > > > > Is it worth opening a Jira issue for this? Is this

Re: is Document match Query

2018-12-17 Thread Anton Zenkov
https://lucene.apache.org/core/7_6_0/memory/org/apache/lucene/index/memory/MemoryIndex.html Anton On Mon, Dec 17, 2018 at 8:06 AM Valentin Popov wrote: > Hello. > > I need implement a feature, that answer for a question: is a Document > match a Query. > > Right no

Re: 500 millions document for loop.

2015-11-12 Thread Anton Zenkov
Which version of Lucene are you using? On Thu, Nov 12, 2015 at 11:39 AM, Valentin Popov wrote: > Hello everyone. > > We have ~10 indexes for 500M documents, each document has «archive date», > and «to» address, one of our task is calculate statistics of «to» for last >

sizes of non-fdt flies affected by compression settings

2015-10-01 Thread Anton Zenkov
) text docs. I guess I could go through the code myself to understand this but may be someone can shed some light on this. Thanks! Anton

Re: IndexWriter is not closing the FDs (deleted files)

2015-08-31 Thread Anton Zenkov
Are you sure you are not holding open readers somewhere? On Mon, Aug 31, 2015 at 7:46 PM, Marcio Napoli wrote: > Hey! :) > > It seems IndexWriter is not closing the descriptors of the removed files, > see the log below. > > Thanks, > Napoli > > [root@server01 log]# ls

Re: Lucene [3.0] Skip Matching Document.

2015-07-28 Thread Anton Zenkov
You can always throw an exception in the collector to stop the collection process. Anton On Tue, Jul 28, 2015 at 4:26 AM, Muhammad Ismail it.is.ism...@gmail.com wrote: Can we skip matching lucene document by using custom collector or some other way. Like I want to bring all document created

Re: CachingWrapperQuery performance

2015-06-17 Thread Anton Lyska
advice. Regards, Anton 2015-06-16 0:35 GMT+03:00 Adrien Grand jpou...@gmail.com: Hi Anton, Thanks for reporting this. It is indeed a bit surprising given that both classes work in a very similar way. Can you confirm that the response times that you are reporting both happen on Lucene 5.2

CachingWrapperQuery performance

2015-06-15 Thread Anton Lyska
Hi, I have performance issues with CachingWrapperQuery with lucene 5.2 and dont know how to solve it. Prehistory: I have search with different parameters, where some parameters are used more frequently then others. For these params I used filters(and cached them), and my search looked

scanning whole index stored fields while using best compression mode

2015-06-03 Thread Anton Zenkov
(); fieldsReader.visitDocument(i, visitor); visitor.getDocument(); } } I was wondering if there is better way of doing this and if there are plans to make access to the faster document loading through some API. Should I try to come up with a patch for this? Thanks! Anton

Re: scanning whole index stored fields while using best compression mode

2015-06-03 Thread Anton Zenkov
Reindexing. If I want to add new fields or change existing fields in the index I need to go through all documents of the index. On Wed, Jun 3, 2015 at 4:46 PM, Robert Muir rcm...@gmail.com wrote: On Wed, Jun 3, 2015 at 4:00 PM, Anton Zenkov azen...@crimsonhexagon.com wrote: for (int i = 0

question about spatial module in lucene 5

2015-03-30 Thread Anton Lyska
Hi I have tried to upgrade lucene from 4.x to 5.0 recently.I found out from documentation that FieldCache is removed from lucene 5, and I should use DocValues fields for sorting. I upgraded my sources, and everything works fine except spatial sorting by distance. When I looked into

lucene search sentence

2006-04-27 Thread Anton Feldmann
the hit. 3) How do I display the sentence before and after the sentence the hit is in? Cherrs anton - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Highlight

2006-04-26 Thread anton feldmann
Hi I wrote a program that make a pdf document to an Lucene document. The field ate contents, sentence, : How do i display the sentence the query String is in? and how do I Highlight the String? cheers anton feldmann package de.coli.seek.lucene; import java.io.File; import

How to serach in sentence and dispaly the whole sentence

2006-04-23 Thread anton feldmann
, how I have to change the IndexFile, whether that conforms with the Lucene Team. cheers anton feldmann - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Clusterization of searching

2006-04-11 Thread anton
What be way for clusterizations of searching? - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Grouping results by choosen field

2006-03-21 Thread anton
Good grouping by domain realized in nutch... Nutch can serve good example of group on certain field. -Original Message- From: Java Programmer [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 21, 2006 3:56 PM To: java-user@lucene.apache.org Subject: Re: Grouping results by choosen field On

speed

2006-03-10 Thread Anton Potehin
for 1.644 sec. Second way return results fot 3.088 sec. Why first way faster then second? Can I accelerate it? And how? Can you offer any ways for get number of results? --CountCollector import org.apache.lucene.search.HitCollector; /** * @author Anton Potekhin * @date: 03.03.2006

RE: speed

2006-03-10 Thread anton
of queries are you issuing? Even 1.6 seconds is a LONG time for a search in most Lucene indexes. Erik On Mar 10, 2006, at 5:38 AM, Anton Potehin wrote: I try 2 way for get count results: 1) Query q = IndexSearcher is = int count = Is.search(q).length(); 2) Query q

RE: speed

2006-03-10 Thread anton
documents in your index and what kind of queries are you issuing? Even 1.6 seconds is a LONG time for a search in most Lucene indexes. Erik On Mar 10, 2006, at 5:38 AM, Anton Potehin wrote: I try 2 way for get count results: 1) Query q = IndexSearcher is = int count

RE: speed

2006-03-10 Thread anton
documents in your index and what kind of queries are you issuing? Even 1.6 seconds is a LONG time for a search in most Lucene indexes. Erik On Mar 10, 2006, at 5:38 AM, Anton Potehin wrote: I try 2 way for get count results: 1) Query q = IndexSearcher

RE: speed

2006-03-10 Thread anton
Each variant started in different process -Original Message- From: Eric Jain [mailto:[EMAIL PROTECTED] Sent: Friday, March 10, 2006 5:15 PM To: java-user@lucene.apache.org Subject: Re: speed Importance: High [EMAIL PROTECTED] wrote: When I make search I get count = 37. May be

RE: speed

2006-03-10 Thread anton
Sorry. I was repeated test. And now both way have same speed. -Original Message- From: Anton Potehin [mailto:[EMAIL PROTECTED] Sent: Friday, March 10, 2006 1:39 PM To: java-user@lucene.apache.org Subject: speed Importance: High I try 2 way for get count results: 1) Query q

RE: speed

2006-03-10 Thread anton
was repeated test. And now both way have same speed. -Original Message- From: Anton Potehin [mailto:[EMAIL PROTECTED] Sent: Friday, March 10, 2006 1:39 PM To: java-user@lucene.apache.org Subject: speed Importance: High I try 2 way for get count results: 1) Query q = IndexSearcher

RE: Get only count

2006-03-08 Thread anton
Signifies this that method collect can be called for document with score = 0 ? -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 6:35 PM To: java-user@lucene.apache.org Subject: Re: Get only count Importance: High On 3/7/06, [EMAIL PROTECTED]

sub search

2006-03-07 Thread Anton Potehin
Is it possible to make search among results of previous search?   For example: I made search: Searcher searcher =... Query query = ... Hits hits = hits = Searcher.search(query);   After it I want to not make a new search, I want to make search among found

RE: sub search

2006-03-07 Thread anton
); -Original Message- From: hu andy [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 12:40 PM To: java-user@lucene.apache.org Subject: Re: sub search Importance: High 2006/3/7, Anton Potehin [EMAIL PROTECTED]: Is it possible to make search among results of previous search

RE: Get only count

2006-03-07 Thread anton
Importance: High Anton Potehin wrote: Now I create new search for get number of results. For example: IndexSearcher is = ... Query q = ... numberOfResults = Is.search(q).length(); Can I accelerate this example ? And how ? Perhaps something like: class CountingHitCollector implements

RE: Get only count

2006-03-07 Thread anton
Importance: High Anton Potehin wrote: Now I create new search for get number of results. For example: IndexSearcher is = ... Query q = ... numberOfResults = Is.search(q).length(); Can I accelerate this example ? And how ? Perhaps something like: class CountingHitCollector implements

RE: Get only count

2006-03-07 Thread anton
Can have matching document score equals zero ? -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 6:20 PM To: java-user@lucene.apache.org Subject: Re: Get only count Importance: High On 3/7/06, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: While

RE: Get only count

2006-03-07 Thread anton
Can have matching document score equals zero ? -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 6:20 PM To: java-user@lucene.apache.org Subject: Re: Get only count Importance: High On 3/7/06, [EMAIL PROTECTED] [EMAIL PROTECTED] wrote: While

RE: search problem

2006-03-03 Thread anton
2) assuming what you want is not all brands and all prices, but just the prices and brands of the documents in you results, then i would strongly recommend doing your search twice -- once do get the Hits object you need and once using a HitCollector. Within the HitCollector, use FieldCaches to

RE: Indexing speed

2006-02-27 Thread anton
maxBufferedDocs parameters. You can also look for my article about indexing with Lucene (link in the Wiki), which includes code for playing with various parameters and explains what's going on, etc. Sorry, but where this link ?

RE: Indexing speed

2006-02-27 Thread anton
maxBufferedDocs parameters. You can also look for my article about indexing with Lucene (link in the Wiki), which includes code for playing with various parameters and explains what's going on, etc. Sorry, but where this link ? Where placed your article ? Please, give me url.

Duplicates recods in index

2006-02-08 Thread Anton Potehin
Is it possible to add records into lucene index using following algorithm: 1) create Document object 2) add 5 fields into Document (id, name, field1, field2, field3). All fields are stored, indexed and tokenized 3) check if the document with current id and name was added before 4) if yes

non-standard query

2006-01-19 Thread Anton Potehin
I've the following problem: I've a big number of documents indexed. Suppose that the search query contains 20 terms. It is necessary to find all documents which contains at least 5 terms from search query. Is it possible to implement? If yes, what problems may arise during the solving of

RE: What is stemming?

2005-11-20 Thread anton
About stemmers you can read on http://snowball.tartarus.org/ -Original Message- From: Koji Sekiguchi [mailto:[EMAIL PROTECTED] Sent: Monday, November 21, 2005 2:37 AM To: java-user@lucene.apache.org Subject: RE: What is stemming? Gekkokid, Daniel, Giovanni, Thank you very much for