Re: A problem of searching word "3D"

2010-01-26 Thread digy digy
while ( token!=null )* *{* *System.Diagnostics.Debug.WriteLine(token.Term());* *token = ts.Next();* *}* DIGY On Wed, Jan 27, 2010 at 3:08 AM, Li Bing wrote: > According to my experiences, "3D" should not be indexed in

Re: Lucene.net in Medium Trust

2010-02-16 Thread digy digy
The reason may be the "FlushFileBuffers" in "SupportClass.FileSupport.Sync". Can you change the code as below and try again? public static void Sync(System.IO.FileStream fileStream) { if (fileStream == null) throw new ArgumentNullException("fileStream"); fileS

Re: Lucene.NET Community Status

2010-11-02 Thread digy digy
See previous discussions about IKVM http://comments.gmane.org/gmane.comp.jakarta.lucene.net.user/806 http://comments.gmane.org/gmane.comp.jakarta.lucene.net.user/802 DIGY On Tue, Nov 2, 2010 at 4:58 PM, Hans Merkl wrote: > Has anybody tried using Lucene with IKVM.NET? I haven't

Re: How Can I Deal with Special Chars like #, /,

2010-11-11 Thread digy digy
It seems that the problem is not related with parsing or escaping the string, rather choosing an inappropriate analyzer for your needs. You can not search what you haven't indexed. You can use below code to see what is indexed with different type of analyzers. DIGY Analyzer analyzer

Re: Is comma a special character in Lucene query parsing.

2010-11-23 Thread digy digy
Again same answer. http://mail-archives.apache.org/mod_mbox/lucene-lucene-net-user/201011.mbox/%3caanlktim6kyuzhwb8p7g=hvqx6dy1fkarchro0hyw+...@mail.gmail.com%3e Use the code and see what StandardAnalyzer does with your input string. DIGY On Tue, Nov 23, 2010 at 11:34 AM, Umer Khalid Qureshi

Re: partial update over lucene index

2011-02-03 Thread digy digy
What do you mean by "partially updating" the index. This seems to be a XY-problem (http://www.perlmonks.org/index.pl?node_id=542341) DIGY On Thu, Feb 3, 2011 at 2:15 PM, K a r n a v wrote: > I want to do partial update over lucene index (rather than doing complete > indexing eve

Re: How to Merge Separate Indexes (If Possible?)

2011-02-04 Thread digy digy
Yes. Use IndexWriter's AddIndexes method. DIGY On Fri, Feb 4, 2011 at 1:28 PM, Nicholas Petersen wrote: > Is it possible to merge a number of indexes that were generated completely > separate from each other? I have a case use where this could really be > useful (where indexes h

Re: [Lucene.Net] lucene.net exception

2011-03-15 Thread digy digy
There is nothing related with Lucene.Net in the stack trace. It would be better to post your question to NHibernate maling list. DIGY On Tue, Mar 15, 2011 at 10:13 AM, Christian Setzkorn wrote: > Hi, > > > > I have a method like this: > > > > public IList Query(stri

Re: [Lucene.Net] Lucene.Net writing/reading synchronization

2011-03-25 Thread digy digy
Yes, for all. Lucene.Net is thread safe and you make indexing and searching at the same time. Just refer to Java documents to learn about indexwriter.GetReader, indexreader.Reopen etc. DIGY On Fri, Mar 25, 2011 at 3:48 PM, Edward83 wrote: > Hello!!! > > > Please tell me: > >

Re: [Lucene.Net] English Language Concatenated Word Tokenizer

2011-04-03 Thread digy digy
"www.worldbestwebsites.com", you can search "world", "best", "web", "website", "bestweb" or "stwebsi" :) etc. DIGY On Mon, Apr 4, 2011 at 7:15 AM, Thomas Rankin wrote: > Hey everyone, I'm trying to figure out the best way to get lu

Re: [Lucene.Net] Index and search for a phrase including plus (+) sign.

2011-04-05 Thread digy digy
Working with "+" can be problematic and you may need to write your own custom analyzer. You can use code below to see what you are indexing and searching. DIGY * string Test(string docTextToIndex, string textToSearch)* *{* * * *StringBuilder sb = new Str

Re: [Lucene.Net] Lucene .Net search with Standard Analyzer does not work properly with keywords containing - (hyphen)

2011-05-12 Thread digy digy
rves > the > right to monitor and review the content of all messages sent to or from > this e-mail > address. Messages sent to or from this e-mail address may be stored on the > Infosys e-mail system. > ***INFOSYS End of Disclaimer INFOSYS*** > DIGY

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-20 Thread digy digy
Take a look at OpenBitSetDISI in Lucene.Net.Util DIGY On Fri, May 20, 2011 at 10:20 AM, lars aslin wrote: > Hi > Im building a search function with Lucene.NET where I want faceted > searches to be supported. To accomplish this in an efficient way I > thought a good idea would

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-20 Thread digy digy
I prepared a sample. Maybe, this can help http://people.apache.org/~digy/FacetedSearch.cs DIGY On Fri, May 20, 2011 at 10:43 AM, K a r n a v wrote: > I've tried with OpenBitSetDISI...its not a performance effective one > but > able generate results with BitArr

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-20 Thread digy digy
List categories = new List(); TermEnum te = reader.Terms(new Term("cat","")); categories.Add(te.Term().Text()); while (te.Next()) { if (te.Term().Field() != "cat") break; categories.Add(te.Term().Text());

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-20 Thread digy digy
"not analyzed", you can also try to use TermQuery while searching. DIGY On Fri, May 20, 2011 at 3:36 PM, K a r n a v wrote: > How can we handle special characters like .;: / \?"\<>~`*!@#$%^&-_+={}[]|( > When I search with hyderabad-india...internally it is removing

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-23 Thread digy digy
See https://issues.apache.org/jira/browse/LUCENENET-415 DIGY On Mon, May 23, 2011 at 1:16 PM, Marco Dissel wrote: > Can you share your implementation (if it's generic) > > Thanks > Op 23 mei 2011 11:47 schreef "lars aslin" het > volgende: > > Thanks Digg

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-25 Thread digy digy
I still don't understand, since these numbers can easily be obtained from current implementation. Just count HitsPerGroup.Name[i]' s. DIGY On Thu, May 26, 2011 at 12:21 AM, Marco Dissel wrote: > Let's take the sample data from http://goo.gl/UQubj > >

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-26 Thread digy digy
Indeed, I seems to be slow. With 30 facets, ~300K docs and 1.4GB index I get better results.(first query: ~100-400 ms, if I repeat it then <10ms) Do you use the code from https://svn.apache.org/repos/asf/incubator/lucene.net/branches/Lucene.Net_2_9_4g/src/contrib/SimpleFacetedSearch/ ? DIGY

Re: [Lucene.Net] search within a list of unique ID's

2011-05-27 Thread digy digy
Creation of SimplefacetedSearch is slow. Therefore it should only be created when a new reader is opened (or reopened). DIGY On Fri, May 27, 2011 at 10:02 AM, Marco Dissel wrote: > Our index contains documents with an unique ID (long number) corresponding > to a record in SQL database.

Re: [Lucene.Net] search within a list of unique ID's

2011-05-27 Thread digy digy
When term count in query increases, search time increases also. Therefore, if possible, use precomputed BitSets and "AND" it with user's query. Another costly operation is the fectching data from the index. Limiting the result count to smaller numbers may help too. DIGY On Fri,

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-27 Thread digy digy
Suppose every field contains 20 unique terms, then you have 64,000,000 combinations to compute in constructor. (I think, SFS finds only 949 of them usable(BitSets with >0 cardinality)). DIGY On Fri, May 27, 2011 at 12:14 PM, Marco Dissel wrote: > If I increase my number of facetfields to

Re: [Lucene.Net] Faceted Search in Lucene.NET without using the Bits-method

2011-05-27 Thread digy digy
I create the SFS only once and use two fields for faceting(added one more field after my eMail with 30 & 11 unique terms). That is all. DIGY On Fri, May 27, 2011 at 2:00 PM, Marco Dissel wrote: > Correct, but why do you get much better performance, according to a > previous > ema

Re: [Lucene.Net] Announcing Lucene.Net.Contrib

2011-06-16 Thread digy digy
ons" of faceted search in contrib. DIGY On Thu, Jun 16, 2011 at 4:38 PM, Itamar Syn-Hershko wrote: > Hi all, > > I created a new github repository where I'm dumping all extensions I write > or port from Java myself. The intention is having a community based > repositor

Re: [Lucene.Net] Memory leaks

2011-08-23 Thread digy digy
Hi Itamar, Can you send a simple self-contained test case showing the bug? DIGY On Tue, Aug 23, 2011 at 2:41 PM, Itamar Syn-Hershko wrote: > Hi, > > We recently discovered another memory leak within Lucene.NET, that is > related to LUCENENET-358 (CloseableThreadLocal me

Re: [Lucene.Net] Memory leaks

2011-08-26 Thread digy digy
> Does this include your changes to ClosableThreadLocal? No, the old implementation. DIGY On Fri, Aug 26, 2011 at 1:45 PM, Ayende Rahien wrote: > Does this include your changes to ClosableThreadLocal? > > Also, you probably won't see a memory leak in this fashion. We only s

Re: [Lucene.Net] How to index/search a file name

2011-09-06 Thread digy digy
That may help UnaccentedWordAnalyzer @ https://svn.apache.org/repos/asf/incubator/lucene.net/trunk/src/contrib/Core/Analysis/Ext/Analysis.Ext.cs DIGY On Tue, Sep 6, 2011 at 12:31 PM, Floyd Wu wrote: > Hi everyone, > > I have a question that annoying me many times. my situation is th

Re: [Lucene.Net] How to index/search a file name

2011-09-06 Thread digy digy
- Just copy and paste to your project :) and use it instead of StandardAnalyzer . - Yes, it is from 2.9.4. DIGY On Tue, Sep 6, 2011 at 12:47 PM, Floyd Wu wrote: > Hi Digy, > > Thank you. But how to apply this to my current project? Is this compitable > with lucene.net-2.9.

Re: [Lucene.Net] Indexing speed result of RAMDirectory MMapDirectory FSDirectory

2011-09-15 Thread digy digy
There must be something wrong. I can index 18,000,000 docs in ~20 mins :) DIGY On Thu, Sep 15, 2011 at 12:05 PM, Floyd Wu wrote: > Hi there, > > I'm doing a little test to find out what is the better implementation > for my situation. > > I'm indexing 18,000 docs and

RE: byteBuf in NIOFSDirectory.cs is never set

2009-12-25 Thread Digy
lmost ready. "NIOFSDirectory" and "MmapDirectory" may be problematic(as you hit one of them). If you avoid using them, I don't think that you will face with a problem. DIGY -Original Message- From: nima dilmaghani [mailto:nim...@gmail.com] Sent: Friday, Decemb

RE: No Stopwords

2009-12-25 Thread Digy
I don't think that there will be a measurable performance difference in using HashTable or string[]. Just to remind, StandardAnalyzer constructors that accept string[] for stopwords are deprecated and will be removed with 3.0. DIGY -Original Message- From: T. R. Halvorson [mai

RE: Possible memory leak in Lucene.NET 2.4?

2010-01-05 Thread Digy
have an open IndexReader , IndexWriter is not going to delete the necessary files till no one uses it. Maybe, "IndexReader.IsCurrent" can solve your problem. DIGY -Original Message- From: Jeff Pennal [mailto:je...@openroad.ca] Sent: Tuesday, January 05, 2010 10:42

RE: Possible memory leak in Lucene.NET 2.4?

2010-01-05 Thread Digy
As Michael stated, I prefer also not hosting "indexing and searching sevices" in IIS. There are many alternatives such as WCF, Remoting etc. With a separate service for Lucene, you can control anything you want. DIGY -Original Message- From: Michael Garski [mailto:mgar...@myspa

RE: Possible memory leak in Lucene.NET 2.4?

2010-01-07 Thread Digy
"(instead it is subclass of MultiTermQueryWrapperFilter). DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmail.com] Sent: Thursday, January 07, 2010 12:39 PM To: lucene-net-user Subject: Re: Possible memory leak in Lucene.NET 2.4? I'm having a look at the

RE: at least one doc

2010-01-14 Thread Digy
The formal way is throwing exception in the HitCollector.Collect to stop iteration. DIGY -Original Message- From: Artem Chereisky [mailto:a.cherei...@gmail.com] Sent: Thursday, January 14, 2010 1:16 AM To: lucene-net-u...@incubator.apache.org; lucene-net-user@lucene.apache.org Subject

RE: at least one doc

2010-01-14 Thread Digy
I also have thought many times why HitCollector.Collect doesn't return a boolean value indicating no more results are needed. Maybe, a small performance increment. DIGY -Original Message- From: Nicholas Paldino [.NET/C# MVP] [mailto:casper...@caspershouse.com] Sent: Thursday, Janua

RE: at least one doc

2010-01-14 Thread Digy
uot;. The problem arises if you have a millions of results: You have to iterate all over the results although you want to have top MaxDox# docs. The real performance problem is, which one is better: an empty iteration of millions of times or an exception (say at 100 or 500)? DIGY

RE: How to get the "greater" term/word in a single-word numeric field

2010-01-18 Thread Digy
You can use TermEnum to find the min. value since terms are stored *sorted* in the index. But finding max. may not be so easy. Storing these values in a control doc(such as doc{ fieldMinValue:m ; fieldMaxValue:n} ) or at an external location (while indexing) may be a solution. DIGY

RE: How to get the "greater" term/word in a single-word numeric field

2010-01-21 Thread Digy
Hi Luis, Why do you post the same question twice? Please read previous answers. Here is a code that may give you a hint private void Form2_Load(object sender, EventArgs e) { IndexInt(); FindMinAndMax(); } L

RE: How to get the "greater" term/word in a single-word numeric field

2010-01-21 Thread Digy
Is this a joke? http://mail-archives.apache.org/mod_mbox/lucene-lucene-net-user/201001.mbox/date?1 DIGY -Original Message- From: Luis Fco. Ramriez Daza Glez [mailto:luis.francisco@gmail.com] Sent: Thursday, January 21, 2010 8:29 PM To: lucene-net-user@lucene.apache.org Subject: How

RE: An Index File Cannot Be Found

2010-01-23 Thread Digy
Which version of Lucene.Net are you using? Is it possible that you're not closing the IndexWriter before opening a new one and two open IndexWriters accidentally share the same index? (such as using IndexWriter.Unlock etc.). DIGY -Original Message- From: Li Bing [mailto:lbl...@gmai

RE: An Index File Cannot Be Found

2010-01-25 Thread Digy
2.0 is really old. I am pretty sure you won't get this error with a newer(2.3.2, 2.4.0, 2.9.1) version. https://svn.apache.org/repos/asf/lucene/lucene.net/ DIGY -Original Message- From: Li Bing [mailto:lbl...@gmail.com] Sent: Monday, January 25, 2010 2:11 AM To: lucene-net

RE: Bug on query serialization between 2.3 and current trunk ?

2010-01-27 Thread Digy
Nice Bug:-) Can you open a JIRA issue for that? DIGY -Original Message- From: Moray McConnachie [mailto:mmcco...@oxford-analytica.com] Sent: Wednesday, January 27, 2010 7:19 PM To: lucene-net-u...@incubator.apache.org Subject: Bug on query serialization between 2.3 and current trunk

RE: Unauthorized Exception Error

2010-01-31 Thread Digy
ecessary permissions). DIGY -Original Message- From: Tim Haughton [mailto:timhaugh...@gmail.com] Sent: Sunday, January 31, 2010 1:31 PM To: lucene-net-u...@incubator.apache.org Subject: Re: Unauthorized Exception Error Some more relevant info: Version: Dev version Exception Date: 30 January

RE: Attempt fix of memory leak at FieldCacheImpl of Lucene.Net 2.1.0

2010-02-02 Thread Digy
Have you read http://issues.apache.org/jira/browse/LUCENENET-106 ? DIGY From: Billy Ho [mailto:bill...@jobsdb.com] Sent: Tuesday, February 02, 2010 5:17 PM To: lucene-net-user@lucene.apache.org Subject: Attempt fix of memory leak at FieldCacheImpl of Lucene.Net 2.1.0 Hi, We have

RE: Attempt fix of memory leak at FieldCacheImpl of Lucene.Net 2.1.0

2010-02-02 Thread Digy
> DIGY's patch to provide a functional equivalent of Java's WeakHashTable is > currently in the trunk, and is in all versions after the 2.3.1 tag in SVN. It was a good collaborative work of many people. But the Oscar goes to Eyal Post for the final version of WeakHashTable. DIG

RE: Lucene.net in Medium Trust

2010-02-10 Thread Digy
Hi Simone, Can you try to open/create a file in that directory within your app(without using Lucene.NET) to narrow down the source of the problem. DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmail.com] Sent: Wednesday, February 10, 2010 9:46 PM To: lucene

RE: text as query of search

2010-02-23 Thread Digy
You can try Similarity.Net. it is available at https://svn.apache.org/repos/asf/lucene/lucene.net/trunk/C#/contrib/Similari ty.Net DIGY -Original Message- From: Giovanni Caputo [mailto:giovannicaput...@gmail.com] Sent: Tuesday, February 23, 2010 4:54 PM To: lucene-net-user

RE: Series of post about Lucene.net (and how Lucene.net was implemented in Subtext)

2010-02-26 Thread Digy
We don't still have any feedback about your medium-trust problem. Did it work? DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmail.com] Sent: Friday, February 26, 2010 10:13 PM To: lucene-net-user Subject: Series of post about Lucene.net (and how Lucene.ne

RE: Lucene index file container

2010-02-26 Thread Digy
rmance). Second, Lucene.Net does not have a built-in support for zipped index. So, you have to develop your own "ZippedDirectory" class. DIGY -Original Message- From: Andrew Schuler [mailto:andrew.schu...@gmail.com] Sent: Friday, February 26, 2010 11:48 PM To: lucene-net-user@luce

RE: Lucene index file container

2010-02-26 Thread Digy
Hi Nick, Suppose that I have 1G file with a compressed size of 100M and I want to read just a 4K block from offset 900M. Considering the SharpZip Lib,DotNetZip or similars , would be the cost more CPU and less IO? Or more CPU more IO? DIGY -Original Message- From: Nicholas Paldino [.NET

RE: Lucene index file container

2010-02-27 Thread Digy
ek request is made, you have to unzip whole file at the beginning, and use that in your app. But this is not what I understand from Andrew's statement "Does any one have experience running an index directly out of zip file?" DIGY -Original Message- From: Nichola

RE: Lucene index file container

2010-02-27 Thread Digy
r own storage on disk. DIGY -Original Message- From: Andrew Schuler [mailto:andrew.schu...@gmail.com] Sent: Friday, February 26, 2010 9:47 PM To: lucene-net-user@lucene.apache.org Subject: Re: Lucene index file container Yes, that is do-able. I was just thinking it would be cleaner to

RE: Commit / Optimize taking too long.

2010-03-03 Thread Digy
As you said, It seems like an open IndexReader blocking the IndexWriter. Can you try to open IndexReader as readonly or better use IndexWriter.GetIndexReader. DIGY. -Original Message- From: Luis Fco. Ramriez Daza Glez [mailto:luisfc...@yahoo.com.mx] Sent: Wednesday, March 03, 2010 6

RE: How to do prefix/phrase matching with term-length-sensitive scoring?

2010-03-03 Thread Digy
> Architectural changes mean that this query generation code no longer has access to an IndexReader and can only use a Searcher Why? IndexSearcher has a method "GetIndexReader" (or possibly, I couldn't understand the problem) DIGY -Original Message- From: Alex Davidson

RE: Commit / Optimize taking too long.

2010-03-03 Thread Digy
and pass to SetInfoStream. DIGY -Original Message- From: Luis Fco. Ramriez Daza Glez [mailto:luisfc...@yahoo.com.mx] Sent: Wednesday, March 03, 2010 6:05 AM To: lucene-net-user@lucene.apache.org Subject: Commit / Optimize taking too long. Hi all We have a problem with some of our prog

RE: Commit / Optimize taking too long.

2010-03-03 Thread Digy
No. :-( DIGY -Original Message- From: Luis Fco. Ramriez Daza Glez [mailto:luisfc...@yahoo.com.mx] Sent: Wednesday, March 03, 2010 9:48 PM To: lucene-net-user@lucene.apache.org Subject: RE: Commit / Optimize taking too long. Hi Digy Thanks, that’s what I had t do. Also I don’t think

RE: Commit / Optimize taking too long.

2010-03-03 Thread Digy
=false" to be able to see the newly added docs, indexWriter can commit the changes. DIGY -Original Message- From: Luis Fco Ramirez Daza Glez [mailto:luis.francisco@gmail.com] Sent: Wednesday, March 03, 2010 10:08 PM To: lucene-net-user@lucene.apache.org Subject: RE: Commit

RE: Setting up lucene.net

2010-03-05 Thread Digy
Is it possible that you are VB.Net user and didn't include C# while installing Visual Studio? DIGY -Original Message- From: rafia qutab [mailto:rafiama...@gmail.com] Sent: Friday, March 05, 2010 7:03 PM To: lucene-net-u...@incubator.apache.org Subject: Setting up lucene.net Hi,

RE: Working with Query classes

2010-03-10 Thread Digy
Maybe that can help; queryParser.SetMultiTermRewriteMethod(MultiTermQuery.SCORING_BOOLEAN_QUERY_R EWRITE); query=queryParser.Parse(...); query.Rewrite(indexReader).ToString(); DIGY -Original Message- From: Matt Honeycutt [mailto:mbhoneyc...@gmail.com] Sent: Wednesday, March 10, 2010 3

RE: 2.9 numeric fields and index size

2010-03-12 Thread Digy
I've always found it easier to index the data as string whenever possible. Same as the numeric data; I mostly pad them with '0's and index as text. At the end, Lucene is a full-"text" search engine. DIGY. -Original Message- From: Michael Garski [mailto:mgar.

RE: hit Highlighting and Mutli Term Search

2010-03-13 Thread Digy
:Ford")), While iterating through the results pass "query for highlight" to FVH. DIGY -Original Message- From: Mike Prager [mailto:m...@tripartum.com] Sent: Saturday, March 13, 2010 1:04 AM To: lucene-net-user@lucene.apache.org Subject: hit Highlighting and Mutli Term Search H

RE: Query question

2010-03-15 Thread Digy
Concatenating name and surname (and then lowercasing) with a custom analyzer? like johnlennon ringostar paulmccartney georgeharrison DIGY -Original Message- From: Michael Garski [mailto:mgar...@myspace-inc.com] Sent: Monday, March 15, 2010 6:09 PM To: lucene-net-user@lucene.apache.org

RE: IndexReader.LastModified giving invalid date?

2010-03-16 Thread Digy
issue for this. DIGY -Original Message- From: Paulo Ferreira [mailto:pfferre...@gmail.com] Sent: Tuesday, March 16, 2010 6:22 AM To: lucene-net-user@lucene.apache.org Subject: IndexReader.LastModified giving invalid date? Hi there, I am moving an app over from using Lucene 2.0, to Lucene

RE: Spellchecking

2010-03-17 Thread Digy
1- Please subscribe to mailing list to post messages. 2- You are right, LuceneDictionary should be fixed as you suggested. 3- Please go to http://issues.apache.org/jira/browse/LUCENENET, signup and create a new JIRA issue describing this bug. Thanks, DIGY -Original Message- From

RE: Are there any analyzers for HTML or RTF files?

2010-03-30 Thread Digy
No. But you can use IFilter interface to convert any registered app's format to text. For ex, if you have MS Office installed, than this means, you already have word2text, excel2text etc. convertors. See http://www.codeproject.com/KB/cs/IFilter.aspx DIGY -Original Message-

RE: Are there any analyzers for HTML or RTF files?

2010-03-31 Thread Digy
want deal with all sort of file formats and write some highlighter code for each of them) to highlight the original document. DIGY -Original Message- From: Hans Merkl [mailto:h...@hmerkl.com] Sent: Wednesday, March 31, 2010 4:03 AM To: lucene-net-user Subject: Re: Are there any analyzer

RE: thread safety

2010-04-02 Thread Digy
Just use a single IndexReader in all threads. No need for lock or other types of sync. primitives. (true since > 2.3.2 but better use 2.9.2 in trunk) DIGY -Original Message- From: emagdnim paradox [mailto:bwsithspaw...@yahoo.com] Sent: Friday, April 02, 2010 8:54 PM To: lucene-net-u

RE: thread safety

2010-04-02 Thread Digy
(); } } //Called by different threads. No "lock" is needed! void Search() { IndexSearcher src = new IndexSearcher( singletonIndexWriter.GetReader() ); //search src.Close(); } DIGY -Original Message- From: emagdnim paradox [mailto:bwsithspaw...@yahoo.

RE: Another set of optimisation questions...

2010-04-07 Thread Digy
to "reopen". Besides that, don't optimize the index at all, you won't lose anything remarkable in terms of performance because of changes in internals of Lucene 2.9.X such as "per segment readers". PS: Using single instances of IndexWriter/IndexReader are safe

RE: FileNotFoundException thrown when I expected Lucene.NET to create the file

2010-04-09 Thread Digy
Hi Nima, Your code runs fine. No exception. DIGY PS: I used Lucene.Net 2.9.2 -Original Message- From: nima dilmaghani [mailto:nim...@gmail.com] Sent: Friday, April 09, 2010 8:46 PM To: lucene-net-user@lucene.apache.org Subject: FileNotFoundException thrown when I expected Lucene.NET

RE: Searching for the Numerals in an Alphanumeric Data

2010-04-21 Thread Digy
You can use wildcard searches but they are slow. If I were you, I would write a custom analyzer to separate tokens as [uche] and [8861]. DIGY. -Original Message- From: EZEILO,UCHENNA JEREMIAH [mailto:ueze...@cenbank.org] Sent: Wednesday, April 21, 2010 6:26 PM To: lucene-net-user

RE: phrase filter

2010-04-21 Thread Digy
cene/lucene.net/trunk/C%23/contrib/Contri b.Net/Contrib.Net/Analysis/Ext/Analysis.Ext.cs DIGY -Original Message- From: emagdnim paradox [mailto:bwsithspaw...@yahoo.com] Sent: Wednesday, April 21, 2010 8:03 PM To: lucene-net-user@lucene.apache.org Subject: phrase filter Hey all, When I

RE: shuffle or randomize method?

2010-04-21 Thread Digy
I am not sure that I understand your need well. But why don't you just shuffle the search results? DIGY -Original Message- From: nima dilmaghani [mailto:nim...@gmail.com] Sent: Wednesday, April 21, 2010 8:40 PM To: lucene-net-user@lucene.apache.org Subject: shuffle or randomize m

RE: Highlighter highlights single words for exact phrase query

2010-04-23 Thread Digy
I don't know much about highlighter. Have you tried FastvectorHighlighter? it can give you more correct results related with phrase query. DIGY -Original Message- From: Hans Merkl [mailto:h...@hmerkl.com] Sent: Friday, April 23, 2010 9:31 PM To: lucene-net-user Subject: Highli

RE: Weighing relevance vs confidence

2010-04-26 Thread Digy
Have you tried "CustomScoreQuery"? DIGY -Original Message- From: Nicholas Paldino [.NET/C# MVP] [mailto:casper...@caspershouse.com] Sent: Monday, April 26, 2010 10:57 PM To: lucene-net-user@lucene.apache.org Subject: Weighing relevance vs confidence So

RE: New user: multi fields query question

2010-04-28 Thread Digy
Hi Nenad, To get 3 matches, you should have indexed 3 documents(either same docs. in the index or 3 docs each having one field). Otherwise it should have worked as you expected. For the second part, other that boosting "keyword", I don't think that you have to do somethi

RE: Find articles I like

2010-04-29 Thread Digy
See "MoreLikeThisQuery" in "Contrib/Queries.Net". It does what you are seeking for. DIGY -Original Message- From: Robert Pohl [mailto:robba...@gmail.com] Sent: Thursday, April 29, 2010 11:18 PM To: lucene-net-user@lucene.apache.org Subject: Find articles I like

RE: Handling Hierarchical Document Categories

2010-04-30 Thread Digy
child1" Doc4 : Category="Root Child1 Grandchild2" Doc5 : Category="Root Child2" Doc6 : Category="Root Child2 Grandchild3" Doc7 : Category="Root Child2 Grandchild4" DIGY -Original Message- From: Matt Honeycutt [mailto:mbhoneyc..

RE: Why did fix for LUCENENET-183 get reverted

2010-05-04 Thread Digy
Thanks Bernie, This patch is lost while working on 2.9.0 port. I recommitted that patch. (for 2.9.1 & 2.9.2 in trunk) and added also a test case(A small variation of your C# code) not to lose that again. DIGY -Original Message- From: Bernie Solomon [mailto:bjsj...@roadrunner.com]

RE: Unwanted Tokens

2010-05-06 Thread Digy
>>>>> return input.Next(); How can you can be sure that input.Next() is not a one-char token? DIGY -Original Message- From: Markus Doerig [mailto:farang...@gmail.com] Sent: Thursday, May 06, 2010 4:43 PM To: lucene-net-user@lucene.apache.org Subject: Unwanted Tokens

RE: Unwanted Tokens

2010-05-06 Thread Digy
For input "This is a b text", your code would output [this] [is] [b] [text]. Call "return this.Next()" instead of "return input.Next()" DIGY -Original Message- From: Markus Doerig [mailto:farang...@gmail.com] Sent: Thursday, May 06, 2010 4:43 PM To: luc

RE: Lucene.net in Medium Trust

2010-05-06 Thread Digy
I updated the title of LUCENENET-357 Attaching your patch to that issue would be good. DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmail.com] Sent: Thursday, May 06, 2010 6:54 PM To: lucene-net-user@lucene.apache.org Subject: Re: Lucene.net in Medium Trust

RE: My Lucene queries only ever return one hit

2010-05-09 Thread Digy
Because you recreate(purge) the index. New IndexWriter(indexDir, New Standard.StandardAnalyzer(), >>>> True <<<<) Last Boolean parameter can be something like "Not System.IO.Directory.Exists(indexDir)" DIGY -Original Message- From: Bob Eastbro

RE: FastVectorHighlighter.Net returning null on GetBestFragment

2010-05-09 Thread Digy
r.WITH_POSITIONS_OFFSETS) ); //<-- Field used in highlighting with a query something like [f1:abc OR f2:abc], you would get a hit but no highlight. DIGY -Original Message- From: Midhat Ali [mailto:midhat...@gmail.com] Sent: Friday, May 07, 2010 9:03 PM To: lucene-net-user Subject:

RE: FastVectorHighlighter.Net returning null on GetBestFragment

2010-05-10 Thread Digy
Below is what I can create from your messy data. It works as expected and returns "The Company has noth". DIGY Lucene.Net.Store.RAMDirectory DIR = new Lucene.Net.Store.RAMDirectory(); IndexWriter wr = new IndexWriter(DIR, new Standa

RE: FastVectorHighlighter.Net returning null on GetBestFragment

2010-05-10 Thread Digy
One more thing, What is "i". Is it index of a loop or docID? fvHighlighter.GetBestFragment(fvHighlighter.GetFieldQuery(query),searcher.Ge tIndexReader(), i, "FullContent", 20); DIGY -Original Message- From: Midhat Ali [mailto:midhat...@gmail.com] Sent: Mo

RE: IndexWriter holding on to files?

2010-05-18 Thread Digy
I close it only when the application terminates). * I never optimize the Index. * With every search request, I invoke "IndexWriter.GetReader", make the search, and close the reader. * I use Lucene.Net 2.9.2 DIGY -Original Message- From: Josh Handel [mailto:josh.han...@catapults

RE: Highlighter and ConstantScoreQuery don't play together

2010-05-18 Thread Digy
Then, you should think of using Query.Rewrite for your queries. DIGY. -Original Message- From: TJ Kolev [mailto:tjko...@gmail.com] Sent: Tuesday, May 18, 2010 11:41 PM To: lucene-net-user@lucene.apache.org Subject: Re: Highlighter and ConstantScoreQuery don't play together I a

RE: IndexWriter holding on to files?

2010-05-19 Thread Digy
ing stuff is handled internally. DIGY -Original Message- From: Josh Handel [mailto:josh.han...@catapultsystems.com] Sent: Wednesday, May 19, 2010 4:52 PM To: lucene-net-user@lucene.apache.org Subject: RE: IndexWriter holding on to files? I'm pretty sure (IE watched it many many times i

RE: QueryParser

2010-05-20 Thread Digy
QueryParser always use analyzers. To search in fields that are not analyzed, use TermQuery. DIGY -Original Message- From: Josh Handel [mailto:josh.han...@catapultsystems.com] Sent: Thursday, May 20, 2010 8:05 PM To: lucene-net-user@lucene.apache.org Subject: RE: QueryParser If you are

RE: Still a problem with medium trust: unmanaged code

2010-05-24 Thread Digy
Hi Simone, Unmanaged flush is the direct port of java. I think using only the managed one would be enough. (There were discussions about that in the past, you can find them with some googling.) DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmail.com] Sent

RE: Big data Suggestions. (Out of Memory)

2010-05-24 Thread Digy
Try to use "commit" (for ex, at every 10000 docs) DIGY -Original Message- From: Josh Handel [mailto:josh.han...@catapultsystems.com] Sent: Monday, May 24, 2010 9:57 PM To: lucene-net-user@lucene.apache.org Subject: Big data Suggestions. (Out of Memory) I hate to ping multipl

RE: Still a problem with medium trust: unmanaged code

2010-05-24 Thread Digy
In 2.4.0 we only used managed flush code and I never saw a bug. DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmail.com] Sent: Monday, May 24, 2010 10:15 PM To: lucene-net-user@lucene.apache.org Subject: Re: Still a problem with medium trust: unmanaged code So

RE: Still a problem with medium trust: unmanaged code

2010-05-24 Thread Digy
Hi Simone, Since you will get a SecurityException everytime sync is called in medium trust, why don't you just comment out that code. If I were 100% sure, I would do the same for the Lucene.Net core. DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmai

RE: Still a problem with medium trust: unmanaged code

2010-05-24 Thread Digy
No, It is not about mono. My hesitation is: If a managed process crashes after it's called filestream.Flush(), can we know for sure that all the data will eventually be written to disk. DIGY -Original Message- From: Simone Chiaretta [mailto:simone.chiare...@gmail.com] Sent: Tuesday

RE: Big data Suggestions. (Out of Memory)

2010-05-25 Thread Digy
it yesterday) DIGY -Original Message- From: Josh Handel [mailto:josh.han...@catapultsystems.com] Sent: Tuesday, May 25, 2010 4:55 PM To: lucene-net-user@lucene.apache.org Subject: RE: Big data Suggestions. (Out of Memory) Set the commit to every 1000 documents and changed to just 1

RE: Normalized scoring of queries against short documents?

2010-05-25 Thread Digy
It is an expected behaviour. Score can be any number .GT. 0 and isn't normalized(although it mostly falls into [0-1] range). DIGY -Original Message- From: Christian Klauß [mailto:s8788...@inf.tu-dresden.de] Sent: Tuesday, May 25, 2010 2:42 PM To: lucene-net-user@lucene.apach

RE: Lucene.NET issue with ASP.NET. NoSuchDirectoryException and LockObtainFailedException error. Windows

2010-05-26 Thread Digy
Killing a process without closing the IndexWriter may leave lock-files behind. You can use the static method "IndexWriter.Unlock" to forcibly unlock the index. DIGY -Original Message- From: Adam smalin [mailto:acidzombi...@gmail.com] Sent: Wednesday, May 26, 2010 8:29 PM To:

RE: querying for certain document term frequency

2010-05-27 Thread Digy
ader.Document(td.Doc()); .. } DIGY -Original Message- From: Artem Chereisky [mailto:a.cherei...@gmail.com] Sent: Thursday, May 27, 2010 2:02 PM To: lucene-net-user@lucene.apache.org Subject: querying for certain document term frequency Hi, I'm l

  1   2   >