Possible location of word inside the file.

2013-07-04 Thread Ankit Murarka
Dear Team, I have a potential usecase. I have large number of log files which are archived in a particular directory. Now the administrator would like to view certain information which might/might not be present in any of the files inside the directory. Using lucene, I was

Re: Possible location of word inside the file.

2013-07-04 Thread Ankit Murarka
:[12340 TO 12350] to get 5 lines either side. Use a NumericField and NumericRangeQuery for lineno if you are concerned about performance. See recent thread on this list for more on that. -- Ian. On Thu, Jul 4, 2013 at 8:10 AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Dear Team

Compare the input string with stored string and Take decision.

2013-07-11 Thread Ankit Murarka
to use etc. -- Regards Ankit Murarka Peace is found not in what surrounds us, but in what we hold within. - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h

Re: Compare the input string with stored string and Take decision.

2013-07-11 Thread Ankit Murarka
characters I might have to use Lucene. Please guide if it will be helpful. Because the entire comparison mechanism is expected to be completed in milliseconds. On 7/11/2013 11:59 AM, Ivan Krišto wrote: On 07/11/2013 08:04 AM, Ankit Murarka wrote: a. I have a string of 190 characters. b. I need

Trying to search java.lang.NullPointerException in log file.

2013-07-22 Thread Ankit Murarka
Hello. I am trying to search java.lang.NullPointerException in a log file. The log file is huge. However I am unable to search it. This is because the StandardAnalyzer must be splitting the words on SPACES and since there is no space present here. The entire string is converted into 1 token.

Re: Trying to search java.lang.NullPointerException in log file.

2013-07-22 Thread Ankit Murarka
on periods into separate words and then just use phrase query to find java.lang.NullPointerException, but it depends entirely on your analysis chain. Best Erick On Mon, Jul 22, 2013 at 6:24 AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello. I am trying to search

Re: Trying to search java.lang.NullPointerException in log file.

2013-07-22 Thread Ankit Murarka
to hand-analyze the query string and simulate exactly what the standard analyzer did at index time. So, please clarify your situation. -- Jack Krupansky -Original Message- From: Ankit Murarka Sent: Monday, July 22, 2013 6:24 AM To: java-user@lucene.apache.org Subject: Trying to search

Search a Part of the Sentence/Complete sentence in lucene 4.3

2013-07-24 Thread Ankit Murarka
Dear All, Say suppose I have 3 documents. The sample text is /*File 1 : */ Mr X David is a manager of the company. He is the senior most manager. I also want to become manager of the company. /*File 2 :*/ Mr X David manager of the company is also very senior. He happens to be the senior

Re: Search a Part of the Sentence/Complete sentence in lucene 4.3

2013-07-24 Thread Ankit Murarka
) but should work well in practice ... Mike McCandless http://blog.mikemccandless.com On Wed, Jul 24, 2013 at 4:31 AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Dear All, Say suppose I have 3 documents. The sample text is /*File 1 : */ Mr X David is a manager of the company. He

Re: Search a Part of the Sentence/Complete sentence in lucene 4.3

2013-07-26 Thread Ankit Murarka
solution here too, but they make your index larger. CommonGramsFilter lets you shingle only specific words, e.g. you could pass your stop words to it. Mike McCandless http://blog.mikemccandless.com On Wed, Jul 24, 2013 at 7:34 AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: I tried using

Re: Search a Part of the Sentence/Complete sentence in lucene 4.3

2013-07-27 Thread Ankit Murarka
of tokens, so the bloat is much less. Mike McCandless http://blog.mikemccandless.com On Fri, Jul 26, 2013 at 7:34 AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello can you elaborate more on this.. I seem to be lost over here.. Since I am new to lucene, so yesterday I was going

Re: Search a Part of the Sentence/Complete sentence in lucene 4.3

2013-07-28 Thread Ankit Murarka
...@lucene.apache.org -- Regards Ankit Murarka Peace is found not in what surrounds us, but in what we hold within. - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user

Did you Mean search on Indexes created by Different Files.

2013-07-29 Thread Ankit Murarka
ache heavy as one possible suggestion. haven't tried it yet but surely will be an absolute beauty to have it.. Also examples available on net for Did you mean are very very old and API have undergone significant changes thus making them not so very useful. -- Regards Ankit Murarka Peace

Re: Did you Mean search on Indexes created by Different Files.

2013-07-29 Thread Ankit Murarka
Can anyone guide me on how to achieve the below mentioned objective. I am kind of struck and cannot figure out what is wrong. Tried spellChecker.setSpellIndex(directory); also but still no suggestions. On 7/29/2013 4:36 PM, Ankit Murarka wrote: Since I am new to this, I can't stop exploring

Re: Did you Mean search on Indexes created by Different Files.

2013-07-30 Thread Ankit Murarka
); } Please guide. Basically the suggested words should be provided from the indexes I have created.. It should not come from any dictionary.. Is it possible ? On 7/29/2013 9:34 PM, Varun Thacker wrote: Hi, On Mon, Jul 29, 2013 at 4:36 PM, Ankit Murarka ankit.mura...@rancoretech.com wrote

Re: Did you Mean search on Indexes created by Different Files.

2013-07-30 Thread Ankit Murarka
Any help on this will be highly appreciated..I have been trying all possible different option but to no avail. Also tried LuceneDictionary BUT THIS ALSO DOES NOT SEEM TO BE HELPING... Please guide. On 7/30/2013 4:49 PM, Ankit Murarka wrote: Hello. Using DirectSpellChecker is not serving my

Re: Did you Mean search on Indexes created by Different Files.

2013-07-31 Thread Ankit Murarka
Can anyone please guide me on how to implement Did You Mean Search using indexes created from the supplied bunch of files as an input. On 7/31/2013 11:15 AM, Ankit Murarka wrote: Any help on this will be highly appreciated..I have been trying all possible different option but to no avail

Re: Did you Mean search on Indexes created by Different Files.

2013-08-01 Thread Ankit Murarka
you are creating the term, you should analyze the string with the same analyzer you used for indexing the contents field. On Tue, Jul 30, 2013 at 4:49 PM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello. Using DirectSpellChecker is not serving my purpose. This seems to return word

Re: Did you Mean search on Indexes created by Different Files. -Completed.

2013-08-01 Thread Ankit Murarka
obviously). Then I gave this new directory path to the spellChecker to search and it gave me what I wanted. Word Suggestions from the documents I supplied as an input. Hopefully someone may find it useful.. On 8/1/2013 10:44 AM, Ankit Murarka wrote: Can anyone please guide me on how to implement

Complete phrase Suggest Feature in Apache Lucene

2013-08-02 Thread Ankit Murarka
Hello All, Just like spellcheck feature which after lot of trouble was Implemented, is it possible to implement Complete Phrase Suggest Feature in Lucene 4.3 . So if I enter an incorrect phrase it can suggest me few possible valid phrases. One way could be to get suggestion for each word in

Re: Complete phrase Suggest Feature in Apache Lucene

2013-08-02 Thread Ankit Murarka
/KojiSekiguchi/lucene-terms-extraction By using it, I got a phrase list. The phrase list can be used for autocomplete and did you mean features. koji -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

How to Index each file and then each Line for Complete Phrase Match. Sample Data shown.

2013-08-03 Thread Ankit Murarka
suggest. How is it possible using Lucene and its API. Javadoc does not seem to guide me anywhere for this case. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

Re: How to Index each file and then each Line for Complete Phrase Match. Sample Data shown.

2013-08-05 Thread Ankit Murarka
line as a tokenized text field and then do PhraseQuery against that text field? Is there something else you need beyond that? -- Jack Krupansky -Original Message- From: Ankit Murarka Sent: Saturday, August 03, 2013 3:22 AM To: java-user@lucene.apache.org Subject: How to Index each file

Re: How to Index each file and then each Line for Complete Phrase Match. Sample Data shown.

2013-08-06 Thread Ankit Murarka
AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello. 1. What I am trying to implement is Complete Suggestion Match-Did You Mean feature for a phrase. I did it for Single Word. I want to do it now for Sentence. 2. What my understanding of indexing each line as a valid phrase

Re: Complete phrase Suggest Feature in Apache Lucene

2013-08-06 Thread Ankit Murarka
by indexing it line by line. Each phrase is actually a line. On 8/2/2013 7:58 PM, Ivan Krišto wrote: On 08/02/2013 10:16 AM, Ankit Murarka wrote: is it possible to implement Complete Phrase Suggest Feature in Lucene 4.3 . So if I enter an incorrect phrase it can suggest me few possible valid

Boolean Query when indexing each line as a document.

2013-08-13 Thread Ankit Murarka
Hello All, I have 2 different usecases. I am trying to provide both boolean query and phrase search query in the application. In every line of the document which I am indexing I have content like : attribute name=remedial action value=Checking/\ Due to the phrase search

Re: Boolean Query when indexing each line as a document.

2013-08-13 Thread Ankit Murarka
:56 AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello All, I have 2 different usecases. I am trying to provide both boolean query and phrase search query in the application. In every line of the document which I am indexing I have content like : attribute name

Trying to store Offsets. Dont know the exact meaning of some terms.

2013-08-13 Thread Ankit Murarka
Hello, I generally add fields to my document in the following manner. I wish to add offsets to this field. doc.add(new StringField(contents,line,Field.Store.YES)); I wish to also store offsets. So, I went through javadoc, and found I need to use FieldType. So, I ended up using :

Re: Boolean Query when indexing each line as a document.

2013-08-14 Thread Ankit Murarka
. You need to give us something more to go on. -- Ian. On Tue, Aug 13, 2013 at 11:13 AM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello, I am aware of that link and I have been through that link many number of times. Problem I have is: 1. Each line is indexed. So indexed

Re: Boolean Query when indexing each line as a document.

2013-08-14 Thread Ankit Murarka
program that demonstrates the problem. -- Ian. On Wed, Aug 14, 2013 at 2:50 PM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello. The problem does not seem to be getting solved. As mentioned, I am indexing each line of each file. The sample text present inside LUKE is am

Re: Boolean Query when indexing each line as a document.

2013-08-14 Thread Ankit Murarka
somewhere but building a BooleanQuery out of a TermQuery or two. Are you sure (test it and prove it) that the strings you pass to the TermQuery are EXACTLY what has been indexed? -- Ian. On Wed, Aug 14, 2013 at 3:29 PM, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello. The problem

Re: Boolean Query when indexing each line as a document.

2013-08-17 Thread Ankit Murarka
build and post the smallest possible self-contained program, using RAMDirectory and no external classes. If you are using a custom analyzer try it without - if that works you've got a clue as to where to look next. Good luck. On Wed, Aug 14, 2013 at 3:46 PM, Ankit Murarka ankit.mura

Re: Boolean Query when indexing each line as a document.

2013-08-19 Thread Ankit Murarka
query/wildcard query/prefix query to return hits even when I parse all the files line by line... On 8/17/2013 1:15 PM, Ankit Murarka wrote: Hello. Reference to CustomAnalyzer is what I had mentioned. I created a custom analyzer by using the StandardAnalyzer code. Only change I made

Re: Boolean Query when indexing each line as a document.

2013-08-21 Thread Ankit Murarka
of Phrase Suggestion which is not serving any purpose. THIS IS NOT DESIRABLE.. My PhraseSuggestion/InfixSuggestion etc. is now not working fine. Please guide.. This is complete blocker. On 8/19/2013 12:28 PM, Roberto Ragusa wrote: On 08/19/2013 08:17 AM, Ankit Murarka wrote

Re: Boolean Query when indexing each line as a document.

2013-08-21 Thread Ankit Murarka
PM, Roberto Ragusa wrote: On 08/21/2013 08:38 AM, Ankit Murarka wrote: Hello. I tried with doc.add(new Field(contents,line,Field.Store.YES,Field.Index.ANALYZED)); The BooleanQuery/PrefixMatch/WildCard all started Running fine.. But it broke the Existing code for Phrase Suggestion

Re: Boolean Query when indexing each line as a document.

2013-08-21 Thread Ankit Murarka
and YYY and 100 lines below XXX and YYY. Any suggestion/guidance will be appreciated. On 8/21/2013 2:39 PM, Roberto Ragusa wrote: On 08/21/2013 09:51 AM, Ankit Murarka wrote: Yeah..I eventually DID THIS Just a small question : Knowing that BooleanQuery/PrefixQuery/WildCardQuery might also

Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-28 Thread Ankit Murarka
and 512 MB heap. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands

Re: Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-29 Thread Ankit Murarka
Any help would be highly appreciatedI am kind of struck and unable to find out a possible solution.. On 8/29/2013 11:21 AM, Ankit Murarka wrote: Hello all, Faced with a typical issue. I have many files which I am indexing. Problem Faced: a. File having size less than 20 MB

Re: Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-29 Thread Ankit Murarka
, Ankit Murarka ankit.mura...@rancoretech.com wrote: Hello all, Faced with a typical issue. I have many files which I am indexing. Problem Faced: a. File having size less than 20 MB are successfully indexed and merged. b. File having size20MB are not getting INDEXED.. No Exception is being thrown

Re: Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-29 Thread Ankit Murarka
BufferedReader(new InputStreamReader(fis, UTF-8; I think that will slurp the whole file in one go which will obviously need more memory on larger files than on smaller ones. Or just run the program with more memory, -- Ian. On Thu, Aug 29, 2013 at 1:05 PM, Ankit Murarka ankit.mura

Re: Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-30 Thread Ankit Murarka
file you are ever going to index? The server may have 8GB but how much memory are you allowing the JVM? What are the command line flags? I think you mentioned 128Mb in an earlier email. That isn't much. -- Ian. On Thu, Aug 29, 2013 at 2:14 PM, Ankit Murarka ankit.mura...@rancoretech.com

Re: Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-30 Thread Ankit Murarka
:42:22,008 INFO Indexing to directory Any guidance will be highly appreciated...... Server Opts are -server -Xms8192m -Xmx16384m -XX:MaxPermSize=512m On 8/30/2013 3:13 PM, Ankit Murarka wrote: Hello. The server has much more memory. I have given minimum 8 GB to Application Server

Re: Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-30 Thread Ankit Murarka
(MainClass1.java:136) Can anyone please guide There has to be some way how a file of say 20 MB can be properly indexed... Any guidance is highly appreciated.. On 8/30/2013 6:49 PM, Ankit Murarka wrote: Hello, The following exception is being printed on the server console when trying to index

Re: Files greater than 20 MB not getting Indexed. No files generated except write.lock even after 8-9 minutes.

2013-08-31 Thread Ankit Murarka
of the JVM. Can you come up with a small piece of code that reproduces the error you are encountering? This would help us see if there is something wrong in the indexing code and try to debug it otherwise. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters

Stream Closed Exception and Lock Obtain Failed Exception while reading the file in chunks iteratively.

2013-09-01 Thread Ankit Murarka
lies. Trying to figure out but to no avail.. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

Re: Stream Closed Exception and Lock Obtain Failed Exception while reading the file in chunks iteratively.

2013-09-02 Thread Ankit Murarka
) { e.printStackTrace(); }finally { fis.close(); } } } } } Kindly guide as to where the possible problem lies. Trying to figure out but to no avail.. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

Re: Stream Closed Exception and Lock Obtain Failed Exception while reading the file in chunks iteratively.

2013-09-02 Thread Ankit Murarka
block. But I don't think this is possibly the root of this problem. There seems to be another problem which is causing this issue..Would appreciate some guidance.. On 9/2/2013 12:11 PM, Trejkaz wrote: On Mon, Sep 2, 2013 at 4:10 PM, Ankit Murarka ankit.mura...@rancoretech.com wrote

Basic understanding and difference between getSuggestion and loopup method of InfixSuggester.

2013-09-05 Thread Ankit Murarka
for a more concrete description of the both. Can anyone please explain me both the parts as well as basic difference between the both.. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

Re: Basic understanding and difference between getSuggestion and loopup method of InfixSuggester.

2013-09-05 Thread Ankit Murarka
Dear All, Can you please suggest me the possible difference between getSuggestion and InfixSuggester.lookup in terms of the suggestions provided by both of these. On 9/5/2013 1:04 PM, Ankit Murarka wrote: Hello All, I would like to know the basic difference between providing a phrase

Providing special characters in input when using QueryParser.

2013-09-10 Thread Ankit Murarka
cannot make any changes in the indexing process . Also inputs like this will be very common from the user end.. Kindly guide. TIA. -- TIA Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

Re: Providing special characters in input when using QueryParser.

2013-09-10 Thread Ankit Murarka
Ok..The problem seems to be solved.. Using the escape method of QueryParser, I am able to parse the given input with special characters also. But if this leads to another pitfall someone may please guide.. On 9/10/2013 12:19 PM, Ankit Murarka wrote: Hello, I am using StandardAnalyzer

Check if Term present in Existing Index before Merging indexes from Directory.

2013-09-11 Thread Ankit Murarka
.. Javadoc does not seem to help me. TIA. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

Query type always Boolean Query even if * and ? are present.

2013-09-12 Thread Ankit Murarka
Absolutely no manipulation is done on string from being given as in input to the string which is provided to this search function. Kindly guide.. TIA. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us

Re: Query type always Boolean Query even if * and ? are present.

2013-09-12 Thread Ankit Murarka
needed.. TIA.. On 9/12/2013 8:52 PM, Ankit Murarka wrote: I also tried it with this query: param name=user_name value=USER_NAME_MENTIONED/* I am still getting it as Boolean Query.. It should be Prefix... On 9/12/2013 8:50 PM, Jack Krupansky wrote: The trailing asterisk in your query input

Re: Query type always Boolean Query even if * and ? are present.

2013-09-12 Thread Ankit Murarka
will not treat it as a wildcard. -- Jack Krupansky -Original Message- From: Ankit Murarka Sent: Thursday, September 12, 2013 10:19 AM To: java-user@lucene.apache.org Subject: Query type always Boolean Query even if * and ? are present. Hello. I am faced with a trivial issue: Everytime my Query

Re: Query type always Boolean Query even if * and ? are present.

2013-09-12 Thread Ankit Murarka
not escaping white space, so your input will be a sequence of terms, which should generate a BooleanQuery. What is the last clause of the BQ? It should be your PrefixQuery. -- Jack Krupansky -Original Message- From: Ankit Murarka Sent: Thursday, September 12, 2013 11:25 AM To: java-user

Number of Times 1 Field has occured in a document within a Given TimeRange,.

2013-12-02 Thread Ankit Murarka
know if any point is not clear and I will clarify it again. -- Regards Ankit Murarka What lies behind us and what lies before us are tiny matters compared with what lies within us - To unsubscribe, e-mail: java-user-unsubscr

Re: Number of Times 1 Field has occured in a document within a Given TimeRange,.

2013-12-04 Thread Ankit Murarka
Hello. Would really appreciate if someone can guide me on the below mentioned issue. On 03-12-2013 12:48, Ankit Murarka wrote: Hello. This might be a long mail but I have mentioned everything very clearly so that I can get needed assistance. Indexing: I have a use case. I am indexing two

Re: Number of Times 1 Field has occured in a document within a Given TimeRange,.

2013-12-06 Thread Ankit Murarka
Well a bit strange as this is the 1st time, I am not receiving any reply to the question even after sending it again. Would be very helpful if someone can throw some light on the problem. On 04-12-2013 18:54, Ankit Murarka wrote: Hello. Would really appreciate if someone can guide me