Re: [CLucene-dev] Endless loop in IndexWriter?

2009-11-11 Thread Veit Jahns
Hi Itamar! 2009/11/10 Itamar Syn-Hershko ita...@divrei-tora.com You might be right, I wonder why this hasn't been discovered by now? Would you mind writing a minified test case for this and possibly also demonstrating how your proposed fix fixes this issue? I will then have to verify this

[CLucene-dev] Multithreaded write and search

2010-03-19 Thread Veit Jahns
Dear all, in our application we have several threads writing and searching on the index. Writing and searching isn't a problem as long the IndexSearcher isn't updated,i.e. deleting the old and creating a new IndexSearcher. But if the IndexSearcher is updated, then the application crashes. Not

Re: [CLucene-dev] PerFieldAnalyzerWrapper memory leak

2010-03-19 Thread Veit Jahns
Itamar, I have a question to one of your commits done in the course of this discussion. The commit is the commit 364c216fb93 [1]. You replaced a TokenStream pointer with a void pointer in the thread local storage of Analyzer::Internal. Now everything can be saved as a previous stream, but

Re: [CLucene-dev] Clucene search - An invalid field name was

2010-04-21 Thread Veit Jahns
. Search is throwing the error and the default field I have given is the field which is tokenized. I am indexing the Title of the document and this title is given as default searchable field. Message: 2 Date: Wed, 7 Apr 2010 21:11:20 +0200 From: Veit Jahns nuncupa...@googlemail.com Subject: Re

Re: [CLucene-dev] Multithreaded write and search

2010-04-23 Thread Veit Jahns
Hi Itamar! 2010/4/16 Itamar Syn-Hershko ita...@divrei-tora.com: Apparently Ben has been working on the same issues on his branch. I applied your patches along with his work, plus some more cleanup, into the atomicthreads branch. Unfortunately, while it compiles and runs perfectly well on

Re: [CLucene-dev] IndexModifier exception during destruction

2010-06-30 Thread Veit Jahns
2010/6/17 Itamar Syn-Hershko ita...@divrei-tora.com: You can send it on the mailing list, or to me privately. Either as a code file, or (ultimately) a patch for cl_test introducing the new test case. I saw this test (IndexModifierExceptionTest) on the git repository. But the IndexModifier of

Re: [CLucene-dev] cl_demo memory leaks discovery

2010-07-01 Thread Veit Jahns
2010/7/1 Itamar Syn-Hershko ita...@code972.com: On one hand, we should be following the Java behavior. If they always call close, we should always do as well. When having a reader object on the stack this is definitely how that should work. On the other hand, if calling close from the

Re: [CLucene-dev] How to use RAMDirectory for index that is 2G

2010-07-09 Thread Veit Jahns
2010/7/9 Liu bbskill bbsk...@gmail.com:    I have a large index whose size is 5.8G, and I fail to use RAMDirectory to load it into memory with a segment fault.    I dived into the code, and found the index of RAMDirectory is limited to 2G.        I an curious about the reason,  and

Re: [CLucene-dev] Exception during thread finish

2010-07-27 Thread Veit Jahns
This message looks familiar to me. Maybe it is the same problem I once had [1]. Veit [1] http://comments.gmane.org/gmane.comp.jakarta.lucene.clucene.devel/3358 -- The Palm PDK Hot Apps Program offers developers who use

[CLucene-dev] Bug in BitSet::writeDgaps()

2010-08-03 Thread Veit Jahns
Hi, I observed that the index becomes corrupted (Read past EOF) after several updates and deletes. The reason was that the deletions file wasn't written out completely, if a bit was setted at the end of the underlying bitset and the DGaps format was used. I made a test case [1] as well as fix [2]

Re: [CLucene-dev] Bug in BitSet::writeDgaps()

2010-08-03 Thread Veit Jahns
2010/8/3 Itamar Syn-Hershko ita...@code972.com: The beauty of tests is they speak for themselves... Is it possible to have a test showing the corruption issue you mentioned if the BitSet patch isn't applied? Just pick the commit with the test case. Then the BitSet test should fail. Veit

Re: [CLucene-dev] Bug in BitSet::writeDgaps()

2010-08-03 Thread Veit Jahns
2010/8/3 Itamar Syn-Hershko ita...@code972.com: What I'm looking for is a test showing the index corruption scenario you described - if it can be reproduced in a test, and then to see the BitSet patch fixes it. Without this test we only assume the BitSet bug caused it. While this makes sense,

Re: [CLucene-dev] Bug in BitSet::writeDgaps()

2010-08-03 Thread Veit Jahns
2010/8/3 Itamar Syn-Hershko ita...@code972.com: On 3/8/2010 4:51 PM, Veit Jahns wrote: 2010/8/3 Itamar Syn-Hershkoita...@code972.com: What I'm looking for is a test showing the index corruption scenario you described - if it can be reproduced in a test, and then to see the BitSet patch fixes

Re: [CLucene-dev] RAMDirectory testing (using MockRAMDirectory)

2010-08-10 Thread Veit Jahns
2010/8/10 Kostka Bořivoj kos...@tovek.cz: I added MockRAMDirectory ported by Veit into TestRAMDirectory and found following: 1) sizeInBytes members in RAMDirectory and RAMFile aren’t correct, I fixed this somehow, but I’m not sure if it shouldn’t be fixed different way, could anybody

Re: [CLucene-dev] TestIndexWriter::testWickedLongTerm failure

2010-08-10 Thread Veit Jahns
2010/8/10 Kostka Bořivoj kos...@tovek.cz: The problem is the bigTerm (16383 B long word) added in doc isn't returned as one token during indexing. StandardTokenizer splits them to set of tokens, each 256 bytes long. So the term isn't skipped as too long but indexed as set of tokens. Then, of

Re: [CLucene-dev] RAMDirectory testing (using MockRAMDirectory)

2010-08-11 Thread Veit Jahns
2010/8/11 Kostka Bořivoj kos...@tovek.cz: For the patch of the RAMFile desctructor I'm in doubt. Looking at the correspondong JLucene classes subtracting the file size is done in the deleteFile method of RAMDirectory (see diff). By the way, I needed some inline keywords removed in

Re: [CLucene-dev] Vote for merging of atomicthreads branch into master

2010-08-20 Thread Veit Jahns
2010/8/17 Itamar Syn-Hershko ita...@code972.com: So I intend to merge it to master soon - thats the only move that makes sense at this point. Any objections? any volunteers for last minute testing? cl_test of the branch atomicthreads works fine for me on different platforms. But when I use this

Re: [CLucene-dev] Span Queries have been ported

2010-09-01 Thread Veit Jahns
2010/8/31 Šplíchal Jiří splic...@tovek.cz: I have just pushed  SpanQuery implementation and its basic tests in to the jiri-working branch. It would great if you could test it so that it can be merged into the current HEAD. Great! The tested it on with Win7/VS2010 and the span query tests

Re: [CLucene-dev] wild card query

2010-09-01 Thread Veit Jahns
Hi Mark, in wildcard queries the tokens are not processed by the analyzer (see also [1]). So in your case the searcher will look for terms beginning the the characters 1ab. And because they original term is split up by the StandardAnalyzer into two terms, there is no such term in the index. I

Re: [CLucene-dev] Span Queries have been ported

2010-09-06 Thread Veit Jahns
2010/9/5 Šplíchal Jiří splic...@tovek.cz: Hello Veit, yes, please feel free to push your changes to my branch. Pushed them. But one of your recent commits [1]. Because of the additional decRef() in the SegmentReader's destructor the assertion in line 84 of this file fails in the

Re: [CLucene-dev] Span Queries have been ported

2010-09-07 Thread Veit Jahns
2010/9/7 Šplíchal Jiří splic...@tovek.cz: Yes, me and Borek wanted to merge our branches so I pushed all my changes - and forgot not to push this one. I removed it again in my local repository but unfortunately I am not able to push the change to the original repository - I always got the

Re: [CLucene-dev] Span Queries have been ported

2010-09-07 Thread Veit Jahns
2010/9/7 Veit Jahns nuncupa...@googlemail.com: 2010/9/7 Šplíchal Jiří splic...@tovek.cz: I tried to find the right position for setting the closed flag to true (in the doClose() method?) but I did not finish the fix yet. The simplest solution is ofcourse not to forget to call the close

Re: [CLucene-dev] Span Queries have been ported

2010-09-07 Thread Veit Jahns
The output looks good. Another guess of mine is that something with your repository is wrong. So I'd check [remote origin] entry the file .git/config or to clone the clucene repository again in another directory and try to push a commit from this repository. Veit

Re: [CLucene-dev] Vote for merging of atomicthreads branch into master

2010-09-08 Thread Veit Jahns
2010/8/20 Veit Jahns nuncupa...@googlemail.com: 2010/8/17 Itamar Syn-Hershko ita...@code972.com: So I intend to merge it to master soon - thats the only move that makes sense at this point. Any objections? any volunteers for last minute testing? cl_test of the branch atomicthreads works fine

Re: [CLucene-dev] Current branches state

2010-09-22 Thread Veit Jahns
2010/9/20 Šplíchal Jiří splic...@tovek.cz: I tested once more our code in order to check the memory leaks. Here is the list of leaking test suites: [...] Let us get rid of the leaks! I will start from the bottom Ok. As I was examining memory leaks in CLucene, I observed, that some leaks

Re: [CLucene-dev] FSDirectory fixed

2010-09-30 Thread Veit Jahns
2010/9/27 Kostka Bořivoj kos...@tovek.cz: Exception CL_ERR_IO  path not a directory was (sometimes) thrown, if getDirectory is called with path to non-existing folder. Wrong condition fixed. Looks good to me. But I guess, Itamar will ask for a test case. ;) By the way, fileStat will be

Re: [CLucene-dev] CLucene-developers Digest, Vol 54, Issue 1

2010-10-11 Thread Veit Jahns
the crash occurs on startup before using any thing of the clucene. Can you attach a debugger, so you can see exactly where in the code the crash occurs? Veit -- Beautiful is writing same markup. Internet Explorer 9

Re: [CLucene-dev] NearSpansUnordered bug fix

2010-11-15 Thread Veit Jahns
2010/11/13 Itamar Syn-Hershko ita...@code972.com: [...] veit_working [is] still there even though [it is] merged into master already - owners, please verify and remove them. Done. Veit -- Centralized Desktop Delivery:

Re: [CLucene-dev] Highlighter does not compile?

2010-11-15 Thread Veit Jahns
Hi Mohammad, it is an encoding issue. I attached a patch to avoid this. Can you check, if this patch works for you? If so, I'd commit it to the master branch (or a fix branch, if needed). Kind regards, Veit 2010/11/13 muhammad ismael m.ismae...@gmail.com: Hi, I want to use highlighter but

Re: [CLucene-dev] Have anybody run cl_test or cl_demo on solaris sparc?

2010-12-21 Thread Veit Jahns
Just an idea regarding: make cl_demo gives this linking error. Linking CXX executable ../../bin/cl_demo Undefined   first referenced  symbol in file lucene::document::Field::Field(const wchar_t*,lucene::util::ValueArrayunsigned

Re: [CLucene-dev] Have anybody run cl_test or cl_demo on solaris sparc?

2010-12-21 Thread Veit Jahns
2010/12/21 Veit Jahns nuncupa...@googlemail.com: [...] (Sun, I guess). Scratch that! It was clearly stated in the subject. Veit -- Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots

Re: [CLucene-dev] contrib: SpellChecker

2011-01-06 Thread Veit Jahns
2011/1/5 Veit Jahns nuncupa...@googlemail.com: It work on Windows (I will check this also on other platforms too), but I saw that you used wstring/wstringstream, e.g., in SpellCheckerC. Thus, it works only, if the ASCII mode is not enabled. With ASCII mode enabled it fails to compile. You have

Re: [CLucene-dev] contrib: SpellChecker

2011-01-07 Thread Veit Jahns
2011/1/5 Freiholz Manuel m.freih...@cadenas.de: Hi, ok i wil take a look at it. Thanks! I changed your files a little bit, so that it compiles on Ubuntu. But the test spellcheckersearch fails, because the the method suggestSimilar() returns no result (NULL pointer). Any idea? Veit

Re: [CLucene-dev] Bug in sorting

2011-01-22 Thread Veit Jahns
Hi Alexander, I created a test cased based on the code you once sent me. I get the same result, i.e., that there seems to be a bug. I will take a closer look tomorrow. Kind regards, Veit -- Special Offer-- Download

Re: [CLucene-dev] (Fixed) Bug in BooleanScorer2 with prohibited scorers

2011-02-07 Thread Veit Jahns
2011/2/7 Itamar Syn-Hershko ita...@code972.com: Hi Veit, good catch! That is quite a straight-forward fix, can you please merge it to master? Done. Veit -- The modern datacenter depends on network connectivity to

Re: [CLucene-dev] Bug in sorting

2011-02-09 Thread Veit Jahns
Hi Alexander, it seems that addIndexesNoOptimize() cause the trouble here. I can't open the index with Luke. I got a read past EOF error. But only from Luke. The CLucene-IndexReader has no problem with the index---besides that sorting doesn't work correctly. I will keep at it. Kind regards,

Re: [CLucene-dev] Bug in sorting

2011-02-16 Thread Veit Jahns
2011/2/9 Veit Jahns nuncupa...@googlemail.com: it seems that addIndexesNoOptimize() cause the trouble here. I can't open the index with Luke. I got a read past EOF error. But only from Luke. The CLucene-IndexReader has no problem with the index---besides that sorting doesn't work correctly. I

Re: [CLucene-dev] Clucene 2.3.2 vs 0.9.21b

2011-03-10 Thread Veit Jahns
Hi Shivaji! 2011/3/8 shivaji badade shivaji.bad...@gmail.com: Do anybody has the diff in these two version on performance data, which will help in moving me to 2.3.2 version. I have no diff. But I changed once from 0.9.21b to 2.3.2 (about 2 years ago). The most changes I had to made where in

Re: [CLucene-dev] Clucene packaging

2011-03-18 Thread Veit Jahns
Hi Ben! 2011/3/17 Ben van Klinken bvanklin...@gmail.com: Dear CLuceners It's been a very long time since I've done anything on CLucene, and I hope I'm not stepping on anyone's toes if I jump in here now. I just wanted to keep you up with a few things I've been doing lately. 1. I've done

Re: [CLucene-dev] Clucene packaging

2011-03-31 Thread Veit Jahns
Should go to the mailing list: 2011/3/30 Veit Jahns nuncupa...@googlemail.com: 2011/3/30 Ben van Klinken bvanklin...@gmail.com: Are Jens's Solaris fixes also included? It has at also least one memory leak---see my mail sent some minutes ago. the master is merged with it. where are those

Re: [CLucene-dev] CLucene's future

2011-04-05 Thread Veit Jahns
Hi Allan! 2011/4/4 Alan Wright alanwright.h...@googlemail.com: The intention behind Lucene++ has always been to closely reflect the java lucene code with a view to being able to keep up with the pace of development. As Itamar wrote, I work on this too. One question that came into my mind was,

Re: [CLucene-dev] it isn't working the functions STRCPY_AtoTand STRCPY_TtoA

2011-05-16 Thread Veit Jahns
2011/5/11 juliana nunes juliana@gmail.com: this is the code: ... line 43 STRCPY_AtoT( TQuery, buffer, strlen(buffer) );        Query *qdesc;        qdesc = QueryParser::parse( TQuery, _T(oferta), analyzer);        Hits *hdesc = s.search(qdesc);        HitCollector *algo = new

Re: [CLucene-dev] Indexing a document

2011-05-26 Thread Veit Jahns
Hi Emerson! 2011/5/26 Emerson Espínola emersonespin...@gmail.com void Index::NewDoc (const char *contents) Using char here means that you use CLucene in an ASCII mode. Right? {     Document* doc = _CLNEW Document();     doc-add(*_CLNEW Field(_T(contents), _T(contents), Field::STORE_YES |

Re: [CLucene-dev] Compile Clucene with mingw on windows 7 Error.

2011-05-27 Thread Veit Jahns
Hi Scott! 2011/5/17 Scott Zhang macromars...@gmail.com: D:\wxwidgets\projects\DocFrog\ clucene-core-2.3.3.4\mbuildmingw32-make [  0%] Building CXX object src/shared/CMakeFiles/clucene-shared.dir/CLucene/Shar edHeader.obj In file included from

Re: [CLucene-dev] Indexing a document

2011-05-27 Thread Veit Jahns
Hi Emerson! 2011/5/27 Emerson Espínola emersonespin...@gmail.com Hi Veit. Thank yo very much for your answer. Great explanation. You don't wonder how much you're helping me. You are welcome! 1. I'll try english documents. 2. Ok. 3. Does BrazilianAnalyzer work similar to

Re: [CLucene-dev] how to trace memory leak in clucene

2011-06-20 Thread Veit Jahns
Hi, I don't know, if this macro is still used in the current version. I think Ben or Itamar should know more about this. But you can also use other tools to trace the memory leak. Depending on your plattform and compiler, you can use, for instance, the Visual Studio facilities [1] or valgrind

Re: [CLucene-dev] Counting segments

2011-08-25 Thread Veit Jahns
Hi Timo! 2011/8/17 Timo Sirainen t...@iki.fi: Is there some API call to tell me how many segment files there are currently, so I could try to guess if I should optimize the index (after a small incremental update)? I don't know it excatly, but the SegmentInfos (part of IndexWriter) may be

Re: [CLucene-dev] Compiling v.2.3.3.4 with VC6

2011-09-17 Thread Veit Jahns
Hi Celto! 2011/9/14 cel tix44 celti...@gmail.com: Team When compiled with VC6, v.2.3.3.4 throws some 300 errors. May I please ask the DEV team whether Clucene is still going to be compatible with VC6? I think, it was not intended in the first place. This release includes some parts of the

Re: [CLucene-dev] Compiling v.2.3.3.4 with VC6

2011-09-19 Thread Veit Jahns
Hi Celto! 2011/9/19 cel tix44 celti...@gmail.com: Veit Thanks very much for your response. Attached is a list of compilation errors as is -- produced after getting the source code from GIT, config'ed with Cmake 2.8 (default settings, no boost, no zlib) and built with vc6 -- essentially, for

Re: [CLucene-dev] pre-built clucene windows binaries (32 and 64 bit) for download ?

2011-09-30 Thread Veit Jahns
Hi Klaus, what are your troubles with compiling clucene? Using the Cmake tool compiling clucene is rather simple. Just execute Cmake and open and build the created VS solution file. Kind regards, Veit Jahns 2011/9/28 Klaus Gruen klausgr...@rocketmail.com: Hi all, I am not a compiling guru

Re: [CLucene-dev] Compiling v.2.3.3.4 with VC6

2011-10-14 Thread Veit Jahns
2011/10/10 Veit Jahns nuncupa...@googlemail.com: Microsoft suggests this: #ifdef _MSC_VER #if _MSC_VER == 1200 std::ostream operator(std::ostream os, __int64 i ) // borrowed from http://support.microsoft.com/kb/168440 {     char buf[20];     sprintf(buf,%I64d, i );     os buf

Re: [CLucene-dev] Compiling v.2.3.3.4 with VC6

2011-10-28 Thread Veit Jahns
Hi Celto, I'm sorry. I was distracted by other things from your e-mail. 2011/10/17 cel tix44 celti...@gmail.com: Veit A couple of minor problems in the latest update --- 1) When generating code, CMAKE throws this error: CMake Error at src/core/CMakeLists.txt:194 (add_library):   Cannot find

Re: [CLucene-dev] Clucene on iOS?

2011-10-28 Thread Veit Jahns
Hi Albin! 2011/10/20 Albin Stigö albin.st...@gmail.com: I wonder if anyone could give me some pointers on how to build clucene as a static library for iOS? I'm not familiar with iOS and never tried to build CLucene on iOS. But think it is possible by using CMake and CLang (somehow). Have you

Re: [CLucene-dev] Indexing a document

2011-11-05 Thread Veit Jahns
Hi Emerson! 2011/11/4 Emerson Espínola emersonespin...@gmail.com I'm willing to write the BrazillianAnalyzer. What should I do? Do you have an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month? If the BrazillianAnalyzer from Java Lucene fits your needs, then the only

Re: [CLucene-dev] developer volunteer

2011-11-30 Thread Veit Jahns
Hi! 2011/11/29 pradeepa kumar cdprade...@gmail.com: Hi I am a C++ developer and would like to volunteer in development tasks Could you pease let me know how and where  do I start Actually, there is a development task. Some months ago we started to give CLucene a new code base. It was intended

Re: [CLucene-dev] [ CLucene-developers Digest ] Help on functions that doesn't exist in CLucene

2012-02-27 Thread Veit Jahns
Hi Mena, as far as I know, every function from Lucene 2.3.2 is ported to C++. What is missing are most of the contribs. Some are there, but many are not. But they can ported to C++. I did it once myself for the an special analyzer. Wasn't so difficult. Kind regards, Veit 2012/2/24 Mena Hanna

Re: [CLucene-dev] help ! - using LanguageBasedAnalyzer/CJKTokenizer returns wrong result

2012-02-28 Thread Veit Jahns
Hi Vivien! 2012/2/27 Vivien Meng v.m...@qsr.com.au Hi Veit: ** ** ** ** Thanks for your reply. ** ** Do you know if there is anyone using CJKTokenizer? No, I don't know anyone. Sorry. ** ** Here is the code, no parsing error that I can see: ** **

Re: [CLucene-dev] help ! - using LanguageBasedAnalyzer/CJKTokenizer returns wrong result

2012-03-11 Thread Veit Jahns
Hi Vivien, sorry, it took some time. Maybe this is an encoding issue. What encoding do your source code files have? I got the same result, when the encoding is UTF-8. But CLucene works internally with UTF-16 and in this case handles the string and the query as an UTF-16 encoded string. Thus, the

Re: [CLucene-dev] searchAfter / TopDocs Pagination

2012-03-17 Thread Veit Jahns
2012/2/28 du...@identicalsoftware.com: Maybe Lucene++ [1] supports this. It is a port of a later Lucene version by Alan and Ben. We tried once to make this the new CLucene version, but havn't finished it yet. How stable is Lucene++? Does it just have some unimplemented functionality from

Re: [CLucene-dev] Does CLucene 32 bits version work on 64 bits windows?

2012-04-01 Thread Veit Jahns
Hi Ahmed, the same here. We also use CLucene on 64 bit Windows machines. Up to now, we didn't face any problems with CLucene crashing. Best regards, Veit 2012/3/21 Ahmed ci7nu...@gmail.com: Hi, Our application use CLucene and it's quiet stable on 32 bits, but some users say that the

Re: [CLucene-dev] Facing Problem in getting IndexWriter in my Metro App (Windows 8)

2012-07-15 Thread Veit Jahns
Hi Santhosh, some ideas from the top of my head: - Do you have this problem also, if you use an older version of Visual Studio? - What result do you get, if you execute the unit tests of CLucene (cl_test)? Regards, Veit 2012/7/13 santhosh k sant...@gmail.com: Hi, Currently I am using

Re: [CLucene-dev] problem building clucene-php

2012-10-25 Thread Veit Jahns
Hi Paul, the Misc.h is part of the sublibrary clucene-shared. Looking into the CMakeLists.txt of this sublibrary, I discovered that the install command for adding the headers ist missing. I guess, somehing similar as in the CMakeLists.txt of the core (lines 224--233) has to be added to the

Re: [CLucene-dev] Getting link error

2013-01-06 Thread Veit Jahns
Hi, just a guess: if it is a issue with the directories, should it be then not more than three unresolved externals errors? To me sounds like a problem with the export of these function from the dll file. Rajeev, how die you created the files? Did you compile them by yourself? Best regards,

Re: [CLucene-dev] search key_words in log

2013-01-06 Thread Veit Jahns
Hi! 2012/12/6 Ji Cai ji@telecom-sudparis.eu: hi all, I'm new here, i use clucene a week ago, and i worked on the demo and found that it searches document. Now i want to finish the work like this : i have a log.txt, i want to search the key words and return the sentences that include the

Re: [CLucene-dev] CLucene cannot compile with Visual Studio 2015

2016-12-26 Thread Veit Jahns
Hi Jonas, I worked on CLucene a while ago. There are two options, one is to add the missing header file where needed. The other, to add the legacy library file in CMake file. At the moment, I don't know what is better. I have to take a look at the source code as well. Best regards Veit Am