Hi Itamar!
2009/11/10 Itamar Syn-Hershko ita...@divrei-tora.com
You might be right, I wonder why this hasn't been discovered by now? Would
you mind writing a minified test case for this and possibly also
demonstrating how your proposed fix fixes this issue? I will then have to
verify this
Dear all,
in our application we have several threads writing and searching on
the index. Writing and searching isn't a problem as long the
IndexSearcher isn't updated,i.e. deleting the old and creating a new
IndexSearcher. But if the IndexSearcher is updated, then the
application crashes. Not
Itamar,
I have a question to one of your commits done in the course of this
discussion. The commit is the commit 364c216fb93 [1]. You replaced
a TokenStream pointer with a void pointer in the thread local storage
of Analyzer::Internal. Now everything can be saved as a previous
stream, but
. Search is throwing the error and the
default field I have given is the field which is tokenized. I am
indexing the Title of the document and this title is given as default
searchable field.
Message: 2
Date: Wed, 7 Apr 2010 21:11:20 +0200
From: Veit Jahns nuncupa...@googlemail.com
Subject: Re
Hi Itamar!
2010/4/16 Itamar Syn-Hershko ita...@divrei-tora.com:
Apparently Ben has been working on the same issues on his branch. I applied
your patches along with his work, plus some more cleanup, into the
atomicthreads branch. Unfortunately, while it compiles and runs perfectly
well on
2010/6/17 Itamar Syn-Hershko ita...@divrei-tora.com:
You can send it on the mailing list, or to me privately. Either as a code
file, or (ultimately) a patch for cl_test introducing the new test case.
I saw this test (IndexModifierExceptionTest) on the git repository.
But the IndexModifier of
2010/7/1 Itamar Syn-Hershko ita...@code972.com:
On one hand, we should be following the Java behavior. If they always
call close, we should always do as well. When having a reader object on
the stack this is definitely how that should work. On the other hand, if
calling close from the
2010/7/9 Liu bbskill bbsk...@gmail.com:
I have a large index whose size is 5.8G, and I fail to use
RAMDirectory to load it into memory with a segment fault.
I dived into the code, and found the index of RAMDirectory is limited
to 2G.
I an curious about the reason, and
This message looks familiar to me. Maybe it is the same problem I once had [1].
Veit
[1] http://comments.gmane.org/gmane.comp.jakarta.lucene.clucene.devel/3358
--
The Palm PDK Hot Apps Program offers developers who use
Hi,
I observed that the index becomes corrupted (Read past EOF) after
several updates and deletes. The reason was that the deletions file
wasn't written out completely, if a bit was setted at the end of the
underlying bitset and the DGaps format was used. I made a test case
[1] as well as fix [2]
2010/8/3 Itamar Syn-Hershko ita...@code972.com:
The beauty of tests is they speak for themselves...
Is it possible to have a test showing the corruption issue you mentioned
if the BitSet patch isn't applied?
Just pick the commit with the test case. Then the BitSet test should fail.
Veit
2010/8/3 Itamar Syn-Hershko ita...@code972.com:
What I'm looking for is a test showing the index corruption scenario you
described - if it can be reproduced in a test, and then to see the
BitSet patch fixes it. Without this test we only assume the BitSet bug
caused it. While this makes sense,
2010/8/3 Itamar Syn-Hershko ita...@code972.com:
On 3/8/2010 4:51 PM, Veit Jahns wrote:
2010/8/3 Itamar Syn-Hershkoita...@code972.com:
What I'm looking for is a test showing the index corruption scenario you
described - if it can be reproduced in a test, and then to see the
BitSet patch fixes
2010/8/10 Kostka Bořivoj kos...@tovek.cz:
I added MockRAMDirectory ported by Veit into TestRAMDirectory and found
following:
1) sizeInBytes members in RAMDirectory and RAMFile aren’t correct, I
fixed this somehow, but I’m not sure if it shouldn’t be fixed different way,
could anybody
2010/8/10 Kostka Bořivoj kos...@tovek.cz:
The problem is the bigTerm (16383 B long word) added in doc isn't returned
as one token during indexing. StandardTokenizer splits them to set of tokens,
each 256 bytes long. So the term isn't skipped as too long but indexed as set
of
tokens. Then, of
2010/8/11 Kostka Bořivoj kos...@tovek.cz:
For the patch of the RAMFile desctructor I'm in doubt. Looking at the
correspondong
JLucene classes subtracting the file size is done in the deleteFile method of
RAMDirectory (see diff).
By the way, I needed some inline keywords removed in
2010/8/17 Itamar Syn-Hershko ita...@code972.com:
So I intend to merge it to master soon - thats the only move that makes
sense at this point. Any objections? any volunteers for last minute testing?
cl_test of the branch atomicthreads works fine for me on different
platforms. But when I use this
2010/8/31 Šplíchal Jiří splic...@tovek.cz:
I have just pushed SpanQuery implementation and its basic tests in to the
jiri-working branch.
It would great if you could test it so that it can be merged into the
current HEAD.
Great! The tested it on with Win7/VS2010 and the span query tests
Hi Mark,
in wildcard queries the tokens are not processed by the analyzer (see
also [1]). So in your case the searcher will look for terms beginning
the the characters 1ab. And because they original term is split up
by the StandardAnalyzer into two terms, there is no such term in the
index.
I
2010/9/5 Šplíchal Jiří splic...@tovek.cz:
Hello Veit,
yes, please feel free to push your changes to my branch.
Pushed them. But one of your recent commits [1]. Because of the
additional decRef() in the SegmentReader's destructor the assertion in
line 84 of this file fails in the
2010/9/7 Šplíchal Jiří splic...@tovek.cz:
Yes, me and Borek wanted to merge our branches so I pushed all my changes -
and forgot
not to push this one. I removed it again in my local repository but
unfortunately I am not able to push the change
to the original repository - I always got the
2010/9/7 Veit Jahns nuncupa...@googlemail.com:
2010/9/7 Šplíchal Jiří splic...@tovek.cz:
I tried to find the right position for setting the closed flag to true (in
the doClose() method?) but I did not finish
the fix yet. The simplest solution is ofcourse not to forget to call the
close
The output looks good. Another guess of mine is that something with
your repository is wrong. So I'd check [remote origin] entry the
file .git/config or to clone the clucene repository again in another
directory and try to push a commit from this repository.
Veit
2010/8/20 Veit Jahns nuncupa...@googlemail.com:
2010/8/17 Itamar Syn-Hershko ita...@code972.com:
So I intend to merge it to master soon - thats the only move that makes
sense at this point. Any objections? any volunteers for last minute testing?
cl_test of the branch atomicthreads works fine
2010/9/20 Šplíchal Jiří splic...@tovek.cz:
I tested once more our code in order to check the memory leaks.
Here is the list of leaking test suites:
[...]
Let us get rid of the leaks! I will start from the bottom
Ok. As I was examining memory leaks in CLucene, I observed, that some
leaks
2010/9/27 Kostka Bořivoj kos...@tovek.cz:
Exception CL_ERR_IO path not a directory was (sometimes) thrown, if
getDirectory is called with path to non-existing folder.
Wrong condition fixed.
Looks good to me. But I guess, Itamar will ask for a test case. ;)
By the way, fileStat will be
the crash occurs on startup before using any thing of the clucene.
Can you attach a debugger, so you can see exactly where in the code
the crash occurs?
Veit
--
Beautiful is writing same markup. Internet Explorer 9
2010/11/13 Itamar Syn-Hershko ita...@code972.com:
[...] veit_working [is] still there even though [it is] merged into master
already - owners, please verify and remove them.
Done.
Veit
--
Centralized Desktop Delivery:
Hi Mohammad,
it is an encoding issue. I attached a patch to avoid this. Can you
check, if this patch works for you? If so, I'd commit it to the master
branch (or a fix branch, if needed).
Kind regards,
Veit
2010/11/13 muhammad ismael m.ismae...@gmail.com:
Hi,
I want to use highlighter but
Just an idea regarding:
make cl_demo gives this linking error.
Linking CXX executable ../../bin/cl_demo
Undefined first referenced
symbol in file
lucene::document::Field::Field(const
wchar_t*,lucene::util::ValueArrayunsigned
2010/12/21 Veit Jahns nuncupa...@googlemail.com:
[...] (Sun, I guess).
Scratch that! It was clearly stated in the subject.
Veit
--
Lotusphere 2011
Register now for Lotusphere 2011 and learn how
to connect the dots
2011/1/5 Veit Jahns nuncupa...@googlemail.com:
It work on Windows (I will check this also on other platforms too),
but I saw that you used wstring/wstringstream, e.g., in SpellCheckerC.
Thus, it works only, if the ASCII mode is not enabled. With ASCII mode
enabled it fails to compile. You have
2011/1/5 Freiholz Manuel m.freih...@cadenas.de:
Hi,
ok i wil take a look at it.
Thanks! I changed your files a little bit, so that it compiles on
Ubuntu. But the test spellcheckersearch fails, because the the method
suggestSimilar() returns no result (NULL pointer). Any idea?
Veit
Hi Alexander,
I created a test cased based on the code you once sent me. I get the
same result, i.e., that there seems to be a bug. I will take a closer
look tomorrow.
Kind regards,
Veit
--
Special Offer-- Download
2011/2/7 Itamar Syn-Hershko ita...@code972.com:
Hi Veit, good catch!
That is quite a straight-forward fix, can you please merge it to master?
Done.
Veit
--
The modern datacenter depends on network connectivity to
Hi Alexander,
it seems that addIndexesNoOptimize() cause the trouble here. I can't
open the index with Luke. I got a read past EOF error. But only from
Luke. The CLucene-IndexReader has no problem with the index---besides
that sorting doesn't work correctly. I will keep at it.
Kind regards,
2011/2/9 Veit Jahns nuncupa...@googlemail.com:
it seems that addIndexesNoOptimize() cause the trouble here. I can't
open the index with Luke. I got a read past EOF error. But only from
Luke. The CLucene-IndexReader has no problem with the index---besides
that sorting doesn't work correctly. I
Hi Shivaji!
2011/3/8 shivaji badade shivaji.bad...@gmail.com:
Do anybody has the diff in these two version on performance data, which will
help in moving me to 2.3.2 version.
I have no diff. But I changed once from 0.9.21b to 2.3.2 (about 2
years ago). The most changes I had to made where in
Hi Ben!
2011/3/17 Ben van Klinken bvanklin...@gmail.com:
Dear CLuceners
It's been a very long time since I've done anything on CLucene, and I
hope I'm not stepping on anyone's toes if I jump in here now. I just
wanted to keep you up with a few things I've been doing lately.
1. I've done
Should go to the mailing list:
2011/3/30 Veit Jahns nuncupa...@googlemail.com:
2011/3/30 Ben van Klinken bvanklin...@gmail.com:
Are Jens's Solaris fixes also included? It has at also least one
memory leak---see my mail sent some minutes ago.
the master is merged with it. where are those
Hi Allan!
2011/4/4 Alan Wright alanwright.h...@googlemail.com:
The intention behind Lucene++ has always been to closely reflect the java
lucene code with a view to being able to keep up with the pace of development.
As Itamar wrote, I work on this too. One question that came into my
mind was,
2011/5/11 juliana nunes juliana@gmail.com:
this is the code:
...
line 43 STRCPY_AtoT( TQuery, buffer, strlen(buffer) );
Query *qdesc;
qdesc = QueryParser::parse( TQuery, _T(oferta), analyzer);
Hits *hdesc = s.search(qdesc);
HitCollector *algo = new
Hi Emerson!
2011/5/26 Emerson Espínola emersonespin...@gmail.com
void Index::NewDoc (const char *contents)
Using char here means that you use CLucene in an ASCII mode. Right?
{
Document* doc = _CLNEW Document();
doc-add(*_CLNEW Field(_T(contents), _T(contents), Field::STORE_YES |
Hi Scott!
2011/5/17 Scott Zhang macromars...@gmail.com:
D:\wxwidgets\projects\DocFrog\
clucene-core-2.3.3.4\mbuildmingw32-make
[ 0%] Building CXX object
src/shared/CMakeFiles/clucene-shared.dir/CLucene/Shar
edHeader.obj
In file included from
Hi Emerson!
2011/5/27 Emerson Espínola emersonespin...@gmail.com
Hi Veit.
Thank yo very much for your answer. Great explanation. You don't wonder how
much you're helping me.
You are welcome!
1. I'll try english documents.
2. Ok.
3. Does BrazilianAnalyzer work similar to
Hi,
I don't know, if this macro is still used in the current version. I
think Ben or Itamar should know more about this. But you can also use
other tools to trace the memory leak. Depending on your plattform and
compiler, you can use, for instance, the Visual Studio facilities [1]
or valgrind
Hi Timo!
2011/8/17 Timo Sirainen t...@iki.fi:
Is there some API call to tell me how many segment files there are
currently, so I could try to guess if I should optimize the index (after
a small incremental update)?
I don't know it excatly, but the SegmentInfos (part of IndexWriter) may
be
Hi Celto!
2011/9/14 cel tix44 celti...@gmail.com:
Team
When compiled with VC6, v.2.3.3.4 throws some 300 errors.
May I please ask the DEV team whether Clucene is still going to be
compatible with VC6?
I think, it was not intended in the first place. This release includes
some parts of the
Hi Celto!
2011/9/19 cel tix44 celti...@gmail.com:
Veit
Thanks very much for your response.
Attached is a list of compilation errors as is -- produced after getting
the source code from GIT, config'ed with Cmake 2.8 (default settings, no
boost, no zlib) and built with vc6 -- essentially, for
Hi Klaus,
what are your troubles with compiling clucene? Using the Cmake tool
compiling clucene is rather simple. Just execute Cmake and open and
build the created VS solution file.
Kind regards,
Veit Jahns
2011/9/28 Klaus Gruen klausgr...@rocketmail.com:
Hi all,
I am not a compiling guru
2011/10/10 Veit Jahns nuncupa...@googlemail.com:
Microsoft suggests this:
#ifdef _MSC_VER
#if _MSC_VER == 1200
std::ostream operator(std::ostream os, __int64 i ) // borrowed from
http://support.microsoft.com/kb/168440
{
char buf[20];
sprintf(buf,%I64d, i );
os buf
Hi Celto,
I'm sorry. I was distracted by other things from your e-mail.
2011/10/17 cel tix44 celti...@gmail.com:
Veit
A couple of minor problems in the latest update ---
1) When generating code, CMAKE throws this error:
CMake Error at src/core/CMakeLists.txt:194 (add_library):
Cannot find
Hi Albin!
2011/10/20 Albin Stigö albin.st...@gmail.com:
I wonder if anyone could give me some pointers on how to build clucene
as a static library for iOS?
I'm not familiar with iOS and never tried to build CLucene on iOS. But
think it is possible by using CMake and CLang (somehow). Have you
Hi Emerson!
2011/11/4 Emerson Espínola emersonespin...@gmail.com
I'm willing to write the BrazillianAnalyzer. What should I do? Do you have
an idea about how much time is needed to write it? 1 week? 2 weeks? 1 month?
If the BrazillianAnalyzer from Java Lucene fits your needs, then the only
Hi!
2011/11/29 pradeepa kumar cdprade...@gmail.com:
Hi
I am a C++ developer and would like to volunteer in development tasks
Could you pease let me know how and where do I start
Actually, there is a development task. Some months ago we started to
give CLucene a new code base. It was intended
Hi Mena,
as far as I know, every function from Lucene 2.3.2 is ported to C++.
What is missing are most of the contribs. Some are there, but many are
not. But they can ported to C++. I did it once myself for the an
special analyzer. Wasn't so difficult.
Kind regards,
Veit
2012/2/24 Mena Hanna
Hi Vivien!
2012/2/27 Vivien Meng v.m...@qsr.com.au
Hi Veit:
** **
** **
Thanks for your reply.
** **
Do you know if there is anyone using CJKTokenizer?
No, I don't know anyone. Sorry.
** **
Here is the code, no parsing error that I can see:
** **
Hi Vivien,
sorry, it took some time. Maybe this is an encoding issue. What
encoding do your source code files have? I got the same result, when
the encoding is UTF-8. But CLucene works internally with UTF-16 and in
this case handles the string and the query as an UTF-16 encoded
string. Thus, the
2012/2/28 du...@identicalsoftware.com:
Maybe Lucene++ [1] supports this. It is a port of a later Lucene version
by
Alan and Ben. We tried once to make this the new CLucene version, but
havn't finished it yet.
How stable is Lucene++? Does it just have some unimplemented functionality
from
Hi Ahmed,
the same here. We also use CLucene on 64 bit Windows machines. Up to
now, we didn't face any problems with CLucene crashing.
Best regards,
Veit
2012/3/21 Ahmed ci7nu...@gmail.com:
Hi,
Our application use CLucene and it's quiet stable on 32 bits, but some users
say that the
Hi Santhosh,
some ideas from the top of my head:
- Do you have this problem also, if you use an older version of Visual Studio?
- What result do you get, if you execute the unit tests of CLucene (cl_test)?
Regards,
Veit
2012/7/13 santhosh k sant...@gmail.com:
Hi,
Currently I am using
Hi Paul,
the Misc.h is part of the sublibrary clucene-shared. Looking into the
CMakeLists.txt of this sublibrary, I discovered that the install
command for adding the headers ist missing. I guess, somehing similar
as in the CMakeLists.txt of the core (lines 224--233) has to be added
to the
Hi,
just a guess: if it is a issue with the directories, should it be then
not more than three unresolved externals errors? To me sounds like a
problem with the export of these function from the dll file.
Rajeev, how die you created the files? Did you compile them by yourself?
Best regards,
Hi!
2012/12/6 Ji Cai ji@telecom-sudparis.eu:
hi all,
I'm new here, i use clucene a week ago, and i worked on the demo and found
that it searches document.
Now i want to finish the work like this : i have a log.txt, i want to search
the key words and return the sentences that include the
Hi Jonas,
I worked on CLucene a while ago. There are two options, one is to add the
missing header file where needed. The other, to add the legacy library file
in CMake file. At the moment, I don't know what is better. I have to take a
look at the source code as well.
Best regards
Veit
Am
65 matches
Mail list logo