[ANNOUNCE] Apache Lucene 4.6 released

2013-11-24 Thread Simon Willnauer
October 2013, Apache Lucene™ 4.6 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.6 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text

[ANNOUNCE] Apache Solr 4.6 released.

2013-11-24 Thread Simon Willnauer
24 November 2013, Apache Solr™ 4.6 available The Lucene PMC is pleased to announce the release of Apache Solr 4.6 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted

[ANNOUNCE] Apache Solr 4.3 released

2013-05-06 Thread Simon Willnauer
May 2013, Apache Solr™ 4.3 available The Lucene PMC is pleased to announce the release of Apache Solr 4.3. Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search,

[ANNOUNCE] Apache Lucene 4.3 released

2013-05-06 Thread Simon Willnauer
May 2013, Apache Lucene™ 4.3 available The Lucene PMC is pleased to announce the release of Apache Lucene 4.3 Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java. It is a technology suitable for nearly any application that requires full-text

Board report June 2012

2012-06-11 Thread Simon Willnauer
Hey folks, I added a board report draft for June 2012: https://svn.apache.org/repos/asf/lucene/board-reports/2012/board-report-june.txt please gimme feedback or change as you like until tomorrow night. The report is due on Wednesday. simon

Re: Lucene Greek Analyzer

2012-05-31 Thread Simon Willnauer
in your lucene distribution you downloaded to obtain the jar file you should find another jar called lucene-analysis in the contrib section . put that in your classpatch and you should be ready to go. simon On Thu, May 31, 2012 at 5:10 PM, ddweb kara...@siteland.gr wrote: And how can i do this?

Re: Lucene Greek Analyzer

2012-05-21 Thread Simon Willnauer
you need to pull in the analysis jar too to get access to the greek analyzer. simon On Tue, May 15, 2012 at 11:52 AM, ddweb kara...@siteland.gr wrote: Hello i use the open source media platform subsonic, that include the lucene for the search. The problem is that i have problem with the greek

Re: are Lucene indexing and search components thread-safe?

2012-05-01 Thread Simon Willnauer
short answer is: yes! common search and indexing classes are threadsafe! IndexWriter IndexReader (IndexSearcher) can be shared between threads. Any other particular class you are in doubt with? simon On Mon, Apr 30, 2012 at 8:25 PM, Ilya Zavorin izavo...@caci.com wrote: Hi All, I am looking

[REPORT] Revised Report Apache Lucene March 2012

2012-03-22 Thread Simon Willnauer
=== Lucene Status Report: March 2012 === TLP We have added several new PMC members: * Martijn van Groningen * Dawid Weiss * Jan Jøydahl Trademarks: We have not made progress on trademarks since the last board report, but do intend to finish the necessary pieces.    * Project Naming And

Call for Submission Berlin Buzzwords 2012all for Submission Berlin Buzzwords - http://berlinbuzzwords.de

2012-01-11 Thread Simon Willnauer
Chairs:  *  Isabel Drost (Nokia Apache Mahout)  *  Jan Lehnardt (CouchBase Apache CouchDB)  *  Simon Willnauer (SearchWorkings Apache Lucene)  *  Grant Ingersoll (Lucid Imagination Apache Lucene)  *  Owen O’Malley (Yahoo Inc. Apache Hadoop)  *  Jim Webber (Neo Technology Neo4j)  *  Sean Treadway

[Report] Apache Lucene

2011-12-13 Thread Simon Willnauer
=== Lucene Status Report: December 2011 === TLP Trademarks: We have not made progress on trademarks since the last board report, but do intend to finish the necessary pieces. * Project Naming And Descriptions : We believe this is complete, but are still reviewing. * Website

JVM Bugs affecting Lucene Solr

2011-11-15 Thread Simon Willnauer
hey folks, we lately looked into https://issues.apache.org/jira/browse/LUCENE-3235 again, an issue where a class using ConcurrentHashMap hangs / deadlocks on specific JVMs in combination with specific CPUs. It turns out its a JVM bug in Sun / Oracle Java 1.5 as well as Java 1.6. Its apparently

Re: no result

2011-10-13 Thread Simon Willnauer
hey, see my comments inline... On Thu, Oct 13, 2011 at 1:03 PM, xdzgor1 p...@alpha-solutions.dk wrote: Hi, can someone please help me with the following simple example. Why does my search not result in a hit? Analyzer indexAnalyzer = new StandardAnalyzer(Version.LUCENE_33); IndexWriterConfig

Re: Lucene search -- Concurrent users issue.. If more than 300 users are using at a time search results are getting delayed

2011-10-12 Thread Simon Willnauer
hey, concurrency should not be an issue for lucene. Can you provide some more information on what you are doing? 1 to 2 seconds sounds very very long for a single user. Are you doing something like opening the indexreader for each request? I'd also be interested in what kind of queries you are

Re: Lucene search -- Concurrent users issue.. If more than 300 users are using at a time search results are getting delayed

2011-10-12 Thread Simon Willnauer
alright :) This is going to kill your app :) Directory dir = FSDirectory.open(new File(indexDir)); IndexSearcher is = new IndexSearcher(dir); you should try to keep this searcher open as long as possible and reuse it. Its threadsafe so you can simply pass it to all your searches. Once you

Re: Board Report Apache Lucene Sept. 2011

2011-09-11 Thread Simon Willnauer
I am going to send / submit this tomorrow. simon On Fri, Sep 9, 2011 at 10:49 AM, Simon Willnauer simon.willna...@googlemail.com wrote: done! On Fri, Sep 9, 2011 at 10:45 AM, Simon Willnauer simon.willna...@googlemail.com wrote: yeah good points I will add something... simon On Thu, Sep

Re: Board Report Apache Lucene Sept. 2011

2011-09-09 Thread Simon Willnauer
yeah good points I will add something... simon On Thu, Sep 8, 2011 at 8:34 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : FYI, I plan to submit the board report for September [1] by Sept. 10th : if nobody objects. two things we might wnat to consider adding: * something in

Re: Board Report Apache Lucene Sept. 2011

2011-09-09 Thread Simon Willnauer
done! On Fri, Sep 9, 2011 at 10:45 AM, Simon Willnauer simon.willna...@googlemail.com wrote: yeah good points I will add something... simon On Thu, Sep 8, 2011 at 8:34 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : FYI, I plan to submit the board report for September [1] by Sept

Re: Board Report Sept 2011

2011-09-01 Thread Simon Willnauer
thanks andi! Simon On Thu, Sep 1, 2011 at 8:11 PM, Andi Vajda va...@apache.org wrote: On Thu, 1 Sep 2011, Simon Willnauer wrote: hey folks, I just committed a board report draft to our svn. There is still a todo open for PyLucene since I am not on top of the development down there. Andy

Re: java.lang.RuntimeException: after flush

2011-08-12 Thread Simon Willnauer
so here are a couple of things that I wonder about: 1. are you using NFS or any other network file system? 2. is it possible to enable the infoStream on the IW and get the output? 3. is it possible that the lucene index is deleted somehow by a second process while we are flushing? simon On Fri,

Re: AlreadyClosedException after few hours of normal work

2011-07-20 Thread Simon Willnauer
Hey, are you guys calling IW#rollback() by any chance? the rollback will close you IW and you have to reopen it on the index. If so in which case are you calling rollback? simon On Wed, Jul 20, 2011 at 6:03 PM, drazen.nis d.niko...@youngculture.com wrote: Hello, We are using single instance

Re: Indexing with Lucene

2011-07-20 Thread Simon Willnauer
On Wed, Jul 20, 2011 at 3:17 PM, raphael812 or...@eecs.qmul.ac.uk wrote: Hello everyone, I am quite new to lucene and i am using the book lucene in action to learn. I need help in extracting the body content of a html page using tika. The implementation from the book only extracts the html's

Re: Where is import org.apache.lucene.analysis.ar package?

2011-07-15 Thread Simon Willnauer
On Fri, Jul 15, 2011 at 7:38 AM, deniz denizdurmu...@gmail.com wrote: Hi all, I would like to implement a plugin and as there is lack of documentation ( well there are some but no example code ) i stared checking the source code... and while checking, i have seen a class importing

Re: June Board Report Due

2011-06-10 Thread Simon Willnauer
On Fri, Jun 10, 2011 at 4:50 AM, Grant Ingersoll gsing...@apache.org wrote: OK, I intend to file the report in SVN tomorrow morning. +1 On Jun 6, 2011, at 5:01 PM, Andi Vajda wrote: On Mon, 6 Jun 2011, Grant Ingersoll wrote: Please fill out

Re: Directory Encryption Enabled - while searching index we get Exception in thread main java.io.IOException: read past EOF

2011-05-20 Thread Simon Willnauer
Hey Arvind, can you give us a little more details about the encryption agent you are using so I can try to reproduce this problem. CAn you also provide some info which lucene version you are using? simon On Fri, May 20, 2011 at 1:19 AM, arvindjce arvind...@gmail.com wrote: We have the

Re: Special Board Report for May 2011

2011-05-16 Thread Simon Willnauer
On Sun, May 15, 2011 at 8:22 PM, Grant Ingersoll gsing...@apache.org wrote: Lucener's, Thank you to everyone who has participated in this thread and the many others during this time.  I know it's been heated at times as well as painful.  I also know this is a pretty big and broad community

Re: Special Board Report for May 2011

2011-05-07 Thread Simon Willnauer
On Sat, May 7, 2011 at 9:52 AM, Greg Stein gst...@gmail.com wrote: I've seen several people note that IRC is not logged. Fine. LOG IT. I see absolutely no reason for you guys not to set up logging for the channel that you use. We do this for Subversion development:  

33 Days left to Berlin Buzzwords 2011

2011-05-04 Thread Simon Willnauer
hey folks, BerlinBuzzwords 2011 is close only 33 days left until the big Search, Store and Scale opensource crowd is gathering in Berlin on June 6th/7th. The conference again focuses on the topics search, data analysis and NoSQL. It is to take place on June 6/7th 2011 in Berlin. We are looking

Re: [VOTE] Create Solr TLP

2011-04-27 Thread Simon Willnauer
On Wed, Apr 27, 2011 at 3:34 AM, Grant Ingersoll gsing...@apache.org wrote: On Apr 26, 2011, at 8:43 PM, Robert Muir wrote: But, this is important context to the discussion. Unfortunately, to the uninformed community, asf board, members etc, my behavior likely appears very irrational: but

Re: [VOTE] Create Solr TLP

2011-04-27 Thread Simon Willnauer
On Wed, Apr 27, 2011 at 10:08 AM, Doron Cohen cdor...@gmail.com wrote: -1 for unmerge. Another thing to keep in mind is that Solr can be considered not just an application that uses Lucene, but rather the *ideal* application layer for Lucene. Solr is basically an awesome example of what

Berlin Buzzwords - conference schedule released

2011-04-12 Thread Simon Willnauer
gives a presentation on how to integrate Solr with J2EE applications. The second day features presentations by Jonathan Gray on Facebook's use of HBase in their Messaging architecture, Dawid Weiss, Simon Willnauer and Uwe Schindler are showing the latest Apache Lucene developments, Mark Miller

Re: regarding apache lucene search engine

2011-03-25 Thread Simon Willnauer
Maybe you can use WhitespaceAnalyzer to just split on whitespace. That analyzer at least does not lowercase you should maybe tell us more about what you wanna do in terms of analysis to give you good advice. asking on java-u...@lucene.apache.org might also yield better results this

Re: Field.Store Field.Index

2011-02-27 Thread Simon Willnauer
On Sun, Feb 27, 2011 at 11:02 AM, aneuryzma patrick.divia...@gmail.com wrote: It is not clear to me how these parameters of org.apache.lucene.document.Field class  exactly work: Field.Store.YES / NO If I set them to NO, then I don't save the content itself but only metadata about it ? YES

Re: Digester / Lucene: How to make the XML parser more filexible ?

2011-02-25 Thread Simon Willnauer
maybe you should ask on the Digester list for advice simon On Fri, Feb 25, 2011 at 9:54 AM, aneuryzma patrick.divia...@gmail.com wrote: I'm using Apache Commons Digester together with Lucene to index a xml collection. Some fields in the colleciton are not always available. See example:

Re: Welcome Dawid Weiss and Stanislaw Osinski as Lucene/Solr committers

2011-02-08 Thread Simon Willnauer
Welcome! ;) Simon On Tue, Feb 8, 2011 at 8:06 PM, Steven A Rowe sar...@syr.edu wrote: Welcome Stanisław and Dawid! -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Tuesday, February 08, 2011 1:13 PM To: general@lucene.apache.org; d...@lucene.apache.org Subject:

Re: Too many open Files

2011-02-02 Thread Simon Willnauer
On Tue, Feb 1, 2011 at 1:10 PM, Fernando ferna...@digitaldoc.com.br wrote: Hi, i upgrade my app, from lucene 2.4.1 to 2.9.1, Now I am having serious problems on my linux clients, index of the files not being released. I have been see the FAQ, but im closing de Writers, and Searchers,

Re: [VOTE] Close down the Lucene.NET project, effective January 31, 2011

2011-01-02 Thread Simon Willnauer
On Fri, Dec 31, 2010 at 4:43 PM, Grant Ingersoll gsing...@apache.org wrote: On Dec 30, 2010, at 5:48 PM, Chris Hostetter wrote: : My vote is -1 to the current resolution, because it lacks a plan for : continuity for the userbase. It seems that although development has : stagnated, there is

Re: [VOTE] Close down the Lucene.NET project, effective January 31, 2011

2010-12-29 Thread Simon Willnauer
+1 On Wed, Dec 29, 2010 at 5:35 PM, Mark Miller markrmil...@gmail.com wrote: On Dec 29, 2010, at 11:03 AM, Grant Ingersoll wrote: On Dec 29, 2010, at 11:01 AM, Grant Ingersoll wrote: Resolution: Whereas the Lucene.NET project has stagnated due to lack of committer activity and, Whereas

Re: Board Report Draft

2010-12-01 Thread Simon Willnauer
On Wed, Dec 1, 2010 at 9:09 PM, Grant Ingersoll gsing...@apache.org wrote: Is in https://svn.apache.org/repos/asf/lucene/board-reports/2010/board-report-dec.txt Comments and additions welcome.  In particular, I need reports from PyLucene and Lucene.NET. -Grant Grant, maybe I miss

Re: Welcome Adriano Crestani as Lucene/Solr committer!

2010-11-18 Thread Simon Willnauer
On Thu, Nov 18, 2010 at 9:30 AM, Michael Busch busch...@gmail.com wrote:  The PMC has accepted Adriano Crestani as Lucene/Solr committer! Welcome on board, Adriano!! Welcome Adriano! simon  Michael PS:  Adriano, it's tradition to introduce yourself here as a new committer! :)

Re: Welcome Steven Rowe as Lucene/Solr committer!

2010-09-22 Thread Simon Willnauer
Welcome Steven! On Wed, Sep 22, 2010 at 3:28 PM, Mark Miller markrmil...@gmail.com wrote: On 9/22/10 9:19 AM, Robert Muir wrote: I'm pleased to announce that the PMC has accepted Steven Rowe as Lucene/Solr committer! Welcome Steven! Welcome Steven! - Mark

Re: Adding more mailing list moderators?

2010-08-24 Thread Simon Willnauer
Thanks Chris! simon On Tue, Aug 24, 2010 at 12:04 AM, Chris Hostetter hossman_luc...@fucit.org wrote: Ok, so it sounds like Grant, Uwe, Simon, Yonik (via IRC) and myself are all willing to be added as moderators for all of these lists. (some folks are already moderators for some lists) I

Re: Adding more mailing list moderators?

2010-08-21 Thread Simon Willnauer
No problem hoss - I will too. simon On 8/19/10, Uwe Schindler u...@thetaphi.de wrote: I could also help its not much traffic to moderate! But my priority will be on migrating Hudson to the FreeBSD VM. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail:

Re: [PMC] [DISCUSS] Lucy

2010-06-13 Thread Simon Willnauer
On Sun, Jun 13, 2010 at 11:33 AM, Michael McCandless luc...@mikemccandless.com wrote: Technically, it's clear that Lucy is taking an innovative and well-thought-out approach, building a search engine that folds in what's been learned from all the painful experiences of those before it.  Marvin

Re: [VOTE] Apache Lucene Java 2.9.3 and 3.0.2 artifacts to be released

2010-06-11 Thread Simon Willnauer
On Fri, Jun 11, 2010 at 7:22 PM, Yonik Seeley yo...@lucidimagination.com wrote: On Fri, Jun 11, 2010 at 1:04 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : See 3.0.2: http://s.apache.org/6kf : vs. 3.0.1: http://s.apache.org/t5 Ugh... ok, well i guess the precident has allready been

Re: Board Report Due by June 13th

2010-06-10 Thread Simon Willnauer
On Thu, Jun 10, 2010 at 11:37 PM, Andi Vajda va...@apache.org wrote: On Thu, 10 Jun 2010, Simon Willnauer wrote: I can not commit to this part of the repo but here is a diff I just updated the PyLucene section and took the opportunity to paste in your changes as well (rev 953471). Thanks

Re: Board Report Due by June 13th

2010-06-10 Thread Simon Willnauer
is in progress. The community is working +towards a 2.9.3 and 3.0.2 release. Added Simon Willnauer as a committer On Thu, Jun 10, 2010 at 9:45 PM, Grant Ingersoll gsing...@apache.org wrote: Checked into   https://svn.apache.org/repos/asf/lucene/board-reports/2010/board-report-june.txt Need

Re: Welcome Uwe Schindler to the Lucene PMC

2010-04-01 Thread Simon Willnauer
Congrats Uwe!! :) On Thu, Apr 1, 2010 at 3:17 PM, Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov wrote: Welcome, Uwe! Cheers, Chris On 4/1/10 4:05 AM, Grant Ignersoll gsing...@apache.org wrote: I'm pleased to announce that the Lucene PMC has voted to add Uwe Schindler to the

Re: Will your project feature at ApacheCon North America 2010?

2010-03-16 Thread Simon Willnauer
On Tue, Mar 16, 2010 at 2:03 PM, Grant Ingersoll gsing...@apache.org wrote: Begin forwarded message: From: Nóirín Shirley for the ApacheCon 2010 Planning Team planners-2010...@apachecon.com Date: March 15, 2010 7:22:14 PM EDT To: p...@apache.org, gene...@incubator.apache.org Subject:

Re: [VOTE] merge lucene/solr development (take 3)

2010-03-12 Thread Simon Willnauer
On Fri, Mar 12, 2010 at 1:00 PM, Grant Ingersoll gsing...@apache.org wrote: On Mar 12, 2010, at 1:39 AM, Simon Willnauer wrote: On Fri, Mar 12, 2010 at 5:39 AM, patrick o'leary pj...@pjaol.com wrote: Hows that? Which vote has been passed? 1,2 or 3? Considering how much has been discussed

Re: [VOTE] merge lucene/solr development (take 3)

2010-03-11 Thread Simon Willnauer
On Fri, Mar 12, 2010 at 5:39 AM, patrick o'leary pj...@pjaol.com wrote: Hows that? Which vote has been passed? 1,2 or 3? Considering how much has been discussed / altered in email threads, what's actually been voted upon? The proposition is definitely unclear, and needs full fleshing out

Re: Problem when the synonym has 2 words

2010-03-11 Thread Simon Willnauer
On Fri, Mar 12, 2010 at 3:51 AM, Robert Muir rcm...@gmail.com wrote: On Thu, Mar 11, 2010 at 7:36 PM, Claudio R user...@yahoo.com.br wrote: Hi Robert. The Solr's SynonymFilter recommends expanding the synonym at indexing time. But it will make my index grow much. I can do a replace in the

Re: [VOTE] merge lucene/solr development (take 3)

2010-03-10 Thread Simon Willnauer
On Tue, Mar 9, 2010 at 11:54 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : from Solr into Lucene don't need to be in there. All of a sudden I'm agreeing : with Hoss about goals rather than actual steps ;) Because those points are not : important to this vote at all - they are more

Re: [VOTE] merge lucene/solr development

2010-03-04 Thread Simon Willnauer
On Thu, Mar 4, 2010 at 2:40 PM, Michael McCandless luc...@mikemccandless.com wrote: On Thu, Mar 4, 2010 at 5:51 AM, Uwe Schindler u...@thetaphi.de wrote: - And last but not least the whole merge should be done *after* the   current code bases are again closer to each other, especially Flex

Re: Factor out a standalone, shared analysis package for Nutch/Solr/Lucene?

2010-03-01 Thread Simon Willnauer
IMO the only downside is that we risk a longer release cycle if we merge. I requires a certain level of discipline but has this been the case since ever?! Anything else seems to be a win to both communities and I personally would love to see the communities coming closer again. I was working on

Re: Factor out a standalone, shared analysis package for Nutch/Solr/Lucene?

2010-02-26 Thread Simon Willnauer
+1 So many people ask me when Solr will have all the lucene features and how quickly solr keeps up. If we can make it somehow I think it would be a huge improvement. Except of mark millers resume :) simon On Fri, Feb 26, 2010 at 10:11 PM, Robert Muir rcm...@gmail.com wrote: +1 On Fri, Feb

Re: Factor out a standalone, shared analysis package for Nutch/Solr/Lucene?

2010-02-24 Thread Simon Willnauer
Mike, thanks for moving out of the JIRA issue. For completeness I just add the link to the issue where this thread started though. -- https://issues.apache.org/jira/browse/LUCENE-2279 I also think we need a solution for this problem but it does not seem to be that easy. Would moving the analysis

Re: Factor out a standalone, shared analysis package for Nutch/Solr/Lucene?

2010-02-24 Thread Simon Willnauer
On Wed, Feb 24, 2010 at 8:09 PM, Doug Cutting cutt...@apache.org wrote: Michael McCandless wrote: I think, in order to stop duplicating our analysis code across Nutch/Solr/Lucene, we should separate out the analyzers into a standalone package, and maybe as its own sub-project under the Lucene

Re: [VOTE] Lucene Java 2.9.2 and 3.0.1 release artifacts

2010-02-18 Thread Simon Willnauer
+1 from here I put the 3.0.1 into several apps and everything seems to run smoothly for the last couple of days. All tests pass simon On Thu, Feb 18, 2010 at 12:13 AM, Ted Dunning ted.dunn...@gmail.com wrote: +0.  I only have time to read the release documents.  Uwe's apologies were

Re: A Question About Lucene

2010-02-05 Thread Simon Willnauer
On Fri, Feb 5, 2010 at 3:05 AM, Grant Ingersoll gsing...@apache.org wrote: On Feb 4, 2010, at 7:15 PM, Mohammed Aziz Parande wrote: Hi, I am a graduate student in the Department of Information Systems at the University of Maryland, Baltimore County (UMBC). I am doing research in the area

Re: Lucene PMC += Mark Miller

2010-01-14 Thread Simon Willnauer
Congrats, Mark!! On Thu, Jan 14, 2010 at 4:37 PM, Grant Ingersoll gsing...@apache.org wrote: I'm pleased to announce the Lucene PMC has elected to add Mark Miller to its ranks in recognition of his longstanding contributions to the Lucene community as a committer on both Lucene Java and

Re: [spatial] Cartesian Tiers nomenclature

2009-12-28 Thread Simon Willnauer
I would extremely prefer a common well know name instead of Cartensian tiers. While the API is still in flux changing the name is not that much of a deal either. Either grid or tiles is fine for me though while I would prefer the most common of the two - grid seems to be the better choice though.

Re: Combine WildcardQuerys

2009-12-22 Thread Simon Willnauer
/3_0_0/api/core/org/apache/lucene/search/MultiTermQuery.html and friends for details. Simon On Tue, Dec 22, 2009 at 2:22 PM, Simon Willnauer simon.willna...@googlemail.com wrote: Hi Claudio, your query setup is fine for what you trying to do but as you are using wildcards lucene internally

Lucene Logo Vectorgraphic

2009-12-04 Thread Simon Willnauer
Hey folks, has anybody a vectorized lucene logo or does something like that exists somewhere in public? Thanks in advance, simon

Re: Lucene Logo Vectorgraphic

2009-12-04 Thread Simon Willnauer
Thanks for the pointers! simon On Fri, Dec 4, 2009 at 7:01 PM, Doug Cutting cutt...@apache.org wrote: Marvin Humphrey wrote: http://svn.apache.org/repos/asf/lucene/site/publish/images/logo.eps Or, more simply: http://lucene.apache.org/images/logo.eps Doug

Re: heavy committing

2009-11-10 Thread Simon Willnauer
Isabel, you rock! :) On Tue, Nov 10, 2009 at 1:58 PM, Isabel Drost isa...@apache.org wrote: wget http://www.heise.de/open/artikel/ApacheCon-2009-Lucene-Hadoop-und-mehr-852929.html grep 'hea' ApacheCon-2009-Lucene-Hadoop-und-mehr-852929.html Have fun, Isabel

Lucene Mahout @ openexpo (Germany)

2009-11-10 Thread Simon Willnauer
Hi all, I just wanna give you a heads-up for an upcoming event. Apache Lucene and Apache Mahout will be present at the openexpo (free open source conference) in Karlsruhe (Nov 15./16. 2009). http://it-republik.de/konferenzen/opensourceexpo09/ You can meet us at the Lucene /. Mahout booth, would

Re: Progress bar for indexing process.?

2009-11-04 Thread Simon Willnauer
. Thanks Simon Willnauer wrote: There is no API which supports what you are asking for but building yourself should not be a problem at all. simon On Tue, Nov 3, 2009 at 3:10 PM,  diego.cassin...@mercadolibre.com wrote: Why not?  Indexing is very fast. --Original Message

Re: how to Index only newly added documents?

2009-11-04 Thread Simon Willnauer
The common approach is to use a UUID field in the index and run an updateDocument with a delete term holding the UUID for a document. That way only the latest added document for a UUID is gonna end up in the index. simon On Wed, Nov 4, 2009 at 6:41 AM, tarunsapra t.sapr...@gmail.com wrote:

Re: Progress bar for indexing process.?

2009-11-03 Thread Simon Willnauer
There is no API which supports what you are asking for but building yourself should not be a problem at all. simon On Tue, Nov 3, 2009 at 3:10 PM, diego.cassin...@mercadolibre.com wrote: Why not?  Indexing is very fast. --Original Message-- From: tarunsapra To:

Re: [VOTE] Graduate Lucene.Net as a subproject under Apache Lucene

2009-10-10 Thread Simon Willnauer
+1 On Sat, Oct 10, 2009 at 7:03 AM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: +1 Otis -- Sematext is hiring -- http://sematext.com/about/jobs.html?mls Lucene, Solr, Nutch, Katta, Hadoop, HBase, UIMA, NLP, NER, IR - Original Message From: George Aroush

Re: how is the fieldNorm factor computed from?

2009-08-11 Thread Simon Willnauer
On Tue, Aug 11, 2009 at 10:35 AM, leibnizse3g2...@gmail.com wrote: i saw someone said that this fieldNor factor is consisted out of :docboost ,fieldboost and lengthNorm. but i tried to computed like this,but the result is not the same as lucene's output. The factors you use seem to be correct

Re: [ORP] JIRA

2009-06-29 Thread Simon Willnauer
Hey Shalin, this url is an admin only url. Try this one: https://issues.apache.org/jira/browse/ORP Website and mailinglists are out there as well, see: http://lucene.apache.org/openrelevance/ simon On Mon, Jun 29, 2009 at 10:58 AM, Shalin Shekhar Mangarshalinman...@gmail.com wrote: On Thu, Jun

Re: [Open Relevance] Mailing List Status

2009-06-25 Thread Simon Willnauer
Task status was changed to Fixed 6 hours ago! https://issues.apache.org/jira/browse/INFRA-2086 we should have openrelevance-u...@lucene.apache.org openrelevance-...@lucene.apache.org by now. On Wed, Jun 24, 2009 at 11:18 PM, Grant Ingersollgsing...@apache.org wrote: Hi Simon, Any word on

Re: [Open Relevance] Mailing List Status

2009-06-25 Thread Simon Willnauer
You may subscribe here: openrelevance-user-subscr...@lucene.apache.org and openrelevance-dev-subscr...@lucene.apache.org simon On Thu, Jun 25, 2009 at 9:54 AM, Simon Willnauersimon.willna...@googlemail.com wrote: Task status was changed to Fixed 6 hours ago!

Re: [ORP] JIRA

2009-06-25 Thread Simon Willnauer
On Thu, Jun 25, 2009 at 2:24 PM, Andrzej Bialeckia...@getopt.org wrote: Simon Willnauer wrote: It worked for me, can you please try again and verify. simon I confirm it worked for me. Cool thanks, we should have somebody not being a moderator to confirm too. simon -- Best regards

Re: [ORP] JIRA

2009-06-25 Thread Simon Willnauer
Thanks guys! :) no further confirmation needed. On Thu, Jun 25, 2009 at 3:36 PM, Michael McCandlessluc...@mikemccandless.com wrote: On Thu, Jun 25, 2009 at 9:13 AM, Marvin Humphreymar...@rectangular.com wrote: I've successfully subscribed to both lists. Me too. Mike

Re: Open Relevance Project Kickoff

2009-06-08 Thread Simon Willnauer
Willnauer (sim...@apache.org) simon On Sat, Jun 6, 2009 at 11:42 PM, Grant Ingersollgsing...@apache.org wrote: On Jun 6, 2009, at 2:33 PM, Simon Willnauer wrote: I'm about to kick off the mailing list request for openrelevance-u...@lucene.apache.org We need to announce list moderators

Re: How to structure lucene query?

2009-06-07 Thread Simon Willnauer
could you please give us more details of you query or an example that might help to understand what you are trying to do. I had the same impression as Ted though. simon On Sun, Jun 7, 2009 at 4:28 PM, ywlee522ywlee...@gmail.com wrote: Thanks for the tip.  But, no, it is not same as finding

Re: Open Relevance Project Kickoff

2009-06-06 Thread Simon Willnauer
Hi there, On Fri, Jun 5, 2009 at 6:17 PM, Grant Ingersollgsing...@apache.org wrote: On Jun 2, 2009, at 12:00 PM, Simon Willnauer wrote: On Tue, Jun 2, 2009 at 7:34 PM, Grant Ingersoll gsing...@apache.org wrote: The Open Relevance Project vote has passed.  Now the fun begins. To get

Re: Open Relevance Project Kickoff

2009-06-02 Thread Simon Willnauer
On Tue, Jun 2, 2009 at 7:34 PM, Grant Ingersoll gsing...@apache.org wrote: The Open Relevance Project vote has passed.  Now the fun begins. To get started, I think we need to get some infrastructure in place.  I'd suggest: 1. Subversion (I can take care of this) 2. Mailing list.  I think we

Re: [VOTE] Make the Open Relevance Project (ORP) and official Lucene subproject

2009-05-28 Thread Simon Willnauer
+1 On Thu, May 28, 2009 at 2:11 PM, Uwe Schindler u...@thetaphi.de wrote: +1 I think it is a good idea to collect such data that is freely available. Maybe there will be licensing issues, but this would need to be discussed in this project. Uwe - Uwe Schindler H.-H.-Meier-Allee 63,

Re: Open Relevance Infrastucture Request

2009-05-26 Thread Simon Willnauer
I wonder if a P2P network would be an option at all? I doubt that P2P is feasible for 100s of GB but we might get peers with bigger pipes supporting the ASF. Providing a Bittorrent download could work as soon as it is boostrapped. simon On Tue, May 26, 2009 at 4:14 PM, Grant Ingersoll

Re: Lucene rich-text search with returned hyperlinks

2009-05-10 Thread Simon Willnauer
Hey there, On Mon, May 11, 2009 at 4:43 AM, apgw anth...@databaserepublic.com wrote: I am new to Lucene. Is this the right utility to use for the following use case: 1) Find a search term - eg. 'lithium battery' in some technical rich-text data (can be in any european language), 4K - 64K