AW: Problem building Lucene
Hello, I downloaded the lucene source and have been trying to build using ant. I am getting the following error message: -- - Buildfile: build.xml init: javacc_check: compile: [javacc] java was not found in /usr/local/apps/java/bin/sparc/native_threads/java BUILD FAILED /users/science/user/lucene/lucene-1.2-src/build.xml:96: java failed with return code 1 -- - The JavaCC version is 2.1. Platform is Sun sparc solaris. JAVA_HOME env variable has been set to /usr/local/apps/java. Any help will be most appreciated. It seems, that ant couldn't find your javacc. Write a configuration file for ant named build.properties. Put it to the basedir of your sources or your home directory and add the following line: - snip - java.home=/opt/lib/javacc2.1/bin - snip - Please change the path to the directory where javacc lives. You may also add the line debug=on to get more details if any error occured. Read the ant manual for further details. Regards, Wolf-Dietrich Materna -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Fun project?
I'm rather partial to Jini for distributed systems, but I agree that JXTA would definitely be the way to go on this type of peer-to-peer scenario. Scott [EMAIL PROTECTED] wrote: I'll be doing something very similar some time in the next 12 months for the project I'm working on. I'll be more than happy than happy to contribute the code when its done, but the rest of the project has been implemented with CORBA, and it had been my plan to use CORBA for the distributed index servers as well. I'll look into JXTA though, as I hadn't come across it before. Kiril Otis Gospodnetic 21/11/2002 16:57 Please respond to Lucene Users List To: Lucene Users List cc: Subject:Re: Fun project? Yeah, I thought of that, too. JXTA is the P2P piece that you are asking about. A recent post on Slashdot mentioned something that IBM did that sounds similar. Time... :) Otis --- Robert A. Decker wrote: I wish I had time to work on this for fun, but I was thinking about what could be a fun lucene project... One could build a peer-to-peer document search application. Each client would index the documents on its harddrive, or documents in a particular directory. When the user at the computer does a search it will look at the documents on its harddrive, but also send out a request for the search on the P2P network. First though, are there any P2P java frameworks out there? One could build one, perhaps with OpenJMS, but it would be nice if one already existed. Hmm... if anyone else thinks this would be cool I'd be willing to work on this with you. thanks, Robert A. Decker http://www.robdecker.com/ http://www.planetside.com/ -- To unsubscribe, e-mail: For additional commands, e-mail: __ Do you Yahoo!? Yahoo! Mail Plus ? Powerful. Affordable. Sign up now. http://mailplus.yahoo.com -- To unsubscribe, e-mail: For additional commands, e-mail: -- To unsubscribe, e-mail: For additional commands, e-mail: -- Brain: Pinky, are you pondering what Im pondering? Pinky: I think so, Brain, but calling it a pu-pu platter? Huh, what were they thinking? -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: AW: Problem building Lucene
Hi Materna: Thanks for your help. I created the build.properties file and that did not fix the problem. Turns out my JAVA_HOME env variable was pointing to a version that was not 'native_threads' enabled. I looked around and found one that was and could compile successfully. Thanks again. Nita --- Hello, I downloaded the lucene source and have been trying to build using ant. I am getting the following error message: -- - Buildfile: build.xml init: javacc_check: compile: [javacc] java was not found in /usr/local/apps/java/bin/sparc/native_threads/java BUILD FAILED /users/science/user/lucene/lucene-1.2-src/build.xml:96: java failed with return code 1 -- - The JavaCC version is 2.1. Platform is Sun sparc solaris. JAVA_HOME env variable has been set to /usr/local/apps/java. Any help will be most appreciated. It seems, that ant couldn't find your javacc. Write a configuration file for ant named build.properties. Put it to the basedir of your sources or your home directory and add the following line: - snip - java.home=/opt/lib/javacc2.1/bin - snip - Please change the path to the directory where javacc lives. You may also add the line debug=on to get more details if any error occured. Read the ant manual for further details. Regards, Wolf-Dietrich Materna -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Help on creating and maintaining an index that changes
I want to do something similiar with Lucene, but I don't know how to approach it. I thought maybe keeping the first hashmap as is, and building a Directory in lucene that replaces the master Hashmap. When I get hits back from lucene I look them up in the first hashmap, and return those. If your index is big its probably best to do it this way. I got indexes that takes up to 12 hours to build and takes about 1gb of harddrive space but searching is still fast. if you put the client id's into keyword fields you can use lucenes to filter out hits from the clients you know is offline by using a boolean NOT, either manually or through the queryparser. How do I put the needed information into Directory so I can look them up in the first hashmap. I would need the unique id identifying the client, and a key that identifies the document that the client has. you add a keyword field to each document that contains the unique id identifying the client. This way you can search for documents from a client, and also filter out documents from that client. Then how do I clean up the Directory when a client is not available? How do I remove a document from Lucene's Directory? the org.apache.lucene.index.IndexReader class contains a delete() function to delete documents from lucene. But as said before, if your index is big it's best not to delete the documents just because a client goes offline, its better to filter out the hits. mvh karl øie -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
There is a book by Wrox called Professional JSP Site Design (I think) that has a chapter on searching and it mentions Lucene, but its coverage on Lucene is *VERY* thin. I wouldn't recommend this book for learning Lucene. I have an article on Lucene to appear in December's Java Developer's Journal. It's not as complete a coverage of Lucene as I would have liked it to be, but with limited space in a magazine I couldn't go into much more than an introduction. I'd have probably written it differently if I had it to do over again. Oh well. Let me know what you think of the article when it comes out. William W wrote: I would like to buy a book about Lucene. Who could write it ? : ) _ STOP MORE SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
Craig I do not subscribe to Java Developer's Journal. Are the articles online? Or could it be posted here after the article is published? Thanks, Dave Kendig There is a book by Wrox called Professional JSP Site Design (I think) that has a chapter on searching and it mentions Lucene, but its coverage on Lucene is *VERY* thin. I wouldn't recommend this book for learning Lucene. I have an article on Lucene to appear in December's Java Developer's Journal. It's not as complete a coverage of Lucene as I would have liked it to be, but with limited space in a magazine I couldn't go into much more than an introduction. I'd have probably written it differently if I had it to do over again. Oh well. Let me know what you think of the article when it comes out. William W wrote: I would like to buy a book about Lucene. Who could write it ? : ) _ STOP MORE SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
Once they are published I will list them on the Lucene resource page. Otis --- William W [EMAIL PROTECTED] wrote: Otis, I would like to read your articles . Is it possible ? Thanks, William. From: Otis Gospodnetic [EMAIL PROTECTED] Reply-To: Lucene Users List [EMAIL PROTECTED] To: Lucene Users List [EMAIL PROTECTED] Subject: Re: Book Date: Wed, 20 Nov 2002 20:59:40 -0800 (PST) I wrote a few articles that I'm trying to publish somewhere now. Cheaper than a book :) Otis --- William W [EMAIL PROTECTED] wrote: I would like to buy a book about Lucene. Who could write it ? : ) __ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] _ MSN 8 with e-mail virus protection service: 2 months FREE* http://join.msn.com/?page=features/virus -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] __ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
JDJ usually makes their articles available online, but I'm not really sure how that works or when they'll be online. I'll keep everyone posted. David Kendig wrote: Craig I do not subscribe to Java Developer's Journal. Are the articles online? Or could it be posted here after the article is published? Thanks, Dave Kendig There is a book by Wrox called Professional JSP Site Design (I think) that has a chapter on searching and it mentions Lucene, but its coverage on Lucene is *VERY* thin. I wouldn't recommend this book for learning Lucene. I have an article on Lucene to appear in December's Java Developer's Journal. It's not as complete a coverage of Lucene as I would have liked it to be, but with limited space in a magazine I couldn't go into much more than an introduction. I'd have probably written it differently if I had it to do over again. Oh well. Let me know what you think of the article when it comes out. William W wrote: I would like to buy a book about Lucene. Who could write it ? : ) _ STOP MORE SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
Craig Walls wrote: There is a book by Wrox called Professional JSP Site Design (I think) that has a chapter on searching and it mentions Lucene, but its coverage on Lucene is *VERY* thin. I wouldn't recommend this book for learning Lucene. I have an article on Lucene to appear in December's Java Developer's Journal. It's not as complete a coverage of Lucene as I would have liked it to be, but with limited space in a magazine I couldn't go into much more than an introduction. I'd have probably written it differently if I had it to do over again. Oh well. Let me know what you think of the article when it comes out. You're not in marketing are you? ;-) William W wrote: I would like to buy a book about Lucene. Who could write it ? : ) _ STOP MORE SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
[gasp] No, I'm not in marketing? Why do you ask? Andrew C. Oliver wrote: Craig Walls wrote: There is a book by Wrox called Professional JSP Site Design (I think) that has a chapter on searching and it mentions Lucene, but its coverage on Lucene is *VERY* thin. I wouldn't recommend this book for learning Lucene. I have an article on Lucene to appear in December's Java Developer's Journal. It's not as complete a coverage of Lucene as I would have liked it to be, but with limited space in a magazine I couldn't go into much more than an introduction. I'd have probably written it differently if I had it to do over again. Oh well. Let me know what you think of the article when it comes out. You're not in marketing are you? ;-) William W wrote: I would like to buy a book about Lucene. Who could write it ? : ) _ STOP MORE SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
You sell your article so well ;-) Craig Walls wrote: [gasp] No, I'm not in marketing? Why do you ask? Andrew C. Oliver wrote: Craig Walls wrote: There is a book by Wrox called Professional JSP Site Design (I think) that has a chapter on searching and it mentions Lucene, but its coverage on Lucene is *VERY* thin. I wouldn't recommend this book for learning Lucene. I have an article on Lucene to appear in December's Java Developer's Journal. It's not as complete a coverage of Lucene as I would have liked it to be, but with limited space in a magazine I couldn't go into much more than an introduction. I'd have probably written it differently if I had it to do over again. Oh well. Let me know what you think of the article when it comes out. You're not in marketing are you? ;-) William W wrote: I would like to buy a book about Lucene. Who could write it ? : ) _ STOP MORE SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: Book
Yeah, I got the joke after I replied to your e-mail. (Sorry, I'm a bit slow today) Don't get me wrong, I really like my article and I'm really proud of it (and the PDF they sent me of it looks fantastic!)...It's just that since I wrote the thing much time has passed and I've learned more about Lucene and had more opportunity to rethink some parts of it. I still think it's a great article (and worth reading...buy a copy for all of your friends!)--I'd just have done it differently given the chance. Andrew C. Oliver wrote: You sell your article so well ;-) Craig Walls wrote: [gasp] No, I'm not in marketing? Why do you ask? Andrew C. Oliver wrote: Craig Walls wrote: There is a book by Wrox called Professional JSP Site Design (I think) that has a chapter on searching and it mentions Lucene, but its coverage on Lucene is *VERY* thin. I wouldn't recommend this book for learning Lucene. I have an article on Lucene to appear in December's Java Developer's Journal. It's not as complete a coverage of Lucene as I would have liked it to be, but with limited space in a magazine I couldn't go into much more than an introduction. I'd have probably written it differently if I had it to do over again. Oh well. Let me know what you think of the article when it comes out. You're not in marketing are you? ;-) William W wrote: I would like to buy a book about Lucene. Who could write it ? : ) _ STOP MORE SPAM with the new MSN 8 and get 2 months FREE* http://join.msn.com/?page=features/junkmail -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: StandardFilter that works for French
On Thu, 21 Nov 2002, Konrad Scherer wrote: In French you have 6 words (me, te, se, le/la , ne, de) where the e is replaced with an apostrophe when the following word starts with a vowel. For example me aider becomes m'aider. Currently Lucene indexes m'aider, s'aider, n'aider as different words when in fact they should be analyzed as me aider, se aider, ne aider, etc. So I modified Standard filter to send back these words as two words. I had to add a one Token buffer. I toyed with modifying StandardTokenizer.jj but I was worried about unintended changes in behavior. This change will not effect English indexing. The only change I can think of is that a word like m'lord would be indexed as me lord. Still it might be better to make a French package and add this to a French Filter. There are a number of contractions in English that could be affected if you're using the apostrophe as a marker, e.g.: isn't, wouldn't, I'd, he's, hasn't. (Granted, these are often considered stop words.) Thus, I think that your idea of incorporating this change into a French filter, rather than modifying Standard filter, is a good idea. Joshua O'Madadhain [EMAIL PROTECTED] Per Obscuriuswww.ics.uci.edu/~jmadden Joshua O'Madadhain: Information Scientist, Musician, Philosopher-At-Tall It's that moment of dawning comprehension that I live for. -- Bill Watterson My opinions are too rational and insightful to be those of any organization. -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: StandardFilter that works for French
Alng these same lines, has anyone developed a Spanish filter? I have looked but have not turned anything up. Justin -Original Message- From: Joshua O'Madadhain [mailto:[EMAIL PROTECTED]] Sent: Thursday, November 21, 2002 4:00 PM To: Lucene Users List Cc: Joshua Rhys Taliesin O'Madadhain Subject: Re: StandardFilter that works for French On Thu, 21 Nov 2002, Konrad Scherer wrote: In French you have 6 words (me, te, se, le/la , ne, de) where the e is replaced with an apostrophe when the following word starts with a vowel. For example me aider becomes m'aider. Currently Lucene indexes m'aider, s'aider, n'aider as different words when in fact they should be analyzed as me aider, se aider, ne aider, etc. So I modified Standard filter to send back these words as two words. I had to add a one Token buffer. I toyed with modifying StandardTokenizer.jj but I was worried about unintended changes in behavior. This change will not effect English indexing. The only change I can think of is that a word like m'lord would be indexed as me lord. Still it might be better to make a French package and add this to a French Filter. There are a number of contractions in English that could be affected if you're using the apostrophe as a marker, e.g.: isn't, wouldn't, I'd, he's, hasn't. (Granted, these are often considered stop words.) Thus, I think that your idea of incorporating this change into a French filter, rather than modifying Standard filter, is a good idea. Joshua O'Madadhain [EMAIL PROTECTED] Per Obscuriuswww.ics.uci.edu/~jmadden Joshua O'Madadhain: Information Scientist, Musician, Philosopher-At-Tall It's that moment of dawning comprehension that I live for. -- Bill Watterson My opinions are too rational and insightful to be those of any organization. -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED] -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Re: StandardFilter that works for French
There are a number of contractions in English that could be affected if you're using the apostrophe as a marker, e.g.: isn't, wouldn't, I'd, he's, hasn't. (Granted, these are often considered stop words.) Thus, I think that your idea of incorporating this change into a French filter, rather than modifying Standard filter, is a good idea. Sorry I forgot to mention that it only looks at words where the apostrophe occurs in the second letter and only for words that start with the six magic letters m,t,s,l,n,d . If filtering the very English specific 's and 'S possessives is good enough for the StandardFilter then why not French as well? In the comments of StandardTokenizer.jj we have This should be a good tokenizer for most European-language documents. Most people will use this one, why not have it work as well as possible? The standard tokenizer is very english centric and the code I posted was for those who may not be aware of it. I work with a lot of bilingual documents (english and french) and my case, this filter improves the quality of the index. More philosophically, there probably shouldn't even be a standard analyzer, just language specific ones. All the best Konrad -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Changing QueryParser Syntax
I'm trying to get our query syntax mapped into the QueryParsers Syntax. I have two ideas for approaching this. 1.) I could try to do this externally to Lucene. I could just take my queries and filter them into lucene queries. 2.) However, I have an idea that I might be able th modify the the JavaCC file to produce another QueryParser that works with my syntax, my syntax is similar to that already used by queryparser, but has a couple of minor differences. (like ='s instead of :'s). It would be good for me to experiment with the JavaCC QueryParser stuff because I would also like to write utilities that translate one syntax into another so that when I have external indexes that need to be added to my project I can map thier syntax into my own as well. I figure if I can use the QueryParser to produce query objects from query strings, I should be able to pass Query Objects to the QueryParser and get Back query strings in that particular parser format. Is this a valid Idea? -Mark Diggory -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]
Query Syntax Continued.
I've also been working on the idea of a Generic Query Markup Language (QML), that describes any search query in XML format, this allows one to use a SAX Parser or and XSLT transform to process one query syntax into another syntax by mapping them both to the XML format and back. My idea is that oneday, instead of trying constantly map one query syntax to another for every query syntax that may exist out there, that all one would need to do is map a syntax to QML, then that syntax would be available for translation in my system. Has anyone already heard of something like this? -Mark Diggory -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]