Overall

2008-06-09 Thread Mihails Agafonovs
Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL database, or it only supports *.xml files as a database? 2) Is there a way to add data in the search database using some online interface, or the only way is manually adding the data in the *.xml files? 3) Is there any

Re: Overall

2008-06-09 Thread Umar Shah
2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]: Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL database, or it only supports *.xml files as a database? you can use DataImportHandler to index from MySql (or other databases) 2) Is there a way to add data in

Re: Overall

2008-06-09 Thread Mihails Agafonovs
1) ok 2) This means developing some custom program, so there is no such functionality in Solr :( 3) I have some connection problems and I really can't load these mailing list archives at all! Anyway, I want to understand, how can I use Solr in my site or any other usage? Quoting Umar Shah :

Re: Overall

2008-06-09 Thread Dom Stockdale
Hi Mihails, I don't know about points 1 and 2 as I'm just starting with Solr but for point 3 you need to understand that Solr is just going to return xml for your queries so you can use any web language to parse the xml of the results. It might return other formats like json as well,

setAllowLeadingWildcard

2008-06-09 Thread Dom Stockdale
Hello list, I really need to setAllowLeadingWildcard to true and I'm wondering if you can advise me on the best way to do this. I am a newbie so forgive me if I'm being a dummy. I've established that it's not set-able in the 1.2.0 version which seems to be quite old so I've been looking

DataImport

2008-06-09 Thread Mihails Agafonovs
Looked through the tutorial on data import, section Full Import Example. 1) Where is this dataimport.jar? There is no such file in the extracted example-solr-home.jar. 2) Use the solr folder inside example-data-config folder as your solr home. What does this mean? Anyway, there is no folder

Re: DataImport

2008-06-09 Thread Shalin Shekhar Mangar
1. Correct, there is no jar. You can use the solr.war file. If you really need a jar, you'll need to use the SOLR-469.patch at http://issues.apache.org/jira/browse/SOLR-469 and build solr from source after applying that patch. 2. The jar contains a folder named example-solr-home. Please check

Solr system and numbers

2008-06-09 Thread dudes dudes
Hello experts, How does Solr deal with numbers or phone numbers .. For example if you have 1234 and 12 34 or 1 234... with spaces between the numbers .. Or this is dealt by lucene ? any documentations or tutorial on this ? many thanks, ak

Re: DataImport

2008-06-09 Thread Mihails Agafonovs
I've placed the solr.war under the tomcat directory, restarted tomcat to deploy the solr.war. But still... there is no .jar, no folder named example-data-config, and hitting http://localhost:8983/solr/dataimport doesn't work. Do I need the original Solr instance to use this .war with? Quoting

RE: Solr system and numbers

2008-06-09 Thread dudes dudes
great info ,,, thanks a lot all Date: Mon, 9 Jun 2008 05:58:50 -0700 From: [EMAIL PROTECTED] Subject: Re: Solr system and numbers To: solr-user@lucene.apache.org Hi, Solr/Lucene can treat phone numbers as strings. If you want to clean them up

Re: Problems in solrJ trunk

2008-06-09 Thread Alexander Ramos Jardim
Well, There is a simple case here. I tried to update SolrJ to use the last one and got the application selected for test broke. So, I developed an alternative interface for SolrServer and a wrapper to CommonsHttpSolrServer. Altered my aoolication to use it and everything is working nice. When

Re: DataImport

2008-06-09 Thread Shalin Shekhar Mangar
No, the steps are as follows: 1. Download the example-solr-home.jar from the DataImportHandler wiki page 2. Extract it. You'll find a folder named example-solr-home and a solr.war file after extraction 3. Copy the solr.war to tomcat_home/webapps. You don't need any other solr instance. This war is

Re: Problems in solrJ trunk

2008-06-09 Thread Otis Gospodnetic
Hi, This interface vs. abstract class and maintenance/backwards compatibility question comes up pretty often. I suggest using markmail.org and searching for things like: interface abstract solr -jira interface abstract lucene -jira I think that will lead to some explanations without anyone

Re: Problems in solrJ trunk

2008-06-09 Thread Lucas F. A. Teixeira
Exactly, And adding the methods in the abstract class in the minor releases, and in the interface in major releases. []s, Lucas Lucas Frare A. Teixeira [EMAIL PROTECTED] mailto:[EMAIL PROTECTED] Tel: +55 11 3660.1622 - R3018 Alexander Ramos Jardim escreveu: Well, There is a simple case

Re: Problems in solrJ trunk

2008-06-09 Thread Alexander Ramos Jardim
Thank you Lucas, You caught my point nicely and even got a clearer idea of what to do. Sorry Solr Dev Team, but I don't there is any reasonable excuse for making such an argument interface vs abstract class as they are complements and don't have the same role in OOP. Anyways, Solr is a great

solrj client in mven repository?

2008-06-09 Thread Zsolt Czinkos
Hello all I'm new to solr, and have a question about the java client. Is it going to be available from central maven repository? I had a look, and saw that it is under development (1.3 dev), but someone may have tha answer. I built the trunk and solrj code seems to be separated from solr

Re: NullPointerException at lucene.analysis.StopFilter with 1.3

2008-06-09 Thread Ronald K. Braun
: I'm just looking into transitioning from solr 1.2 to 1.3 (trunk). I : have some legacy handler code (called AdvancedRequestHandler) that : used to work with 1.2 but now throws an exception using 1.3 (latest : nightly build). This is an interesting use case that wasn't really considered

XSL scripting

2008-06-09 Thread Lance Norskog
This started out in the num-docs thread, but deserves its own. And a wiki page. There is a more complex and general way to get the number of documents in the index. I run a query against solr and postprocess the output with an XSL script. Install this xsl script as home/conf/xslt/numfound.xsl.

Re: solrj client in mven repository?

2008-06-09 Thread spencer.c
It is not in a central repo yet, though this has been requested. See the issue I filed here: https://issues.apache.org/jira/browse/SOLR-586 If you follow the outline there, you can build/install into your own repo pretty easily. Zsolt Czinkos-2 wrote: Hello all I'm new to solr, and

html to text based on some sort of uniqueness metric

2008-06-09 Thread Cam Bazz
Hello, I am indexing newspaper articles as an excercise in solr. When dealing with newspaper articles in previous experiences I always tried to get the div or the table that contains the actual news, using nekohtml traversing tru the dom tree and getting the text from the div or table that

Re: solrj client in mven repository?

2008-06-09 Thread Alexander Ramos Jardim
I have done mine already. It is really simple. 2008/6/9 spencer.c [EMAIL PROTECTED]: It is not in a central repo yet, though this has been requested. See the issue I filed here: https://issues.apache.org/jira/browse/SOLR-586 If you follow the outline there, you can build/install into your

Re: Solr system and numbers

2008-06-09 Thread Cam Bazz
I got a similar question: how would one normalize or even detect if a string is a phone number? On Mon, Jun 9, 2008 at 4:17 PM, dudes dudes [EMAIL PROTECTED] wrote: great info ,,, thanks a lot all Date: Mon, 9 Jun 2008 05:58:50 -0700 From: [EMAIL

Re: XSL scripting

2008-06-09 Thread Otis Gospodnetic
Lance, Thanks, want to put it up on the Wiki? Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Lance Norskog [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Monday, June 9, 2008 1:12:35 PM Subject: XSL scripting This started

Re: Solr system and numbers

2008-06-09 Thread Otis Gospodnetic
Not sure. Perhaps it can be done by training a language model and treating phone numbers as named entities? Not sure if it would work. But I know there are a few NLP people subscribed, maybe they'll have some good ideas. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch

Re: html to text based on some sort of uniqueness metric

2008-06-09 Thread Otis Gospodnetic
I have not looked at the code yet, but look for NovelAnalyzer in Lucene JIRA. I believe it's supposed to do something similar. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Cam Bazz [EMAIL PROTECTED] To: solr-user@lucene.apache.org

Re: Solr system and numbers

2008-06-09 Thread Otis Gospodnetic
Doh, I forgot. Regular expressions worked well for me when I dealt with that problem many years ago. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message From: Otis Gospodnetic [EMAIL PROTECTED] To: solr-user@lucene.apache.org Sent: Monday, June 9,

Re: Overall

2008-06-09 Thread Alexander Ramos Jardim
2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]: Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL database, or it only supports *.xml files as a database? If you do that, use MySQL own full text search capabilities and not Solr, as it is built from Lucene. 2) Is

Re: solrj client in mven repository?

2008-06-09 Thread Zsolt Czinkos
I've already installed the jars into my local repo, but the pom files are very useful. Thank you zsolt On Mon, Jun 9, 2008 at 10:02 PM, Alexander Ramos Jardim [EMAIL PROTECTED] wrote: I have done mine already. It is really simple. 2008/6/9 spencer.c [EMAIL PROTECTED]: It is not in a

Re: Num docs

2008-06-09 Thread Otis Gospodnetic
Exactly. I think I mentioned this once before several months ago. One can take various hardware specs (# cores, CPU speed, FSB, RAM, etc.), performance numbers, etc. and come up with a number for each server's overall capacity. As a matter of fact, I think this would be useful to have right

searching only within allowed documents

2008-06-09 Thread Stephen Weiss
Hi, I'm new to Solr (and Lucene) and I'm trying to work out just how I could fit this technology into my app (I'm moving over from using MySQL fulltext indexes). Things are actually going really well - the facet functionality fits in just perfectly, and the basic full-text searching is

Re: Overall

2008-06-09 Thread Shalin Shekhar Mangar
2) Take a look at DataImportHandler for indexing data at http://wiki.apache.org/solr/DataImportHandler 2008/6/10 Alexander Ramos Jardim [EMAIL PROTECTED]: 2008/6/9 Mihails Agafonovs [EMAIL PROTECTED]: Hi! Some questions: 1) Is it possible to make Solr to use, for example, MySQL