Re: dynamic changes to schema

2009-08-19 Thread Constantijn Visinescu
huh? I think I lost you :) You want to use a multivalued field to list what dynamic fields you have in your document? Also if you program your application correctly you should be able to restrict your users from doing anything you please (or don't please in this case). On Tue, Aug 18, 2009 at

RE: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-19 Thread Fuad Efendi
DO NOT RELY on your hosting provider. They use automated tools creating complete mess with approved for production on CentOS versions of Lucene, Servlet-API, java.util.* package, and etc; look at this: Here is my classpath entry when Tomcat starts up java.library.path:

Re: DataImportHandler ignoring most rows

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
this comment says that str name=Total Rows Fetched7/str the query fetched only 7 rows. If possible open a tool and just run the same query and see how many rows are returned On Wed, Aug 19, 2009 at 3:46 AM, Erik Earleerikea...@yahoo.com wrote: Using: - apache-solr-1.3.0 - java 1.6 -

Re: Replication over multi-core solr

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Wed, Aug 19, 2009 at 2:27 AM, vivek sarvivex...@gmail.com wrote: Hi,  We use multi-core setup for Solr, where new cores are added dynamically to solr.xml. Only one core is active at a time. My question is how can the replication be done for multi-core - so every core is replicated on the

Re: Is negative boost possible?

2009-08-19 Thread Marc Sturlese
:the only way to negative boost is to positively boost the inverse... : : (*:* -field1:value_to_penalize)^10 This will do the job aswell as bq supports pure negative queries (at least in trunk): bq=-field1:value_to_penalize^10

Re: Replication over multi-core solr

2009-08-19 Thread Licinio Fernández Maurelo
Hi Vivek, currently we want to add cores dynamically when the active one reaches some capacity, can you give me some hints to achieve such this functionality? (Just wondering if you have used shell-scripting or you have code some 100% Java based solution) Thx 2009/8/19 Noble Paul നോബിള്‍

Problems importing HTML content contained within XML document

2009-08-19 Thread venn hardy
Hello, I have just started trying out SOLR to index some XML documents that I receive. I am using the SOLR 1.3 and its HttpDataSource in conjunction with the XPathEntityProcessor. I am finding the data import really useful so far, but I am having a few problems when I try and import HTML

Re: CorruptIndexException: Unknown format version

2009-08-19 Thread Licinio Fernández Maurelo
It looks like your solr lucene-core version doesn't match with the lucene version used to generate the index, as Yonik said, looks like there is a lucene library conflict. 2009/8/19 Chris Hostetter hossman_luc...@fucit.org: : how can that happen, it is a new index, and it is already corrupt? :

Re: Replication over multi-core solr

2009-08-19 Thread vivek sar
Licinio, Please open a separate thread - as it's a different issue - and I can respond there. -vivek 2009/8/19 Licinio Fernández Maurelo licinio.fernan...@gmail.com: Hi Vivek, currently we want to add cores dynamically when the active one reaches some capacity, can you give me some hints

Adding cores dynamically

2009-08-19 Thread Licinio Fernández Maurelo
Hi there, currently we want to add cores dynamically when the active one reaches some capacity, can anyone give me some hints to achieve such this functionality? (Just wondering if you have used shell-scripting or you have code some 100% Java based solution) Thx -- Lici

Re: Replication over multi-core solr

2009-08-19 Thread Licinio Fernández Maurelo
Ok 2009/8/19 vivek sar vivex...@gmail.com: Licinio,  Please open a separate thread - as it's a different issue - and I can respond there. -vivek 2009/8/19 Licinio Fernández Maurelo licinio.fernan...@gmail.com: Hi Vivek, currently we want to add cores dynamically when the active one

Re: Spanish Stemmer

2009-08-19 Thread Licinio Fernández Maurelo
Hi, take a look at this: !-- Tipo de campo para Textos (con stemming en español) -- fieldtype name=textTypeWithStemming class=solr.TextField analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.WordDelimiterFilterFactory

Re: Problems importing HTML content contained within XML document

2009-08-19 Thread Martijn v Groningen
Hi Venn, I think what is happening when the BODY element is being processed by xpath expressen (/document/category/BODY), is that it does not retrieve the text content from the P elements inside the body element. The expression will only retrieve text content that is directly a child of the BODY

Re: Problems importing HTML content contained within XML document

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
try this field column=textContent xpath=/document/category/BODY faltten=true/ this should slurp al the tags under body On Wed, Aug 19, 2009 at 1:44 PM, venn hardyvenn.ha...@hotmail.com wrote: Hello, I have just started trying out SOLR to index some XML documents that I receive. I am using

Re: Problems importing HTML content contained within XML document

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
sorry field column=textContent xpath=/document/category/BODY flatten=true/ 2009/8/19 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com: try this field column=textContent xpath=/document/category/BODY faltten=true/ this should slurp al the tags under body On Wed, Aug 19, 2009 at 1:44 PM,

Re: Relevant results with DisMaxRequestHandler

2009-08-19 Thread Vincent Pérès
Wow, it's like the 'mm' parameters is just appeared for the first time... Yes, I read the doc few times, but never understood that the documents who doesn't match any of the expressions will not be return... my apologize everything seems more clear now thanks to the min number parameter. Thank

Re: JVM Heap utilization Memory leaks with Solr

2009-08-19 Thread Rahul R
Fuad, We have around 5 million documents and around 3700 fields. All documents will not have values for all the fields JRockit is not approved for use within my organization. But thanks for the info anyway. Regards Rahul On Tue, Aug 18, 2009 at 9:41 AM, Funtick f...@efendi.ca wrote: BTW,

Solr-773 (GEO Module) question

2009-08-19 Thread johan . sjoberg
Hi, we're glancing at the GEO search module known from the jira issue 773 (http://issues.apache.org/jira/browse/SOLR-773). It seems to us that the issue is still open and not yet included in the nightly builds. Is there a release plan for the nightly builds, and is this module considered core

Re: MultiCore Queries? are they possible

2009-08-19 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 5:47 PM, Ninad Raut hbase.user.ni...@gmail.comwrote: Hi, Can we create a Join query between two indexes on two cores? Is this possible in Solr? I have a index which stores author profiles and other index which stores content and a author id as a reference. Can I query

Re: Strange error with shards

2009-08-19 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 9:01 PM, ahammad ahmed.ham...@gmail.com wrote: HTTP Status 500 - null java.lang.NullPointerException at org.apache.solr.handler.component.QueryComponent.mergeIds(QueryComponent.java:437) at The way I created this shard was to copy an existing one, erasing all the

Re: Passing a Cookie in SolrJ

2009-08-19 Thread Shalin Shekhar Mangar
On Tue, Aug 18, 2009 at 10:18 PM, Ramirez, Paul M (388J) paul.m.rami...@jpl.nasa.gov wrote: Hi All, The project I am working on is using Solr and OpenSSO (Sun's single sign on service). I need to write some sample code for our users that shows them how to query Solr and I would just like to

Re: How to boost fields with many terms against single-term?

2009-08-19 Thread Shalin Shekhar Mangar
On Wed, Aug 19, 2009 at 12:32 AM, Fuad Efendi f...@efendi.ca wrote: I don't want single-term docs such as home to appear in top for simple search for a home; I need home improvement made easy in top... How to implement it at query time? If you always want home improvement made easy on top

Re: Strange error with shards

2009-08-19 Thread ahammad
Each core has a different database as a datasource, which means that they have different DB structures and fields. That is why the schemas are different. I figured out the cause of this problem. You were right, it was the uniqueKey field. All of my cores have that field set to id but for this

Re: Strange error with shards

2009-08-19 Thread Shalin Shekhar Mangar
On Wed, Aug 19, 2009 at 6:44 PM, ahammad ahmed.ham...@gmail.com wrote: Each core has a different database as a datasource, which means that they have different DB structures and fields. That is why the schemas are different. If all the shards should have the same schema, then what is the

RE: JVM Heap utilization Memory leaks with Solr

2009-08-19 Thread Fuad Efendi
Hi Rahul, JRockit could be used at least in a test environment to monitor JVM (and troubleshoot SOLR, licensed for-free for developers!); they have even Eclipse plugin now, and it is licensed by Oracle (BEA)... But, of course, in large companies test environment is in hands of testers :)

multi words synonyms

2009-08-19 Thread Jae Joo
Hi, I would like to make the synonym for internal medicine to physician or doctor. but it is not working properly. Anyone help me? synonym.index.txt internal medicine = physician synonyms.query.txt physician, internal medicine = physician, doctor In the Analysis tool, I can see clearly that

Shutdown Solr

2009-08-19 Thread Miller, Michael P.
Does anyone know a graceful way to shutdown Solr? (other than killing the process with Ctrl-C)

Re: Shutdown Solr

2009-08-19 Thread Tobias Brennecke
it catches the kill signal and shuts down as it should, I guess :) because it writes stuff to the log after pressing ^c 2009/8/19 Miller, Michael P. m.mil...@radium.ncsc.mil Does anyone know a graceful way to shutdown Solr? (other than killing the process with Ctrl-C)

Re: Data Modeling

2009-08-19 Thread Smiley, David W.
This is the sort of Solr fundamentals question my book (chapter 2) will help you with. Think about what your user interface is. What are users searching for? That is, what exactly comes back from search results? It's not clear from your description what your search scenario is. ~ David

Re: Solr-773 (GEO Module) question

2009-08-19 Thread Ryan McKinley
On Aug 19, 2009, at 6:45 AM, johan.sjob...@findwise.se wrote: Hi, we're glancing at the GEO search module known from the jira issue 773 (http://issues.apache.org/jira/browse/SOLR-773). It seems to us that the issue is still open and not yet included in the nightly builds. correct

RE: Shutdown Solr

2009-08-19 Thread Fuad Efendi
catalina.sh stop But SolrServlet catches everything and forgets to implement destroy()! I am absolutely unsure about Ctrl-C and even have many concerns regarding catalina.sh stop... J2EE/JEE does not specify any support for threads outside than container-managed... I hope SolrServlet closes

RE: Shutdown Solr

2009-08-19 Thread Fuad Efendi
Most probably Ctrl-C is graceful for Tomcat, and kill -9 too... Tomcat is smart... I prefer /etc/init.d/my_tomcat wrapper around catalina.sh (su tomcat, /var/lock etc...) - ok then, Graceful Shutdown depends on how you started Tomcat.

strange sorting results: each word in field is sorted

2009-08-19 Thread Paul Rosen
I'm trying to sort, but I am not always getting the correct results and I'm not sure where to start tracking down the problem. You can see the problem here (at least until it's fixed!): http://nines.performantsoftware.com/search/saved?user=paulname=poem If you sort by Title/Ascending, you

Re: Shutdown Solr

2009-08-19 Thread Paul Tomblin
On Wed, Aug 19, 2009 at 2:43 PM, Fuad Efendif...@efendi.ca wrote: Most probably Ctrl-C is graceful for Tomcat, and kill -9 too... Tomcat is smart... I prefer /etc/init.d/my_tomcat wrapper around catalina.sh (su tomcat, /var/lock etc...) - ok then, Graceful Shutdown depends on how you started

Re: strange sorting results: each word in field is sorted

2009-08-19 Thread Erik Hatcher
On Aug 19, 2009, at 2:45 PM, Paul Rosen wrote: You can see the problem here (at least until it's fixed!): http://nines.performantsoftware.com/search/saved?user=paulname=poem Hi Paul - that project looks familiar! :) If you sort by Title/Ascending, you get partially sorted results, but it

RE: Shutdown Solr

2009-08-19 Thread Fuad Efendi
Thanks... kill should be / can be graceful; kill -9 should kill immediately... no any hang, whole point... http://www.nabble.com/Is-kill--9-safe-or-not--td24866506.html -Original Message- From: ptomb...@gmail.com [mailto:ptomb...@gmail.com] On Behalf Of Paul Tomblin Sent: August-19-09

WordDelimiterFilter = MultiPhraseQuery?

2009-08-19 Thread jOhn
My issue is with the use of WordDelimiterFilter and how the QueryParser (Dismax) converts the query into a MultiPhraseQuery. This is on solr 1.3 / lucene 2.4.1. For example: 1. yuma - 3:10 to Yuma 2. yUma - no results For #2 it gets split into y + uma and becomes a MultiPhraseQuery requiring

Re: strange sorting results: each word in field is sorted

2009-08-19 Thread Paul Rosen
Erik Hatcher wrote: On Aug 19, 2009, at 2:45 PM, Paul Rosen wrote: You can see the problem here (at least until it's fixed!): http://nines.performantsoftware.com/search/saved?user=paulname=poem Hi Paul - that project looks familiar! :) Hi Erik! I should hope so! And I've gone a year

FW: Data Modeling

2009-08-19 Thread Vladimir Landman
I hit reply and sent this to just David, but I think it should go to the whole list: Hi David, I want to do 2 kinds of things with Solr  Maybe 3 in the future 1. I want to use  it on our website so that a customer can filter down products by different attributes.  So suppose we have:

Re: Adding cores dynamically

2009-08-19 Thread vivek sar
Lici, We're doing similar thing with multi-core - when a core reaches capacity (in our case 200 million records) we start a new core. We are doing this via web service call (Create web service), http://wiki.apache.org/solr/CoreAdmin This is all done in java code - before writing we check

Re: strange sorting results: each word in field is sorted

2009-08-19 Thread Erik Hatcher
On Aug 19, 2009, at 3:50 PM, Paul Rosen wrote: I'm surprised you're not seeing an exception when trying to sort on title given this configuration. Sorting must be done on single valued indexed fields, that have at most a single term indexed per document. I recommend you use copyField to

Re: Passing a Cookie in SolrJ

2009-08-19 Thread Lance Norskog
SolrJ uses the Apache Commons HTTP client. This describes the authentication system: http://hc.apache.org/httpclient-3.x/authentication.html http://hc.apache.org/httpclient-3.x/apidocs/org/apache/commons/httpclient/auth/package-frame.html *This has code to use authentication*

Re: Shutdown Solr

2009-08-19 Thread Lance Norskog
In production systems I have done a three-stage technique. First, use the container's standard shutdown tool. Tomcat, JBoss, Jetty all have their own. Then, sleep for maybe 60 seconds. Then do kill, sleep more, then 'kill -9'. On Wed, Aug 19, 2009 at 12:21 PM, Fuad Efendi f...@efendi.ca wrote:

Re: DataImportHandler ignoring most rows

2009-08-19 Thread Lance Norskog
It usually helps to make a database view of your query, and then load the DIH from that view. There are cases where some query syntaxes are mangled on the way to the DB. 2009/8/18 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com this comment says that str name=Total Rows Fetched7/str the

Re: DataImportHandler ignoring most rows

2009-08-19 Thread erikea...@yahoo.com
I switched to the ms driver an now all is well. Must be an incompatibility with the JSQLConnect driver. Sent from my iPhone On Aug 18, 2009, at 11:47 PM, Noble Paul നോബിള്‍ नो ब्ळ् noble.p...@corp.aol.com wrote: this comment says that str name=Total Rows Fetched7/str the query

RE: Data Modeling

2009-08-19 Thread Smiley, David W.
It's getting clearer Vladimir. So fundamentally your users are searching for products (apparently auto parts) and the different attributes would become navigation filters. If this is right, then your initial schema (the first email) is a start, although it's a little ambigous to interpert it

Re: dynamic changes to schema

2009-08-19 Thread Marco Westermann
Hi, thanks for your answers, I think I have to go more in deatail. we are talking about a shop-application which have products I want to search for. This products normally have the standard attributes like sku, a name, a price and so on. But the user can add attributes to the product. So for

Re: dynamic changes to schema

2009-08-19 Thread Erik Hatcher
However, you can have a dynamic * field mapping that catches all field names that aren't already defined - though all of the fields will be the same field type. Erik On Aug 19, 2009, at 5:48 PM, Marco Westermann wrote: Hi, thanks for your answers, I think I have to go more in

【solr DIH】A problem about solr delta-imports

2009-08-19 Thread huenzhao
Hi all, There is a problem when I use solr delta-imports to update the index. I have added the last_modified column in the table. After I use the full-import command to index the database data, the dataimport.properties file contains nothing, and when I use the delta-import command to update

Re: 【solr DIH】A problem about solr delta-imports

2009-08-19 Thread Noble Paul നോബിള്‍ नोब्ळ्
which version of solr are you using? .Solr1.3 had a bug with this. On Thu, Aug 20, 2009 at 9:42 AM, huenzhaohuenz...@126.com wrote: Hi all, There is a problem when I use solr delta-imports to update the index. I have added the last_modified column in the table. After I use the full-import

Re: 【solr DIH】A problem about solr delta-imports

2009-08-19 Thread huenzhao
The version is 1.3. After I used the full-import, the tomcat log show that the solr did not call the SolrWriter class. Do you know the solution of this bug? Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: which version of solr are you using? .Solr1.3 had a bug with this. On Thu, Aug 20, 2009 at