Re: dynamic changes to schema

2009-08-20 Thread Constantijn Visinescu
There's that or you can just change the user entered my_author field into my_author_customattribute in code after the user has entered it and add a *_customattribute to your schema. you'd have to add the postfix in code also at querytime and off you go. Constantijn On Wed, Aug 19, 2009 at 11:52

Re: JVM Heap utilization Memory leaks with Solr

2009-08-20 Thread Rahul R
All these 3700 fields are single valued non-boolean fields. Thanks Regards Rahul On Wed, Aug 19, 2009 at 8:33 PM, Fuad Efendi f...@efendi.ca wrote: Hi Rahul, JRockit could be used at least in a test environment to monitor JVM (and troubleshoot SOLR, licensed for-free for developers!); they

RE: Problems importing HTML content contained within XML document

2009-08-20 Thread venn hardy
Thanks Paul, I upgraded to solr 1.4 and used the flatten attribute as you suggested. It works well. From: noble.p...@corp.aol.com Date: Wed, 19 Aug 2009 15:05:48 +0530 Subject: Re: Problems importing HTML content contained within XML document To: solr-user@lucene.apache.org try this

Fetching Query Results from Solr

2009-08-20 Thread bhaskar chandrasekar
Hi,   I am using Solr to retrive records and display them thru carrot. Assuming that i give Google as search , it will display me all the records relating to that. It should give me exact word search only. It shld not look for combination of words in the search query and display them. How can i

Re: Fetching Query Results from Solr

2009-08-20 Thread bhaskar chandrasekar
  Hi,   Which Java class needs to be modified to get the exact search results in Solr. either   1) SearchServlet.java 2) SolrQuerySession.java 3) SolrQuery.java   and where it should be modified.   Scenario: I am using Solr to retrive records and display them thru carrot. Assuming that i give

Re: Fetching Query Results from Solr

2009-08-20 Thread Erik Hatcher
I think you need to elaborate a bit more ... I don't understand what you're asking. Exact word search only? What is not working as you'd like/expect currently? Erik On Aug 20, 2009, at 7:35 AM, bhaskar chandrasekar wrote: Hi, Which Java class needs to be modified to get the

Group by field in Solr

2009-08-20 Thread Daniel Löfquist
Hello, I'm trying to accomplish something akin to GROUP BY in SQL but in Solr. I have an index full of songs (one song per document in Solr) by various artists and I would like to construct a search that gives me all of the artists back, one row per artist. The current search returns one row

Re: Group by field in Solr

2009-08-20 Thread Constantijn Visinescu
You'll want to use faceting, try to use a query like this http://localhost:8080/Solr/select/?q=artist%3Awarversion=2.2start=0rows=0facet=truefacet.limit=-1facet.field=artist replace localhost:8080 with your own :) On Thu, Aug 20, 2009 at 2:40 PM, Daniel Löfquist daniel.lofqu...@it.cdon.com

Re: Is wildcard search not correctly analyzed at query?

2009-08-20 Thread Alexander Herzog
It seems like the analyzer/filter isn't affected at all, since the query http://localhost:8983/solr/select/?q=PhysicalDescription:nü*debugQuery=true does not return a str name=parsedqueryPhysicalDescription:nu*/str as I would expect. So can I just have a you're right, wildcard search is passed

Adding a prefix to fields

2009-08-20 Thread ahammad
Hello, Is it possible to add a prefix to the data in a Solr field? For example, right now, I have a field called id that gets data from a DB through the DataImportHandler. The DB returns a 4-character string like ag5f. Would it be possible to add a prefix to the data that is received? In this

Re: Adding a prefix to fields

2009-08-20 Thread Shalin Shekhar Mangar
On Thu, Aug 20, 2009 at 7:07 PM, ahammad ahmed.ham...@gmail.com wrote: Hello, Is it possible to add a prefix to the data in a Solr field? For example, right now, I have a field called id that gets data from a DB through the DataImportHandler. The DB returns a 4-character string like ag5f.

Solr Range Query Anomalities

2009-08-20 Thread johan . sjoberg
Hi, we're performing range queries of a field which is of type double. Some queries which should generate results does not, and I think it's best explained by the following examples; it's also expected to exist data in all ranges: ?q=field:[10.0 TO 20.0] // OK ?q=field:[9.0 TO 20.0] // NOT OK

Re: Solr Range Query Anomalities

2009-08-20 Thread Shalin Shekhar Mangar
On Thu, Aug 20, 2009 at 7:37 PM, johan.sjob...@findwise.se wrote: Hi, we're performing range queries of a field which is of type double. Some queries which should generate results does not, and I think it's best explained by the following examples; it's also expected to exist data in all

Re: Solr Range Query Anomalities

2009-08-20 Thread Andrew Clegg
Try a sdouble or sfloat field type? Andrew. johan.sjoberg wrote: Hi, we're performing range queries of a field which is of type double. Some queries which should generate results does not, and I think it's best explained by the following examples; it's also expected to exist data in

Re: Solr Range Query Anomalities

2009-08-20 Thread Øystein F. Steimler
On Thursday 20 August 2009 16:07, johan.sjob...@findwise.se wrote: we're performing range queries of a field which is of type double. Some queries which should generate results does not, and I think it's best explained by the following examples; it's also expected to exist data in all ranges:

Re: Solr Range Query Anomalities [Solved]

2009-08-20 Thread johan . sjoberg
SortableDoubleField works excellent, haven't tried TrieField though. Thanks for the super fast support everyone. Regards, Johan On Thursday 20 August 2009 16:07, johan.sjob...@findwise.se wrote: we're performing range queries of a field which is of type double. Some queries which should

Re: Wildcard seaches?

2009-08-20 Thread Andrew Clegg
Paul Tomblin wrote: Is there such a thing as a wildcard search? If I have a simple solr.StrField with no analyzer defined, can I query for foo* or foo.* and get everything that starts with foo such as 'foobar and foobaz? Yes. foo* is fine even on a simple string field. Andrew. --

Remove data from index

2009-08-20 Thread clico
Hello I'm trying a way to do that : I index a db query like select id from table_documents Some documents are updated or deleted from the data table. Using DIH, I can indexe the updated document But I want to remove from the index the documents that were removed in the database. How

Re: Remove data from index

2009-08-20 Thread clico
I hope it could be a solution. But I think I understood that u can use deletePkQuery like this select document_id from table_document where statusDeleted= 'Y' In my case I have no status like statusDeleted. The request I would like to write is Delete from my solr Index the id that are no

Re: Remove data from index

2009-08-20 Thread Constantijn Visinescu
You could write a solr query that queries *:* and only returns the id field and then throw out all the IDs from select id from databaseTable and then run a delete query for all the IDs that are left after wards. However you'd have to write a seperate program/script to do this i think as the DIH

Re: Remove data from index

2009-08-20 Thread Marc Sturlese
As far as I know you can not do that with DIH. What size is your index? Probably the best you can do is index from scratch again with full-import. clico wrote: I hope it could be a solution. But I think I understood that u can use deletePkQuery like this select document_id from

Re: How to reduce the Solr index size..

2009-08-20 Thread Grant Ingersoll
On Aug 20, 2009, at 11:00 AM, Silent Surfer wrote: Hi, I am newbie to Solr. We recently started using Solr. We are using Solr to process the server logs. We are creating the indexes for each line of the logs, so that users would be able to do a fine grain search upto second/ms. Now

Re: Wildcard seaches?

2009-08-20 Thread Paul Tomblin
On Thu, Aug 20, 2009 at 10:51 AM, Andrew Cleggandrew.cl...@gmail.com wrote: Paul Tomblin wrote: Is there such a thing as a wildcard search?  If I have a simple solr.StrField with no analyzer defined, can I query for foo* or foo.* and get everything that starts with foo such as 'foobar and

Solr Quoted search confusions

2009-08-20 Thread Vannia Rajan
Hi,* *I need some help to clarify how solr indexes documents. I have 6 documents with various forms of the word ilike (complete word and not i like) - one having ilike as such and others having a special character in between i and like. What i expected from solr is that, when i do a Quoted

Re: Remove data from index

2009-08-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
did you see the deletedPkQuery? On Thu, Aug 20, 2009 at 8:27 PM, clicocl...@mairie-marseille.fr wrote: Hello I'm trying a way to do that : I index a db query like  select id from table_documents Some documents are updated or deleted from the data table. Using DIH, I can indexe the

How to reduce the Solr index size..

2009-08-20 Thread Silent Surfer
Hi, I am newbie to Solr. We recently started using Solr. We are using Solr to process the server logs. We are creating the indexes for each line of the logs, so that users would be able to do a fine grain search upto second/ms. Now what we are observing is , the index size that is being

Re: Solr Quoted search confusions

2009-08-20 Thread Chris Male
Hi, What analyzers/filters have you configured for the field that you are searching? One could be causing the various versions of ilike to be indexed the same way. Thanks Chris On Thu, Aug 20, 2009 at 5:29 PM, Vannia Rajan kvanniara...@gmail.comwrote: Hi,* *I need some help to clarify how

EmbeddedSolrServer restart

2009-08-20 Thread Ron Chan
Is it possible to restart an EmbeddedSolrServer using code without having to stop and start the holding application? -- View this message in context: http://www.nabble.com/EmbeddedSolrServer-restart-tp25065189p25065189.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: EmbeddedSolrServer restart

2009-08-20 Thread Mark Miller
Ron Chan wrote: Is it possible to restart an EmbeddedSolrServer using code without having to stop and start the holding application? Reload the core? -- - Mark http://www.lucidimagination.com

Re: EmbeddedSolrServer restart

2009-08-20 Thread Ron Chan
would that be the reload method in CoreContainer? will this pick up changes in schema.xml? Thanks markrmiller wrote: Ron Chan wrote: Is it possible to restart an EmbeddedSolrServer using code without having to stop and start the holding application? Reload the core? -- -

Re: EmbeddedSolrServer restart

2009-08-20 Thread Mark Miller
Yes and yes. -- - Mark http://www.lucidimagination.com Ron Chan wrote: would that be the reload method in CoreContainer? will this pick up changes in schema.xml? Thanks markrmiller wrote: Ron Chan wrote: Is it possible to restart an EmbeddedSolrServer using code without

Re: Is wildcard search not correctly analyzed at query?

2009-08-20 Thread Avlesh Singh
Wildcard queries are not analyzed by Lucene and hence the behavior. A similar thread earlier - http://www.lucidimagination.com/search/document/a6b9144ecab9d0ff/search_phrase_wildcard Cheers Avlesh On Thu, Aug 20, 2009 at 7:03 PM, Alexander Herzog herz...@ait.co.at wrote: It seems like the

can solr accept other tag other than field?

2009-08-20 Thread Elaine Li
Hi, I am new solr user. I want to use solr search to run query against many xml files I have. I have set up the solr server to run query against the example files. One problem is my xml does not have field tag and name attribute. My format is rather easy: doc segment id=1/segment segment

Re: can solr accept other tag other than field?

2009-08-20 Thread Andrew Clegg
You can use the Data Import Handler to pull data out of any XML or SQL data source: http://wiki.apache.org/solr/DataImportHandler Andrew. Elaine Li wrote: Hi, I am new solr user. I want to use solr search to run query against many xml files I have. I have set up the solr server to

where to get solr 1.4 nightly

2009-08-20 Thread Joe Calderon
i want to try out the improvements in 1.4 but the nightly site is down http://people.apache.org/builds/lucene/solr/nightly/ is there a mirror for nightlies? --joe

Re: where to get solr 1.4 nightly

2009-08-20 Thread Shalin Shekhar Mangar
On Thu, Aug 20, 2009 at 11:31 PM, Joe Calderon calderon@gmail.comwrote: i want to try out the improvements in 1.4 but the nightly site is down http://people.apache.org/builds/lucene/solr/nightly/ Yeah, its going to be down for 24 hours. is there a mirror for nightlies? No, but you

Implementing customized Scorer with solr API 1.4

2009-08-20 Thread Jérôme Etévé
Hi all, I'm kind of struggling with a customized lucene.Scorer of mine, since I use solr 1.4. Here's the problem: I wrote a DocSetQuery which inherit from a lucene.Query. This query is a decorator for a lucene.Query that filters out the documents which are not in a given set of predefined

WordDelimiterFilter to QueryParser to MultiPhraseQuery?

2009-08-20 Thread jOhn
If you have several tokens, for example after a WordDelimiterFilter, there is almost no way NOT to trigger a MultiPhraseQuery when you have catenateWords=1 or catenateAll=1. For example the title: Jokers Wild In the index it is: jokers wild, jokers, wild, jokerswild. When you query jOkerswild

solr and approximate string matching

2009-08-20 Thread Ryszard Szopa
Hi, I've been using Solr for some time in the simplest possible way (as a backend to a search engine for English documents) and I've been really happy about it. However, now I need to do something which is a bit non-standard, and unfortunately I am desperately stuck. To make things more

Re: Implementing customized Scorer with solr API 1.4

2009-08-20 Thread Mark Miller
You might be interested in this issue: http://issues.apache.org/jira/browse/LUCENE-1821 -- - Mark http://www.lucidimagination.com Jérôme Etévé wrote: Hi all, I'm kind of struggling with a customized lucene.Scorer of mine, since I use solr 1.4. Here's the problem: I wrote a

Re: Implementing customized Scorer with solr API 1.4

2009-08-20 Thread Jason Rutherglen
We should probably move to using Lucene's Filters/DocIdSets instead of DocSets and merge the two. Then we will not need to maintain two separate but similar and confusing functionality classes. This will make seamlessly integrating searching with Solr's Filters/DocSets into Lucene's new per

Re: Implementing customized Scorer with solr API 1.4

2009-08-20 Thread Mark Miller
Jérôme Etévé wrote: Hi all, I'm kind of struggling with a customized lucene.Scorer of mine, since I use solr 1.4. Here's the problem: I wrote a DocSetQuery which inherit from a lucene.Query. This query is a decorator for a lucene.Query that filters out the documents which are not in a

Common Solr Question

2009-08-20 Thread darniz
Hi i have some basic yet important question about solr and that too with terminology. I want to be crystal clear about it. Please answer the following questions. Question 1 1) Incoming queries are processed by the appropriate SolrRequestHandler. For the purposes of this discussion, you will use

RE: [ANNOUNCEMENT] Newly released book: Solr 1.4 Enterprise Search Server

2009-08-20 Thread Chenini, Mohamed
Hi, Is there any promotional code I may use to get a discount? Thanks, Mohamed -Original Message- From: Smiley, David W. [mailto:dsmi...@mitre.org] Sent: Wednesday, August 19, 2009 12:38 AM To: solr-user@lucene.apache.org Subject: RE: [ANNOUNCEMENT] Newly released book: Solr 1.4

Multi-shard query with error on one shard

2009-08-20 Thread Phillip Farber
What will the client receive from the primary solr instance if that instance doesn't get HTTP 200 from all the shards in a multi-shard query? Thanks, Phil

WordDelimiterFilter misunderstanding

2009-08-20 Thread jOhn
I've misunderstood WordDelimiterFilter. You might think that catenateAll=1 would append the full phrase (sans delimiters) as an OR against the query. So jOkersWild would produce: j (okers wild) OR jokerswild But you thought wrong. Its actually: j (okers wild jokerswild) Which is confusing

Re: WordDelimiterFilter misunderstanding

2009-08-20 Thread Yonik Seeley
This is unfortunately outside the scope of what filters can currently do at query time. This is why the example schema has WordDelimiterFilter only producing subwords at query time (not catenating them). -Yonik http://www.lucidimagination.com On Thu, Aug 20, 2009 at 5:29 PM,

Re: Embedded Solr Clustering

2009-08-20 Thread Chris Hostetter
: Yes we are using Solr for a non-traditional search purpose and the : performance is critical. However it sounds like that sharing the same index : could slow down reading / writing to the index. And access synchronization : is tricky as well. no matter how you use Solr (HTTP or Embedded) only

Snapshot backups with new replication

2009-08-20 Thread KaktuChakarabati
Hey, I was wondering if there is any equivalent in new in-process replication to what could previously be achieved by running snapcleaner -Nnumber which would essentially allow me to keep backups of N latest indices pulls on a search node. This is of course very important for failover operation

Facet filtering

2009-08-20 Thread Asif Rahman
Is there any way to assign metadata to terms in a field and then filter on that metadata when using that field as a facet? For example, I have a collection of news articles in my index. Each article has a field that contains tags based on the topics discussed in the article. An article might

Re: Retrieving the boost factor using Solrj CommonsHttpSolrServer

2009-08-20 Thread Chris Hostetter
: Subject: Retrieving the boost factor using Solrj CommonsHttpSolrServer : References: : e3cd93650907311406r5261c055yfa2676b864405...@mail.gmail.com957081.80086 : @web50309.mail.re2.yahoo.come3cd93650908010915j162baaddved542c8482d8e :

Re: Facet filtering

2009-08-20 Thread Uri Boness
Another solution is to use hierachical values. So for example, instead of having a Barack Obama value you'll have person/Barak Obama. To filter on a person you can just use wildcards (e.g. person/*). Asif Rahman wrote: Is there any way to assign metadata to terms in a field and then filter on

Re: Facet filtering

2009-08-20 Thread Avlesh Singh
You can use a dynamic field called tag_*. If a patch for SOLR 247https://issues.apache.org/jira/browse/SOLR-247gets committed, you can perform a facet query like facet.field=tag_*. Cheers Avlesh On Fri, Aug 21, 2009 at 3:21 AM, Asif Rahman a...@newscred.com wrote: Is there any way to assign

Re: Overview of Query Parsing API Stack? / Dismax parsing, new 1.4 parsing, etc.

2009-08-20 Thread Chris Hostetter
: Subject: Overview of Query Parsing API Stack? / Dismax parsing, : new 1.4 parsing, etc. Oh, what i would give for time to sit and document in depth how some of this stuff works (assuming i first had time to verify that it really does work the way i think) The nutshell answer is that as

Re: Question mark glyphs in indexed content

2009-08-20 Thread Chris Hostetter
: Hello, I am using the latest Solr4j to index content. When I look at : that content in the Solr Admin web utility I see weird characters like : this: : : http://brockwine.com/images/solrglyphs.png : : When I look at the text in the MySQL DB those chars appear to just be : plain hyphens. The

Re: Common Solr Question

2009-08-20 Thread Uri Boness
Hi, 1. that change you made should work. Just remember that request parameters (query string parameters) override the configured defaults. 2. That is correct 3. not quite sure what you mean by that. 4. I guess you're asking in your statement is correct... it is. I think you should have a look

Results from Solr

2009-08-20 Thread bhaskar chandrasekar
Hi,   Can some one help me with the below situation? To elaborate more on this. Assuming i give BHASKAR as input string. Scenario 1: It should give me search results pertaining to BHASKAR only. Select * from MASTER where name =Bhaskar; Example:It should not display search results as

Re: Overview of Query Parsing API Stack? / Dismax parsing, new 1.4 parsing, etc.

2009-08-20 Thread Yonik Seeley
On Thu, Aug 20, 2009 at 10:16 PM, Chris Hostetterhossman_luc...@fucit.org wrote: coming in Lucene 2.9 (which is what Solr 1.4 will use) is a completley new QueryParser framework, which (i'm told) is suppose to make it much easier to create custom query parser syntaxs, I've quickly looked, but

Re: Using Lucene's payload in Solr

2009-08-20 Thread Chris Hostetter
: of the field are correct but the delimiter and payload are stored so they : appear in the response also. Here is an example: ... : I am thinking maybe I can do this instead when indexing: : : XML for indexing: : field name=title payload=2.0Solr In Action/field : : This will simplify

Re: Passing a Cookie in SolrJ

2009-08-20 Thread Chris Hostetter
: but I can't see an easy way to be able to pass a cookie with the request. : The cookie is needed to be able to get through the SSO layer but will just Unless i'm remembering wrong, and HttpClient instance will manage cookies for you, so why not just document how your users can use an

Re: Solr Range Query Anomalities

2009-08-20 Thread Chris Hostetter
: Subject: Solr Range Query Anomalities : In-Reply-To: 42aac72d6e244561cb364739bf3c7517.squir...@webmail01.one.com : References: 42aac72d6e244561cb364739bf3c7517.squir...@webmail01.one.com http://people.apache.org/~hossman/#threadhijack Thread Hijacking on Mailing Lists When starting a new

Re: Results from Solr

2009-08-20 Thread Stephen Weiss
If I'm not mistaken, you should index name as field type string - right now you are probably using text so it is tokenizing on the Uppercase characters. If you use string type this shouldn't happen. You could use a copyField to make a separate name_string field so that you can do both #1

Re: Results from Solr

2009-08-20 Thread Avlesh Singh
Or maybe tweak the splitOnCaseChange property in the WordDelimiterFilterFactory for the text field. Cheers Avlesh On Fri, Aug 21, 2009 at 8:46 AM, Stephen Weiss swe...@stylesight.comwrote: If I'm not mistaken, you should index name as field type string - right now you are probably using text

Re: Remove data from index

2009-08-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
On Thu, Aug 20, 2009 at 8:39 PM, clicocl...@mairie-marseille.fr wrote: I hope it could be a solution. But I think I understood that u can use deletePkQuery like this select document_id from table_document where statusDeleted= 'Y' In my case I have no status like statusDeleted. I don't

Implementing a logout

2009-08-20 Thread Rahul R
Hello, Can somebody give me some pointers on the Solr objects I need to clean up/release while doing a logout on a Solr Application. I find that only the SolrCore object has a close() method. I typically do a lot of faceting queries on a large dataset with my application. I am using Solr 1.3.0.

Re: Is wildcard search not correctly analyzed at query? [solved]

2009-08-20 Thread Alexander Herzog
Hi Thanks for the info! best, Alexander Avlesh Singh schrieb: Wildcard queries are not analyzed by Lucene and hence the behavior. A similar thread earlier - http://www.lucidimagination.com/search/document/a6b9144ecab9d0ff/search_phrase_wildcard Cheers Avlesh On Thu, Aug 20, 2009 at

Re: Solr Quoted search confusions

2009-08-20 Thread Vannia Rajan
Hi, On Thu, Aug 20, 2009 at 9:13 PM, Chris Male gento...@gmail.com wrote: Hi, What analyzers/filters have you configured for the field that you are searching? One could be causing the various versions of ilike to be indexed the same way. I'm using text field with the following analyzers

Re: Snapshot backups with new replication

2009-08-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
cleaning of snapshots is not a feature in the current version. probably a feature can be added On Fri, Aug 21, 2009 at 3:11 AM, KaktuChakarabatijimmoe...@gmail.com wrote: Hey, I was wondering if there is any equivalent in new in-process replication to what could previously be achieved by

Re: how do i - include the items without a facet

2009-08-20 Thread Chris Hostetter
: location_name is a text field , copyto puts it in facet.location_name : : i'm thinking this could be because the field was not entered as NULL but an : empty string ? assuming facet.location_name is a StrField then that would certainly be your problem -- because the empty string is a

Re: Remove data from index

2009-08-20 Thread Chris Hostetter
: The request I would like to write is : : Delete from my solr Index the id that are no longer present in my : table_document : : With Lucene I had a way to do that : : open IndexReader, : for each lucene document : check in table_document and remove in lucene : index if document is no