payload queries running slow

2009-12-17 Thread Raghuveer Kancherla
Hi, With help from the group here, I have been able to set up a search application with payloads enabled. However, there is a noticeable increase in query response times with payloads as compared to the same queries without payloads. I am also seeing a lot more disk IO (I have a 7200 rpm disk) and

Problem with words thats amost similar

2009-12-17 Thread Steinar Asbjørnsen
Hi all. I have a delicate problem when it comes to two words that are rather similar in the way they are typed, but when it comes to the meaning of the word they are completely different. The actual words are restaurant (as in restaurant) and restaurering (as in restoration). Solr seems to

Re: Problem with words thats amost similar

2009-12-17 Thread Shalin Shekhar Mangar
2009/12/17 Steinar Asbjørnsen steinar...@gmail.com Hi all. I have a delicate problem when it comes to two words that are rather similar in the way they are typed, but when it comes to the meaning of the word they are completely different. The actual words are restaurant (as in restaurant)

shards parameter

2009-12-17 Thread pcurila
Hello, is there any way to configure shards parameter in solrconfig.xml? So I do not need provide it in the url. Thanks Peter -- View this message in context: http://old.nabble.com/shards-parameter-tp26826908p26826908.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: shards parameter

2009-12-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
yes. put it under the defaults section in your standard requesthandler. On Thu, Dec 17, 2009 at 5:22 PM, pcurila p...@eea.sk wrote: Hello, is there any way to configure shards parameter in solrconfig.xml? So I do not need provide it in the url. Thanks Peter -- View this message in context:

Re: Problem with words thats amost similar

2009-12-17 Thread Steinar Asbjørnsen
Den 17. des. 2009 kl. 12.42 skrev Shalin Shekhar Mangar: 2009/12/17 Steinar Asbjørnsen steinar...@gmail.com Hi all. I have a delicate problem when it comes to two words that are rather similar in the way they are typed, but when it comes to the meaning of the word they are completely

Re: Problem with words thats amost similar

2009-12-17 Thread Shalin Shekhar Mangar
2009/12/17 Steinar Asbjørnsen steinar...@gmail.com Den 17. des. 2009 kl. 12.42 skrev Shalin Shekhar Mangar: For specific cases like this, you can add the word to a file and specify it in schema, for example: filter class=solr.SnowballPorterFilterFactory language=English

Adaptive search?

2009-12-17 Thread Siddhant Goel
Hi, Does Solr provide adaptive searching? Can it adapt to user clicks within the search results it provides? Or that has to be done externally? I couldn't find anything on googling for it. Thanks, -- - Siddhant

Solr Web Courses

2009-12-17 Thread Tommy Molto
Hi, I need to train a team in Solr. Since we are in Brazil, and i didnt find any course about it around here, do you have any recommendation of some web course of Solr? Att,

Re: regarding QParserPlugin

2009-12-17 Thread Grant Ingersoll
I'd probably use the logging stuff instead of System.out, maybe the stream isn't be flushed as expected. On Dec 16, 2009, at 8:29 AM, gudumba l wrote: Hello all, I am trying to use query parser plugin feature of solr. But its really strange that everytime its behaving in a

Re: payload queries running slow

2009-12-17 Thread Grant Ingersoll
On Dec 17, 2009, at 4:52 AM, Raghuveer Kancherla wrote: Hi, With help from the group here, I have been able to set up a search application with payloads enabled. However, there is a noticeable increase in query response times with payloads as compared to the same queries without payloads. I

Dynamically changing the stored-state of a dynamicField

2009-12-17 Thread Jan-Simon Winkelmann
Hi, i am currently building a Solr configuration for a rather large search index. To allow for indexing of differently named fields for each dataset, i have includ the following dynamicField: dynamicField name=*_index type=text indexed=true stored=true required=false / What i don't like about

Re: Adaptive search?

2009-12-17 Thread Paul Libbrecht
What can it mean to adapt to user clicks ? Quite many things in my head. Do you have maybe a citation that inspires you here? paul Le 17-déc.-09 à 13:52, Siddhant Goel a écrit : Does Solr provide adaptive searching? Can it adapt to user clicks within the search results it provides? Or

Re: debugging javascript DIH

2009-12-17 Thread Luca Molteni
Of course you can. Just use remote debugging. Which app server are you using? IDE? L.M. 2009/12/3 Joel Nylund jnyl...@yahoo.com: is there a way to print to std out or anything from my javascript DIH transformer? thanks Joel

Re: Solr Web Courses

2009-12-17 Thread Koji Sekiguchi
Tommy Molto wrote: Hi, I need to train a team in Solr. Since we are in Brazil, and i didnt find any course about it around here, do you have any recommendation of some web course of Solr? Att, It seems that Lucid Imagination provides Online Training courses for Solr:

Upgrade from 1.2 to 1.4

2009-12-17 Thread ERIC TREECE
Has anyone successfully upgraded from 1.2 to 1.4 by just changing the war file? When we tried this we could not longer sort to retrieve the highest numbered date/id in our index. Previously with 1.2 the following would return the most recent data and now it returns data from around the same

Re: Adaptive search?

2009-12-17 Thread Siddhant Goel
Let say we have a search engine (a simple front end - web app kind of a thing - responsible for querying Solr and then displaying the results in a human readable form) based on Solr. If a user searches for something, gets quite a few search results, and then clicks on one such result - is there

Re: Solr Web Courses

2009-12-17 Thread Tommy Molto
Yes, i entered in contact with them, i'm waiting an answer. But i was looking for a plan b too, if we cant close the deal with them. Att, On Thu, Dec 17, 2009 at 12:55 PM, Koji Sekiguchi k...@r.email.ne.jp wrote: Tommy Molto wrote: Hi, I need to train a team in Solr. Since we are in

Re: Can solr do the equivalent of select distinct(field)?

2009-12-17 Thread Aleksander Stensby
A follow up question on this Hoss: If I have a set of documents, let's say this email thread. Each email has a unique author. All emails in the thread are indexed with threadid=33 If I want to count the number of unique authors in this email thread, I could go along the lines you mention at the

Re: Can solr do the equivalent of select distinct(field)?

2009-12-17 Thread Aleksander Stensby
Forgot to add facet.mincount=1, obviously. But still, is this the only or prefered way of doing something along these lines? Or is there a different (better) approach? Best regards, Aleksander On Thu, Dec 17, 2009 at 5:59 PM, Aleksander Stensby aleksander.sten...@integrasco.com wrote: A

Re: Dynamically changing the stored-state of a dynamicField

2009-12-17 Thread Erik Hatcher
You'll have to do it the way you mentioned, with two dynamic field mappings. Erik On Dec 17, 2009, at 8:33 AM, Jan-Simon Winkelmann wrote: Hi, i am currently building a Solr configuration for a rather large search index. To allow for indexing of differently named fields for each

Re: Upgrade from 1.2 to 1.4

2009-12-17 Thread Erik Hatcher
Eric, The sort changed, so it is now a separate parameter. q=sort=... The query;sort syntax has been deprecated. Be sure to reach Solr's CHANGES.txt when upgrading. There's this section the upgrading from 1.3 (which would apply when going from 1.2 also!) mentioning this: The

Re: solr core size on disk

2009-12-17 Thread Matthieu Labour
Paul Thank you for your reply I did du -sh in /solr_env/index/data and it shows 36G It is distributed among 700 cores with most of them being 150M Is that a big index that should be sharded ? 2009/12/17 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com look at the index dir and see the size

Re: Can solr do the equivalent of select distinct(field)?

2009-12-17 Thread Erik Hatcher
On Dec 17, 2009, at 11:59 AM, Aleksander Stensby wrote: A follow up question on this Hoss: If I have a set of documents, let's say this email thread. Each email has a unique author. All emails in the thread are indexed with threadid=33 If I want to count the number of unique authors in this

Re: solr core size on disk

2009-12-17 Thread Erik Hatcher
Sharding isn't necessarily decided upon by index size. Is your search performance ok? Got enough free disk space to optimize? Then don't shard. But no, 150M is not a large index size. 700 cores, now that's a lot! Erik On Dec 17, 2009, at 1:27 PM, Matthieu Labour wrote: Paul

Re: Can solr do the equivalent of select distinct(field)?

2009-12-17 Thread Aleksander Stensby
Thanks for your reply Erik! The speed of my suggested query is actually very fast once we add the facet.mincount=1 (when searching within a limited set of documents). The set-back seem to be in the sharding of our data.. And that puzzles me a little bit... I can't really see why SOLR is so slow

can't start solr by java -jar start.jar

2009-12-17 Thread Jill Han
I downloaded solr1.4, and unzipped it at C:\apache-solr-1.4.0 . At C:\apache-solr-1.4.0\example, I try to run solr by java -jar start.jar as instructed by Solr tutorial. However, it failed. C:\apache-solr-1.4.0\examplejava -jar start.jar java.lang.NullPointerException at

Re: Searching .msg files

2009-12-17 Thread Lance Norskog
As to the indexing part: This is an automated document input tool: http://wiki.apache.org/solr/DataImportHandler This is a plugin for it that pulls mail from an IMAP server: http://wiki.apache.org/solr/MailEntityProcessor This is a comment about microsoft MSG files and parsing in Java:

Re: store content only of documents

2009-12-17 Thread javaxmlsoapdev
Anyone? javaxmlsoapdev wrote: I store document in a field content field defiend as follow in schema.xml field name=content type=text indexed=true stored=true multiValued=true/ and following in solrconfig.xml requestHandler name=/update/extract

Re: Document model suggestion

2009-12-17 Thread Lance Norskog
Role-based authentication is one level of sophistication up from user-based authentication. Users can have different roles, and authentication goes against roles. Documents with multiple viewers would be assigned special roles. All users would also have their own matching role. On Tue, Dec 15,

Re: Document model suggestion

2009-12-17 Thread caman
Are you suggesting that roles should be maintained in the index? We do manage out authentication based on roles but at granular level, user rights play a big role as well. I know we need to compromise, just need to find a balance. Thanks Lance Norskog-2 wrote: Role-based authentication is

RE: Question about updateRequestProcessorChain

2009-12-17 Thread Manepalli, Kalyan
I specified another updateRequestChain with out the custom updateProcessor and it worked. Thanks for the quick help. -Kalyan -Original Message- From: noble.p...@gmail.com [mailto:noble.p...@gmail.com] On Behalf Of Noble Paul ??? ?? Sent: Thursday, December 17, 2009 5:59 AM

Re: Spellchecking - Is there a way to do this?

2009-12-17 Thread Lance Norskog
Character-based NGrams are a good tool for this problem. MLT is a document-wide numerical analysis. If the common types of OCR mistakes are different than what NGrams create, you might tune the ngram generator. For example, swapping letters might not happen very often. SIngle- and multi-word

Re: Converting java date to solr date and querying dates

2009-12-17 Thread Lance Norskog
Or add a new field type to Solr. This would accept a date and include a Java formatting string. The field would parse the date, print it with that format string, and store the results. You could use any aspect of the date. On Tue, Dec 15, 2009 at 1:06 PM, Chris Hostetter hossman_luc...@fucit.org

Re: Simple Wildcard Search Question

2009-12-17 Thread Erick Erickson
I think your problem is WordDelimiterFilterFactory. For reference, see: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactoryAs I understand it, your input name01 gets

Re: shards parameter

2009-12-17 Thread Yonik Seeley
You're setting up an infinite loop by adding a shards parameter on the default search handler. Create a new search handler and put your default under that. -Yonik http://www.lucidimagination.com On Thu, Dec 17, 2009 at 7:47 AM, pcurila p...@eea.sk wrote: I tried it out. But there is another

Re: Spellchecking - Is there a way to do this?

2009-12-17 Thread Lance Norskog
Another thing you might check into is stemming. The Porter stemmer included in Solr is aggressive, meaning that it will tend to do weird things with misspellings. There is a different stemmer called KStem which is available from www.lucidimagination.com/Downloads is less aggressive. Porter turns

Re: Searching .msg files

2009-12-17 Thread javaxmlsoapdev
1)use tika to index .msg files (Tika does support Microsoft outlook format and I am already using Tika: http://lucene.apache.org/tika/formats.html). 2)while indexing you'll have to write handler to extract To, CC, Bcc values and store it in a separate field in index. 3)when user searches on .msg

Re: shards parameter

2009-12-17 Thread Jacob Elder
If the goal is to save time when using the admin interface, you can just add this to conf/admin-extra.html: script src=http://www.google.com/jsapi;/script script google.load(prototype, 1.6); /script script type=text/javascript Event.observe( window, 'load',

Re: dose solr sopport distribute index storage ?

2009-12-17 Thread Camilo Aguilar
HI! I have the same question Thanks in advance On Mon, Oct 12, 2009 at 1:55 PM, Pieter Steyn pieter...@gmail.com wrote: Sorry for the hijack, but s replication necessary when using a cluster file-system such as GFS2. Where the files are the same for any instance of Solr? On Mon, Oct 12,

Re: Results after using Field Collapsing are not matching the results without using Field Collapsing

2009-12-17 Thread Varun Gupta
After a lot of debugging, I finally found why the order of collapse results are not matching the uncollapsed results. I can't say if it is a bug in the implementation of fieldcollapse or not. *Explaination:* Actually, I am querying the fieldcollapse with some filters to restrict the collapsing to