Re: Search results after importing from Dih

2010-08-27 Thread Grijesh.singh
have u commited the data, use *:* query to see that data is commited yet or not -- View this message in context: http://lucene.472066.n3.nabble.com/Search-results-after-importing-from-Dih-tp1365720p1365927.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Search results after importing from Dih

2010-08-27 Thread hemant.verma
Check your index folder, does it contains files other than segment files? If yes then your data is in index, then you need to commit it. Try restart your solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Search-results-after-importing-from-Dih-tp1365720p1366104.html Sent

catching SqlEXceptions in DataImport Time

2010-08-27 Thread kishan
HI all, iam using solr 1.4.0 recently i observed in my solr logs that because of the invalid userName i got java.sql.SQLException: Access denied for user '1234'@'localhost i resolved this but iam not able to capture this in my code . how can i catch the SqlEXceptions in Solr-DataImport

full control over norm values?

2010-08-27 Thread Michael Zach
Hello all, is there a way in solr to directly set the norm value for fields? I have been using index time boost values so far, but they just are one part of norm value calculation. How can I influence that? Kind regards Michael

Re: Search Results optimization

2010-08-27 Thread Hasnain
Thank you so much for valuable suggestions this is the query im using q=swingline red stapler hammer hand rigidfl=nameqt=standard1rows=35debugQuery=true here is my output, as you can see close to last records, Swingline Stapler scored more than Swingline Red Stapler - 747 series, but I want

Re: full control over norm values?

2010-08-27 Thread MitchK
Hi Micheal, have a look at SweetSpotSimilarity (Lucene). Kind regards, - Mitch -- View this message in context: http://lucene.472066.n3.nabble.com/full-control-over-norm-values-tp1366910p1367462.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Search results after importing from Dih

2010-08-27 Thread hemantverm...@gmail.com
check is there any file other than segments file. if yes then your data is in index, commit it. try restart solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Search-results-after-importing-from-Dih-tp1365720p1368560.html Sent from the Solr - User mailing list archive at

Re: Document Section in Solr

2010-08-27 Thread Markus Jelsma
You cannot divide a document into sections as far as i know. You could, however, store divisions in different fields, if your use-case allows this, and retrieve only the fields that you need. This way you can avoid downloading 20MiB at once. On Friday 27 August 2010 11:26:05 maheshkumar wrote:

Re: Status of Solr in the cloud?

2010-08-27 Thread Markus Jelsma
That would be Solr 4.0, or maybe 3.1 first. http://wiki.apache.org/solr/Solr3.1 http://wiki.apache.org/solr/Solr4.0 On Thursday 26 August 2010 23:58:25 Charlie Jackson wrote: There seem to be a few parallel efforts at putting Solr in a cloud configuration. See

Re: Auto ID for Documents indexed

2010-08-27 Thread Markus Jelsma
No. Solr doesn't require a unique ID nor is an auto incrementing value really useful in indices spanning multiple machines. Maybe SOLR-308 could help you out but then the question remains, why would you need a feature like this? https://issues.apache.org/jira/browse/SOLR-308 On Friday 27

Re: Auto ID for Documents indexed

2010-08-27 Thread Grijesh.singh
No there is no such feature,your indexing component have to keep generation of ids -- View this message in context: http://lucene.472066.n3.nabble.com/Auto-ID-for-Documents-indexed-tp1367972p1369051.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-27 Thread Markus Jelsma
It's just a configured filter, so you should be able to define it twice. Have you tried it? But it might be tricky, the output from the first will be the input of the second so i doubt the usefulness of this approach. On Thursday 26 August 2010 17:45:45 Shawn Heisey wrote: Can I pass my

Re: Auto ID for Documents indexed

2010-08-27 Thread Lucas F. A. Teixeira
Oh, wait. If you do not have to have sequential, but only unique keys, you can use the UUIDType. You can set SOLR to handle this and generate the field ids for you. Works great for me. []s, Lucas Frare Teixeira .·. - lucas...@gmail.com - lucastex.com.br - blog.lucastex.com -

Re: A few query issues with solr

2010-08-27 Thread hemantverm...@gmail.com
this link will help you: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory -- View this message in context: http://lucene.472066.n3.nabble.com/A-few-query-issues-with-solr-tp1358081p1369261.html Sent from the Solr - User mailing list archive at

Re: how to deal with virtual collection in solr?

2010-08-27 Thread Jan Høydahl / Cominvent
Hi, Version 1.4.1 does not support the SolrCloud style sharding. In 1.4.1, please use this style: shards=localhost:8983/solr/aaprivate,localhost:8983/solr/aapublic/ However, since schema is the same, I'd opt for one index with a collections field as the filter. You can add that field to your

Re: A few query issues with solr

2010-08-27 Thread Markus Jelsma
For solving the car/car-rent issue you'll need to add a SynonymFilter to your analyzer chain and configure it accordingly. On Friday 27 August 2010 13:40:15 hemantverm...@gmail.com wrote: this link will help you: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimi

Re: Creating new Solr cores using relative paths

2010-08-27 Thread Jan Høydahl / Cominvent
Yes, this is really a pain sometimes. I'd prefer a well defined base path, which could be assumed everywhere unless otherwise documented. SolrHome is one natural choice. For backward compat we could add a config in solr(config).xml to easily switch to old behaviour. Also, it makes sense to

multiple values

2010-08-27 Thread Brad Dewar
Hi In a multiValued field, I want to write a query that returns all the documents that actually have more than 1 value in that field. (don't care what the value is). Maybe this should be obvious, but I just can't think of how to formulate such a query. Any help? Thanks, Brad

Re: multiple values

2010-08-27 Thread Erik Hatcher
My recommendation is if you need to query on something, index it as you need... so in this case index another field with the number of values in that field. This is easy if you're writing a custom indexer, but maybe not so trivial if you're indexing other ways - so a custom update

RE: how to deal with virtual collection in solr?

2010-08-27 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thanks so much, I really appreciate your help! Have a great weekend! Xiaohui -Original Message- From: Jan Høydahl / Cominvent [mailto:jan@cominvent.com] Sent: Friday, August 27, 2010 7:42 AM To: solr-user@lucene.apache.org Subject: Re: how to deal with virtual collection in solr?

RE: how to deal with virtual collection in solr?

2010-08-27 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]
Thank you, Jan Høydahl. I used http://localhost:8983/solr/select?shards=localhost:8983/solr/aaprivate,localhost:8983/solr/aapublic/. I got a error Missing solr core name in path. I have aapublic and aaprivate cores. I also got a error if I used

Problem related to Sorting in Solr1.4

2010-08-27 Thread deepak agrawal
Hi, I have one Text fileld in our schema i want to do the sorting for that column. field name=TITLE type=text indexed=true stored=true / field name=UPDBY type=text indexed=true stored=true / I have these two columns i want to use the SORT for these two columns. any one can please suggest what

Re: Problem related to Sorting in Solr1.4

2010-08-27 Thread Markus Jelsma
What seems to be the problem? Did you consult the wiki on this matter? http://wiki.apache.org/solr/CommonQueryParameters#sort On Friday 27 August 2010 15:14:06 deepak agrawal wrote: Hi, I have one Text fileld in our schema i want to do the sorting for that column. field name=TITLE

Re: Problem related to Sorting in Solr1.4

2010-08-27 Thread kenf_nc
the 'text' fieldType is not suitable for sorting. You need to use the copyField directive in your schema and at indexing time copy the data to your TITLE and UPDBY fields, and you need to create 2 new fields: field name=TITLE_sort type=string indexed=true stored=true / field name=UPDBY_sort

Re: Searching for words with accented characters.

2010-08-27 Thread Muneeb Ali
Hey Robert, Just wondering if you ever got to solve this problem? We are facing a similar issue with our catalog search :( look forward to hearing from you. -Thanks, Muneeb -- View this message in context:

Re: Document Section in Solr

2010-08-27 Thread Erick Erickson
I don't understand your use-case behind this question. Even displaying 2M is still more than most users want. Can you explain what you're trying to accomplish? If you're trying to show just some context around the search terms, highlighting might help. Best Erick On Fri, Aug 27, 2010 at 4:04

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-27 Thread Erick Erickson
I agree with Marcus, the usefulness of passing through WDF twice is suspect. You can always do a copyfield to a completely different field and do whatever you want there, copyfield forks the raw input to the second field, not the analyzed stream... What is it you're really trying to accomplish?

Private data within SOLR Schema

2010-08-27 Thread Manuel Portela
Hi all, I'm wondering how to handle following scenario: There is a pool of data which can be searched by all users via a web interface. Now some users would like to add own documents, or private fields to existing documents which shall not be accessable by others. Concerning private fields .. I

Re: Private data within SOLR Schema

2010-08-27 Thread kenf_nc
my feeling is that private fields in a public document will be the hardest nut to crack, unless you have an intermediary layer that users call instead of hitting your solr instance directly. If you front it with a web service you could handle various authorization scenarios a little easier.

Re: Private data within SOLR Schema

2010-08-27 Thread Lance Norskog
User security tends to change often. You may find it easier to use user/role security. You could create a unique role for a user's docs and store that role instead. You need a separate user-role database. Later, the user can choose to share docs with someone else and you would then change the

Surge 2010 Early Registration ends Tuesday!

2010-08-27 Thread Jason Dixon
Early Bird Registration for Surge Scalability Conference 2010 ends next Tuesday, August 31. We have a killer lineup of speakers and architects from across the Internet. Listen to experts talk about the newest methods and technologies for scaling your Web presence.

Re: Duplicating a Solr Doc

2010-08-27 Thread Lance Norskog
On further investigation: DocumentBuilder.loadStoredFields() is used in one utility function which is only called from one unit test. This should be considered dead code. Don't use it. SolrPluginUtils.docListToSolrDocument() SolrPluginUtilsTest.testDocListConversion() On Wed, Aug 25, 2010 at

Can I use an ExternalFileField as an input to a boost query?

2010-08-27 Thread Andy
I have a field popularity that is changing frequently. So I'd like to put it in an ExternalFileField. If I do that, can I still use popularity in a boosted query such as: {!boost b=log(popularity)}foo Thanks.

Re: sort by field length

2010-08-27 Thread Lance Norskog
You might be better off starting with the Lucene CheckIndex program. It walks all of the Lucene index data structures. I have done forensics by fiddling with the CheckIndex code. On Thu, Aug 26, 2010 at 9:11 AM, Shawn Heisey s...@elyograg.org wrote:  On 5/24/2010 6:30 AM, Sascha Szott wrote:

Re: spellcheck index blown away during rebuild

2010-08-27 Thread Lance Norskog
To make a dictionary with a 'minimum document count' you need to make the dictionary from the facets. Facets will create this for you; but will allocate memory for every last term. The last N facets will have the smallest # of terms. To get term counts for hundreds of millions of terms, I think