date:20100827

Re: Private data within SOLR Schema

2010-08-27 Thread Lance Norskog

A problem with this as recently surfaced: spelling suggestions. A spelling checker built from the index pulls all terms. You cannot give it a filter query. But, you don't want to show people words from documents they should not see. On Fri, Aug 27, 2010 at 12:01 PM, Lance Norskog wrote: > User se

Re: Problem related to Sorting in Solr1.4

2010-08-27 Thread Lance Norskog

Make the sort fields multiValued="false". This guarantees that the field is sortable. If there is one term more than the number of documents, you get an ArrayIndexOutOfBounds exception. Also, sorting works from the indexed terms, not the stored values. You don't need to store the _sort fields: O

Re: how to deal with virtual collection in solr?

2010-08-27 Thread Lance Norskog

For XML files that are not in the Solr document upload format, you would use the DataImportHandler. http://wiki.apache.org/solr/DataImportHandler#XPathEntityProcessor Look for the wikipedia example. It shows how to read XML files from disk. You give XPath expressions for different items in the XM

Re: multiple values

2010-08-27 Thread Lance Norskog

Would "number of values" be an interesting value source/function? Is it easy to do with a Lucene query? On Fri, Aug 27, 2010 at 5:25 AM, Erik Hatcher wrote: > My recommendation is if you need to query on something, index it as you > need... so in this case index another field with the number of v

Re: Searching for words with accented characters.

2010-08-27 Thread Lance Norskog

This was a 2-year-old question :) Have you made sure that UTF-8 character encoding is set in all phases of your project? Servlet container, XML input header, etc? Character encodings are hell to debug on Windows, so I would suggest checking it on Linux or a Mac. Since this is a one-character fumb

Re: spellcheck index blown away during rebuild

2010-08-27 Thread Lance Norskog

To make a dictionary with a 'minimum document count' you need to make the dictionary from the facets. Facets will create this for you; but will allocate memory for every last term. The last N facets will have the smallest # of terms. To get term counts for hundreds of millions of terms, I think yo

Re: sort by field length

2010-08-27 Thread Lance Norskog

You might be better off starting with the Lucene CheckIndex program. It walks all of the Lucene index data structures. I have done forensics by fiddling with the CheckIndex code. On Thu, Aug 26, 2010 at 9:11 AM, Shawn Heisey wrote: > On 5/24/2010 6:30 AM, Sascha Szott wrote: >> >> Hi folks, >> >

Can I use an ExternalFileField as an input to a boost query?

2010-08-27 Thread Andy

I have a field "popularity" that is changing frequently. So I'd like to put it in an ExternalFileField. If I do that, can I still use "popularity" in a boosted query such as: {!boost b=log(popularity)}foo Thanks.

Re: Duplicating a Solr Doc

2010-08-27 Thread Lance Norskog

On further investigation: DocumentBuilder.loadStoredFields() is used in one utility function which is only called from one unit test. This should be considered dead code. Don't use it. SolrPluginUtils.docListToSolrDocument() SolrPluginUtilsTest.testDocListConversion() On Wed, Aug 25, 2010 at 9:

Surge 2010 Early Registration ends Tuesday!

2010-08-27 Thread Jason Dixon

Early Bird Registration for Surge Scalability Conference 2010 ends next Tuesday, August 31. We have a killer lineup of speakers and architects from across the Internet. Listen to experts talk about the newest methods and technologies for scaling your Web presence. http://omniti.com/surge/2010/re

Re: Private data within SOLR Schema

2010-08-27 Thread Lance Norskog

User security tends to change often. You may find it easier to use user/role security. You could create a unique role for a user's docs and store that role instead. You need a separate user->role database. Later, the user can choose to share docs with someone else and you would then change the mapp

Re: Private data within SOLR Schema

2010-08-27 Thread kenf_nc

my feeling is that private fields in a public document will be the hardest nut to crack, unless you have an intermediary layer that users call instead of hitting your solr instance directly. If you front it with a web service you could handle various authorization scenarios a little easier. Priva

Private data within SOLR Schema

2010-08-27 Thread Manuel Portela

Hi all, I'm wondering how to handle following scenario: There is a pool of data which can be searched by all users via a web interface. Now some users would like to add own documents, or private fields to existing documents which shall not be accessable by others. Concerning private fields .. I

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-27 Thread Erick Erickson

I agree with Marcus, the usefulness of passing through WDF twice is suspect. You can always do a copyfield to a completely different field and do whatever you want there, copyfield forks the raw input to the second field, not the analyzed stream... What is it you're really trying to accomplish? Yo

Re: Document Section in Solr

2010-08-27 Thread Erick Erickson

I don't understand your use-case behind this question. Even displaying 2M is still more than most users want. Can you explain what you're trying to accomplish? If you're trying to show just some context around the search terms, highlighting might help. Best Erick On Fri, Aug 27, 2010 at 4:04 AM,

Re: Searching for words with accented characters.

2010-08-27 Thread Muneeb Ali

Hey Robert, Just wondering if you ever got to solve this problem? We are facing a similar issue with our catalog search :( look forward to hearing from you. -Thanks, Muneeb -- View this message in context: http://lucene.472066.n3.nabble.com/Searching-for-words-with-accented-characters-tp4863

Re: Problem related to Sorting in Solr1.4

2010-08-27 Thread kenf_nc

the 'text' fieldType is not suitable for sorting. You need to use the copyField directive in your schema and at indexing time copy the data to your TITLE and UPDBY fields, and you need to create 2 new fields: then you Search on TITLE but Sort on TITLE_sort -- View this message in context: h

Re: Problem related to Sorting in Solr1.4

2010-08-27 Thread Markus Jelsma

What seems to be the problem? Did you consult the wiki on this matter? http://wiki.apache.org/solr/CommonQueryParameters#sort On Friday 27 August 2010 15:14:06 deepak agrawal wrote: > Hi, > > I have one Text fileld in our schema i want to do the sorting for that > column. > > > > > > I ha

Problem related to Sorting in Solr1.4

2010-08-27 Thread deepak agrawal

Hi, I have one Text fileld in our schema i want to do the sorting for that column. I have these two columns i want to use the SORT for these two columns. any one can please suggest what should i need to do for that. I am currently using Solr1.4. -- DEEPAK AGRAWAL +91-9379433455 GOOD LUCK...

RE: how to deal with virtual collection in solr?

2010-08-27 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]

Thank you, Jan Høydahl. I used http://localhost:8983/solr/select?&shards=localhost:8983/solr/aaprivate,localhost:8983/solr/aapublic/. I got a error "Missing solr core name in path". I have aapublic and aaprivate cores. I also got a error if I used http://localhost:8983/solr/aapublic/select?&s

RE: how to deal with virtual collection in solr?

2010-08-27 Thread Ma, Xiaohui (NIH/NLM/LHC) [C]

Thanks so much, I really appreciate your help! Have a great weekend! Xiaohui -Original Message- From: Jan Høydahl / Cominvent [mailto:jan@cominvent.com] Sent: Friday, August 27, 2010 7:42 AM To: solr-user@lucene.apache.org Subject: Re: how to deal with virtual collection in solr? Hi

Re: multiple values

2010-08-27 Thread Erik Hatcher

My recommendation is if you need to query on something, index it as you need... so in this case index another field with the number of values in that field. This is easy if you're writing a custom indexer, but maybe not so trivial if you're indexing other ways - so a custom update processo

multiple values

2010-08-27 Thread Brad Dewar

Hi In a multiValued field, I want to write a query that returns all the documents that actually have more than 1 value in that field. (don't care what the value is). Maybe this should be obvious, but I just can't think of how to formulate such a query. Any help? Thanks, Brad

Re: Creating new Solr cores using relative paths

2010-08-27 Thread Jan Høydahl / Cominvent

Yes, this is really a pain sometimes. I'd prefer a well defined base path, which could be assumed everywhere unless otherwise documented. SolrHome is one natural choice. For backward compat we could add a config in solr(config).xml to easily switch to old behaviour. Also, it makes sense to defi

Re: A few query issues with solr

2010-08-27 Thread Markus Jelsma

For solving the car/car-rent issue you'll need to add a SynonymFilter to your analyzer chain and configure it accordingly. On Friday 27 August 2010 13:40:15 hemantverm...@gmail.com wrote: > this link will help you: > http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimi > te

Re: how to deal with virtual collection in solr?

2010-08-27 Thread Jan Høydahl / Cominvent

Hi, Version 1.4.1 does not support the SolrCloud style sharding. In 1.4.1, please use this style: &shards=localhost:8983/solr/aaprivate,localhost:8983/solr/aapublic/ However, since schema is the same, I'd opt for one index with a "collections" field as the filter. You can add that field to yo

Re: A few query issues with solr

2010-08-27 Thread hemantverm...@gmail.com

this link will help you: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory -- View this message in context: http://lucene.472066.n3.nabble.com/A-few-query-issues-with-solr-tp1358081p1369261.html Sent from the Solr - User mailing list archive at Nabble.co

Re: Auto ID for Documents indexed

2010-08-27 Thread Lucas F. A. Teixeira

Oh, wait. If you do not have to have sequential, but only unique keys, you can use the UUIDType. You can set SOLR to handle this and generate the field ids for you. Works great for me. []s, Lucas Frare Teixeira .·. - lucas...@gmail.com - lucastex.com.br - blog.lucastex.com - twitter.com/lucaste

Re: Multiple passes with WordDelimiterFilterFactory

2010-08-27 Thread Markus Jelsma

It's just a configured filter, so you should be able to define it twice. Have you tried it? But it might be tricky, the output from the first will be the input of the second so i doubt the usefulness of this approach. On Thursday 26 August 2010 17:45:45 Shawn Heisey wrote: > Can I pass my dat

Re: Auto ID for Documents indexed

2010-08-27 Thread Grijesh.singh

No there is no such feature,your indexing component have to keep generation of ids -- View this message in context: http://lucene.472066.n3.nabble.com/Auto-ID-for-Documents-indexed-tp1367972p1369051.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Auto ID for Documents indexed

2010-08-27 Thread Markus Jelsma

No. Solr doesn't require a unique ID nor is an auto incrementing value really useful in indices spanning multiple machines. Maybe SOLR-308 could help you out but then the question remains, why would you need a feature like this? https://issues.apache.org/jira/browse/SOLR-308 On Friday 27 Augu

Re: Status of Solr in the cloud?

2010-08-27 Thread Markus Jelsma

That would be Solr 4.0, or maybe 3.1 first. http://wiki.apache.org/solr/Solr3.1 http://wiki.apache.org/solr/Solr4.0 On Thursday 26 August 2010 23:58:25 Charlie Jackson wrote: > There seem to be a few parallel efforts at putting Solr in a cloud > configuration. See http://wiki.apache.org/solr/Kat

Re: Document Section in Solr

2010-08-27 Thread Markus Jelsma

You cannot divide a document into sections as far as i know. You could, however, store divisions in different fields, if your use-case allows this, and retrieve only the fields that you need. This way you can avoid downloading 20MiB at once. On Friday 27 August 2010 11:26:05 maheshkumar wrote:

Re: Search results after importing from Dih

2010-08-27 Thread hemantverm...@gmail.com

check is there any file other than segments file. if yes then your data is in index, commit it. try restart solr. -- View this message in context: http://lucene.472066.n3.nabble.com/Search-results-after-importing-from-Dih-tp1365720p1368560.html Sent from the Solr - User mailing list archive at N

Re: full control over norm values?

2010-08-27 Thread MitchK

Hi Micheal, have a look at SweetSpotSimilarity (Lucene). Kind regards, - Mitch -- View this message in context: http://lucene.472066.n3.nabble.com/full-control-over-norm-values-tp1366910p1367462.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Search Results optimization

2010-08-27 Thread Hasnain

Thank you so much for valuable suggestions this is the query im using q=swingline red stapler hammer hand rigid&fl=name&qt=standard1&rows=35&debugQuery=true here is my output, as you can see close to last records, "Swingline Stapler" scored more than "Swingline Red Stapler - 747 series, but I

full control over norm values?

2010-08-27 Thread Michael Zach

Hello all, is there a way in solr to directly set the norm value for fields? I have been using index time boost values so far, but they just are one part of norm value calculation. How can I influence that? Kind regards Michael

catching SqlEXceptions in DataImport Time

2010-08-27 Thread kishan

HI all, iam using solr 1.4.0 recently i observed in my solr logs that because of the invalid userName i got java.sql.SQLException: Access denied for user '1234'@'localhost i resolved this but iam not able to capture this in my code . how can i catch the SqlEXceptions in Solr-DataImport ple

38 matches

Mail list logo