A problem with this as recently surfaced: spelling suggestions. A
spelling checker built from the index pulls all terms. You cannot give
it a filter query. But, you don't want to show people words from
documents they should not see.
On Fri, Aug 27, 2010 at 12:01 PM, Lance Norskog wrote:
> User se
Make the sort fields multiValued="false". This guarantees that the
field is sortable. If there is one term more than the number of
documents, you get an ArrayIndexOutOfBounds exception. Also, sorting
works from the indexed terms, not the stored values. You don't need to
store the _sort fields:
O
For XML files that are not in the Solr document upload format, you
would use the DataImportHandler.
http://wiki.apache.org/solr/DataImportHandler#XPathEntityProcessor
Look for the wikipedia example. It shows how to read XML files from
disk. You give XPath expressions for different items in the XM
Would "number of values" be an interesting value source/function? Is
it easy to do with a Lucene query?
On Fri, Aug 27, 2010 at 5:25 AM, Erik Hatcher wrote:
> My recommendation is if you need to query on something, index it as you
> need... so in this case index another field with the number of v
This was a 2-year-old question :)
Have you made sure that UTF-8 character encoding is set in all phases
of your project? Servlet container, XML input header, etc? Character
encodings are hell to debug on Windows, so I would suggest checking it
on Linux or a Mac.
Since this is a one-character fumb
To make a dictionary with a 'minimum document count' you need to make
the dictionary from the facets. Facets will create this for you; but
will allocate memory for every last term. The last N facets will have
the smallest # of terms.
To get term counts for hundreds of millions of terms, I think yo
You might be better off starting with the Lucene CheckIndex program.
It walks all of the Lucene index data structures. I have done
forensics by fiddling with the CheckIndex code.
On Thu, Aug 26, 2010 at 9:11 AM, Shawn Heisey wrote:
> On 5/24/2010 6:30 AM, Sascha Szott wrote:
>>
>> Hi folks,
>>
>
I have a field "popularity" that is changing frequently. So I'd like to put it
in an ExternalFileField.
If I do that, can I still use "popularity" in a boosted query such as:
{!boost b=log(popularity)}foo
Thanks.
On further investigation: DocumentBuilder.loadStoredFields() is used
in one utility function which is only called from one unit test. This
should be considered dead code. Don't use it.
SolrPluginUtils.docListToSolrDocument()
SolrPluginUtilsTest.testDocListConversion()
On Wed, Aug 25, 2010 at 9:
Early Bird Registration for Surge Scalability Conference 2010 ends next
Tuesday, August 31. We have a killer lineup of speakers and architects
from across the Internet. Listen to experts talk about the newest
methods and technologies for scaling your Web presence.
http://omniti.com/surge/2010/re
User security tends to change often. You may find it easier to use
user/role security. You could create a unique role for a user's docs
and store that role instead. You need a separate user->role database.
Later, the user can choose to share docs with someone else and you
would then change the mapp
my feeling is that private fields in a public document will be the hardest
nut to crack, unless you have an intermediary layer that users call instead
of hitting your solr instance directly. If you front it with a web service
you could handle various authorization scenarios a little easier.
Priva
Hi all,
I'm wondering how to handle following scenario:
There is a pool of data which can be searched by all users via a web
interface. Now some users would like to add own documents, or private fields
to existing documents which shall not be accessable by others.
Concerning private fields
.. I
I agree with Marcus, the usefulness of passing through WDF twice
is suspect. You can always do a copyfield to a completely different
field and do whatever you want there, copyfield forks the raw input
to the second field, not the analyzed stream...
What is it you're really trying to accomplish? Yo
I don't understand your use-case behind this question. Even displaying
2M is still more than most users want. Can you explain what you're
trying to accomplish?
If you're trying to show just some context around the search terms,
highlighting might help.
Best
Erick
On Fri, Aug 27, 2010 at 4:04 AM,
Hey Robert,
Just wondering if you ever got to solve this problem?
We are facing a similar issue with our catalog search :(
look forward to hearing from you.
-Thanks,
Muneeb
--
View this message in context:
http://lucene.472066.n3.nabble.com/Searching-for-words-with-accented-characters-tp4863
the 'text' fieldType is not suitable for sorting. You need to use the
copyField directive in your schema and at indexing time copy the data to
your TITLE and UPDBY fields, and you need to create 2 new fields:
then you Search on TITLE but Sort on TITLE_sort
--
View this message in context:
h
What seems to be the problem? Did you consult the wiki on this matter?
http://wiki.apache.org/solr/CommonQueryParameters#sort
On Friday 27 August 2010 15:14:06 deepak agrawal wrote:
> Hi,
>
> I have one Text fileld in our schema i want to do the sorting for that
> column.
>
>
>
>
>
> I ha
Hi,
I have one Text fileld in our schema i want to do the sorting for that column.
I have these two columns i want to use the SORT for these two columns.
any one can please suggest what should i need to do for that.
I am currently using Solr1.4.
--
DEEPAK AGRAWAL
+91-9379433455
GOOD LUCK...
Thank you, Jan Høydahl.
I used
http://localhost:8983/solr/select?&shards=localhost:8983/solr/aaprivate,localhost:8983/solr/aapublic/.
I got a error "Missing solr core name in path". I have aapublic and aaprivate
cores. I also got a error if I used
http://localhost:8983/solr/aapublic/select?&s
Thanks so much, I really appreciate your help!
Have a great weekend!
Xiaohui
-Original Message-
From: Jan Høydahl / Cominvent [mailto:jan@cominvent.com]
Sent: Friday, August 27, 2010 7:42 AM
To: solr-user@lucene.apache.org
Subject: Re: how to deal with virtual collection in solr?
Hi
My recommendation is if you need to query on something, index it as
you need... so in this case index another field with the number of
values in that field. This is easy if you're writing a custom
indexer, but maybe not so trivial if you're indexing other ways - so a
custom update processo
Hi
In a multiValued field, I want to write a query that returns all the documents
that actually have more than 1 value in that field. (don't care what the value
is).
Maybe this should be obvious, but I just can't think of how to formulate such a
query.
Any help?
Thanks,
Brad
Yes, this is really a pain sometimes.
I'd prefer a well defined base path, which could be assumed everywhere unless
otherwise documented.
SolrHome is one natural choice. For backward compat we could add a config in
solr(config).xml to easily switch to old behaviour.
Also, it makes sense to defi
For solving the car/car-rent issue you'll need to add a SynonymFilter to your
analyzer chain and configure it accordingly.
On Friday 27 August 2010 13:40:15 hemantverm...@gmail.com wrote:
> this link will help you:
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimi
> te
Hi,
Version 1.4.1 does not support the SolrCloud style sharding. In 1.4.1, please
use this style:
&shards=localhost:8983/solr/aaprivate,localhost:8983/solr/aapublic/
However, since schema is the same, I'd opt for one index with a "collections"
field as the filter.
You can add that field to yo
this link will help you:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory
--
View this message in context:
http://lucene.472066.n3.nabble.com/A-few-query-issues-with-solr-tp1358081p1369261.html
Sent from the Solr - User mailing list archive at Nabble.co
Oh, wait.
If you do not have to have sequential, but only unique keys, you can use the
UUIDType.
You can set SOLR to handle this and generate the field ids for you.
Works great for me.
[]s,
Lucas Frare Teixeira .·.
- lucas...@gmail.com
- lucastex.com.br
- blog.lucastex.com
- twitter.com/lucaste
It's just a configured filter, so you should be able to define it twice. Have
you tried it? But it might be tricky, the output from the first will be the
input of the second so i doubt the usefulness of this approach.
On Thursday 26 August 2010 17:45:45 Shawn Heisey wrote:
> Can I pass my dat
No there is no such feature,your indexing component have to keep generation
of ids
--
View this message in context:
http://lucene.472066.n3.nabble.com/Auto-ID-for-Documents-indexed-tp1367972p1369051.html
Sent from the Solr - User mailing list archive at Nabble.com.
No. Solr doesn't require a unique ID nor is an auto incrementing value really
useful in indices spanning multiple machines. Maybe SOLR-308 could help you
out but then the question remains, why would you need a feature like this?
https://issues.apache.org/jira/browse/SOLR-308
On Friday 27 Augu
That would be Solr 4.0, or maybe 3.1 first.
http://wiki.apache.org/solr/Solr3.1
http://wiki.apache.org/solr/Solr4.0
On Thursday 26 August 2010 23:58:25 Charlie Jackson wrote:
> There seem to be a few parallel efforts at putting Solr in a cloud
> configuration. See http://wiki.apache.org/solr/Kat
You cannot divide a document into sections as far as i know. You could,
however, store divisions in different fields, if your use-case allows this,
and retrieve only the fields that you need. This way you can avoid downloading
20MiB at once.
On Friday 27 August 2010 11:26:05 maheshkumar wrote:
check is there any file other than segments file.
if yes then your data is in index, commit it.
try restart solr.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Search-results-after-importing-from-Dih-tp1365720p1368560.html
Sent from the Solr - User mailing list archive at N
Hi Micheal,
have a look at SweetSpotSimilarity (Lucene).
Kind regards,
- Mitch
--
View this message in context:
http://lucene.472066.n3.nabble.com/full-control-over-norm-values-tp1366910p1367462.html
Sent from the Solr - User mailing list archive at Nabble.com.
Thank you so much for valuable suggestions
this is the query im using
q=swingline red stapler hammer hand
rigid&fl=name&qt=standard1&rows=35&debugQuery=true
here is my output, as you can see close to last records, "Swingline Stapler"
scored more than "Swingline Red Stapler - 747 series, but I
Hello all,
is there a way in solr to directly set the norm value for fields?
I have been using index time boost values so far, but they just are one part of
norm value calculation.
How can I influence that?
Kind regards
Michael
HI all,
iam using solr 1.4.0
recently i observed in my solr logs that because of the invalid userName
i got java.sql.SQLException: Access denied for user '1234'@'localhost
i resolved this but iam not able to capture this in my code .
how can i catch the SqlEXceptions in Solr-DataImport
ple
38 matches
Mail list logo