Hi,
With help from the group here, I have been able to set up a search
application with payloads enabled. However, there is a noticeable increase
in query response times with payloads as compared to the same queries
without payloads. I am also seeing a lot more disk IO (I have a 7200 rpm
disk) and
Hi all.
I have a delicate problem when it comes to two words that are rather similar in
the way they are typed, but when it comes to the meaning of the word they are
completely different.
The actual words are restaurant (as in restaurant) and restaurering (as in
restoration).
Solr seems to
2009/12/17 Steinar Asbjørnsen steinar...@gmail.com
Hi all.
I have a delicate problem when it comes to two words that are rather
similar in the way they are typed, but when it comes to the meaning of the
word they are completely different.
The actual words are restaurant (as in restaurant)
Hello, is there any way to configure shards parameter in solrconfig.xml? So I
do not need provide it in the url. Thanks Peter
--
View this message in context:
http://old.nabble.com/shards-parameter-tp26826908p26826908.html
Sent from the Solr - User mailing list archive at Nabble.com.
yes.
put it under the defaults section in your standard requesthandler.
On Thu, Dec 17, 2009 at 5:22 PM, pcurila p...@eea.sk wrote:
Hello, is there any way to configure shards parameter in solrconfig.xml? So I
do not need provide it in the url. Thanks Peter
--
View this message in context:
Den 17. des. 2009 kl. 12.42 skrev Shalin Shekhar Mangar:
2009/12/17 Steinar Asbjørnsen steinar...@gmail.com
Hi all.
I have a delicate problem when it comes to two words that are rather
similar in the way they are typed, but when it comes to the meaning of the
word they are completely
2009/12/17 Steinar Asbjørnsen steinar...@gmail.com
Den 17. des. 2009 kl. 12.42 skrev Shalin Shekhar Mangar:
For specific cases like this, you can add the word to a file and specify
it
in schema, for example:
filter class=solr.SnowballPorterFilterFactory language=English
Hi,
Does Solr provide adaptive searching? Can it adapt to user clicks within the
search results it provides? Or that has to be done externally?
I couldn't find anything on googling for it.
Thanks,
--
- Siddhant
Hi,
I need to train a team in Solr. Since we are in Brazil, and i didnt find any
course about it around here, do you have any recommendation of some web
course of Solr?
Att,
I'd probably use the logging stuff instead of System.out, maybe the stream
isn't be flushed as expected.
On Dec 16, 2009, at 8:29 AM, gudumba l wrote:
Hello all,
I am trying to use query parser plugin feature of solr.
But its really strange that everytime its behaving in a
On Dec 17, 2009, at 4:52 AM, Raghuveer Kancherla wrote:
Hi,
With help from the group here, I have been able to set up a search
application with payloads enabled. However, there is a noticeable increase
in query response times with payloads as compared to the same queries
without payloads. I
Hi,
i am currently building a Solr configuration for a rather large search
index. To allow for indexing of differently named fields for each dataset, i
have includ the following dynamicField:
dynamicField name=*_index type=text indexed=true stored=true
required=false /
What i don't like about
What can it mean to adapt to user clicks ? Quite many things in my
head.
Do you have maybe a citation that inspires you here?
paul
Le 17-déc.-09 à 13:52, Siddhant Goel a écrit :
Does Solr provide adaptive searching? Can it adapt to user clicks
within the
search results it provides? Or
Of course you can. Just use remote debugging. Which app server are you
using? IDE?
L.M.
2009/12/3 Joel Nylund jnyl...@yahoo.com:
is there a way to print to std out or anything from my javascript DIH
transformer?
thanks
Joel
Tommy Molto wrote:
Hi,
I need to train a team in Solr. Since we are in Brazil, and i didnt find any
course about it around here, do you have any recommendation of some web
course of Solr?
Att,
It seems that Lucid Imagination provides Online Training courses for Solr:
Has anyone successfully upgraded from 1.2 to 1.4 by just changing the war file?
When we tried this we could not longer sort to retrieve the highest numbered
date/id in our index. Previously with 1.2 the following would return the most
recent data and now it returns data from around the same
Let say we have a search engine (a simple front end - web app kind of a
thing - responsible for querying Solr and then displaying the results in a
human readable form) based on Solr. If a user searches for something, gets
quite a few search results, and then clicks on one such result - is there
Yes, i entered in contact with them, i'm waiting an answer.
But i was looking for a plan b too, if we cant close the deal with them.
Att,
On Thu, Dec 17, 2009 at 12:55 PM, Koji Sekiguchi k...@r.email.ne.jp wrote:
Tommy Molto wrote:
Hi,
I need to train a team in Solr. Since we are in
A follow up question on this Hoss:
If I have a set of documents, let's say this email thread. Each email has a
unique author. All emails in the thread are indexed with threadid=33 If I
want to count the number of unique authors in this email thread, I could go
along the lines you mention at the
Forgot to add facet.mincount=1, obviously. But still, is this the only or
prefered way of doing something along these lines? Or is there a different
(better) approach?
Best regards,
Aleksander
On Thu, Dec 17, 2009 at 5:59 PM, Aleksander Stensby
aleksander.sten...@integrasco.com wrote:
A
You'll have to do it the way you mentioned, with two dynamic field
mappings.
Erik
On Dec 17, 2009, at 8:33 AM, Jan-Simon Winkelmann wrote:
Hi,
i am currently building a Solr configuration for a rather large search
index. To allow for indexing of differently named fields for each
Eric,
The sort changed, so it is now a separate parameter. q=sort=...
The query;sort syntax has been deprecated.
Be sure to reach Solr's CHANGES.txt when upgrading. There's this
section the upgrading from 1.3 (which would apply when going from 1.2
also!) mentioning this:
The
Paul
Thank you for your reply
I did du -sh in /solr_env/index/data
and it shows
36G
It is distributed among 700 cores with most of them being 150M
Is that a big index that should be sharded ?
2009/12/17 Noble Paul നോബിള് नोब्ळ् noble.p...@corp.aol.com
look at the index dir and see the size
On Dec 17, 2009, at 11:59 AM, Aleksander Stensby wrote:
A follow up question on this Hoss:
If I have a set of documents, let's say this email thread. Each
email has a
unique author. All emails in the thread are indexed with
threadid=33 If I
want to count the number of unique authors in this
Sharding isn't necessarily decided upon by index size. Is your search
performance ok? Got enough free disk space to optimize? Then don't
shard.
But no, 150M is not a large index size.
700 cores, now that's a lot!
Erik
On Dec 17, 2009, at 1:27 PM, Matthieu Labour wrote:
Paul
Thanks for your reply Erik!
The speed of my suggested query is actually very fast once we add the
facet.mincount=1 (when searching within a limited set of documents).
The set-back seem to be in the sharding of our data.. And that puzzles me a
little bit...
I can't really see why SOLR is so slow
I downloaded solr1.4, and unzipped it at C:\apache-solr-1.4.0 .
At C:\apache-solr-1.4.0\example, I try to run solr by java -jar
start.jar as instructed by Solr tutorial.
However, it failed.
C:\apache-solr-1.4.0\examplejava -jar start.jar
java.lang.NullPointerException
at
As to the indexing part:
This is an automated document input tool:
http://wiki.apache.org/solr/DataImportHandler
This is a plugin for it that pulls mail from an IMAP server:
http://wiki.apache.org/solr/MailEntityProcessor
This is a comment about microsoft MSG files and parsing in Java:
Anyone?
javaxmlsoapdev wrote:
I store document in a field content field defiend as follow in
schema.xml
field name=content type=text indexed=true stored=true
multiValued=true/
and following in solrconfig.xml
requestHandler name=/update/extract
Role-based authentication is one level of sophistication up from
user-based authentication. Users can have different roles, and
authentication goes against roles. Documents with multiple viewers
would be assigned special roles. All users would also have their own
matching role.
On Tue, Dec 15,
Are you suggesting that roles should be maintained in the index? We do manage
out authentication based on roles but at granular level, user rights play a
big role as well.
I know we need to compromise, just need to find a balance.
Thanks
Lance Norskog-2 wrote:
Role-based authentication is
I specified another updateRequestChain with out the custom updateProcessor and
it worked.
Thanks for the quick help.
-Kalyan
-Original Message-
From: noble.p...@gmail.com [mailto:noble.p...@gmail.com] On Behalf Of Noble
Paul ??? ??
Sent: Thursday, December 17, 2009 5:59 AM
Character-based NGrams are a good tool for this problem. MLT is a
document-wide numerical analysis.
If the common types of OCR mistakes are different than what NGrams
create, you might tune the ngram generator. For example, swapping
letters might not happen very often. SIngle- and multi-word
Or add a new field type to Solr. This would accept a date and include
a Java formatting string. The field would parse the date, print it
with that format string, and store the results. You could use any
aspect of the date.
On Tue, Dec 15, 2009 at 1:06 PM, Chris Hostetter
hossman_luc...@fucit.org
I think your problem is WordDelimiterFilterFactory. For reference, see:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactoryAs
I understand it, your input name01 gets
You're setting up an infinite loop by adding a shards parameter on the
default search handler.
Create a new search handler and put your default under that.
-Yonik
http://www.lucidimagination.com
On Thu, Dec 17, 2009 at 7:47 AM, pcurila p...@eea.sk wrote:
I tried it out. But there is another
Another thing you might check into is stemming. The Porter stemmer
included in Solr is aggressive, meaning that it will tend to do
weird things with misspellings. There is a different stemmer called
KStem which is available from www.lucidimagination.com/Downloads is
less aggressive. Porter turns
1)use tika to index .msg files (Tika does support Microsoft outlook format
and I am already using Tika: http://lucene.apache.org/tika/formats.html).
2)while indexing you'll have to write handler to extract To, CC, Bcc values
and store it in a separate field in index.
3)when user searches on .msg
If the goal is to save time when using the admin interface, you can just add
this to conf/admin-extra.html:
script src=http://www.google.com/jsapi;/script
script
google.load(prototype, 1.6);
/script
script type=text/javascript
Event.observe(
window,
'load',
HI!
I have the same question
Thanks in advance
On Mon, Oct 12, 2009 at 1:55 PM, Pieter Steyn pieter...@gmail.com wrote:
Sorry for the hijack, but s replication necessary when using a cluster
file-system such as GFS2. Where the files are the same for any
instance of Solr?
On Mon, Oct 12,
After a lot of debugging, I finally found why the order of collapse results
are not matching the uncollapsed results. I can't say if it is a bug in the
implementation of fieldcollapse or not.
*Explaination:*
Actually, I am querying the fieldcollapse with some filters to restrict the
collapsing to
41 matches
Mail list logo