Hi Eric
Yes, it was a classpath issue.
Sowmya.
On Mon, Jul 25, 2011 at 4:01 PM, Erick Erickson erickerick...@gmail.comwrote:
Well, MultiMapSolrParams is a subclass of SolrParams, so you actually
do use it in your code G
But this looks like a classpath problem. You say your code
On Mon, Jul 25, 2011 at 12:24 PM, Stefan Matheis
matheis.ste...@googlemail.com wrote:
Hi Remy,
so you may open an Issue for this on the github Project? i mean .. just
creating another client, because i have one problem, does not sound like a
good plan?
Agreed, and thanks for calling my
Hi,
2011/7/25 Gabriel Farrell gsf...@gmail.com
On Mon, Jul 25, 2011 at 12:24 PM, Stefan Matheis
matheis.ste...@googlemail.com wrote:
Hi Remy,
so you may open an Issue for this on the github Project? i mean .. just
creating another client, because i have one problem, does not sound like
When crating cores through solr.xml, I am able to specify custom
properties, to be referenced in solrconfig.xml. For example:
cores adminPath=/admin/cores defaultCoreName=master
core name=master instanceDir=core1 shard=shard1 collection=myconf
property name=enable.master value=true /
Hmmm, I think that should be fq=field:[* TO *] since the problem is to
include docs with anything in that field.
You could also index a special EMPTY_FIELD string and do something
like fq=-field:[EMPTY_FIELD]..
Best
Erick
On Mon, Jul 25, 2011 at 12:10 PM, Stefan Matheis
I'm a little confused. Are you searching against these
different titles or is the search something else and you're really
only interested in displaying different titles for documents returned
for the query?
If it's just a display issue, you can use multivalued fields, the order
in which you put
: A followup. The wiki has a whole discussion of the 'update' XML
: message. But solrj has nothing like it. Does that really exist? Is
: there a reason to use it? If I just 'add' the document a second time,
: it will replace?
You should only see update in Solr docs used in the context of
I have three documents with the following product titles in a text field
called moreWords with analysis stack matching the solr example text
field definition.
1. HP LaserJet P1102W Monochrome Laser Printer
http://www.buy.com/prod/hp-laserjet-p1102w-monochrome-laser-printer/q/l
Sorry, to clarify a search for P1102W matches all three docs but a
search for p1102w LaserJet only matches the second two. Someone asked
me a question while I was typing and I got distracted, apologies for any
confusion.
-Original Message-
From: Robert Petersen [mailto:rober...@buy.com]
thanks!! it worked.
I was just wondering if xpath can be used to use process default xml format
for solr index doc
--
View this message in context:
http://lucene.472066.n3.nabble.com/dih-fetching-but-not-adding-records-to-index-tp3189438p3198705.html
Sent from the Solr - User mailing list
Hmmm, I'm assuming that moreWords is your default text field, yes?
But it works for me (tm), using 1.4.1. What version of Solr are you on?
Also, take a glance at the admin/analysis page, that might help...
Gotta run
Erick
On Mon, Jul 25, 2011 at 4:52 PM, Robert Petersen rober...@buy.com
hi erick,
I will be searching only on search_term.
I did exactly as u said in application layer,
I was not sure how multi-valued fields works in co-relation
--
View this message in context:
http://lucene.472066.n3.nabble.com/multivalue-or-denormalise-tp3197942p3198710.html
Sent from the
I'm still on solr 1.4.0 and the analysis page looks like they should match, and
other products with the same content do in fact match. I'm reindexing the
non-matching ones to rule that out.
-Original Message-
From: Erick Erickson [mailto:erickerick...@gmail.com]
Sent: Monday, July 25,
As in http://wiki.apache.org/solr/UpdateXmlMessages?
On Mon, Jul 25, 2011 at 4:10 PM, Chris Hostetter
hossman_luc...@fucit.org wrote:
: A followup. The wiki has a whole discussion of the 'update' XML
: message. But solrj has nothing like it. Does that really exist? Is
: there a reason to use
Hmmm, I can't find a convenient 1.4.0 to download, but re-indexing is a good
idea since this seems like it *should* work.
Erick
On Mon, Jul 25, 2011 at 5:32 PM, Robert Petersen rober...@buy.com wrote:
I'm still on solr 1.4.0 and the analysis page looks like they should match,
and other
This is a pretty low level issue with inverted indexes (i.e. the underlying
data structure used) and not so much the architecture. It is possible, I
suppose, to solve it at the architectural level, but in many cases this causes
performance problems that are not usually acceptable.
On Jul 20,
Thanks, this is helpful.
I do indeed periodically update or delete just about every doc in the
index, so it makes sense that optimization might be neccesary even in
post 1.4, but I'm still on 1.4 -- add this to another thing to look into
rather than assume after I upgrade.
Indeed I was
: As in http://wiki.apache.org/solr/UpdateXmlMessages?
Exactly ... the title is XML Messages for Updating a Solr Index
But i do see some confusing usages of add/update in the context of
documents that definitely don't belong there -- so i've changed them to
add/replace.
Thanks for bringing
Anybody who knows how to query an solr server whether it is optimized or not ?
As replication can config slave to pull the indexes after optimized ,so I
think there must be someway to query that .But I didn't find any document to
identify that , anyone knows ?
Thanks so much O(n_n)O
Yes - I am using edismax but the reason is not obvious to me can you give
me a pointer?
Thanks
Yee
--
View this message in context:
http://lucene.472066.n3.nabble.com/Logically-equivalent-queries-but-vastly-different-no-of-results-tp3190278p3199362.html
Sent from the Solr - User mailing
Hi Yonik:
On Jul 17, 2011, at 9:30 AM, Yonik Seeley wrote:
On Sun, Jul 17, 2011 at 10:38 AM, Jeff Schmidt j...@535consulting.com wrote:
I don't want to query for a particular facet value, but rather have Solr do
a grouping of facet values. I'm not sure about the appropriate nomenclature
Hi all,
I need to make date faceted query and I tried to use facet.range but can't
get result I need.
I want to make 4 facet like following.
1 Months,3 Months, 6Months, more than 1 Year
The onlinedate field in schema.xml like this
field name=onlinedate type=tdate indexed=true stored=true/
I
On Fri, Jul 22, 2011 at 12:42 AM, abhayd ajdabhol...@hotmail.com wrote:
hi
I m trying to load data into solr index from a xml file using dih
my promotions.xml file
--
add
doc
field name=id3/field
/doc
doc
field
LotsOfCores does not exist as a feature. It is just a write-up, some jira
issues and a couple of patches. Did I miss something?
On Sun, Jul 24, 2011 at 8:26 PM, Markus Jelsma
markus.jel...@openindex.iowrote:
It works fine but you would keep an eye on additional overhead, cores
`stealing` too
No i missed something and interpreted the question as using a lot of cores.
LotsOfCores does not exist as a feature. It is just a write-up, some jira
issues and a couple of patches. Did I miss something?
On Sun, Jul 24, 2011 at 8:26 PM, Markus Jelsma
markus.jel...@openindex.iowrote:
It
Hey Stephan,
Thanks, but I already used this solr client and I got an error when I add
too much documents FATAL ERROR: JS Allocation failed - process out of
memory.
I didn't find the source of the problem in the solr client. So I decided to
write my own without this error hopefully and also I'm
Hi, I recently went through a little hell when I upgraded my Solr
servers to 3.2.0. What I didn't anticipate was that my Java SolrJ
clients depend on the server version.
I would like to add a note about this in the SolrJ docs:
http://wiki.apache.org/solr/Solrj#Streaming_documents_for_an_update
hi,
while searching for word SOLR in
highlighting in solr can be manipulated
with frag-size =10 .
how is the fragment decided ? how many characters are taken before the world
SOLR and after the word SOLR ?
jame
You've three analyzer elements, i wonder what that would do. You need to add
the char filter to the index-time analyzer.
On Monday 25 July 2011 13:09:14 Merlin Morgenstern wrote:
Hi there,
I am trying to strip html tags from the data before adding the documents to
the index. To do that I
sounds logical. I just changed it to the following, restarted and reindexed
with commit:
fieldType name=text class=solr.TextField
positionIncrementGap=100 autoGeneratePhraseQueries=true
analyzer type=index
tokenizer
Are you looking at the returned result set or what you've actually indexed?
Analyzers are not run on the stored data, only on indexed data.
On Monday 25 July 2011 15:03:18 Merlin Morgenstern wrote:
sounds logical. I just changed it to the following, restarted and reindexed
with commit:
http://lucene.apache.org/solr/api/org/apache/solr/client/solrj/request/CoreAdminRequest.html
That should get you started.
Best
Erick
On Tue, Jul 19, 2011 at 11:40 PM, Jiang mingyuan
mailtojiangmingy...@gmail.com wrote:
hi all,
Our solr server contains two cores:core0,core1,and they both
dismax is a fairly narrow-use parser. By that I mean it was created
to solve a specific issue. It has some pronounced warts as you've
discovered.
edismax is the preferred parser if you have access to it. I'd just
ignore dismax if you have access to edismax. There's been some
talk of deprecating
I'd seriously consider going with SolrJ as your indexing strategy, it allows
you to do anything you need to do in Java code. You can call the Tika
library yourself on the files pointed to by your rows as you see fit, indexing
them as you choose, perhaps one Solr doc per attachment, perhaps one
per
hi
when u highlight and get back snippet fragments , can you over write the
default hl.regex.pattern through url .
can some quote an example url of that sort ?
what if i make pass hl.slop=0 will this stop considering regex pattern at
all ?
--
-JAME
Well, MultiMapSolrParams is a subclass of SolrParams, so you actually
do use it in your code G
But this looks like a classpath problem. You say your code compiles,
but do you make all the jars you path to during compilation available
to your servlet? And/or do you have any old jar files in
Well, WAY_ANALYZED:de l hotel de ville parses as
WAY_ANALYZED:de default:l default:hotel default:de default:ville
You probably want something like WAY_ANALYZED:(de l hotel de ville),
perhaps with AND between them. Try adding debugQuery=on to your
queries and you can sometimes see this kind of
I'm no expert on frange, but fq clauses are intersections. So if your
two frange queries have no terms in common, you won't get anything.
You can think of it as an implied AND between all the fq clauses you specify...
Best
Erick
On Thu, Jul 21, 2011 at 5:29 AM, Rohit Gupta ro...@in-rev.com
Yes and that's causing some problems in my application. Is there a way to
truncate the 7th decimal place in regards to sorting by the score?
On Fri, Jul 22, 2011 at 4:27 PM, Yonik Seeley yo...@lucidimagination.comwrote:
On Fri, Jul 22, 2011 at 4:11 PM, Brian Lamb
brian.l...@journalexperts.com
Thanks so much Erick (and Stefan). Yes, I did some reading on SolrJ and
Tika and you are spot-on. We will write our own importer using SolrJ and
then we can grab the DB records and parse any attachments along the way.
Now it comes down to a schema design question. The issue I'm struggling
with
Hi,
I try to use the suggest component (solr 3.3) with multiple cores.
I added a search component and a request handler as described in the docs (
http://wiki.apache.org/solr/Suggester) to my solrconfig.
That works fine for 1 core but querying my solr instance with the shards
parameter does not
thanks
using parenthesis
select?q=VINCI Parkfq=WAY_ANALYZED:(rue de l hotel de ville) AND
(TOWN_ANALYZED:paris OR
DEPARTEMENT_ANALYZED:paris)rows=200fl=NAME,TOWN,WAY,scoredebugQuery=on
works
but I would rather not use parenthesis or AND between those words
this brings another question:
I'm afraid there currently isn't much support for correcting misplaced
whitespace. Solr is going to look at each word individually and won't even try
to combine ajacent words (or split a word into 2 or more). So there is no good
way to get these kinds of suggestions.
One thing that might
Yeah, the 1.4 code base is older. That is, optimization will have more
effect on that vintage code than on 3.x and trunk code.
I should have been a bit more explicit in that other thread. In the case
where you add a bunch of documents, optimization doesn't buy you all
that much currently. If you
Are you using stemming on that field? Sometimes stemming and wildcards don't
get along very well. If you are, take a look at how the terms that should
match ban?le are analyzed on the Analysis section of the admin.
On Sat, Jul 23, 2011 at 6:33 AM, Romi romijain3...@gmail.com wrote:
I am using
Travis,
that sounds like a perfect usecase for dynamic fields .. attachment_*
and there you go. works for no attachment, as well as one, three or 50.
for the user interface, you could iterate over them and show them as
list - or something else that would fit your need.
also, maybe, you
Well, the attachment_1, attachment_2 idea would be awkward
to form queries (i.e. there would be 100 clauses if there were 100 docs?)
Dynamic fields have this same problem.
You could certainly index them all into a big field, just make it
multivalued and do a SolrDocument.add(bigtextfield,
Not that I know of, although it does give you the parsed fq results, which
you could then use as query parameters (i.e. the 'q' parameter) for debugging...
You have to use parens or fully qualify each term
(e.g. WAY_ANALYZED:rue WAY_ANALYZED:de), that's just how the
query parsing works...
Am 25.07.2011 16:58, schrieb Erick Erickson:
Well, the attachment_1, attachment_2 idea would be awkward
to form queries (i.e. there would be 100 clauses if there were 100 docs?)
Dynamic fields have this same problem.
Oh, yes .. correct .. overlooked that part :/ sorry.
This will work for mispelled compounds indeed but not when the compound word
is actually queried as two separate correctly spelled words. Most likely both
sail and boat exist in the index as single token.
There is a work around but that's limited to a scenario where users never use
more than 1
Related to this is this jira issue:
https://issues.apache.org/jira/browse/SOLR-2585 . With this patch, Solr will
consider alternatives in cases where a word is mispelled in its context, but
nevertheless exists in the index and/or dictionary. This is a work-in-progress
and is for trunk only,
Hello all,
I'm getting a strange suggestion for a purposely mistyped word in Solr 1.4.1
I search for the term snia, and I would expect the term sina to be
suggested, as this is a fairly common word in quite a bit of the indexed
documents.
Instead, I'm getting india as a suggestion, which is
Hi all,
I have an optional field called common_names. I would like to keep this
field optional but at the same, occasionally do a search where I do not
include results where there is no value set for this field. Is this possible
to do within solr?
In other words, I would like to do a search
I think you need to list the charfilter earlier in the analysis chain;
before the tokenizer. Porbably Solr should tell you this...
-Mike
On 07/25/2011 09:03 AM, Merlin Morgenstern wrote:
sounds logical. I just changed it to the following, restarted and reindexed
with commit:
charFilters are executed first regardless of their position in the analyzer.
On Monday 25 July 2011 17:53:59 Mike Sokolov wrote:
I think you need to list the charfilter earlier in the analysis chain;
before the tokenizer. Porbably Solr should tell you this...
-Mike
On 07/25/2011 09:03
hi
thanks for the response
I am aware of post.sh but i wanted to make use of dih and scheduling. We can
not use cron due to some other issues.
So was thinking of using scheduling Data import
--
View this message in context:
Hmm - I'm not sure about that; see
https://issues.apache.org/jira/browse/SOLR-2119
On 07/25/2011 12:01 PM, Markus Jelsma wrote:
charFilters are executed first regardless of their position in the analyzer.
On Monday 25 July 2011 17:53:59 Mike Sokolov wrote:
I think you need to list the
So, with other words you want to exlude those records? :)
FilterQuery with a Range-Query could help: fq=-field:[* TO *]
Regards
Stefan
Am 25.07.2011 17:53, schrieb Brian Lamb:
Hi all,
I have an optional field called common_names. I would like to keep this
field optional but at the same,
On Mon, Jul 25, 2011 at 9:36 PM, abhayd ajdabhol...@hotmail.com wrote:
hi
thanks for the response
I am aware of post.sh but i wanted to make use of dih and scheduling. We can
not use cron due to some other issues.
So was thinking of using scheduling Data import
[...]
OK, though in that
I've seen that issue too and read comments on the list yet i've never had
trouble with the order, don't know what's going on. Check this analyzer, i've
moved the charFilter to the bottom:
analyzer type=index
tokenizer class=solr.WhitespaceTokenizerFactory/
filter
Hi Remy,
so you may open an Issue for this on the github Project? i mean .. just
creating another client, because i have one problem, does not sound like
a good plan?
Regards
Stefan
Am 25.07.2011 10:56, schrieb Remy Loubradou:
Hey Stephan,
Thanks, but I already used this solr client and I
hi
What i want to do is get title_1 if NOW is between st_date_1 and end_date_1
Also at the same time get title_2 if NOW is between st_date_2 and
end_date_2
and so on
at present i have a schema like this denorm'. I cant figure it out a
single solr query to do this.
field name=id
Hmm that looks like it's working fine. I stand corrected.
On 07/25/2011 12:24 PM, Markus Jelsma wrote:
I've seen that issue too and read comments on the list yet i've never had
trouble with the order, don't know what's going on. Check this analyzer, i've
moved the charFilter to the bottom:
63 matches
Mail list logo