Hi Mark,
I suspect that issue you are facing is
https://issues.apache.org/jira/browse/SOLR-2649
You can verify this by toggling default operator between 'AND' and 'OR'.
--- On Wed, 8/10/11, Mark juszczec mark.juszc...@gmail.com wrote:
From: Mark juszczec mark.juszc...@gmail.com
Subject:
I tried tweaking WordDelimiterFactory but I won't accept # OR @ symbols
and it ignored totally.
I need solution plz suggest.
On 4 August 2011 21:08, Jonathan Rochkind rochk...@jhu.edu wrote:
It's the WordDelimiterFactory in your filter chain that's removing the
punctuation entirely from your
From what I see on my slaves, yes.
After replication has finished and new index is in place and new reader has
started
I have always a write.lock file in my index directory on slaves, even though
the index
on master is optimized.
Regards
Bernd
Am 10.08.2011 09:12, schrieb Pranav Prakash:
On Wed, Aug 10, 2011 at 1:11 PM, Bernd Fehling
bernd.fehl...@uni-bielefeld.de wrote:
From what I see on my slaves, yes.
After replication has finished and new index is in place and new reader has
started
I have always a write.lock file in my index directory on slaves, even
though the index
Sure there is actually no optimizing on the slave needed,
but after calling optimize on the slave the write.lock will be removed.
So why is the replication process not doing this?
Regards
Bernd
Am 10.08.2011 10:57, schrieb Shalin Shekhar Mangar:
On Wed, Aug 10, 2011 at 1:11 PM, Bernd Fehling
Hello,
First of all, I am a beginner and i am trying to develop a sample
application using SolrNet.
I am struggling about schema definition i need to use to correspond my
needs. In database, i have Books(bookId, name) and Pages(pageId, bookId,
text) tables. They have master-detail relationship.
That is not true. Replication is roughly a copy of the diff between the
master and the slave's index.
In my case, during replication entire index is copied from master to slave,
during which the size of index goes a little over double. Then it shrinks to
its original size. Am I doing something
It really does depend upon what you want to do in your app but from
the info given I'd go for denormalizing by repeating the least number
of values. So in your case that would be book
PageID+BookID(uniqueKey), pageID, PageVal1, PageValn, BookID, BookName
On 10 August 2011 09:46, directorscott
Hi All,
I am trying to sort the results on a unix timestamp using this query.
http://url.com:8983/solr/db/select/?indent=onversion=2.1q={!frange%20l=0.25}query($qq)qq=nokiasort=unix-timestamp%20descstart=0rows=10qt=dismaxwt=dismaxfl=*,scorehl=onhl.snippets=1
When I run this query, it says 'no
I have had a mistake with the configs files. From the example directory all
works correctly. Thanks to all.
---
Rode González
Libnova, SL
Paseo de la Castellana, 153-Madrid
[t]91 449 08 94 [f]91 141 21 21
www.libnova.es
-Mensaje original-
De: Rode González [mailto:r...@libnova.es]
Worked fine. Thanks a lot!
Massimo
On 09/08/2011 11:58, Jayendra Patil wrote:
Try using -
str name=hl.tag.pre![CDATA[b]]/str
str name=hl.tag.post![CDATA[/b]]/str
Regards,
Jayendra
On Tue, Aug 9, 2011 at 4:46 AM, Massimo Schiavonmschia...@volunia.com wrote:
In my Solr (3.3)
Could you please tell me schema.xml fields tag content for such case?
Currently index data is something like this:
PageID BookID Text
1 1some text
2 1some text
3 1some text
4 1some text
5 2some text
6 2
Hi,
I'm trying date faceting per last 24 hours, three days and last week, but I
don't know how to do it.
I have a DateField and I want to set different ranges, it is posible?
I understand the example from solr
hi,
i want to retrieve all the data from solr (say 10,000 ids ) and my page size
is 1000 .
how do i get back the data (pages) one after other ?do i have to increment
the start value each time by the page size from 0 and do the iteration ?
In this case am i querying the index 10 time instead of one
While going through my error logs of Solr, i found that a user had fired a
query - jawapan ujian bulanan thn 4 (bahasa melayu). This was converted to
following for autosuggest purposes -
jawapan?ujian?bulanan?thn?4?(bahasa?melayu)* by the javascript code. Solr
threw the exception
Cannot parse
I would use facet queries:
facet.query=date:[NOW-1DAY TO NOW]
facet.query=date:[NOW-3DAY TO NOW]
facet.query=date:[NOW-7DAY TO NOW]
--
View this message in context:
http://lucene.472066.n3.nabble.com/Date-faceting-per-last-hour-three-days-and-last-week-tp3242364p3242574.html
Sent from the Solr
I think this is because ) is treated as a token delimiter. So (foo)bar is
treated the same as (foo) bar (that is, bar is treated as a separate word).
So (foo)* is really parsed as (foo) * and thus the * is treated as the
start of a new word.
-Michael
Hi,
We are in the process of putting up a new Lucene/Solr/PyLucene/OpenRelevance
website. You can see a preview at http://lucene.staging.apache.org/lucene/.
It is more or less a look and feel copy of Mahout and Open For Biz websites.
This new site, IMO, both looks better than the old one
Well, it depends (tm).
If you're talking about *indexed* terms, then the value is stored only
once in both the cases you mentioned below. There's really very little
difference between a non-multi-valued field and a multi-valued field
in terms of how it's stored in the searchable portion of the
With the first option you can be page specific in your search results
and searches.
Field collapsing/grouping will help with your normalisation issue.
(what you have listed is different to what I listed you don't have a
unique key)
Option 2 means you loose any ability to reference page, but as
Thanks,
for this quick and enlightening answer!
I didn't consider that a Transformer can create new columns. In combination
with dynamic fields it is exactly what I was looking for.
Thanks James ^^
-Ursprüngliche Nachricht-
Von: Dyer, James [mailto:james.d...@ingrambook.com]
Please look more carefully at the documentation for WDDF,
specifically:
split on intra-word delimiters (all non alpha-numeric characters).
WordDelimiterFilterFactory will always throw away non alpha-numeric
characters, you can't tell it do to otherwise. Try some of the other
tokenizers/analyzers
Could you tell us what you're trying to achieve with the range query ?
It's not clear.
-Simon
On Wed, Aug 10, 2011 at 5:57 AM, Amit Sawhney sawhney.a...@gmail.com wrote:
Hi All,
I am trying to sort the results on a unix timestamp using this query.
I meant the frange query, of course
On Wed, Aug 10, 2011 at 10:21 AM, simon mtnes...@gmail.com wrote:
Could you tell us what you're trying to achieve with the range query ?
It's not clear.
-Simon
On Wed, Aug 10, 2011 at 5:57 AM, Amit Sawhney sawhney.a...@gmail.com wrote:
Hi All,
I am
This is expected behavior. You might be optimizing
your index on the master after every set of changes,
in which case the entire index is copied. During this
period, the space on disk will at least double, there's no
way around that.
If you do NOT optimize, then the slave will only copy changed
Well, if you really want to you can specify start=0 and rows=1 and
get them all back at once.
You can do page-by-page by incrementing the start parameter as you
indicated.
You can keep from re-executing the search by setting your queryResultCache
appropriately, but this affects all searches
Worth remembering there are some performance penalties with deep
paging, if you use the page-by-page approach. may not be too much of a
problem if you really are only looking to retrieve 10K docs.
-Simon
On Wed, Aug 10, 2011 at 10:32 AM, Erick Erickson
erickerick...@gmail.com wrote:
Well, if
I would imagine the performance penalties with deep paging will ALSO be there
if you just ask for 1 rows all at once though, instead of in, say, 100 row
paged batches. Yes? No?
-Original Message-
From: simon [mailto:mtnes...@gmail.com]
Sent: Wednesday, August 10, 2011 10:44 AM
To:
Hi - I'm trying to do a (I think) simple facet query, but I'm not getting the
results I expect. I have a field, MyField, and I want to get facets for
specific values of that field. That is, I want a FacetField if MyField is
ABC, DEF, etc. (a specific list of values), but not if MyField is any
when you say queryResultCache, does it only cache n number of result for the
last one query or more than one queries?
On 10 August 2011 20:14, simon mtnes...@gmail.com wrote:
Worth remembering there are some performance penalties with deep
paging, if you use the page-by-page approach. may not
Okay, with this command it hangs.
Also: I managed to get a Thread Dump (attached).
regards
Am 05.08.2011 15:08, schrieb Yonik Seeley:
On Fri, Aug 5, 2011 at 7:33 AM, alexander sulza.s...@digiconcept.net wrote:
Usually you get a XML-Response when doing commits or optimize, in this case
I get
Hi,
Apologies if this is really basic. I'm trying to learn how to create a
custom request handler, so I wrote the minimal class (attached), compiled
and jar'd it, and placed it in example/lib. I added this to solrconfig.xml:
requestHandler name=/flaxtest class=FlaxTestHandler /
When I
Oops. I think I found it. My desiredFieldValues list has the wrong info. Knew
there was something simple wrong.
From: Simon, Richard T
Sent: Wednesday, August 10, 2011 10:55 AM
To: solr-user@lucene.apache.org
Cc: Simon, Richard T
Subject: Building a facet query in SolrJ
Hi - I'm trying to do a
On Wed, Aug 10, 2011 at 11:00 AM, alexander sulz a.s...@digiconcept.net wrote:
Okay, with this command it hangs.
It doesn't look like a hang from this thread dump. It doesn't look
like any solr requests are executing at the time the dump was taken.
Did you do this from the command line?
curl
Consider putting a cache (memcached, redis, etc) *in front* of your
solr slaves. Just make sure to update it when replication occurs.
didier
On Tue, Aug 9, 2011 at 6:07 PM, arian487 akarb...@tagged.com wrote:
I'm wondering if the caches on all the slaves are replicated across (such as
Hi all-
I apologize in advance if this turns out to be a problem between the keyboard
and the chair, but I'm confused about why my date field is correct in the
index, but wrong in SolrJ.
I have a field defined as a date in the index:
field name=FILE_DATE type=date indexed=true
The Date difference is coming because of different time zones.
In Solr the date is stored as Zulu time zone and Solrj is returning date in
CDT timezone (jvm is picking system time zone.)
date name=FILE_DATE2002-05-13T00:00:00Z/date
I get:
-- Sun May 12 19:00:00 CDT 2002
Ah, great! I knew the problem was between the keyboard and the chair. Thanks!
-Original Message-
From: Sethi, Parampreet [mailto:parampreet.se...@teamaol.com]
Sent: Wednesday, August 10, 2011 10:25 AM
To: solr-user@lucene.apache.org
Subject: Re: Dates off by 1 day?
The Date difference
Th attachment isn't showing up (in gmail, at least). Can you inline
the relevant bits of code ?
On Wed, Aug 10, 2011 at 11:05 AM, Tom Mortimer t...@flax.co.uk wrote:
Hi,
Apologies if this is really basic. I'm trying to learn how to create a
custom request handler, so I wrote the minimal class
Sure -
import org.apache.solr.request.SolrQueryRequest;
import org.apache.solr.response.SolrQueryResponse;
import org.apache.solr.handler.RequestHandlerBase;
public class FlaxTestHandler extends RequestHandlerBase {
public FlaxTestHandler() { }
public void
You can use solr.LowerCaseFilterFactory in an analyser chain for both
indexing and queries. The schema.xml supplied with example has several field
types using this (including text_general).
Tom
On 10 August 2011 16:42, nagarjuna nagarjuna.avul...@gmail.com wrote:
Hi please help me ..
Very well explained. Thanks. Yes, we do optimize Index before replication. I
am not particularly worried about disk space usage. I was more curious of
that behavior.
*Pranav Prakash*
temet nosce
Twitter http://twitter.com/pranavprakash | Blog http://blog.myblive.com |
Google
The site looks great. And thank you for including the ManifoldCF link. ;-)
Karl
-Original Message-
From: ext Grant Ingersoll [mailto:gsing...@apache.org]
Sent: Wednesday, August 10, 2011 10:09 AM
To: solr-user@lucene.apache.org; java-u...@lucene.apache.org
Subject: [Help Wanted]
Looks nice! Font seems too light to read with comfort though.
Hi,
We are in the process of putting up a new
Lucene/Solr/PyLucene/OpenRelevance website. You can see a preview at
http://lucene.staging.apache.org/lucene/. It is more or less a look and
feel copy of Mahout and Open For Biz
It's working for me. Compiled, inserted in solr/lib, added the config
line to solrconfig.
when I send a /flaxtest request i get
response
lst name=responseHeader
int name=status0/int
int name=QTime16/int
/lst
str name=FlaxTestHello!/str
/response
I was doing this within a core defined in
Hi,
I've noticed poor performance for my solr queries in the past few days.
Queries of that type :
http://server:5000/solr/select?q=story_search_field_en:(water boston) OR
story_search_field_fr:(water boston)rows=350start=0sort=r_modify_date
Interesting.. is this in trunk (4.0)? Maybe I've broken mine somehow!
What classpath did you use for compiling? And did you copy anything other
than the new jar into lib/ ?
thanks,
Tom
On 10 August 2011 18:07, simon mtnes...@gmail.com wrote:
It's working for me. Compiled, inserted in
I take it back. I didn't find it. I corrected my values and the facet queries
still don't find what I want.
The values I'm looking for are URIs, so they look like: http://place.org/abc/def
I add the facet query like so:
query.addFacetQuery(MyField + : + \ + uri + \);
I print the query, just
This is in trunk (up to date). Compiler is 1.6.0_26
classpath was
dist/apache-solr-solrj-4.0-SNAPSHOT.jar:dist/apache-solr-core-4.0-SNAPSHOT.jar
built from trunk just prior by 'ant dist'
I'd try again with a clean trunk .
-Simon
On Wed, Aug 10, 2011 at 1:20 PM, Tom Mortimer t...@flax.co.uk
Off the top of my head ...
Can you tell if GC is happening more frequently than usual/expected ?
Is the index optimized - if not, how many segments ?
It's possible that one of the shards is behind a flaky network connection.
Is the 10s performance just for the Solr query or wallclock time at
Hello
So, I'm a newbie to Solr and Tika and whatnot, so please use simple words
for me :P
I am running Solr on Tomcat 7 on Windows Server 2008 r2, running as the
search engine for a Drupal web site.
Up until recently, everything has been fine - searching works, faceting
works, etc.
Recently a
Thanks Simon. I'll try again tomorrow.
Tom
On 10 August 2011 18:46, simon mtnes...@gmail.com wrote:
This is in trunk (up to date). Compiler is 1.6.0_26
classpath was
dist/apache-solr-solrj-4.0-SNAPSHOT.jar:dist/apache-solr-core-4.0-SNAPSHOT.jar
built from trunk just prior by 'ant dist'
Try making your queries, manually, to see this closer in action...
q=MyField:uri and see what you get. In this case, because your URI contains
characters that make the default query parser unhappy, do this sort of query
instead:
{!term f=MyField}uri
That way the query is parsed properly
Hi -- I do get facets for all the values of MyField when I specify the facet
field, but that's not what I want. I just want facets for a subset of the
values of MyField. That's why I'm trying to use the facet queries, to just get
facets for those values.
-Rich
-Original Message-
Thanks Simon for these tracks.
Here's my answers :
Can you tell if GC is happening more frequently than usual/expected ?
GC is OK.
Is the index optimized - if not, how many segments ?
According to the statistics page from the admin :
One shard (master/slave) has 10 segments
The other shard
I would like to combine the ShingleFilterFactory with a SynonymFilterFactory in
a field type.
I've looked at something like this using the analysis.jsp tool:
fieldType name=TestTerm class=solr.TextField
positionIncrementGap=100
analyzer type=index
tokenizer
: custom request handler, so I wrote the minimal class (attached), compiled
: and jar'd it, and placed it in example/lib. I added this to solrconfig.xml:
that's the crux of hte issue.
example/lib is where the jetty libraries live -- not solr plugins.
you should either put your custom jar's in
Hi Jeff,
Hi Jeff,
You have configured ShingleFilterFactory with a token separator of , so e.g.
International Corporation will output the shingle InternationalCorporation.
If this is the form you want to use for synonym matching, it must exist in
your synonym file. Does it?
Steve
: query.addFacetQuery(MyField + : + \ + uri + \);
...
: But when I examine queryResponse.getFacetFields, it's an empty list, if
facet.query constraints+counts do not come back in the facet.field
section of hte response. they come back in the facet.query section of
the response (look
If I were to build a master with multiple slaves, is it possible to promote
a slave to be the new master if the original master fails? Will all the
slaves pickup right where they left off, or any time the master fails will
we need to completely regenerate all the data?
If this is possible, are
Hi, Guys,
Based on the document below, I should be able to include a file under the
same directory by specifying relative path via xinclude in solrconfig.xml:
http://wiki.apache.org/solr/SolrConfigXml
However I am getting the following error when I use relative path (absolute
path works fine
Sorry for the spam. I just figured it out. Thanks.
On Wed, Aug 10, 2011 at 2:17 PM, Way Cool way1.wayc...@gmail.com wrote:
Hi, Guys,
Based on the document below, I should be able to include a file under the
same directory by specifying relative path via xinclude in solrconfig.xml:
Hi Steven,
The token separator was certainly a deliberate choice, are you saying that
after applying shingles, synonyms can only match shingled terms? The term
analysis suggests the original tokens still exist.
You've made me realize that only certain synonyms seem to have problems though,
Hello, all!
I want to create a good DIH configuration for my Oracle database with deltas
support. Unfortunately I am not able to do it well as DIH has the strange
restrictions.
I want to explain a problem on a simple example. In a reality my database
has very difficult structure.
Initial
Hi,
I have been trying to increase the size of the highlight snippets using
hl.fragSize parameter, without much success. It seems that hl.fragSize is
not making any difference at all in terms of snippet size.
For example, compare the following two set of query/results:
Yes you can promote a slave to be master refer
http://wiki.apache.org/solr/SolrReplication#enable.2BAC8-disable_master.2BAC8-slave_in_a_node
In AWS one can use an elastic IP(http://aws.amazon.com/articles/1346) to
refer to the master and this can be assigned to slaves as they assume the
role of
an hl.fragsize of 1000 is problematical, as Solr parses that
parameter as a 32 bit int... that's several bits more.
-Simon
On Wed, Aug 10, 2011 at 4:59 PM, Sang Yum sang...@gmail.com wrote:
Hi,
I have been trying to increase the size of the highlight snippets using
hl.fragSize
I was just trying to set it a ridiculously large number to make it work.
What I am seeing is that hl.fragsize doesn't seem to make any difference in
term of highlight snippet size... I just tried the query with hl.fragsize
set to 1000. Same result as 10.
On Wed, Aug 10, 2011 at 2:20 PM, simon
Thanks for the advice paul, but post processing is a must for me given the
nature of my application. I haven't had problems yet though.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Cache-replication-tp3240708p3244202.html
Sent from the Solr - User mailing list archive
After some further playing around, I think I understand what's going on.
Because the SynonymFilterFactory pays attention to term position when it
inserts a multi-word synonym, I had assumed it scanned for matches in a way
that respected term position as well. (ie, for a two-word synonym, I
Well, only after I posted this question in a public forum, I found the cause
of my problem. I was using hl.fragSize, instead of hl.fragsize. After
correcting the case, it worked as expected.
Thanks.
On Wed, Aug 10, 2011 at 3:19 PM, Sang Yum sang...@gmail.com wrote:
I was just trying to set it
On Wed, Aug 10, 2011 at 7:10 PM, Jeff Wartes jwar...@whitepages.com wrote:
After some further playing around, I think I understand what's going on.
Because the SynonymFilterFactory pays attention to term position when it
inserts a multi-word synonym, I had assumed it scanned for matches in a
Whenever I try to build this on our hudson server it says it can't find
org.apache.lucene:lucene-xercesImpl:jar:4.0-SNAPSHOT. Is the Apache repo
lacking this artifact?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Hudson-build-issues-tp3244563p3244563.html
Sent from the
Hi,
We are doing streaming update to solr for multiple user,
We are getting
Aug 10, 2011 11:56:55 AM org.apache.solr.common.SolrException log
SEVERE: org.apache.lucene.store.LockObtainFailedException: Lock obtain timed
out: NativeFSLock@/var/lib/solr/data/index/write.lock
at
Do you really want a search on ipad to *fail* to match input of #ipad?
Or
vice-versa?
My requirement is : I want to search both '#ipad' and 'ipad' for q='ipad'
BUT for q='#ipad' I want to search ONLY '#ipad' excluding 'ipad'.
On 10 August 2011 19:49, Erick Erickson erickerick...@gmail.com
75 matches
Mail list logo