Chris Hostetter wrote:
: However, I'd like to hear a comment on the approach of doing the parsing
: using Lucene and then constructing a SolrQuery from a Lucene Query:
I believe you are asking about doing this in the client code? using the
Lucene QueryParser to parse a string using an analyzer,
NO I actually Bothered About the QueryTime(Qtime which shows in the solr
Log).
Taking around 4-5 secs for each Query which results 18 Lacks Records After i
added the "fl" parameter to fecth The required Fields.
I want to Reduce this Qtime further.
Iam using solr 1.3 version with multicore ( whi
Hi all,
I am forming a query to boost a certain ids, the list of ids can go till
2000 too. I am sometimes getting the error for too many clauses in the
boolean query and otherwise i am getting a null page. Can you suggest any
config changes regarding this.
I am using solr 1.3.
Regards,
Pooja
Hi Rama,
What field types are these Title and Description?
You may go to SOLR admin console and try "Analysis", and select the field type
that you have used for Title and Description and provide those words Prepaid
and Postpaid in the indexing analyzer and see how is it storing the information.
I have indexed person names in solr using synonym expansion and am getting a
match when I explicitly use that field in my query (name:query). However,
when I copy that field into another field using copyfield and search on that
field, I don't get a match. Below are excerpts from schema.txt. I am n
You are absolutely right. The fields have trailing spaces in it. Thanks Erick
for your time. Really appreciated!
Thanks,
Alex
On May 12, 2010, at 8:29 PM, Erick Erickson wrote:
> Click the "schema browser" link on the admin page.
> On the next page click
> the "fields" link, then the field in
Found the problem! It's because all the values in productType field have
trailing spaces in it like this: "ProductBean ". Thanks Hoss for your
suggestion of using Luke query which exposed the problem.
You guys are awesome!
Thanks,
Alex
On May 12, 2010, at 10:12 PM, Alex Wang wrote:
Sorr
Sorry please discard my query results here, because I was playing with the
field type and changed it to "text" from "string" and forgot to change it back.
I will change it back to "string" and post the query results shortly.
I apologize for the careless mistake.
Thanks,
Alex
On May 12, 2010,
Anyone know of any way to accomplish (or at least simulate) this?
Thanks again
--
View this message in context:
http://lucene.472066.n3.nabble.com/MLT-Boost-Function-tp811227p813982.html
Sent from the Solr - User mailing list archive at Nabble.com.
Thanks Hoss. Please see the query results as follows:
>
> :
> :
> :
> : productType:ProductBean
> :
> :
>
> ...can you please disable the QueryElevationComponent and see if that
> changes things?
>
> : productType:ProductBean
>
> What are the numFound values for these queries? (bot
Hi,
Thanks for your response. Attached are the Schema.xml and sample docs
that were indexed. The query and response are as below. The attachment
Prodsku4270257.xml has a field "paymenttype" whose value is 'prepaid'.
query:
q=prepaid&start=0&rows=10&fl=*%2Cscore&qt=standard&wt=json&debugQ
Hmmm, there's not much information to go on here.
You might review this page:
http://wiki.apache.org/solr/UsingMailingLists
and post with more information. At minimum,
the field definitions, the query output (include
&debugQuery=on), perhaps what comes out
of the analysis admin page for both indexi
Click the "schema browser" link on the admin page.
On the next page click
the "fields" link, then the field in question.
But first I'd do whatever Chris suggested.
BTW, the field definition you pasted isn't the one that
really counts here, "fieldtype" is the one that does, but in this case
th
Hi,
We are trying to use SOLR for searching our catalog online and during QA
came across a interesting case where SOLR is not returning results that it
should.
Specificially, we have indexed things like "Title" and "Description", of the
words in the Title happens to be "Prepaid' and "Postpa
If you do go this route, I'd use something besides a colon, it's too easily
confused with a field delimiter. That's just *asking* for trouble...
Any non-alpha character you use will cause some grief if you choose an
incompatible Analyzer. Even upper/lower case can be split in ways
you wouldn't wan
:
:
:
: productType:ProductBean
:
:
...can you please disable the QueryElevationComponent and see if that
changes things?
: productType:ProductBean
What are the numFound values for these queries? (both with and w/o
QueryElevationComponent) ...
/admin/select?q=productType:[*+T
: pf has the format as qf.
:
: titlePhrase^10.0
...uh, i'm not sure what that xml syntax is suppose to convey, but if you
are putting it in a solrconfig file as a default it would be...
titlePhrase^10.0
Note also that Dickens qf is also malformed...
: > 'pSubject^2 pContent^1.5'
...thos
In our deployment, we thought that complications might arise when attempting
to hit the Solr server with addresses of too many cores. For instance, we
have 15+ cores running at the moment. At the worst case, we will have to use
all 15+ addresses of all the cores to search all our data. What we
eve
: On the wiki, I've read something about MediaWiki indexation:
: http://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia
...
: I do not know MW database very well, but should the schema be really as
: simple as the one given on the above page?
thta example demonstrates i
: I got a fundamental understanding question that Mike's posting did not
: answer:
: You say "q=apple iPhone & qf=title^5 manufacturer & mm=100%" is correct.
: That means:
: title: "iphone" -> matches "iphone" but not "apple"
: manufacturer: "apple" -> matches "apple" but not "iphone"
: According
Hi Lance,
On Wed, May 12, 2010 at 11:48 AM, Lance Norskog wrote:
> The core name is set in solr.xml.
>
Ah. Ok. I'll look into that.
>
> Start with the example/multicore directory in the solr distribution.
> This shows how to set up multiple cores.
>
Do I need to set up multiple cores to use
: However, I'd like to hear a comment on the approach of doing the parsing
: using Lucene and then constructing a SolrQuery from a Lucene Query:
I believe you are asking about doing this in the client code? using the
Lucene QueryParser to parse a string using an analyzer, then toString'ing
that
: Does anyone know if there is any way to create a new Core with specified
: properties or to alter and reload Core Properties for a Core without
: restarting the service?
:
: I tried to do this in three steps:
:
: 1) Create a new core;
:
: 2) Edit solr.xml directly to add propertie
Hi Ahmed,
Interesting, I did not think of a multi-core approach.
I am not sure, but we might have upto 10 different kinds of data to contend
with like property, pets, farming, electronics, travel, auto, jobs, sport
etc that might complicate things.
Also, one practical limitation we have, is that
: Is there some way to override the data directory in the Tomcat context file?
i don't believe so.
work was done a long time ago to support system property substitution when
the solrconfig.xml file is loaded, but i don't think that was ever
generalized to support JNDI values as well (which is
: this test fails for requests built from a SimpleRequestParser or
: StandardRequestParser where the parameter "key" was given, but empty ( e.g.
: localhost:8393/select/?key=¶1=val1&parm2=val2 ).
:
: The reason is that oas.request.ServletSolrParams returns null for values with
: length() == 0,
:
I had the same problem as you last year, i.e. indexing stuff from different
sources with different characteristics. The way I approached it is by
setting up a multi-core environment, with each core representing one type of
data. Within each core, I had a "data type" sort of field that would define
Because leading negative clauses don't work. The (*:* AND x) syntax
means "select everything AND also select x".
You could also do
(+category:xyz +price:[100 TO *]) -category:xyz
On Tue, May 11, 2010 at 12:36 PM, Satish Kumar
wrote:
> thanks Ahmet.
>
> (+category:xyz +price:[100 TO *]) (+*:* -ca
The core name is set in solr.xml.
Start with the example/multicore directory in the solr distribution.
This shows how to set up multiple cores.
Also, spaces in URLs are translated as + signs, and maybe translated
back. People generally use alphanumeric and underscore names for
cores; these work b
Hi Erick,
Thank your for your thoughts,
I had exactly the same idea like your screenLCD suggestion (but with a
semicolon)
For example:
range1 range2 range3 range_flagsproperties
890 2001 range1;km, range2;kw, range3;year group;auto,
make;audi, model;a4, col
Erick,
I tried the lower case search (productType:productbean) and I did not get any
results either. Luke shows that "ProductBean" for the field, not sure whether
it's indexed term or stored term. Does this mean that this is not a case issue?
Here is the field definition in the schema:
Thank
dont know if its the best solution but i have a field i facet on
called type its either 0,1, combined with collapse.facet=before i just
sum all the values of the facet field to get the total number found
if you dont have such a field u can always add a field with a single value
--joe
On Wed, May
We're running into an out of memory problem when sending a large file to our
SOLR server using the ContentStreamUpdateRequest. It appears that this
happens because when the request method of CommonsHttpSolrServer is called
(this is called even when using a StreamingUpdateSolrServer instance becaus
Hi, fellows!
I use field collapsing to collapse near-duplicate documents based on
document fuzzy signature calculated at index time.
The problem is that, when field collapsing is enabled, in query
response numFound is equal to the number of rows requested.
For instance, with solr example schema i
Hi
I ran into a replication issue yesterday and I have no explanation for it. I
see the following in my logs:
SEVERE: Unable to move index file from:
/my/dir/Solr/data/property/index.20100511050029/_3zj.fdt to:
/my/dir/Solr/data/property/index.20100511042539/_3zj.fdt
I restarted the subscriber th
Not til this evening, don't have a handy SOLR implementation to ping...
But another option is to get a copy of Luke and look at the index, but the
same caution
about seeing terms not stored data holds.
Or you could just try your troublesome search with all lower case for your
term (not field)
and
Hello,
I am not reusing the context object. The remaining part of the code takes in
a "Blob" object, converts it to a FileInputStream, and reads the contents
using PDFBox. It does not deal with anything related to Solr.
The Transformer doesn't even execute the remaining part of the code. It
does
Is the cleanup of indexes using Solr 1.4 Replication documented
somewhere? I can't find any information regarding this at:
http://wiki.apache.org/solr/SolrReplication
Too many snapshot indexes are being left around, and so they need to
be cleaned up.
Sorry Erick, can you tell me how to find the raw *indexed* terms from the admin
console? I am not familiar with the admin console.
Thanks,
On May 12, 2010, at 10:18 AM, Erick Erickson wrote:
> Hmmm, nothing looks odd about that, except perhaps the casing. If you use
> the admin
> console to loo
I'm not sure I understand how your results are truncated. They both find 21502
documents. The fact that you are sorting on '_erstelldatum' ascending and not
seeing any results for that field on the first page leads me to think that you
have 'sortMissingLast="false"' on that field's fieldType. In
Are u reusing the context object? It may help if u can paste the relevant
part of ur code
On 10 May 2010 19:03, "ahammad" wrote:
I have a Solr core that retrieves data from an Oracle DB. The DB table has a
few columns, one of which is a Blob that represents a PDF document. In order
to retrieve
Hmmm, nothing looks odd about that, except perhaps the casing. If you use
the admin
console to look at the raw terms, is productbean mixed case or all lower
case? If the
latter, that would explain things
Be a bit cautious because if you look at the *stored* data it will be in
mixed case, but
t
I'm not entirely sure this is germane, but there's absolutely no requirement
that
all documents in SOLR have the same fields. So it's possible for you to
index
the "wildly different content" in "wildly different fields" . Then
searching for
screen:LCD would be straightforward. Of course this may no
Mike,
This only happens when I attempt to do a delta-import without first deleting
the index dir before doing a full-index.
For example these will work correctly.
1) Delete /home/corename/data
2) Full-Import
3) Delta-Import
However I attempt to do the following, it will result in an error
1)
Gary:
If you could make a patch out of your changes that would be great. The dev
list would be
a good place to get it some attention. There's no need to open a JIRA before
posting
to the dev list, but one should be created if there's agreement that this is
something
that should be fixed.
Best
Eri
Well, you haven't identified where the time is actually being taken up, so
suggesting
config settings isn't easy. Is the actual search taking a long time or is
the time
spent packaging up the return? Your SOLR logs will help you narrow this
down, you
might also get some insight by trying your troub
Hello,
I was doing some more testing but I could not find a definitive reason for
this behavior. The following is my transformer:
public Map transformRow(Map row, Context
context) {
List> fields = context.getAllEntityFields();
for (Map field : fields)
sorry, the problem was sitting in front of the monitor!
it is not an error or something, i forgot that some documents didn't have all
fields filled,
so its absolutly normal, that not all fields were in the result.
markus
> -Ursprüngliche Nachricht-
> Von: markus.rietz...@rzf.fin-nrw
Jira ticket added https://issues.apache.org/jira/browse/SOLR-1908
Wiki entry updated http://wiki.apache.org/solr/Deduplication
On Tuesday 11 May 2010 17:24:10 Mark Miller wrote:
> 1. You need to set the sig field to indexed.
> 2. This should be added to the wiki
> 3. Want to make a JIRA issue? Thi
when i do a search, eg.
http://xxx:8983/solr/select?q=steuer&fl=score,id,__intern,title,__source,_dienststelle,_erstelldatum,__cyear,_stelle
i get a normal result, like
1.3633566
2009
0
zzz
xyz
2009020200
Presse- u. Informationsreferat
34931684
Merkblatt Vereine und Steuern
when i do a
okay i found it.
Hi,
On 07.05.2010 22:47, Chris Hostetter wrote:
so it's the full request time, and would be inclusive of any postCommit
event handlers -- that's important to know. the logs will help clear up
wether the underlying "commit" is really taking up a large amount of time
or if it's some postCommit ev
Sorry the previous copy & paste of the output seems to be messed up. Hope this
one is better:
0
15
true
on
0
productType:ProductBean
10
2.2
productType:ProductBean
productType:ProductBean
productType:ProductBean
productType:ProductBean
productType:ProductBean
Thanks Erick for your response. Here is the debugQuery output:
-<%5Cl%20%22%22>
-<%5Cl%20%22%22>
0
15
-<%5Cl%20%22%22>
true
on
0
productType:ProductBean
10
2.2
-<%5Cl%20%22%22>
-<%5Cl%20%22%22>
productType:ProductBean
productType:ProductBean
productType:ProductBean
i only find $docBoost
http://wiki.apache.org/solr/DataImportHandler?highlight=(docboost)
but i dont understand how to use this and where ???
pls help ;)
--
View this message in context:
http://lucene.472066.n3.nabble.com/Boost-at-IndexTime-for-special-docs-tp812277p812359.html
Sent from the
please help me.. I am stuck..
I tried different way using terminal but still not finding way to patch...
-- Forwarded message --
From: Jonty Rhods
Date: Wed, May 12, 2010 at 11:56 AM
Subject: Re: how to patch solr-236 in mac os
To: solr-user@lucene.apache.org
I tried to patch
Hello..
how can i boost docs where the field "is_highlight = 1" in my DIH ??
can i put an if-block into the DIH ?
i found not so much about this .. =(
th
--
View this message in context:
http://lucene.472066.n3.nabble.com/Boost-at-IndexTime-for-special-docs-tp812277p812277.html
Sent from th
Hi Aditya,
Thanks for your response.
Yes, a category type would be needed.
One thing I am not clear about,
If you have multi-values like "toshiba, tecra, LCD"
it is then clear that you can run solr queries like:
fq=mymultivaluefield:LCD
but for associated fields like:
"make=toshiba, model=te
You should do a preprocessing(multiply your document as many documents as
values you have in your multivalue field, with the principalFlag:T in your
first document) before you indexing the data with that logic
Marco Martínez Bautista
http://www.paradigmatecnologico.com
Avenida de Europa, 26. Ática
Hi Eric
I catch the NPE in the NonAdjacentDocumentCollapser class and now it does
return the data field collapsed.
However I can not promise how accurate or correct this fix is becuase I have not
got allot of time to study all the code.
It would be best if some of the experts could give us a
hi Marco,
Thanks for quick reply..
I have another doubt: In 2nd solution: How to set flag for duplicate value.
because I am not sure about the no fo duplicate rows (it could be random
no..)
so how can I set the flag..
thank
On Wed, May 12, 2010 at 12:59 PM, Marco Martinez <
mmarti...@paradigmatec
Hi,
2º solution:
Not use multiValue fields, instead use two single fields, in your example
will be:
doc1:
dept: student1
city: city1
principalFlag:T
doc2:
dept: student2
city: city2
principalFlag:F
So, if you search without specify any city or dept, you should put
princiaplFlag:T for no get dup
Hello Eric,
Certainly it is possible. I would strongly advice to have field which
differentiates the record type (RECORD_TYPE:"CAR" / "PROPERTY").
>>In general I was also wondering how Solr developers implement websites
that
uses tag filters.For example, a user clicks on "Hard drives" then get ta
Hi Marco,
I am trying to patch for collapse component support (till now no luck)..
In mean time I would like to know the 2nd and 3rd option you mentioned
(logic in solrj)..
with regards
On Thu, May 6, 2010 at 2:36 PM, Marco Martinez <
mmarti...@paradigmatecnologico.com> wrote:
> Hi Jonty,
>
> I
Hi,
You can use localsolr (http://www.gissearch.com/localsolr) that supports
sharding if you need this feature.
Marco Martínez Bautista
http://www.paradigmatecnologico.com
Avenida de Europa, 26. Ática 5. 3ª Planta
28224 Pozuelo de Alarcón
Tel.: 91 352 59 42
2010/5/11 Jean-Sebastien Vachon
Hallo Solr community,
We are considering Solr for searching on content from various partners
with wildly different content.
Is it possible or practical to work with multi-valued associated fields like
this?
"Make:Audi, Model:A4, Color:Blue, Year:1998, KM:20, Extras:GPS"
"Type:Flat, Rooms:2, P
66 matches
Mail list logo