You should use file instead of fileName in column
field column=file name=fileName/
Don't forget to add the 'fileName' to the schema.xml in the fields section.
field name=fileName type=string indexed=true stored=true /
Have fun,
Daniel Rijkhof
06 12 14 12 17
On Mon, May 16, 2011 at 4:20
: query that in fact returns the negative results. As a simple example,
: I believe that, for a boolean field, -field:true is exactly the same as
: +field:false, but the former is a negative query and the latter is a
that's not strictly true in all cases...
* if the field is
Hi,
We are moving to a multi-core Solr installation with each of the core having
millions of documents, also documents would be added to the index on an
hourly basis. Everything seems to run find and I getting the expected
result and performance, except where sorting is concerned.
I have
Explicit Warming of Sort Fields
If you do a lot of field based sorting, it is advantageous to add explicitly
warming queries to the newSearcher and firstSearcher event listeners in
your solrconfig which sort on those fields, so the FieldCache is populated
prior to any queries being executed by
Hi,
We are trying build enterprise search solution using SOLR , out data source
is Database which is interfaced with JPA.
Solution looks like
SOLR INDEX JPA Oracle database.
We need help to findout what is the best approch integrate Solr Index with
JPA.
We tried out two appoches
Approch
Hi,
The query is generated dynamically and can be more or less complex depending on
different parameters. I'm also not free to give many details of our
implementation, but I'll give you the minimal query string that fails and the
relevant pieces of the config.
The query string is:
Hi,
How do I write/build a Solr query using dismax handler for my application
specific use case explained below:
Snippet of fields definition from schema.xml:
field name=documentid type=string indexed=true stored=true
required=true /
field name=companyid type=long indexed=true stored=true
Thanks for pointing me in the right direction, now I see the configuration
for firstsearcher or newsearcher, the str name=q needs to configured
previously. In my case the q is every changing, users can actually search
for anything and the possibilities of queries unlimited.
How can I make this
Hi All!
Kindly provide me the links for suitable patches that are applied to
solr version 1.4.1 and 3.0 so that field collapsing should work properly.
Thanks in advance!
Isha garg
edismax supports full query format of lucene parser.But you can search using
filter queries eg.
qf=textfield1, textfield2, textfield3fq=textfield1:solr AND
strutsfq=textfield2:solr AND strutsfq=textfield3:solr AND struts
fq=companyid:100
-
Thanx:
Grijesh
www.gettinhahead.co.in
--
View
lookups to work with an arbitrary query, you would either need to changed
the cache structure from Query=DocSet to a mapping of
Query=[DocSet,inverseionBit] and store the same cache value needs needs
with two keys -- both the positive and the negative; or you keep the
Well, I don't know
By the way, I was wrong when saying that using bf instead of !boost did not
work either. I probably hit more than one problem at the same time when I first
tested that.
I've retested now and this works:
/select?q=+id:12345^0.01 +_query_:{!dismax
Hello,
Does anyone know if there is a v 3.1 book coming any time soon?
Regards,
Savvas
Thank you Gora in advance!
However, I decided to create a bean for indexing something like that:
...
String[] vertices
String[] edges
int[] triple_inx_levels
...
So I can search in vertices text edge text in vertices edges array
fields, and I hope to find the relation from triple_inx_levels
Hello!
Take a look at the Solr resources page on the wiki
(http://wiki.apache.org/solr/SolrResources).
--
Regards,
Rafał Kuć
http://solr.pl
edismax supports full query format of lucene parser.But you can search
using
filter queries eg.
qf=textfield1, textfield2, textfield3fq=textfield1:solr AND
strutsfq=textfield2:solr AND strutsfq=textfield3:solr AND struts
fq=companyid:100
Is it not possible to build query without filter
Hi All,
I am having some problems with the presence of unnecessary parenthesis in my
query.
A query such as:
title:software AND (title:engineer)
will return no results. Remove the parenthesis fix the issue but then since my
user can enter the parenthesis by himself I need to
great, thanks!
So, I guess the Solr In Action and Solr Cookbook will be based on 3.1..
:)
2011/5/19 Rafał Kuć ra...@alud.com.pl
Hello!
Take a look at the Solr resources page on the wiki
(http://wiki.apache.org/solr/SolrResources).
--
Regards,
Rafał Kuć
Hi Erick,
It is about ordering the facet information. The result set is empty via
rows=0.
Here is the logics and example:
Each doc has string field someStr and a date field associated with it, and
same doc id has same value of the date field. Question: is it possible to
sort the facet values
Well the good news is FuzzyQuery is indeed much faster in Lucene/Solr 4.0.
But the bad news is... FuzzyQuery won't do what you need here. You
need some sort of FuzzyPhraseQuery, which is able to replace terms
similar to one another (comp/company/corporation) by some metric. I
don't know of
The warming queries warm up the caches used in sorting. So
just including the sort=. will warm the sort caches. the terms
searched are not important. The same is true with facets...
However, I don't understand how that relates to your OOM problems. I'd
expect the OOM to start happening on
Here's the root issue, and all available patches:
https://issues.apache.org/jira/browse/SOLR-236
I confess I have no clue what's what here, so
you're largely on your own. There are some
encouraging titles (note you can sort the patches
by date, which might help in figuring out which
to use)..
How do you construct a query in java for spatial search ? not the default
solr REST interface
--
View this message in context:
http://lucene.472066.n3.nabble.com/Spatial-search-with-SolrJ-3-1-How-to-tp2961136p2961136.html
Sent from the Solr - User mailing list archive at Nabble.com.
Hi Erick,
My OOM problem starts when I query the core with 13217121 documents. My
schema and other details are given below,
1 how is your sort field defined? String? Integer? If it's a string and you
could change it to a numeric type, you'd use a lot less memory.
We primarily use two different
On Thu, May 19, 2011 at 8:52 AM, martin_groenhof
martin.groen...@yahoo.com wrote:
How do you construct a query in java for spatial search ? not the default
solr REST interface
It depends on what you are trying to do - a spatial request (as
currently implemented in Solr) is typically more than
You, or any other solr member, knows a good fuzzy string matching library to
recommend?
On Thu, May 19, 2011 at 9:39 AM, Michael McCandless
luc...@mikemccandless.com wrote:
Well the good news is FuzzyQuery is indeed much faster in Lucene/Solr
4.0.
But the bad news is... FuzzyQuery won't
Hi!
I would like to announce a white paper that describes the technical
details of Near Real Time implementation with Solr and the
RankingAlgorithm. The paper discusses the modifications made to enable NRT.
You can download the white paper from here:
The only two ways to influence facet order is by count and alphabetically.
facet.sort=index will sort by alpha, the default is facet.sort=count
All that said, I still don't quite understand what you're asking for. Facets
are simply a count of the documents that have unique values for, in your
Dmitry,
how should that work? Take a this short sample-data:
id | date
T-AS_1386229 | 1995-12-31T23:59:59Z
T-AS_1386181 | 1996-12-31T23:59:59Z
T-AS_1386229 | 1997-12-31T23:59:59Z
So, you'll have two facets for the ids .. but how should they be
sorted? One (of the two) is the first and the other
See below:
On Thu, May 19, 2011 at 9:06 AM, Rohit ro...@in-rev.com wrote:
Hi Erick,
My OOM problem starts when I query the core with 13217121 documents. My
schema and other details are given below,
H, how many cores are you running and what are they doing? Because they
all use the same
Hi,
Thanks for the questions, guys, and sorry for the confusion. I should start
with a broader picture of what we are trying to achieve. The only problem is
that I cannot speak about specifics of the task we are solving the way we
do. We currently sort the facets on the client side, having the
What version of Solr are you using? Because this works fine for me.
Could you attach the results of adding debugQuery=on in both instances?
The parsed form of the query is identical in 1.4.1 as far as I can tell. The bug
you're referencing is a peculiarity of the not (-) operator I think.
Best
I don't care about the method, I just want results within let's say 10km of a
lat,lng ?
(I can do this with REST) but don't know how to with a Java API
[code]SpatialOptions spatialOptions =
new SpatialOptions(company.getLatitude() + , +
company.getLongitude(),
Hey all!
I have a few questions concerning the field cache method for faceting.
The wiki says for enum method: This was the default (and only) method
for faceting multi-valued fields prior to Solr 1.4. . And for fc
method: This was the default method for single valued fields prior to
Solr
Hello
i want to index some datefields with this dateformat: -mm-dd. Solr
thwows an exception like this: can not be represented as java.sql.Date
i am unsing ...transformer=DateFormatTransformer
and ...zeroDateTimeBehavoir=convertToNull
how can i say to DIH to convert this fields in correct
On Thu, May 19, 2011 at 9:56 AM, Erik Fäßler erik.faess...@uni-jena.de wrote:
I have a few questions concerning the field cache method for faceting.
The wiki says for enum method: This was the default (and only) method for
faceting multi-valued fields prior to Solr 1.4. . And for fc method:
Oh, isn't that ducky. The facet.sort parameter only sorts ascending
as far as I can tell. Which is exactly the reverse of what you want.
Would it work to cleverly encode the facet field to do what you want
just by a lexical sort? Something like use a very large constant,
subtract the date for
Thanks Erick, this sounds solid to me!
It of course will require the repost of the entire index (pretty big one,
sharded), but that's not an issue as we periodically do that anyway.
Thanks and regards,
Dmitry
On Thu, May 19, 2011 at 5:08 PM, Erick Erickson erickerick...@gmail.comwrote:
Oh,
On 5/18/2011 9:07 PM, Chris Hostetter wrote:
You could implement a parser like that relatively easily -- just make sure
you put a MatchAllDocsQuery in every BooleanQuery object thta you
construct, and only ever use the PROHIBITED and MANDATORY clause types
(never OPTIONAL) ... the thing is, a
Try this in your query:
TIME_FORMAT(timeDb, '%H:%i') as timefield
http://www.java2s.com/Tutorial/MySQL/0280__Date-Time-Functions/TIMEFORMATtimeformat.htm
--
View this message in context:
This is more a speculation than direction, I don't currently use Field
Collapsing but my take on it is that it returns the number of docs
collapsed. So instead of faceting could you do a search returning DocID,
collapsing on DocID sorting on date, then the count of collapsed docs
*should* match
Hi,
1. Is it possible to produce the collapsed docs count in the same query?
2. What is the performance of Field Collapsing versus Facet Search?
Dmitry
On Thu, May 19, 2011 at 5:36 PM, kenf_nc ken.fos...@realestate.com wrote:
This is more a speculation than direction, I don't currently use
did you mean something like this ?
DATE_FORMAT(cp.field, '%Y-%m-%di %H:%i:%s') AS field ???
i think i need to add the timestamp to my date fields? or not ?
why cannot DIH handle with this ?
-
--- System
One Server, 12
Oooh, that's clever
The glitch is that field collapsing is scheduled for 3.2, but that
probably means
the patch is close to being applicable to 3.1 but I don't know that for sure.
Erick
On Thu, May 19, 2011 at 10:36 AM, kenf_nc ken.fos...@realestate.com wrote:
This is more a speculation
Offhand, I don't think the problem is DIH since your stack trace
specifies a SQL error. What is the SQL you're using? And
the DIH configuration?
Best
Erick
On Thu, May 19, 2011 at 10:53 AM, stockii stock.jo...@googlemail.com wrote:
did you mean something like this ?
DATE_FORMAT(cp.field,
Hi,
I'd like to announce the release of a new version of my Python-Solr
library, sunburnt:
http://pypi.python.org/pypi/sunburnt/0.5
Documentation and tutorial examples are available at:
http://opensource.timetric.com/sunburnt/
and there's a mailing list for discussion at
entity name=foo pk=cp_id transformer=DateFormatTransformer
query=SELECT ...,
...some fields ...
cp.start_date_1,
cp.start_date_2,
cp.end_date_1,
cp.end_date_2,
.. some other fields ..
FROM ...
/entity
okay, i found the problem.
i put the fields two times in my data-config ;-)
-
--- System
One Server, 12 GB RAM, 2 Solr Instances, 7 Cores,
1 Core with 31 Million Documents other Cores 100.000
- Solr1 for
Hm, anyone?
On Sat, May 14, 2011 at 7:11 PM, Stefan Matheis
matheis.ste...@googlemail.com wrote:
Hi Guys,
while working on the UI for Replication, i've got confused sometimes because
of the following response (from /replication?command=details):
?xml version=1.0 encoding=UTF-8?
response
Am 19.05.2011 16:07, schrieb Yonik Seeley:
On Thu, May 19, 2011 at 9:56 AM, Erik Fäßlererik.faess...@uni-jena.de wrote:
I have a few questions concerning the field cache method for faceting.
The wiki says for enum method: This was the default (and only) method for
faceting multi-valued fields
Hello,
We have configured solr for delta processing through DIH and we kick off the
index request from within a batch process.
However, we somehow need to know whether our indexing request succeeded or
not because we want to be able to rollback a db transaction if that step
fails.
By looking at
Hi all,
Based on advice I received on a previous email thread, I applied patch
https://issues.apache.org/jira/browse/SOLR-2338. My goal was to be able to
apply a similarity class to certain fields but not all fields.
I ran the following commands:
$ cd your Solr trunk checkout dir
$ svn up
$
Also, I've tried adding:
similarity class=org.apache.lucene.misc.SweetSpotSimilarity/
To the end of the schema file so that it is applied globally but it does not
appear to change the score either. What am I doing incorrectly?
Thanks,
Brian Lamb
On Thu, May 19, 2011 at 2:45 PM, Brian Lamb
I tried editing the SweetSpotSimilarity class located at
lucene/contrib/misc/src/java/org/apache/lucene/misc/SweetSpotSimilarity.java
to just return 1 for each function and the score does not change at all.
This has led me to believe that it does not recognize similarity at all. At
this point, all
: sort=query({!v=area_id: 78153}) desc, score desc
:
: What I want to achieve is sort by if there is a match with area_id, then
: sort by the actual score
I think you can use the map function here to map all scores greater then
zero (matching docs) to some fixed value. something like this
Hi,
I have a code logic to push documents to SOLR using SOLRj APIs.
Due to an error in schema, i get appropriate error in SOLR logs printed in
catalina.log inside tomcat. Here is a snippet:
SEVERE: org.apache.solr.common.SolrException: ERROR: multiple values
encountered for non multiValued copy
Newbie at SOLR,
When I ran through my test data config, it was able to find my 91 rows
sample test. However, it didn't add any into my index.
Can someone help me and tell me why?
Please find the data config below:
dataConfig
dataSource
: It is fairly simple to generate facets for ranges or 'buckets' of
: distance in Solr:
: http://wiki.apache.org/solr/SpatialSearch#How_to_facet_by_distance.
: What isnt described is how to generate the links for these facets
any query you specify in a facet.query to generate a constraint count
: Thanks for the reply. I'm at home right now, or I'd try this myself, but is
: the suggestion that two optimize() calls in a row would resolve the issue?
it might ... I think the situations in which it happens have evolved a bit
over the years as IndexWRiter has gotten smarter about knowing
What is JPA?
You are better off pulling from JPA yourself than coding with the
DataImportHandler. It will be much easier.
EmbeddedSolr is just like web solr: when you commit data it is on the
disk. If you crash during indexing, it may or may not be available to
commit. EmbeddedSolr does not do
Ahh, thanks. I might try a basic commit() then and see, although it's not a
huge deal for me. It occurred to me that two optimize() calls would probably
leave exactly the same problem behind.
On 20 May 2011 09:52, Chris Hostetter hossman_luc...@fucit.org wrote:
: Thanks for the reply. I'm at
Hi,
i make the same query to import my data with mysql and postgres.
But only postgres index all data (17090).
While Mysql index 17086, after 197085, after 17087... never 17090. But the
response tell me that it has skipped 0 documents. I don't understand!
Help me please, i woul to use Mysql for
Excuse me, i wrong to write 197085, correct is 17085. But never the same
count...
--
View this message in context:
http://lucene.472066.n3.nabble.com/Mysql-vs-Postgres-DIH-tp2963822p2963824.html
Sent from the Solr - User mailing list archive at Nabble.com.
thank you dan... i have checked the code that produces XML for solr and then
fixed nbsp problem
-
Zeki ama calismiyor... Calissa yapar...
--
View this message in context:
http://lucene.472066.n3.nabble.com/error-while-doing-full-import-tp2951185p2963832.html
Sent from the Solr - User
hi Gora,
i guess you are right, i have checked and url seems serving data slowly...
maybe its because of the crappy test env too...
thank you so much
-
Zeki ama calismiyor... Calissa yapar...
--
View this message in context:
I'm using Solr 1.4...
I thought I had a case without a NOT but it seems to work now :S
It might be a glitch on my server.
The problem is easily reproducible with the NOT operator
http://10.0.5.221:8983/jobs/select?q=title:java%20AND%20(-title:programmer)
you mean you have change the code of the solr admin page to remove all
indexes? and also, when you by indexes are gone you mean they are deleted
or solr sees no indexes when you run it? a little bit confusing post :)
-
Zeki ama calismiyor... Calissa yapar...
--
View this message in context:
sorry for the typos in the prev msg... a little bit drowsy still...
so if you can make a little bit more clear about your problem, we can help
you
-
Zeki ama calismiyor... Calissa yapar...
--
View this message in context:
Apologies if this is obvious, but I've been banging my head against a wall.
I can define a query like the following:
http://HOST_NAME/solr/select?q=$search_termbq=boost_high:$search_term^1.5bq=boost_medium:$search_term^1.3bq=boost_max:$search_term^1.7bq=boost_low:$search_term^1.1
This does
Could you please unsubscribe me.
From: ronveenstra ron-s...@agathongroup.com
Reply-To: solr-user@lucene.apache.org
Date: Thu, 19 May 2011 18:52:52 -0700 (PDT)
To: solr-user@lucene.apache.org
Subject: Re: Using Boost fields for a sum total score.
Apologies if this is obvious, but I've been
Dear team.
I installed to cores on my tomcat ,
http://localhost:8983/solr/fund_dih/admin/
http://localhost:8983/solr/fund_tika/admin/
How can I send one query request via Solrj to these URL?
Thanks and Regards
Zane
We are having an issue with facet counts and grouping
We have multiple doctors with addresses. How do I search these lat longs?
1. Using SOLR 3.1, I can duplicate all fields except lat_long, and use
group.field for the key.
2. I can use David Smiley's solution for multiple points (but it seems
Put everything except q in solrconfig... Then just use
qt=nameinsolrconfigq=
On 5/19/11 7:52 PM, ronveenstra ron-s...@agathongroup.com wrote:
Apologies if this is obvious, but I've been banging my head against a
wall.
I can define a query like the following:
73 matches
Mail list logo