the key phrase was this one :) :
A sloppy phrase query specifies a maximum slop, or the number of
positions tokens need to be moved to get a match.
so you could search for foo bar~101 in your example.
Ludovic.
-
Jouve
France.
--
View this message in context:
I would prefer to put a higher slop number instead of a boolean clause : 200
perhaps in your specific case.
Ludovic.
-
Jouve
France.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Order-of-words-in-proximity-search-tp2938427p2946645.html
Sent from the Solr - User
I don't know if it is allowed to modify Lucid code and add it to jira.
If someone from Lucid would give me the permission and the Solr developers
have nothing against it I won't mind adding the Lucid KStemmer to jira
for Solr 3.x and 4.x.
There are several Lucid KStemmer users which I can see
Hi,
I am using ASP.Net MVC and solrnet for my search tool. The files I index
include pdf files, word docs, excel etc...
I am able to search and retrieve all the docs with a hit. Now the problem
lies in opening the files with a hit.
When I open the file, it should open at the location where the
On Mon, May 16, 2011 at 12:00 PM, Vignesh Raj
vignesh...@greatminds.co.in wrote:
Hi,
I am using ASP.Net MVC and solrnet for my search tool. The files I index
include pdf files, word docs, excel etc...
I am able to search and retrieve all the docs with a hit. Now the problem
lies in opening
Hi,
The strange part is that i have actually tried a slop of 1000 (1K),
and the results are still different. This even when the test data has
a limiter of 10K for each sentence.
(This means that a sloppy phrase should only give hits where the
complete sentence is found, yet it is not the
The analyzer of the field you are using could impact the Phrase Query Slop.
Could you copy/paste the part of the schema ?
Ludovic.
-
Jouve
France.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Order-of-words-in-proximity-search-tp2938427p2946764.html
Sent from the
It is by default commented in solrconfig.xml
On Sat, May 14, 2011 at 10:49 PM, Gabriele Kahlout gabri...@mysimpatico.com
wrote:
I guess you are having issues with the datadir. Did you set the datadir in
solrconfig.xml?
On Sat, May 14, 2011 at 4:10 PM, Pawan Darira pawan.dar...@gmail.com
I want to be able to dynamically change scores without having to
update the entire document.
For this, I started using the External File Field.
I set a fieldType called idRankFile and field called idRank in schema.xml :
fieldType name=idRankFile keyField=id defVal=0 stored=false
indexed=false
Hello,
if you want to take the descriptor from a jar, provided that you configured
the jar inside a lib element in solrconfig, then you just need to write
the correct classpath in the analysisEngine element.
For example if your descriptor resides in com/something/desc/ path inside
the jar then
http://pastebin.com/svyefmM6
Pretty standard :)
/Tor
On Mon, May 16, 2011 at 9:18 AM, lboutros boutr...@gmail.com wrote:
The analyzer of the field you are using could impact the Phrase Query Slop.
Could you copy/paste the part of the schema ?
Ludovic.
-
Jouve
France.
--
View this
Hi,
This poll is to investigate how you currently do or would like to do logging
with Solr when deploying solr.war to a SEPARATE java application server (such
as Tomcat, Resin etc) outside of the bundled solr/example. For background on
how things work in Solr now, see
[X] I always use the JDK logging as bundled in solr.war, that's perfect
[ ] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
[ ] Give me solr.war WITHOUT an slf4j logger binding, so I can
choose at deploy time
[ ] Let me choose whether to bundle a binding
[ ] I always use the JDK logging as bundled in solr.war, that's perfect
[X ] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
[ ] Give me solr.war WITHOUT an slf4j logger binding, so I can
choose at deploy time
[ ] Let me choose whether to bundle a binding
On Sun, May 15, 2011 at 7:44 PM, Mark Miller markrmil...@gmail.com wrote:
Could you please revert your commit, until we've reached some
consensus on this discussion first?
Let's reach some consensus, but why revert? This has been the behavior -
shouldn't the consensus onus be on changing it
On Mon, May 16, 2011 at 2:13 PM, Jan Høydahl jan@cominvent.com wrote:
[...]
Please tick one of the options below with an [X]:
[ X] I always use the JDK logging as bundled in solr.war, that's perfect
[ ] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
[ ] I always use the JDK logging as bundled in solr.war, that's perfect
[ ] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
[X] Give me solr.war WITHOUT an slf4j logger binding, so I can choose at
deploy time
[ ] Let me choose whether to bundle a binding or
Please tick one of the options below with an [X]:
[ ] I always use the JDK logging as bundled in solr.war, that's perfect
[X] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
actually : not so happy because our operations team has to repackage it.
But
[X] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
actually : not so happy because our operations team has to repackage it.
But there is no option for
[X] add the logger configuration to the server's classpath, no
repackaging!
That's what happens if
My solr index is updated simultaneously by multiple clients via REST. I use
commitWithing attribute in the add/add command to direct auto commits.
I start getting this error after a couple of days of usage. How do i fix
this ? Please find the error log below. Using solr 3.1 with tomcat Thanks
On Mon, May 16, 2011 at 5:29 PM, Jasneet Sabharwal
jasneet.sabhar...@ngicorporation.com wrote:
Hi
Where do I set the default value of clean = false when a full-import is
done.
Append it to the URL, e.g., dataimport?command=full-importclean=false
Regards,
Gora
I have been doing that, but I want to set it as False by default, so
that even if the admin forgets to set clean=false in the URL, it doesn't
do it on its own.
On 16-05-2011 17:38, Gora Mohanty wrote:
On Mon, May 16, 2011 at 5:29 PM, Jasneet Sabharwal
jasneet.sabhar...@ngicorporation.com
Hi,
I have a question about Solr Cell please.
I index some files. For example, if I want to extract the filename, then use
a hash function on it like MD5 and then store it on Solr ; the correct way
is to use Tika « manually » to extract the metadata I want, do the
transformations on it and
Jasneet,
what about defining the value as a default in the dataimport
request-handler? like the sample at
http://wiki.apache.org/solr/SolrRequestHandler does?
Regards
Stefan
On Mon, May 16, 2011 at 2:10 PM, Jasneet Sabharwal
jasneet.sabhar...@ngicorporation.com wrote:
I have been doing that,
On May 16, 2011, at 5:30 AM, Michael McCandless wrote:
Does anyone disagree that Yonik's commit was inappropriate? This is
not how we work at Apache.
Ah - dunno yet - I obviously missed part of the conversation here. I thought
you where talking about reversing 'autophrase off' as the
Stefan,
I have added the DIH request handler in the solrconfig.xml. Do I have to
add the clean=false in that or somewhere else ?
Regards
Jasneet
On 16-05-2011 18:03, Stefan Matheis wrote:
Jasneet,
what about defining the value as a default in the dataimport
request-handler? like the sample
Jasneet
On Mon, May 16, 2011 at 3:10 PM, Jasneet Sabharwal
jasneet.sabhar...@ngicorporation.com wrote:
I have added the DIH request handler in the solrconfig.xml.
Exactly there :)
Regards
Stefan
Hi,
I am doing a full import in one of the cores. But I am getting Null poniter
exception and the import is failing again and again. I also tried clearing
the indexes and started the full import, but still indexing failed.
The full import request is prefect and I verified it with other full
Dear list,
Might have missed it from the literature and the list, sorry if so, but:
SOLR 1.4.1
solrQueryParser defaultOperator=AND/
Consider the query:
term1 term2 OR term1 term2 OR term1 term3
Problem: The query produces a hit containing only term1.
Solution: Modified query, grouping with
Stefan
requestHandler name=/dataimport
class=org.apache.solr.handler.dataimport.DataImportHandler
lst name=defaults
str
name=config/home/jasneet/apache-solr-3.1.0/example/solr/conf/data-config.xml/str
str name=cleanfalse/str
/lst
/requestHandler
Should it be like this ?
On 16-05-2011 18:48,
Yep that fixed my problem ...many thanks !
-Original Message-
From: Chris Hostetter [mailto:hossman_luc...@fucit.org]
Sent: Friday, May 13, 2011 6:37 PM
To: solr-user@lucene.apache.org
Subject: RE: SolrDispatchFilter
: This problem is only occurring when using IE8 ( Chrome
On Sun, May 15, 2011 at 1:48 PM, Michael McCandless
luc...@mikemccandless.com wrote:
Could you please revert your commit, until we've reached some
consensus on this discussion first?
Huh?
I thought everyone was in agreement that we needed more field types
for different languages?
I added my
Hi,
I'm using Apache Solr v3.1.
How do I configure/allow Solr to both index and query the term c# as a
whole word/term? From Analysis page, I could see that the term c# is
being reduced/converted into just c by solr.WordDelimiterFilterFactory.
Regards,
Gnanam
[ ] I always use the JDK logging as bundled in solr.war, that's perfect
[x] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
[ ] Give me solr.war WITHOUT an slf4j logger binding, so I can choose at
deploy time
[ ] Let me choose whether to bundle a
On Mon, May 16, 2011 at 3:27 PM, Jasneet Sabharwal
jasneet.sabhar...@ngicorporation.com wrote:
Should it be like this ?
Never tried it myself, but what i guess from the Wiki ... Yes. doesn't
work for you, or just asked to be sure, before integrating it?
On Mon, May 16, 2011 at 5:30 AM, Michael McCandless
luc...@mikemccandless.com wrote:
To be clear, I'm asking that Yonik revert his commit from yesterday
(rev 1103444), where he added text_nwd fieldType and dynamic fields
*_nwd to the example schema.xml.
So... your position is that until the
On Mon, May 16, 2011 at 3:51 PM, Yonik Seeley
yo...@lucidimagination.com wrote:
On Mon, May 16, 2011 at 5:30 AM, Michael McCandless
luc...@mikemccandless.com wrote:
To be clear, I'm asking that Yonik revert his commit from yesterday
(rev 1103444), where he added text_nwd fieldType and dynamic
On Mon, May 16, 2011 at 9:51 AM, Yonik Seeley
yo...@lucidimagination.com wrote:
To be clear, I'm asking that Yonik revert his commit from yesterday
(rev 1103444), where he added text_nwd fieldType and dynamic fields
*_nwd to the example schema.xml.
So... your position is that until the text
Hi, thanks for the reply.
I tried a couple of things both in the tika-test entity and in the entity
named 'f'.
In the tika-test entity I tried:
field column=fileName name=${f.fileName} /
field column=fileName name=${f.file} /
even
field column=fileName name=${f.fileAbsolutePath} /
I also
On Mon, May 16, 2011 at 10:06 AM, Michael McCandless
luc...@mikemccandless.com wrote:
On Mon, May 16, 2011 at 9:51 AM, Yonik Seeley
yo...@lucidimagination.com wrote:
To be clear, I'm asking that Yonik revert his commit from yesterday
(rev 1103444), where he added text_nwd fieldType and
I don't think you'd want to use the string type here. String type is
almost never appropriate for a field you want to actually search on (it
is appropriate for fields to facet on).
But you may want to use Text type with different analyzers selected.
You probably want Text type so the value
Why? Becuase of how the solr/lucene query parser parses?
It parses into seperate tokens/phrases, and then marks each unit as
mandatory or optional. The operator's joining the tokens/phrases are
used to determine if a unit is mandatory or optional.
Since your defaultOperator=AND
term1
I have always just converted terms like 'C#' or 'C++' into 'csharp' and
'cplusplus' before indexing them and similarly converted those terms if
someone searched on them. That always has worked just fine for me...
:)
-Original Message-
From: Jonathan Rochkind [mailto:rochk...@jhu.edu]
Before indexing so outside Solr? Using the SynonymFilter would be easier i
guess.
On Monday 16 May 2011 17:44:24 Robert Petersen wrote:
I have always just converted terms like 'C#' or 'C++' into 'csharp' and
'cplusplus' before indexing them and similarly converted those terms if
someone
Hi Jonathan,
Well, I clearly understand, why 'term1 term2 OR ...' gives exactly same
results as 'term1 AND term2 OR ...', but what I do not get is, why grouping
with parentheses is required to have both term1 and term2 in the same hit
even though AND is the default operator and space between
On 05/15/2011 11:48 AM, Erick Erickson wrote:
Where are the documents coming from? Because storing them ONLY in
Solr risks losing them if your index is somehow hosed.
In our case, we generally have source documents and can reproduce the
index if need be, but that's a good point.
Storing
Hi,
I'm new to Solr and I'm trying to use my custom Similarity class but I've
not succeeded on that.
I added some debug information and my class is loaded, but it is not used
when queries are made.
Does someone could help me? If any further information is relevant, I can
provide it.
Thanks in
On Mon, May 16, 2011 at 10:22 AM, Yonik Seeley
yo...@lucidimagination.com wrote:
My position is: please don't suddenly commit changes, with your way,
while we're still discussing how to solve the issue. That's not the
Apache way.
Dude... everyone has always agreed we need more fieldtypes to
Both of the clustering algorithms that ship with Solr (Lingo and STC) are
designed to allow one document to appear in more than one cluster, which
actually does make sense in many scenarios. There's no easy way to force
them to produce hard clusterings because this would require a complete
Thanks Erick !
As I re-checked the configuration files, it turns out someone had
modified the /solr/conf/*stopwords.txt* on the production server,
and now we know what problem we're dealing with, which seems to be
related to:
-
On 05/16/2011 09:24 AM, Dmitry Kan wrote:
Dear list,
Might have missed it from the literature and the list, sorry if so, but:
SOLR 1.4.1
solrQueryParser defaultOperator=AND/
Consider the query:
term1 term2 OR term1 term2 OR term1 term3
I think what's happening is that your query gets
Thanks much Stan,
Ramdev
On May 16, 2011, at 11:38 AM, Stanislaw Osinski wrote:
Both of the clustering algorithms that ship with Solr
(Lingo and STC) are designed to allow one document to appear in more than one
cluster, which actually does make sense in many
We use log4j explicitly and find it irritating to deal with the built-in
JDK logging default. We also have conflicts with other packages that
have their own ideas about how to bind slf4j, so the less of this the
better, IMO. The 1.6.1 no-op default behavior seems a bit unfortunate
as
Hi Tommaso,
Thanks for the quick reply. I had copied the lib files and
followed instructions on http://wiki.apache.org/solr/SolrUIMA#Installation.
However i get this error. The AnalysisEngine has the default class path
which is /org/apache/uima/desc/.
SEVERE:
On Mon, May 16, 2011 at 10:04 PM, Alex Grilo a...@umamao.com wrote:
Hi,
I'm new to Solr and I'm trying to use my custom Similarity class but I've
not succeeded on that.
I added some debug information and my class is loaded, but it is not used
when queries are made.
Does someone could help
I solved the problem of velocity.log following this tutorial:
http://kris-itproblems.blogspot.com/2010/11/velocitylog-permission-denied.html
On Thu, May 12, 2011 at 6:36 PM, Yuhan Zhang yzh...@onescreen.com wrote:
hi all,
I'm new to solr, and trying to install it on tomcat. however, an
The error you pasted doesn't seem to be related to a (class)path issue but more
likely to be related to a Solr instance at 1.4.1/3.1.0 and Solr-UIMA module at
3.1.0/4.0-SNAPSHOT(trunk); it seems that the error raises from
UpdateRequestProcessorFactory API changed.
Hope this helps,
Tommaso
Il
I’m pretty
new to Solr and I have a question about indexing data using CSV.
I have a
Blacklight-application running on my Mac 10.6.7 and I configured the schema.xml
and solrconfig.xml in the separate Apache-Solr-directory according to the
guidelines on the Blacklight-website. I
The code is here: http://pastebin.com/50ugqRfA
http://pastebin.com/50ugqRfAand my schema.xml configuration entry for
similarity is:
similarity class=com.umamao.solr.ShortFieldNormSimilarity/
Thanks
Alex
On Mon, May 16, 2011 at 2:01 PM, Gora Mohanty g...@mimirtech.com wrote:
On Mon, May 16,
: Does anyone disagree that Yonik's commit was inappropriate? This is
: not how we work at Apache.
FWIW: I don't see how Yonik's commit was inappropriate at all
He added some new example configuration to trunk that was unused, and in
no way un-did or blocked any other attempts at improving
Sorry I am also using a synonyms.txt for this in the analysis stack. I
was not clear, sorry for any confusion. I am not doing it outside of
Solr but on the way into the index it is converted... :)
-Original Message-
From: Markus Jelsma [mailto:markus.jel...@openindex.io]
Sent: Monday,
All,
I have just installed Solr 3.1 running on Tomcat 7. I am noticing a possible
issue with Highlighting. I have a filed in my index called story. The solr
document that I am testing with the data in the story field starts with the
following snippet (remaining data in the field is not
Hello,
is it possible to index graph - named vertices and named edges? My target
is, with text search to find whether particular node is connected(direct or
indirect) with another. Thank you.
--
View this message in context:
Dani,
i'm actually playing with Neo4j .. and the have a Lucene-Indexing and
plan to have Solr-Integration (no idea what the current state is).
http://lists.neo4j.org/pipermail/user/2010-January/002372.html
Regards
Stefan
Am 16.05.2011 21:50, schrieb dani.b.angelov:
Hello,
is it possible to
[ ] I always use the JDK logging as bundled in solr.war, that's perfect
[ ] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
[X] Give me solr.war WITHOUT an slf4j logger binding, so I can choose at
deploy time
[ ] Let me choose whether to bundle a binding or
Thank you Gora,
1. Could you confirm, that the context of IMHO is 'In My Humble Opinion'.
2. Could you point example of graph database.
--
View this message in context:
http://lucene.472066.n3.nabble.com/indexing-directed-graph-tp2949556p2949734.html
Sent from the Solr - User mailing list
Hello,
is it possible to index graph - named vertices and named edges? My target
is, with text search to find whether particular node is connected(direct or
indirect) with another. Thank you.
--
View this message in context:
The other advantage to the synonyms approach is it will be much less
of a headache down the road.
For instance, imagine you've defined whitespacetokenizer and
lowercasefilter.
That'll fix your example just fine. It'll also cause all punctuation
to be included in
the tokens, so if you indexed try
I am wandering, whether the following idea is worth.
We can describe the graph with series of triples. So can we create some bean
with fields, for example:
...
@Field
String[] sybjects;
@Field
String[] predicates;
@Field
String[] objects;
@Field
int[] level;
...
or other combination of metadata.
You can certainly index it, the problem will be being able to make the
kinds of queries you want to make on it once indexed. Indexing it in a
way that will let you do such queries.
The kind of typical queries I'd imagine you wanting to run on such a
graph -- I can't think of any way to index
Absolutely you can index each point or line of the graph with it's own
document in Solr, perhaps as a triple. (Sounds like you are specifically
talking about RDF-type data, huh? Asking about that specifically might
get you more useful ideas than asking graphs in general).
But if you want to
: just a very basic question, but I haven't been able to find the answer in
: the Solr wiki: how many updateHandlers can one Solr config have? Just one?
: Or many?
There can only be one updateHandler / declaration in solrconfig.xml,
it's
responsible for owning updates to the index.
But there
Lucid's KStemmer is LGPL and the Solr committers have shown that they don't
want LGPL libraries shipping with Solr. If you are intent on releasing your
changes, I suggest attaching both the modified source and the compiled jar onto
Solr's k-stemmer wiki page; and of course say that it's LGPL
: The code is here: http://pastebin.com/50ugqRfA
:
: http://pastebin.com/50ugqRfAand my schema.xml configuration entry for
: similarity is:
: similarity class=com.umamao.solr.ShortFieldNormSimilarity/
exactly what version of Solr are you using?
what does the full field/fieldType declaration
On 5/16/2011 at 5:33 PM, David W. Smiley wrote:
Lucid's KStemmer is LGPL and the Solr committers have shown that they
don't want LGPL libraries shipping with Solr. If you are intent on
releasing your changes, I suggest attaching both the modified source and
the compiled jar onto Solr's
On Mon, May 16, 2011 at 5:33 PM, Smiley, David W. dsmi...@mitre.org wrote:
Lucid's KStemmer is LGPL and the Solr committers have shown that they don't
want LGPL libraries shipping with Solr. If you are intent on releasing your
changes, I suggest attaching both the modified source and the
: This poll is to investigate how you currently do or would like to do
: logging with Solr when deploying solr.war to a SEPARATE java application
: server (such as Tomcat, Resin etc) outside of the bundled
FWIW...
a) the context of this poll is SOLR-2487
b) this poll seems flawed to me, as
My answers...
: [X] I always use the JDK logging as bundled in solr.war, that's perfect
: [X] I sometimes use log4j or another framework and am happy with
re-packaging solr.war
: [ ] Give me solr.war WITHOUT an slf4j logger binding, so I can choose at
deploy time
: [X] Let me choose
The map function lets you replace an arbitrary range of values with a
new value, so you could map any value greater then the ms that today
started on to any other point in history...
http://wiki.apache.org/solr/FunctionQuery#map
An easier approach would be probably be to apply some logic at
: http://code.google.com/p/solr-geonames/wiki/DeveloperInstall
: It's worth noting that the build has also been run on Mac and Solaris now,
: and the Solr index is about half the size. We suspect the optimize() call in
: Embedded Solr is not working correctly under Windows.
:
: We've observed
Thanks for the reply. I'm at home right now, or I'd try this myself, but is
the suggestion that two optimize() calls in a row would resolve the issue?
The process in question is a JVM devoted entirely to harvesting, calls
optimize() then shuts down.
The least processor intensive way of triggering
(11/05/17 3:27), Nemani, Raj wrote:
All,
I have just installed Solr 3.1 running on Tomcat 7. I am noticing a possible issue with
Highlighting. I have a filed in my index called story. The solr document
that I am testing with the data in the story field starts with the following snippet
How does MoreLikeThis use termVectors?
My documents (full sample at the bottom) frequently include lines more or less
like this
M /trunk/home/.Aquamacs/Preferences.el
I want to MoreLikeThis based on the full path, but not the M. But what I
actually display as a search result should include
I want create field from extract value from another field with some java
code ( using regular expressions ) . How to make this ?
--
View this message in context:
http://lucene.472066.n3.nabble.com/How-to-set-a-common-field-to-several-values-types-tp2922192p2951036.html
Sent from the Solr -
I set the datadir in solrconfig.xml. actually m using core based structures.
is it creating any problem
On Sat, May 14, 2011 at 10:49 PM, Gabriele Kahlout gabri...@mysimpatico.com
wrote:
I guess you are having issues with the datadir. Did you set the datadir in
solrconfig.xml?
On Sat, May
org.apache.solr.handler.dataimport.DataImportHandlerException: Parsing failed
for xml, url:http://xxx.xxx.xxx/frontend_dev.php/xxx/xxx/xxx rows
processed:0 Processing Document # 1 at
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:72)
at
86 matches
Mail list logo