Good day,
I recently moved to solrj 3.6.1. As the CommonsHttpSolrServer class is deprecated in that version I
migrated to HttpSolrServer. But now tika does not generate the stream_size field correctly, it is
saying in the result response for an arbitrary jpeg file str
Hi all,
Today i'm using solritas as front-end for the solr search engine.
But i would like to do url rewriting to deliver urls more compliant with
SEO.
First the end user types that kind of url : http://host.com/query/myquery
So this url should be rewriten internally (kind of reverse proxy) in
https://issues.apache.org/jira/browse/SOLR-4037
-Original message-
From:Mark Miller markrmil...@gmail.com
Sent: Sat 03-Nov-2012 14:24
To: solr-user@lucene.apache.org
Subject: Re: Continuous Ping query caused exception:
java.util.concurrent.RejectedExecutionException
On Nov
https://issues.apache.org/jira/browse/SOLR-4038
Still trying to gather the logs
-Original message-
From:Mark Miller markrmil...@gmail.com
Sent: Sat 03-Nov-2012 14:17
To: Markus Jelsma markus.jel...@openindex.io
Cc: solr-user@lucene.apache.org
Subject: Re: SolrCloud indexing
LucidFind is a searchable archive of Solr documentation and email lists:
http://find.searchhub.org/?q=solrcloud
- Original Message -
| From: Jack Krupansky j...@basetechnology.com
| To: solr-user@lucene.apache.org
| Sent: Monday, November 5, 2012 4:44:46 AM
| Subject: Re: Where to get
The question you meant to ask is: Does MoreLikeThis support Distributed
Search? and the answer apparently is no. This is the issue to get it working:
https://issues.apache.org/jira/browse/SOLR-788
(Distributed Search is independent of SolrCloud.) If you want to make unit
tests, that would
Hi,
We are running a small solr cluster with 8 cores on 4 machines. This
database has about 1E9 very small documents. One of the statistics we
need requires a facet on a text field with high cardinality.
During the uninvert phase of this text field the searchers experience
long stalls because of
Hi Alexey, responses are inline:
Zookeeper manages not only the cluster state, but also the common
configuration files.
My question is, what are the exact rules of precedence? That is, when SOLR
node will decide to download new configuration files?
When the SolrCore is started.
Will
Hi *,
I want to import some data to build a Solr index. For this import, I need to
skip some documents from importing. In my data-config file it looks like
this:
field column=$skipDoc regex=^MyPattern .* replaceWith=true
sourceColName=text/
As I also need to search my 'titles' I tried this:
bq: where can i find all the items on the road map?
Well, you really can't G... There's no official roadmap. I happen to
know this since I follow the developer's list and I've seen references to
this being important to the folks doing SolrCloud development work and it's
been a recurring theme on
Not that I know of. This would be extremely expensive in the usual case.
Loading up configs, reconfiguring all the handlers etc. would add a huge
amount of overhead to the commit operation, which is heavy enough as it is.
What's the use-case here? Changing your configs really often and reading
Hi,
Given following values in the document:
Doc1: Engine
Doc2. Engineer
Doc3. ResidentEngineer
We need to return all three documents when someone searches for engi.
Basically we need to implement partial word search. Currently, we have a
wild card on the right side of search term (term*). Is
I think you're conflating shards and cores. Shards are physical slices of a
singe logical index. An incoming query is sent to each and every shard and
the results tallied.
The case you're talking about seems to be more you have N separate indexes
(cores), where each core is for a specific user.
Velocity/Solaritas was never intended to be a user-facing app. How are you
locking things down so a user can't enter, or instance,
q=deletequery*:*/query/deletecommit=true?
I'd really recommend a proper middleware layer unless you have a trusted
user base...
FWIW,
Erick
On Tue, Nov 6, 2012
Hi Erick,
Thanks for your help.
OK except the php client delivered as a sample, do you have a preference
for an out of the box front end easly deployable?
My main use case is to be compliant with SEO, or at least to give nice
(url) entry point.
Thanks.
2012/11/6 Erick Erickson
Add an edge n-gram filter (EdgeNGramFilterFactory) to your index
analyzer. This will add all the prefixes of words to the index, so that a
query of engi will be equivalent to but much faster than the wildcard
engi*. You can specify a minimum size, such as 3 or 4 to eliminate tons of
too-short
So is it a better approach to query for smaller rows, say 500, and keep
increasing the start parameter? wouldnt that be slower since I have an
increasing start parameter and I will also be sorting by the same field in
each of my queries made to the multiple shards?
Also, does it make sense to
Thanks a million, Erick! You're right about killing both nodes hosting
the shard. I'll get the wiki corrected.
Nick
On 11/3/2012 10:51 PM, Erick Erickson wrote:
SolrCloud doesn't work unless every shard has at least one server that is
up and running.
I _think_ you might be killing
Thank you very much for taking the time to do this. This version is
able to read the index files, but there is at least one issue:
The home screen reports ERROR: can't count terms per field and this
exception is thrown:
java.util.NoSuchElementException
at
Hi we are extending SearchHandler to provide a custom search request
handler. Basically we've added NamedLists called allowed , whiteList,
maxMinList etc.
These look like the default, append and invariant namedLists in the
standard search handler config. In handleRequestBody we then remove params
Thanks Jack.
In the configuration below:
fieldType name=text_edgngrm class=solr.TextField
positionIncrementGap=100
analyzer
tokenizer class=solr.EdgeNGramTokenizerFactory side=front
minGramSize=1 maxGramSize=1/
/analyzer
/fieldType
What are the possible values for side?
If I
I got the following error in browser console:
http://localhost:8080/indexer/$%7BcontextPath%7D$%7BadminPath%7D?wt=json
We can't see the contents of that link.. Could you post it on
pastebin.com or something?
Michael Della Bitta
Appinions
18
Hi Michael, thank for your answer.
I already posted it in stackoverflow (
http://stackoverflow.com/questions/13236383/migrating-from-solr3-to-solr4 ),
but, this looks like a encoding issue, actually, is exactly the error.
I'm not sure, but I look in all xml files in my JBoss and also in app,
Forward to solr-user mailing list. We forgot to reply to it, :-/
2012/11/5 Luis Cappa Banda luisca...@gmail.com
Hello, Mark!
I´ve been testing more and more and things are going better. I have tested
what you told me about -Dbootstrap_conf=true and works fine, but the
problem is that if I
Hey Carlos
just had a quick look at our changes and figured out the revision which
introduced this change, which might help you while having another look?
http://svn.apache.org/viewvc?view=revisionrevision=1297578
The LoadAdminUiServlet is responsible for replacing those placeholders which
On 11/6/2012 7:45 AM, Carrie Coy wrote:
Thank you very much for taking the time to do this. This version is
able to read the index files, but there is at least one issue:
The home screen reports ERROR: can't count terms per field and this
exception is thrown:
On Nov 6, 2012 at 6:06 AM, Arend-Jan Wijtzes
ajwyt...@wise-guys.nlmailto:ajwyt...@wise-guys.nl wrote:
...
During the uninvert phase of this text field the searchers experience
long stalls because of the garbage collecting (20+ seconds pauses) which
causes Solr to lose the Zookeeper lease. Often
Hi Stefan,
Thank you very much, I just realized that I didn't updated the web.xml, so,
I not has the LoadAdminUiServlet configured, that's why it was not working.
By now, the only problem I still have, is that it tries to access
solr.home/collection1/conf, and I used to have it in
Hi,
On Mon, Nov 5, 2012 at 8:24 PM, SuoNayi suonayi2...@163.com wrote:
Thanks jack and thanks for the great country.
All big famous websites such as google, slideshares and blogspot etc are
blocked.
What I want to know about is more details about solrcloud, here is my
questions:
1.Can we
I have a five node SolrCloud implementation running as a test with no
replication using a three node zookeeper ensemble. Admittedly, I'm new to Solr
and just grinding it out. Accidently re-initialized zookeeper with the wrong
conf dir and I'm trying to recover. I re-ran the initialization
DIH SolrJ don't really support what you want to do. But you can make it work
with code like this, which reloads the DIH configuration and checks for the
response. Just note this is quite brittle: whenever the response changes in
future versions of DIH, it'll break your code.
MapString,
Not really. Mostly it's whatever you are most comfortable with. Since the
app - solr connection is just HTTP, the front-end is wide open.
FWIW,
Erick
On Tue, Nov 6, 2012 at 8:30 AM, Sébastien Dartigues
sebastien.dartig...@gmail.com wrote:
Hi Erick,
Thanks for your help.
OK except the php
I was right for once G..
Thanks for updating the Wiki!
Erick
On Tue, Nov 6, 2012 at 9:42 AM, Nick Chase nch...@earthlink.net wrote:
Thanks a million, Erick! You're right about killing both nodes hosting
the shard. I'll get the wiki corrected.
Nick
On 11/3/2012 10:51 PM, Erick
This is a complex setup, all right.
A pluggable sharding strategy is definitely something that is on the
roadmap for SolrCloud, but hasn't made it into the code base yet.
Keep in mind, though, that all the SolrCloud goodness centers around the
idea of a single index that may be sharded. I don't
got it. thanks a lot
On Tue, Nov 6, 2012 at 8:43 PM, Erick Erickson erickerick...@gmail.comwrote:
bq: where can i find all the items on the road map?
Well, you really can't G... There's no official roadmap. I happen to
know this since I follow the developer's list and I've seen references
Erick
We have a requirement where seach admin can add or remove some synonyms and
would want these changes to be reflected in search thereafter.
yes, we looked at reload command and it seems to be suitable for that
purpose. We have a master and slave setup so it should be OK to issue
reload
thanks for your feedback Erick.
I am also aware of the current limitation of shard number in a collection is
fixed. changing the number will need re-config and re-index. Let's say if
the limitation gets levitated in near future release, I would then consider
setup collection for each customer,
Hi,
Note about modifying synonyms - you need to reindex, really, if using
index-time synonyms. And if you're using search-time synonyms you have
multi-word synonym issue described on the Wiki.
Otis
--
Performance Monitoring - http://sematext.com/spm
On Nov 6, 2012 11:02 PM, roz dev
Hi all,sorry for questions about solrcloud from newbie.
here is my two questions:
1.If I have a solrcloud cluster with two shards and 0 replica on two different
server.
when one of server restarts will the solr instance on that server replay
the transaction log to make sure these operations
Erik Hatcher-4 wrote
There's an open issue (with a patch!) that enables this, it seems:
lt;https://issues.apache.org/jira/browse/SOLR-3911gt;
Erik
well patch seems not doing that... i have tried and still getting some error
lines about the dir types
-
Zeki ama calismiyor...
Thanks Otis for pointing this out.
We may end up using search time synonyms for single word synonym and use
index time synonym for multi world synonyms.
-Saroj
On Tue, Nov 6, 2012 at 8:09 PM, Otis Gospodnetic otis.gospodne...@gmail.com
wrote:
Hi,
Note about modifying synonyms - you need
41 matches
Mail list logo