str name=stream_sizenull/str when using HttpSolrServer

2012-11-06 Thread sh
Good day, I recently moved to solrj 3.6.1. As the CommonsHttpSolrServer class is deprecated in that version I migrated to HttpSolrServer. But now tika does not generate the stream_size field correctly, it is saying in the result response for an arbitrary jpeg file str

Solr / Velocity url rewrite

2012-11-06 Thread Sébastien Dartigues
Hi all, Today i'm using solritas as front-end for the solr search engine. But i would like to do url rewriting to deliver urls more compliant with SEO. First the end user types that kind of url : http://host.com/query/myquery So this url should be rewriten internally (kind of reverse proxy) in

RE: Continuous Ping query caused exception: java.util.concurrent.RejectedExecutionException

2012-11-06 Thread Markus Jelsma
https://issues.apache.org/jira/browse/SOLR-4037 -Original message- From:Mark Miller markrmil...@gmail.com Sent: Sat 03-Nov-2012 14:24 To: solr-user@lucene.apache.org Subject: Re: Continuous Ping query caused exception: java.util.concurrent.RejectedExecutionException On Nov

RE: SolrCloud indexing blocks if node is recovering

2012-11-06 Thread Markus Jelsma
https://issues.apache.org/jira/browse/SOLR-4038 Still trying to gather the logs -Original message- From:Mark Miller markrmil...@gmail.com Sent: Sat 03-Nov-2012 14:17 To: Markus Jelsma markus.jel...@openindex.io Cc: solr-user@lucene.apache.org Subject: Re: SolrCloud indexing

Re: Where to get more documents or references about sold cloud?

2012-11-06 Thread Lance Norskog
LucidFind is a searchable archive of Solr documentation and email lists: http://find.searchhub.org/?q=solrcloud - Original Message - | From: Jack Krupansky j...@basetechnology.com | To: solr-user@lucene.apache.org | Sent: Monday, November 5, 2012 4:44:46 AM | Subject: Re: Where to get

Re: Does SolrCloud supports MoreLikeThis?

2012-11-06 Thread Lance Norskog
The question you meant to ask is: Does MoreLikeThis support Distributed Search? and the answer apparently is no. This is the issue to get it working: https://issues.apache.org/jira/browse/SOLR-788 (Distributed Search is independent of SolrCloud.) If you want to make unit tests, that would

GC stalls cause Zookeeper timeout during uninvert for facet field

2012-11-06 Thread Arend-Jan Wijtzes
Hi, We are running a small solr cluster with 8 cores on 4 machines. This database has about 1E9 very small documents. One of the statistics we need requires a facet on a text field with high cardinality. During the uninvert phase of this text field the searchers experience long stalls because of

Re: SolrCloud - configuration management in ZooKeeper

2012-11-06 Thread Tomás Fernández Löbbe
Hi Alexey, responses are inline: Zookeeper manages not only the cluster state, but also the common configuration files. My question is, what are the exact rules of precedence? That is, when SOLR node will decide to download new configuration files? When the SolrCore is started. Will

Solr4 data import skipdoc and regex

2012-11-06 Thread Randy
Hi *, I want to import some data to build a Solr index. For this import, I need to skip some documents from importing. In my data-config file it looks like this: field column=$skipDoc regex=^MyPattern .* replaceWith=true sourceColName=text/ As I also need to search my 'titles' I tried this:

Re: Add new shard will be treated as replicas in Solr4.0?

2012-11-06 Thread Erick Erickson
bq: where can i find all the items on the road map? Well, you really can't G... There's no official roadmap. I happen to know this since I follow the developer's list and I've seen references to this being important to the folks doing SolrCloud development work and it's been a recurring theme on

Re: How to re-read the config files in Solr, on a commit

2012-11-06 Thread Erick Erickson
Not that I know of. This would be extremely expensive in the usual case. Loading up configs, reconfiguring all the handlers etc. would add a huge amount of overhead to the commit operation, which is heavy enough as it is. What's the use-case here? Changing your configs really often and reading

Searching for Partial Words

2012-11-06 Thread Sohail Aboobaker
Hi, Given following values in the document: Doc1: Engine Doc2. Engineer Doc3. ResidentEngineer We need to return all three documents when someone searches for engi. Basically we need to implement partial word search. Currently, we have a wild card on the right side of search term (term*). Is

Re: load balance with SolrCloud

2012-11-06 Thread Erick Erickson
I think you're conflating shards and cores. Shards are physical slices of a singe logical index. An incoming query is sent to each and every shard and the results tallied. The case you're talking about seems to be more you have N separate indexes (cores), where each core is for a specific user.

Re: Solr / Velocity url rewrite

2012-11-06 Thread Erick Erickson
Velocity/Solaritas was never intended to be a user-facing app. How are you locking things down so a user can't enter, or instance, q=deletequery*:*/query/deletecommit=true? I'd really recommend a proper middleware layer unless you have a trusted user base... FWIW, Erick On Tue, Nov 6, 2012

Re: Solr / Velocity url rewrite

2012-11-06 Thread Sébastien Dartigues
Hi Erick, Thanks for your help. OK except the php client delivered as a sample, do you have a preference for an out of the box front end easly deployable? My main use case is to be compliant with SEO, or at least to give nice (url) entry point. Thanks. 2012/11/6 Erick Erickson

Re: Searching for Partial Words

2012-11-06 Thread Jack Krupansky
Add an edge n-gram filter (EdgeNGramFilterFactory) to your index analyzer. This will add all the prefixes of words to the index, so that a query of engi will be equivalent to but much faster than the wildcard engi*. You can specify a minimum size, such as 3 or 4 to eliminate tons of too-short

Re: Solr 4.0 simultaneous query problem

2012-11-06 Thread Rohit Harchandani
So is it a better approach to query for smaller rows, say 500, and keep increasing the start parameter? wouldnt that be slower since I have an increasing start parameter and I will also be sorting by the same field in each of my queries made to the multiple shards? Also, does it make sense to

Re: SolrCloud failover behavior

2012-11-06 Thread Nick Chase
Thanks a million, Erick! You're right about killing both nodes hosting the shard. I'll get the wiki corrected. Nick On 11/3/2012 10:51 PM, Erick Erickson wrote: SolrCloud doesn't work unless every shard has at least one server that is up and running. I _think_ you might be killing

Re: lukeall.jar for Solr4r?

2012-11-06 Thread Carrie Coy
Thank you very much for taking the time to do this. This version is able to read the index files, but there is at least one issue: The home screen reports ERROR: can't count terms per field and this exception is thrown: java.util.NoSuchElementException at

custom request handler

2012-11-06 Thread Lee Carroll
Hi we are extending SearchHandler to provide a custom search request handler. Basically we've added NamedLists called allowed , whiteList, maxMinList etc. These look like the default, append and invariant namedLists in the standard search handler config. In handleRequestBody we then remove params

Re: Searching for Partial Words

2012-11-06 Thread Sohail Aboobaker
Thanks Jack. In the configuration below: fieldType name=text_edgngrm class=solr.TextField positionIncrementGap=100 analyzer tokenizer class=solr.EdgeNGramTokenizerFactory side=front minGramSize=1 maxGramSize=1/ /analyzer /fieldType What are the possible values for side? If I

Re: migrating from solr3 to solr4

2012-11-06 Thread Michael Della Bitta
I got the following error in browser console: http://localhost:8080/indexer/$%7BcontextPath%7D$%7BadminPath%7D?wt=json We can't see the contents of that link.. Could you post it on pastebin.com or something? Michael Della Bitta Appinions 18

Re: migrating from solr3 to solr4

2012-11-06 Thread Carlos Alexandro Becker
Hi Michael, thank for your answer. I already posted it in stackoverflow ( http://stackoverflow.com/questions/13236383/migrating-from-solr3-to-solr4 ), but, this looks like a encoding issue, actually, is exactly the error. I'm not sure, but I look in all xml files in my JBoss and also in app,

Re: SolrCloud Tomcat configuration: problems and doubts.

2012-11-06 Thread Luis Cappa Banda
Forward to solr-user mailing list. We forgot to reply to it, :-/ 2012/11/5 Luis Cappa Banda luisca...@gmail.com Hello, Mark! I´ve been testing more and more and things are going better. I have tested what you told me about -Dbootstrap_conf=true and works fine, but the problem is that if I

Re: migrating from solr3 to solr4

2012-11-06 Thread Stefan Matheis
Hey Carlos just had a quick look at our changes and figured out the revision which introduced this change, which might help you while having another look? http://svn.apache.org/viewvc?view=revisionrevision=1297578 The LoadAdminUiServlet is responsible for replacing those placeholders which

Re: lukeall.jar for Solr4r?

2012-11-06 Thread Shawn Heisey
On 11/6/2012 7:45 AM, Carrie Coy wrote: Thank you very much for taking the time to do this. This version is able to read the index files, but there is at least one issue: The home screen reports ERROR: can't count terms per field and this exception is thrown:

Re: GC stalls cause Zookeeper timeout during uninvert for facet field

2012-11-06 Thread Gil Tene
On Nov 6, 2012 at 6:06 AM, Arend-Jan Wijtzes ajwyt...@wise-guys.nlmailto:ajwyt...@wise-guys.nl wrote: ... During the uninvert phase of this text field the searchers experience long stalls because of the garbage collecting (20+ seconds pauses) which causes Solr to lose the Zookeeper lease. Often

Re: migrating from solr3 to solr4

2012-11-06 Thread Carlos Alexandro Becker
Hi Stefan, Thank you very much, I just realized that I didn't updated the web.xml, so, I not has the LoadAdminUiServlet configured, that's why it was not working. By now, the only problem I still have, is that it tries to access solr.home/collection1/conf, and I used to have it in

Re: Reply:Re: Where to get more documents or references about sold cloud?

2012-11-06 Thread Otis Gospodnetic
Hi, On Mon, Nov 5, 2012 at 8:24 PM, SuoNayi suonayi2...@163.com wrote: Thanks jack and thanks for the great country. All big famous websites such as google, slideshares and blogspot etc are blocked. What I want to know about is more details about solrcloud, here is my questions: 1.Can we

New Index directory regardless of Solr.xml

2012-11-06 Thread Rasmussen, Chris
I have a five node SolrCloud implementation running as a test with no replication using a three node zookeeper ensemble. Admittedly, I'm new to Solr and just grinding it out. Accidently re-initialized zookeeper with the wrong conf dir and I'm trying to recover. I re-ran the initialization

RE: Access DIH from inside application (via Solrj)?

2012-11-06 Thread Dyer, James
DIH SolrJ don't really support what you want to do. But you can make it work with code like this, which reloads the DIH configuration and checks for the response. Just note this is quite brittle: whenever the response changes in future versions of DIH, it'll break your code. MapString,

Re: Solr / Velocity url rewrite

2012-11-06 Thread Erick Erickson
Not really. Mostly it's whatever you are most comfortable with. Since the app - solr connection is just HTTP, the front-end is wide open. FWIW, Erick On Tue, Nov 6, 2012 at 8:30 AM, Sébastien Dartigues sebastien.dartig...@gmail.com wrote: Hi Erick, Thanks for your help. OK except the php

Re: SolrCloud failover behavior

2012-11-06 Thread Erick Erickson
I was right for once G.. Thanks for updating the Wiki! Erick On Tue, Nov 6, 2012 at 9:42 AM, Nick Chase nch...@earthlink.net wrote: Thanks a million, Erick! You're right about killing both nodes hosting the shard. I'll get the wiki corrected. Nick On 11/3/2012 10:51 PM, Erick

Re: load balance with SolrCloud

2012-11-06 Thread Erick Erickson
This is a complex setup, all right. A pluggable sharding strategy is definitely something that is on the roadmap for SolrCloud, but hasn't made it into the code base yet. Keep in mind, though, that all the SolrCloud goodness centers around the idea of a single index that may be sharded. I don't

Re: Add new shard will be treated as replicas in Solr4.0?

2012-11-06 Thread Zeng Lames
got it. thanks a lot On Tue, Nov 6, 2012 at 8:43 PM, Erick Erickson erickerick...@gmail.comwrote: bq: where can i find all the items on the road map? Well, you really can't G... There's no official roadmap. I happen to know this since I follow the developer's list and I've seen references

Re: How to re-read the config files in Solr, on a commit

2012-11-06 Thread roz dev
Erick We have a requirement where seach admin can add or remove some synonyms and would want these changes to be reflected in search thereafter. yes, we looked at reload command and it seems to be suitable for that purpose. We have a master and slave setup so it should be OK to issue reload

Re: load balance with SolrCloud

2012-11-06 Thread Jie Sun
thanks for your feedback Erick. I am also aware of the current limitation of shard number in a collection is fixed. changing the number will need re-config and re-index. Let's say if the limitation gets levitated in near future release, I would then consider setup collection for each customer,

Re: How to re-read the config files in Solr, on a commit

2012-11-06 Thread Otis Gospodnetic
Hi, Note about modifying synonyms - you need to reindex, really, if using index-time synonyms. And if you're using search-time synonyms you have multi-word synonym issue described on the Wiki. Otis -- Performance Monitoring - http://sematext.com/spm On Nov 6, 2012 11:02 PM, roz dev

Two questions about solrcloud

2012-11-06 Thread SuoNayi
Hi all,sorry for questions about solrcloud from newbie. here is my two questions: 1.If I have a solrcloud cluster with two shards and 0 replica on two different server. when one of server restarts will the solr instance on that server replay the transaction log to make sure these operations

Re: Solr Replication is not Possible on RAMDirectory?

2012-11-06 Thread deniz
Erik Hatcher-4 wrote There's an open issue (with a patch!) that enables this, it seems: lt;https://issues.apache.org/jira/browse/SOLR-3911gt; Erik well patch seems not doing that... i have tried and still getting some error lines about the dir types - Zeki ama calismiyor...

Re: How to re-read the config files in Solr, on a commit

2012-11-06 Thread roz dev
Thanks Otis for pointing this out. We may end up using search time synonyms for single word synonym and use index time synonym for multi world synonyms. -Saroj On Tue, Nov 6, 2012 at 8:09 PM, Otis Gospodnetic otis.gospodne...@gmail.com wrote: Hi, Note about modifying synonyms - you need