Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
a cluster between my macbook and linux box. Each nodes talks to ZooKeeper to learn about the other nodes, including their addresses. Recovery is then done node to node using the appropriate addresses. - Mark Miller lucidimagination.com On Mar 16, 2012, at 3:00 PM, Matthew Parker wrote

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
The cluster is running on one machine. On Sun, Mar 18, 2012 at 2:07 PM, Mark Miller markrmil...@gmail.com wrote: From every node in your cluster you can hit http://MACHINE1:8084/solr in your browser and get a response? On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote: My cloud instance

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
. On Sun, Mar 18, 2012 at 2:07 PM, Mark Miller markrmil...@gmail.com wrote: From every node in your cluster you can hit http://MACHINE1:8084/solr in your browser and get a response? On Mar 18, 2012, at 1:46 PM, Matthew Parker wrote: My cloud instance finally tried to sync. It looks like it's having

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
I had tried importing data from Manifold, and one document threw a Tika Exception. If I shut everything down and restart SOLR cloud, the system sync'd on startup. Could extraction errors be the issue? On Sun, Mar 18, 2012 at 2:50 PM, Matthew Parker mpar...@apogeeintegration.com wrote: I

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-18 Thread Matthew Parker
That idea was short lived. I excluded the document. The cluster isn't syncing even after shutting everything down and restarting. On Sun, Mar 18, 2012 at 2:58 PM, Matthew Parker mpar...@apogeeintegration.com wrote: I had tried importing data from Manifold, and one document threw a Tika

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-16 Thread Matthew Parker
I'm still having issues replicating in my work environment. Can anyone explain how the replication mechanism works? Is it communicating across ports or through zookeeper to manager the process? On Thu, Mar 8, 2012 at 10:57 PM, Matthew Parker mpar...@apogeeintegration.com wrote: All, I

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-08 Thread Matthew Parker
All, I recreated the cluster on my machine at home (Windows 7, Java 1.6.0.23, apache-solr-4.0-2012-02-29_09-07-30) , sent some document through Manifold using its crawler, and it looks like it's replicating fine once the documents are committed. This must be related to my environment somehow.

Re: XSLT Response Writer and content transformation

2012-03-05 Thread Matthew Parker
You can embed custom Java functions in XSLT: http://cafeconleche.org/books/xmljava/chapters/ch17s03.html On Mon, Mar 5, 2012 at 4:27 AM, darul daru...@gmail.com wrote: Hello, Using native XSLT Response Writer, we may need to alter content before processing xml solr output as a RSS Feed.

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-02 Thread Matthew Parker
I've ensured the SOLR data subdirectories and files were completed cleaned out, but the issue still occurs. On Fri, Mar 2, 2012 at 9:06 AM, Erick Erickson erickerick...@gmail.comwrote: Matt: Just for paranoia's sake, when I was playing around with this (the _version_ thing was one of my

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-01 Thread Matthew Parker
/ If you do have a _version_ field, there is something to track down here for sure. On Wed, Feb 29, 2012 at 1:15 PM, Matthew Parker mpar...@apogeeintegration.com wrote: Mark/Sami I ran the system with 3 zookeeper nodes, 2 solr cloud nodes, and left numShards set to its default value

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-01 Thread Matthew Parker
customizations you are making to solrconfig? On Mar 1, 2012, at 1:48 PM, Matthew Parker wrote: Added it back in. I still get the same result. On Wed, Feb 29, 2012 at 10:09 PM, Mark Miller markrmil...@gmail.com wrote: Do you have a _version_ field in your schema? I actually just came back

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-03-01 Thread Matthew Parker
, 2012, at 3:35 PM, Mark Miller wrote: Any other customizations you are making to solrconfig? On Mar 1, 2012, at 1:48 PM, Matthew Parker wrote: Added it back in. I still get the same result. On Wed, Feb 29, 2012 at 10:09 PM, Mark Miller markrmil...@gmail.com wrote: Do you have

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
exactly this setup recently, so there must be something we are missing here... Any info you can offer might help. - Mark On Feb 28, 2012, at 1:00 PM, Matthew Parker wrote: Mark, I got the codebase from the 2/26/2012, and I got the same inconsistent results. I have solr running

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
I tried running SOLR Cloud with the default number of shards (i.e. 1), and I get the same results. On Wed, Feb 29, 2012 at 10:46 AM, Matthew Parker mpar...@apogeeintegration.com wrote: Mark, Nothing appears to be wrong in the logs. I wiped the indexes and imported 37 files from SharePoint

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
I also took out my requestHandler and used the standard /update/extract handler. Same result. On Wed, Feb 29, 2012 at 11:47 AM, Matthew Parker mpar...@apogeeintegration.com wrote: I tried running SOLR Cloud with the default number of shards (i.e. 1), and I get the same results. On Wed, Feb

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
, Sami Siren ssi...@gmail.com wrote: On Wed, Feb 29, 2012 at 7:03 PM, Matthew Parker mpar...@apogeeintegration.com wrote: I also took out my requestHandler and used the standard /update/extract handler. Same result. How did you install/start the system this time? The same way as earlier

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-29 Thread Matthew Parker
with 2 shards works after waiting for the system to sync. Regards, Matt On Wed, Feb 29, 2012 at 12:03 PM, Matthew Parker mpar...@apogeeintegration.com wrote: I also took out my requestHandler and used the standard /update/extract handler. Same result. On Wed, Feb 29, 2012 at 11:47 AM, Matthew

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-28 Thread Matthew Parker
it seems the windows from 8081 and 8083 move responding to the query but there are no results. if the queries run on 8081/8082 or 8081/8084 then results come back ok. The query is nothing more than: q=*:* Regards, Matt On Mon, Feb 27, 2012 at 9:26 PM, Matthew Parker mpar...@apogeeintegration.com

Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
TWIMC: Environment = Apache SOLR rev-1236154 Apache Zookeeper 3.3.4 Windows 7 JDK 1.6.0_23.b05 I have built a SOLR Cloud instance with 4 nodes using the embeded Jetty servers. I created a 3 node zookeeper ensemble to manage the solr configuration data. All the instances run on one

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
? Are you using a custom index chain or anything out of the ordinary? - Mark On Feb 27, 2012, at 12:26 PM, Matthew Parker wrote: TWIMC: Environment = Apache SOLR rev-1236154 Apache Zookeeper 3.3.4 Windows 7 JDK 1.6.0_23.b05 I have built a SOLR Cloud instance with 4 nodes

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
, 2012, at 2:22 PM, Matthew Parker wrote: Thanks for your reply Mark. I believe the build was towards the begining of the month. The solr.spec.version is 4.0.0.2012.01.10.38.09 I cannot access the clusterstate.json contents. I clicked on it a couple of times, but nothing happens

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
, 2012, at 2:22 PM, Matthew Parker wrote: Thanks for your reply Mark. I believe the build was towards the begining of the month. The solr.spec.version is 4.0.0.2012.01.10.38.09 I cannot access the clusterstate.json contents. I clicked on it a couple of times, but nothing happens

Re: Inconsistent Results with ZooKeeper Ensemble and Four SOLR Cloud Nodes

2012-02-27 Thread Matthew Parker
with distrib=false to check that. If you build is a month old, I'd honestly recommend you try upgrading as well. - Mark On Feb 27, 2012, at 3:34 PM, Matthew Parker wrote: Here is most of the cluster state: Connected to Zookeeper localhost:2181, localhost: 2182, localhost:2183 /(v

TIKA Errors Importing MS Word Documents into SOLR Cloud

2012-02-26 Thread Matthew Parker
I tried to import some documents into SOLR Cloud using Apache Manifold. TIKA started throwing exceptions for various documents The exception reads like the following: org.apache.solr.common.SolrException at org.apache.solr.handler.extraction.ExtractionDocumentLoader.load(

Re: nutch in solr

2012-02-05 Thread Matthew Parker
Doesn't tomcat run on port 8080, and not port 8983? Or did you change the tomcat's default port to 8983? On Feb 5, 2012 5:17 AM, alessio crisantemi alessio.crisant...@gmail.com wrote: Hi All, I have some problems with integration of Nutch in Solr and Tomcat. I follo Nutch tutorial for

Re: nutch in solr

2012-02-05 Thread Matthew Parker
No, they all don't run on 8983. Tomcat's default port is 8080. If you're using the embedded server in SOLR, you are using Jetty, which runs on port 8983. On Sun, Feb 5, 2012 at 11:54 AM, alessio crisantemi alessio.crisant...@gmail.com wrote: no, all run on port 8983. .. 2012/2/5 Matthew

Re: How can I index this?

2012-01-18 Thread Matthew Parker
I just started trying Apache ManifoldCF, which has a SharePoint connector that appears to integrate through Sharepoint's web services. Nutch also has a SharePoint connector, and it can publish documents into SOLR for indexing. On Wed, Jan 18, 2012 at 3:34 PM, ahammad ahmed.ham...@gmail.com

Re: PlainTextEntityProcessor and RegexTransformer in DataImport Handler

2011-12-24 Thread Matthew Parker
I would try something like the following: dataConfig dataSource type=FileDataSource / script![CDATA[ function format(row){ var text = row.get(plainText) // do regex processsing with Javascript's RegExp object.

UpdateRequestProcessor - processCommit

2011-11-23 Thread Matthew Parker
TWIMC: I creating a custom UpdateRequestProcessor chain, where I need to commit records to a database once the import process has completed. I'm assuming the processCommit method is called for each UpdateRequestProcessor chain class when the records are being commited to the Lucene index. I'm