Re: phrase extraction from user paragraph input

2014-11-28 Thread Vineet Mishra
Hi Nokos, Can you quote an example for your usecase, I guess that will be helpful for understanding the problem more clearly. Cheers! On Fri, Nov 28, 2014 at 2:31 PM, Nikos Chaliasos nchal...@cs.uoi.gr wrote: Hello, I am investigating a university project where in a part of it, the user

Re: Inconsistent Behavior of Solr Cloud

2014-06-16 Thread Vineet Mishra
erickerick...@gmail.com wrote: It seems like for some reason you have shards that are not reachable. What does your cloud stat in the admin UI tell you when you don't get all the docs back? Best, Erick On Fri, Jun 13, 2014 at 1:37 AM, Vineet Mishra clearmido...@gmail.com wrote: Hi All

Inconsistent Behavior of Solr Cloud

2014-06-13 Thread Vineet Mishra
Hi All, I am having a Cloud setup with 3 Shards and 2 Replica running on 3 Tomcats with 3 External Zookeeper, all running on single machine. I have Indexed around 70 Mln Documents that seems to be querying back fine. When I index another 30 Mln to same, the result are vague as with the query *:*

Re: Collection communication internally

2014-06-10 Thread Vineet Mishra
think this is supported yet for distributed setups. Best, Erick On Mon, Jun 9, 2014 at 7:32 AM, Vineet Mishra clearmido...@gmail.com wrote: Hi All, I was curious to know how multiple Collection communication be achieved? If yes then by what means. The use case says, having multiple

Re: solr4 optimization

2014-06-10 Thread Vineet Mishra
As Otis mentioned, its obviously good to run Optimization once in a while or when you are done with most of your heavy indexing operation. Its not concern with the Disk Capacity rather with the IO and seeking in segements, When comparably it has less segments to query the IO operation will be less

Collection communication internally

2014-06-09 Thread Vineet Mishra
Hi All, I was curious to know how multiple Collection communication be achieved? If yes then by what means. The use case says, having multiple collection I need to query the first collection and get the unique ids from first collection to query the second one(Foreign Key Relation). Now if the

Re: Solr maximum Optimal Index Size per Shard

2014-06-06 Thread Vineet Mishra
to appropriate shard directories. Thanks! On Wed, Jun 4, 2014 at 12:43 PM, Shawn Heisey s...@elyograg.org wrote: On 6/4/2014 12:45 AM, Vineet Mishra wrote: Thanks all for your response. I presume this conversation concludes that indexing around 1Billion documents per shard won't be a problem

Re: Solr maximum Optimal Index Size per Shard

2014-06-06 Thread Vineet Mishra
for high volume, strive to assure that your per-shard index fits entirely into the system memory available for OS caching of file system pages. In any case, a proof of concept implementation will tell you everything you need to know. -- Jack Krupansky -Original Message- From: Vineet

Re: Solr maximum Optimal Index Size per Shard

2014-06-06 Thread Vineet Mishra
-06 at 12:32 +0200, Vineet Mishra wrote: *Does that mean for querying smoothly we need to have memory atleast equal or greater to the size of index? If you absolutely, positively have to reduce latency as much as possible, then yes. With an estimated index size of 2TB, I would guess that 10

Re: Solr maximum Optimal Index Size per Shard

2014-06-06 Thread Vineet Mishra
...@statsbiblioteket.dk wrote: On Fri, 2014-06-06 at 14:05 +0200, Vineet Mishra wrote: Could you state what indexing mechanism are you using, as I started with EmbeddedSolrServer but it was pretty slow after a few GB(~30+) of indexing. I suspect that is due to too-frequent commits, too small

Re: Solr maximum Optimal Index Size per Shard

2014-06-04 Thread Vineet Mishra
Thanks all for your response. I presume this conversation concludes that indexing around 1Billion documents per shard won't be a problem, as I have 10 Billion docs to index, so approx 10 shards with 1 Billion each should be fine with it and how about Memory, what size of RAM should be fine for

Solr maximum Optimal Index Size per Shard

2014-06-03 Thread Vineet Mishra
Hi All, Has anyone came across the maximum threshold document or size wise for each core of solr to hold. As I have indexed some 10 Million Documents of 18Gb and when I index another 5 (9Gb)Million Documents on top of these indexes it responds little slow with Stats query. Considering I have

Re: Offline Indexes Update to Shard

2014-06-02 Thread Vineet Mishra
29, 2014 at 7:52 PM, Otis Gospodnetic otis.gospodne...@gmail.com wrote: Hi, On Wed, May 28, 2014 at 4:25 AM, Vineet Mishra clearmido...@gmail.com wrote: Hi All, Has anyone tried with building Offline indexes with EmbeddedSolrServer and posting it to Shards. What do you mean

Re: Offline Indexes Update to Shard

2014-06-02 Thread Vineet Mishra
...@gmail.com wrote: Hi, On Wed, May 28, 2014 at 4:25 AM, Vineet Mishra clearmido...@gmail.com wrote: Hi All, Has anyone tried with building Offline indexes with EmbeddedSolrServer and posting it to Shards. What do you mean by posting it to shards? How is that different than

Re: Offline Indexes Update to Shard

2014-06-02 Thread Vineet Mishra
, Jun 2, 2014 at 2:24 PM, Wolfgang Hoschek whosc...@cloudera.com wrote: Sounds like you should consider using MapReduceIndexerTool. AFAIK, this is the most scalable indexing (and merging) solution out there. Wolfgang. On Jun 2, 2014, at 10:33 AM, Vineet Mishra clearmido...@gmail.com wrote

Offline Indexes Update to Shard

2014-05-28 Thread Vineet Mishra
Hi All, Has anyone tried with building Offline indexes with EmbeddedSolrServer and posting it to Shards. FYI, I am done building the indexes but looking out for a way to post these index files on shards. Copying the indexes manually to each shard's replica is possible and is working fine but I

Indexing Getting Failed

2014-05-16 Thread Vineet Mishra
Hi I have setup default cloud cluster 4.6.0 with inbuilt Zookeeper running on Jetty, as I started with indexing till a few thousand it goes fine but soon after some 5000 documents or so it started giving error(please find below) and stopped the indexing too as the Zookeeper Leader selection was

Re: Inconsistent response from Cloud Query

2014-05-15 Thread Vineet Mishra
Hi Shawn, There is no recovery case for me, neither the commit is pending. The case I am talking about is when I restart the Cloud all over again with index already flushed to disk. Thanks! On Sun, May 11, 2014 at 10:17 PM, Shawn Heisey s...@elyograg.org wrote: On 5/9/2014 11:42 AM, Cool

Fwd: Inconsistent response from Cloud Query

2014-05-13 Thread Vineet Mishra
Copying. Community: Looking forward for your response. -- Forwarded message -- From: Vineet Mishra clearmido...@gmail.com Date: Mon, May 12, 2014 at 5:57 PM Subject: Re: Inconsistent response from Cloud Query To: solr-user@lucene.apache.org Hi Shawn, There is no recovery case

Inconsistent response from Cloud Query

2014-05-06 Thread Vineet Mishra
Hi All, I have setup cloud-4.6.2 with default configuration on single machine with 2 shards and 2 replication through https://cwiki.apache.org/confluence/display/solr/Getting+Started+with+SolrCloud Cloud was up and running and I indexed the example data xml to it, it went fine. Now when I am

Re: Indexing Big Data With or Without Solr

2014-04-23 Thread Vineet Mishra
I did it with Tomcat and Zookeeper Ensemble, will mail you the steps shortly. Cheers On Sat, Apr 19, 2014 at 9:09 AM, Aman Tandon amantandon...@gmail.comwrote: Vineet please share after you setup for solr cloud Are you using jetty or tomcat.? On Saturday, April 19, 2014, Vineet Mishra

Re: Indexing Big Data With or Without Solr

2014-04-18 Thread Vineet Mishra
ask it here. Thanks; Furkan KAMACI 2014-04-15 13:20 GMT+03:00 Vineet Mishra clearmido...@gmail.com: Hi All, I have worked with Solr 3.5 to implement real time search on some 100GB data, that worked fine but was little slow on complex queries(Multiple group/joined queries). But now

Indexing Big Data With or Without Solr

2014-04-15 Thread Vineet Mishra
Hi All, I have worked with Solr 3.5 to implement real time search on some 100GB data, that worked fine but was little slow on complex queries(Multiple group/joined queries). But now I want to index some real Big Data(around 4 TB or even more), can SolrCloud be solution for it if not what could be

Re: SolrCloud with Tomcat

2014-03-10 Thread Vineet Mishra
GMT+02:00 Vineet Mishra clearmido...@gmail.com: Hi I am installing SolrCloud with 3 External Zookeeper(localhost:2181,localhost:2182,localhost:2183) and 2 Tomcats(localhost:8181,localhost:8182) all available on a single Machine(Just for getting started). By Following these links

SolrCloud with Tomcat

2014-03-06 Thread Vineet Mishra
Hi I am installing SolrCloud with 3 External Zookeeper(localhost:2181,localhost:2182,localhost:2183) and 2 Tomcats(localhost:8181,localhost:8182) all available on a single Machine(Just for getting started). By Following these links

Re: Fault Tolerant Technique of Solr Cloud

2014-02-27 Thread Vineet Mishra
Steffensen st...@designware.dk wrote: On 24/02/14 13:04, Vineet Mishra wrote: Can you brief as how to make a direct call to Zookeeper instead of Cloud Collection(as currently I was querying the Cloud something like *http://192.168.2.183:8900/solr/collection1/select?q=*:* http://192.168.2.183:8900

Scalability Limit of SolrCloud

2014-02-27 Thread Vineet Mishra
Hi All What is the Scalability Limit of CloudSolr, can it reach to index Billions of Documents and each document containing 400-500 Number Field(probably Float or Double). Is it possible and feasible to go with current CloudSolr Architecture or are there some other alternative or replacement.

Re: Fault Tolerant Technique of Solr Cloud

2014-02-24 Thread Vineet Mishra
...@designware.dk wrote: On 19/02/14 07:57, Vineet Mishra wrote: Thanks for all your response but my doubt is which *Server:Port* should the query be made as we don't know the crashed server or which server might crash in the future(as any server can go down). That is what CloudSolrServer

Fault Tolerant Technique of Solr Cloud

2014-02-18 Thread Vineet Mishra
Hi All, I want to have clear idea about the Fault Tolerant Capability of SolrCloud Considering I have setup the SolrCloud with a external Zookeeper, 2 shards, each having a replica with single collection as given in the official Solr Documentation.

Re: Fault Tolerant Technique of Solr Cloud

2014-02-18 Thread Vineet Mishra
for distributed search might work if any of the shard or replica goes down. Thanks On Tue, Feb 18, 2014 at 11:22 PM, Shawn Heisey s...@elyograg.org wrote: On 2/18/2014 8:32 AM, Shawn Heisey wrote: On 2/18/2014 6:05 AM, Vineet Mishra wrote: *Shard 1

pool-1-thread-4 java.lang.NoSuchMethodError: org.apache.solr.util.SimplePostTool

2013-12-06 Thread Vineet Mishra
pool-1-thread-4 java.lang.NoSuchMethodError: org.apache.solr.util.SimplePostTool I am getting this error while posting Data to Solr from XML generated file. Although the Solr post.jar is present in the Library Class Path and I also tried keeping the Source class of the Post Tool. Urgent Call.

Making a Web Request is failing with 403 Request Forbidden

2013-10-30 Thread Vineet Mishra
Hi All, I am making web server call to a website for Shortening the links, that is bit.ly but recieving a 403 Request Forbidden. Although if I use their webpage to short the web link its working good. Can any body tell me what might be the reason for such a vague behavior. Here is the code

Post Call to Solr RequestHandler

2013-08-09 Thread Vineet Mishra
Hi Currently I am working with RequestHandler in Solr, where the user defined query is processed at the class specified by the requesthandler in Solrconfig.xml. But my requirement is that I want to make it a Post call rather than a Get query call. Is it possible or are there some way we can

Re: Unexpected character '' (code 60) expected '='

2013-08-01 Thread Vineet Mishra
I am using Solr 3.5 with the posting XML file size of just 1Mb. On Wed, Jul 31, 2013 at 8:19 PM, Shawn Heisey s...@elyograg.org wrote: On 7/31/2013 7:16 AM, Vineet Mishra wrote: I checked the File. . .nothing is there. I mean the formatting is correct, its a valid XML file. What version

Re: Unexpected character '' (code 60) expected '='

2013-08-01 Thread Vineet Mishra
Management with Lock. On Thu, Aug 1, 2013 at 1:50 PM, Paul Masurel paul.masu...@gmail.com wrote: You can check for your xml validity with xmllint very simply. xmllint file Does this return an error? On Thu, Aug 1, 2013 at 9:59 AM, deniz denizdurmu...@gmail.com wrote: Vineet Mishra wrote

SimplePostTool: FATAL: Solr returned an error #400 Bad Request

2013-07-31 Thread Vineet Mishra
Hi All Currently I am in a mid of a project which Index some data to Solrs multiple instance. I have the Configuration as, on the same machine I have made multiple instances of Solr http://localhost:8080/solr1 http://localhost:8080/solr2 http://localhost:8080/solr3 http://localhost:8080/solr4

Re: SimplePostTool: FATAL: Solr returned an error #400 Bad Request

2013-07-31 Thread Vineet Mishra
the same document fail every time? What does it look like? You really haven't provided much information to go on. Best Erick On Wed, Jul 31, 2013 at 3:55 AM, Vineet Mishra clearmido...@gmail.com wrote: Hi All Currently I am in a mid of a project which Index some data to Solrs

Unexpected character '' (code 60) expected '='

2013-07-31 Thread Vineet Mishra
Hi All I am currently stuck in a Solr Issue while Posting some data to Solr Server. I have some record from Hbase which I am posting to Solr, but after posting some 1 Million of data records, it suddenly stopped. Checking the Catalina log trace it showed, *org.apache.solr.common.SolrException:

Re: Unexpected character '' (code 60) expected '='

2013-07-31 Thread Vineet Mishra
I checked the File. . .nothing is there. I mean the formatting is correct, its a valid XML file. On Wed, Jul 31, 2013 at 6:38 PM, Markus Jelsma markus.jel...@openindex.iowrote: This file is malformed: *SEVERE: org.apache.solr.common.SolrException: Unexpected character '' (code 60) expected

Group and performing statistics on groups

2013-07-26 Thread Vineet Mishra
Hi This is a urgent call, I am grouping the solr documents by a field name and want to get the Range(Min and Max) value for another field in that group. StatsComponent works fine on all the document as whole rendering the max and min of a field, is it possible to get the StatsComponent per group

Sorting the Solr Document after clubbing them from multiple instances

2013-07-24 Thread Vineet Mishra
Hi I have a Master Solr through which I am querying to multiple solr instance and aggregating their response and responding back to the user. Now the requirement is that when I get the data querying multiple solr instance, I want it to be sorted based on some field name. Say I have 3 Slave

Custom RequestHandlerBase XML Response Issue

2013-07-18 Thread Vineet Mishra
Hi all I am using a Custom RequestHandlerBase where I am querying from multiple different Solr instance and aggregating their output as a XML Document using DOM, now in the RequestHandler's function handleRequestBody(SolrQueryRequest req, SolrQueryResponse resp) I want to output this XML Document

Re: Custom RequestHandlerBase XML Response Issue

2013-07-18 Thread Vineet Mishra
18, 2013 at 3:31 PM, Vineet Mishra clearmido...@gmail.com wrote: Hi all I am using a Custom RequestHandlerBase where I am querying from multiple different Solr instance and aggregating their output as a XML Document using DOM, now in the RequestHandler's function handleRequestBody

Re: Custom RequestHandlerBase XML Response Issue

2013-07-18 Thread Vineet Mishra
/src/java/org/apache/solr/response/TextResponseWriter.java On Thu, Jul 18, 2013 at 4:08 PM, Vineet Mishra clearmido...@gmail.com wrote: Thanks for your response Shalin, so does that mean that we can't return a XML object in SolrQueryResponse through Custom RequestHandler? On Thu

Re: Custom RequestHandlerBase XML Response Issue

2013-07-18 Thread Vineet Mishra
So does that mean there is no way that we can write a XML or JSON object to the SolrQueryResponse and expect it to be formatted?

Re: Custom RequestHandlerBase XML Response Issue

2013-07-18 Thread Vineet Mishra
to serialize so it just calls .toString on it and writes it out. As long as you stick a known type into the SolrQueryResponse, you should be fine. On Thu, Jul 18, 2013 at 6:24 PM, Vineet Mishra clearmido...@gmail.com wrote: So does that mean there is no way that we can write a XML or JSON object