Dedupe in a SolrCloud

2015-06-17 Thread Markus.Mirsberger
Hi, I am trying to use the dedupe feature to detect and mark near duplicate content in my collections. I dont want to prevent duplicate content. I woud like to detect it and keep it for further processing. Thats why Im using an extra field and not the documents unique field. Here is how I

Sorting of facets

2014-01-10 Thread Markus.Mirsberger
Hi, is ist possible to sort the facet results by other fields than the facet field? e.g. I have 3 int fields: directory, pages, links Because I want all unique directories I have to use directory as the facet.field parameter. As far as I understand what I read I can now only sort the facet

ngroups does not show correct number of groups when used in SolrCloud

2013-06-14 Thread Markus.Mirsberger
Hi, I just noticed (after long time testing and finally looking into the docu :p) that the ngroups parameter does not show the correct number of groups when used in anything else than a single shard environment (in my case SolrCloud). Is there another way to get the amount of all groups

Re: ngroups does not show correct number of groups when used in SolrCloud

2013-06-14 Thread Markus.Mirsberger
Hi Shreejay, Thanks for the info. I read about this too but as far as I understand this this feature is not really usefull in my case. This means I would have to reindex my documents just to get the grouping that I need now. It would be OK to do that one time but I would have to do that

How to set the shardid?

2013-02-25 Thread Markus.Mirsberger
Hi, I have two servers, each server one shard in a collection. Id like to have one server have the same shardId for every collection I create (eg shard1 on server1 and shard2 on server2) I thought this would work by setting -DshardId=shard1 when starting the server. But the shardId's shard1

Problem with shard naming in Solr 4.1

2013-02-10 Thread Markus.Mirsberger
Hi, I upgraded today from a 4.1 nightly build to the 4.1 realease. Now I noticed that the naming of collection shards changed. When I create a collection an identifier for the shard and the replica will be added automatically? Can I turn this off somehow or is there a way to retrieve the

Re: Crawl Anywhere -

2013-02-10 Thread Markus.Mirsberger
Hi, did you try Heritrix? The documents are stored as html inside an warc file which can be postprocessed easily. Cheers, Markus On 11.02.2013 12:16, SivaKarthik wrote: Dear Erick, Thanks for ur relpy.. ya..nutch can meet my requirement... but the problem is, i want to store

Is it possible to use something like sum() in a solr-query?

2012-10-29 Thread Markus.Mirsberger
Hi, the subject says it all :) Is there something like sum() available in a solr query to sum all values of a field ? Regards, Markus Mirsberger

Re: Is it possible to use something like sum() in a solr-query?

2012-10-29 Thread Markus.Mirsberger
: Markus.Mirsberger Sent: Monday, October 29, 2012 11:08 AM To: solr-user@lucene.apache.org Subject: Is it possible to use something like sum() in a solr-query? Hi, the subject says it all :) Is there something like sum() available in a solr query to sum all values of a field ? Regards, Markus Mirsberger

Re: SolrJ missing CollectionAdmin Api to create new collections dynamically

2012-10-26 Thread Markus.Mirsberger
Yes thanks. But how can I check the status of a collection? The action STATUS not exist in the CollectionAdmin, only in the CoreAdmin. At the moment probably the only way to get information about this is somehow through the ZkStateReader? Regards, Markus On 27.10.2012 06:37, Chris Hostetter

SolrJ missing CollectionAdmin Api to create new collections dynamically

2012-10-24 Thread Markus.Mirsberger
Hi, I can't find a good way to create a new Collection with SolrJ. I need to create my Collections dynamically and at the moment the only way I see is to call the CollectionAdmin with a HTTP Call directly to any of my SolrServers. I don't like this because I think its a better way only to

Solr3.6 DeleteByQuery with negated field not working

2012-10-22 Thread Markus.Mirsberger
Hi, I am trying to delete a some documents in my index by query. When I select them with this negated query I get all the documents I want to delete but when I use this query in the DeleteByQuery it is not working Im trying to delete all elements which value ends with 'somename/' When I

Solr3.6 DeleteByQuery not working with negated query

2012-10-22 Thread Markus.Mirsberger
Hi, I am trying to delete a some documents in my index by query. When I just select them with this negated query, I get all the documents I want to delete but when I use this query in the DeleteByQuery it is not working Im trying to delete all elements which value ends with 'somename/'

Re: Solr3.6 DeleteByQuery not working with negated query

2012-10-22 Thread Markus.Mirsberger
wrote: Hi Markus, Why do you think it's not deleting amyrhing,? Thanks, Patrick Op 22 okt. 2012 08:36 schreef Markus.Mirsberger markus.mirsber...@gmx.de het volgende: Hi, I am trying to delete a some documents in my index by query. When I just select them with this negated query, I get all

Re: Solr3.6 DeleteByQuery not working with negated query

2012-10-22 Thread Markus.Mirsberger
the delete? Patrick Op 22 okt. 2012 08:43 schreef Markus.Mirsberger markus.mirsber...@gmx.de het volgende: Hi, Patrick, Because I have the same amount of documents in my index than before I perform the query. And when I use the negated query just to select the documents I can see they still

Re: Solr3.6 DeleteByQuery not working with negated query

2012-10-22 Thread Markus.Mirsberger
/collection1/update?commit=truestream.body=deletequery*:* -store_0_coordinate:[* TO *]/query/delete The curl you used is, of course, fine. I just find the above easier. Best Erick On Mon, Oct 22, 2012 at 4:22 AM, Markus.Mirsberger markus.mirsber...@gmx.de wrote: Yes Im sure. I commited a second time