Re: Why no composite primary-key in lucene ?

2017-04-29 Thread Dorian Hoxha
ere are much more > valuable things you could spend engineering time on than the rabbit > hole of trying to work around Solr's way of dealing with > ;).. > > Best, > Erick > > On Sat, Apr 29, 2017 at 11:38 AM, Dorian Hoxha <dorian.ho...@gmail.com> > wrote: > > @Y

Re: Why no composite primary-key in lucene ?

2017-04-29 Thread Dorian Hoxha
..@gmail.com> wrote: > > On Sat, Apr 29, 2017 at 1:45 PM, Dorian Hoxha <dorian.ho...@gmail.com> > wrote: > > @Yonik > > Thanks makes sense. So this means that the 'id' need to be indexed(is > always?), (so you can get/update/delete docs not in translog), ri

Re: Why no composite primary-key in lucene ?

2017-04-29 Thread Dorian Hoxha
ction log. > Only when the document is not found in the transaction log will it go > and consult the lucene index (which can only search as of the last > commit). > > -Yonik > > On Sat, Apr 29, 2017 at 12:57 PM, Dorian Hoxha <dorian.ho...@gmail.com> > wrote: > > I kno

Re: Why no composite primary-key in lucene ?

2017-04-29 Thread Dorian Hoxha
s > specified. Then Solr tells Lucene to delete the doc with the internal > Lucene doc ID and then tells Lucene to index this new doc. Lucene > assigns it a new internal Lucene doc ID. > > > Best, > Erick > > On Sat, Apr 29, 2017 at 5:38 AM, Dorian Hoxha <dorian.ho...@gm

Re: Why no composite primary-key in lucene ?

2017-04-29 Thread Dorian Hoxha
<apa...@elyograg.org> wrote: > On 4/28/2017 6:16 AM, Dorian Hoxha wrote: > > I searched for this on mailing-list,issues etc, but couldn't find any > > post. > > > > So, why not have the possibility of ? > > Or nobody cared enough to implement it ? Or no

Why no composite primary-key in lucene ?

2017-04-28 Thread Dorian Hoxha
Hi friends, I searched for this on mailing-list,issues etc, but couldn't find any post. So, why not have the possibility of ? Or nobody cared enough to implement it ? Or no gains ? Cause I've had many cases (on es/solr) that the 'id' was a "tenant_id:project_id:item_id" and I still had those

Re: Requires subscription

2017-04-28 Thread Dorian Hoxha
What do you mean subscription ? Have you seen: https://wiki.apache.org/solr/HowToContribute? On Fri, Apr 28, 2017 at 1:01 PM, Adi wrote: > Hi Team, > > I would interest in contribution of solr development ,please provide > subscription and instructions > > Thanks in

Make solr routing great again (child indexes)

2017-04-27 Thread Dorian Hoxha
Hey friends, So my reasoning is 1. Use the same `val1!val2` routing 2. Push all `val1!` to same shard 3. When doing split shard, if most values are for `val1!`, (a very big tenant) then convert the shard to a child_index(same schema/settings as parent), and add an alias (so

Re: Delta of delta encoding

2017-04-27 Thread Dorian Hoxha
@Adrien choosing different type of compression/storage depending on the data >From BigQuery: https://cloud.google.com/blog/big-data/2016/04/inside-capacitor-bigquerys-next-generation-columnar-storage-format BigQuery has background processes that constantly look at all the stored data and

Re: Embeding distributed-solr in client-app without storing data

2017-04-19 Thread Dorian Hoxha
er.wunderwood.org/ (my blog) > > > On Apr 19, 2017, at 9:01 AM, Dorian Hoxha <dorian.ho...@gmail.com> wrote: > > @Walter > > Usually you have: client-app --> random-solr-node(mergerer) --> each other > node that has a shard > While what I want: client-app

Re: Embeding distributed-solr in client-app without storing data

2017-04-19 Thread Dorian Hoxha
wrote: > > Hello, Dorian. > I'm not sure about 1. But you can create EmbeddedSolrServer and add > "collection" parameter. It's what's done in > org.apache.solr.response.transform.SubQueryAugmenter > [subquery] > > On Wed, Apr 19, 2017 at 3:53 PM, Dorian Hoxha <dorian.ho...@

Embeding distributed-solr in client-app without storing data

2017-04-19 Thread Dorian Hoxha
Hi friends, Anybody has done this ? Reasons being: 1 less http-request when doing distributed search. But also not storing data itself (like a search-only-node). And the other nodes not caring about search-nodes. Makes sense ? Regards, Dorian

Re: Resources/pointers on hacking on solr

2017-04-19 Thread Dorian Hoxha
On Wed, Apr 19, 2017 at 12:41 AM, Dorian Hoxha <dorian.ho...@gmail.com> > wrote: > >> Hey friends, >> >> I'll be having some free time in the next 2 weeks and would like to get >> up to speed on hacking on solr. I've done very little java on university

Resources/pointers on hacking on solr

2017-04-18 Thread Dorian Hoxha
Hey friends, I'll be having some free time in the next 2 weeks and would like to get up to speed on hacking on solr. I've done very little java on university so I'd like to get on speed on java too together with that. So if you can give me some high level pointers if possible on::: 1. getting

Re: How would you architect solr/lucene if you were starting from scratch for them to be 10X+ faster/efficient ?

2017-02-10 Thread Dorian Hoxha
/RediSearch On Fri, Feb 10, 2017 at 1:36 PM, Dorian Hoxha <dorian.ho...@gmail.com> wrote: > > > On Wed, Feb 8, 2017 at 3:58 PM, Alexandre Rafalovitch <arafa...@gmail.com> > wrote: > >> One you filter out the JIRA messages, the forum is very strong and >> alive.

Re: How would you architect solr/lucene if you were starting from scratch for them to be 10X+ faster/efficient ?

2017-02-10 Thread Dorian Hoxha
s parallel shared nothing architecture like the on in the TurboPFor inverted index app and a ram resident inverted index." > Regards and good luck, >Alex. > > http://www.solr-start.com/ - Resources for Solr users, new and experienced > > > On 8 February 2017 at 03

Re: How would you architect solr/lucene if you were starting from scratch for them to be 10X+ faster/efficient ?

2017-02-08 Thread Dorian Hoxha
So, am I asking too much (maybe), is this forum dead (then where to ask ? there is extreme noise here), is lucene perfect(of course not) ? On Wed, Jan 25, 2017 at 5:01 PM, Dorian Hoxha <dorian.ho...@gmail.com> wrote: > Was thinking also how bing doesn't use posting lists > <http:

Re: How would you architect solr/lucene if you were starting from scratch for them to be 10X+ faster/efficient ?

2017-01-25 Thread Dorian Hoxha
rd-per-core could probably be faked with multiple JVMs and numactl - > could be an interesting experiment. > > > > That said, I’m aware that a phenomenal amount of optimization effort has > gone into Lucene, and I’d also be interested in hearing about things that > worked well. &

How would you architect solr/lucene if you were starting from scratch for them to be 10X+ faster/efficient ?

2017-01-20 Thread Dorian Hoxha
Hi friends, I was thinking how scylladb architecture works compared to cassandra which gives them 10x+ performance and lower latency. If you were starting lucene and solr from scratch what would you do to achieve something similar ? Different