Thank you for the response.
Can I please know the reason why bit map indexes are not appropriate for
big data.
Rather than using the traditional bitmap indexing techniques we are
planning to implement a combination of novel bitmap indexing techniques
like bit sliced indexes and projection indexes.
Furthermore, can I please know whether bloom filters have already been
implemented in Spark.

Thank you

On Thu, Jun 30, 2016 at 12:51 AM, Jörn Franke <jornfra...@gmail.com> wrote:

>
> Is it the traditional bitmap indexing? I would not recommend it for big
> data. You could use bloom filters and min/max indexes in-memory which look
> to be more appropriate. However, if you want to use bitmap indexes then you
> would have to do it as you say. However, bitmap indexes may consume a lot
> of memory, so I am not sure that simply caching them in-memory is desired.
>
> > On 29 Jun 2016, at 19:49, Nishadi Kirielle <ndime...@gmail.com> wrote:
> >
> > Hi All,
> >
> > I am a CSE undergraduate and as for our final year project, we are
> expecting to construct a cluster based, bit-oriented analytic platform
> (storage engine) to provide fast query performance when used for OLAP with
> the use of novel bitmap indexing techniques when and where appropriate.
> >
> > For that we are expecting to use Spark SQL. We will need to implement a
> way to cache the bit map indexes and in-cooperate the use of bitmap
> indexing at the catalyst optimizer level when it is possible.
> >
> > I would highly appreciate your feedback regarding the proposed approach.
> >
> > Thank you & Regards
> >
> > Nishadi Kirielle
> > Department of Computer Science and Engineering
> > University of Moratuwa
> > Sri Lanka
>

Reply via email to