Thank you Keith for the answers and material you sent me.
Just one more question about this solution:
  - What's the best way to consume data from Kafka to Flue. Do I need to
implement something like in the webindex project: Kafka (Common Crawl) ->
Spark -> Fluo? Or it's possible to ingest data directly from a Flue
application?

Thank you again!
Alan Camillo

-----Original Message-----
From: Keith Turner [mailto:ke...@deenlo.com]
Sent: Tuesday, December 19, 2017 4:52 PM
To: fluo-dev <dev@fluo.apache.org>
Subject: Re: Fluo application question

On Tue, Dec 19, 2017 at 1:25 PM, Alan Camillo <a...@blueshift.com.br> wrote:
> We just start a project that the objective is consolidate some
> personal information using some business rules. It's a kind of ranking
> of the best information of a person.
>
> Today they use to reprocess every batch they receive comparing the new
> data with all historical data. They're using Spark for this operation.
> I'd like to propose something like this:
> https://www.dropbox.com/s/glqhh7zzxd7g433/architecture.png?dl=0
>
> Two questions:
>  - is it possible create an observer to synchronizes with HBase?

You could use an export queue to make updates to an HBase instance.

http://fluo.apache.org/docs/fluo-recipes/1.1.0-incubating/export-queue/

Also the slides below discuss the export queue (slide 27) and the concept of
invert on export (slide 33).  Invert on export would likely be useful for a
key value store like hbase.

https://www.slideshare.net/AccumuloSummit/accumulo-summit-2016-tips-for-writing-fluo-applications

Fluo recipes does not currently have an exporter for HBase.  It would be
useful to add one to Fluo Recipes like the following for Accumulo.

http://fluo.apache.org/docs/fluo-recipes/1.1.0-incubating/accumulo-export-queue/

>  - Am I doing a good use of Fluo? If not, why?

It sounds like it may be a good fit.  However, the exporter for HBase would
need to be implemented.  The Accumulo exporter is written in such a way that
multiple transactions can share a single writer for efficiency.  Not sure if
this pattern should be followed for HBase.

>
> Thank you all!
>
> -----Original Message-----
> From: Keith Turner [mailto:ke...@deenlo.com]
> Sent: Tuesday, December 19, 2017 2:21 PM
> To: fluo-dev <dev@fluo.apache.org>
> Subject: Re: About user group
>
> On Tue, Dec 19, 2017 at 8:18 AM, Alan Camillo <a...@blueshift.com.br>
> wrote:
>> Hello Fluo group!
>>
>> My name is Alan, I'm a big date architect and owner of a company
>> called BlueShift Brasil. And I'm looking foward for Apache Fluo. I'd
>> like to know about a user group to because I was no able to find, is
>> it exist?
>
> We currently do not have a user list.  Feel free to ask any questions
> you have here on the dev list.
>
>>
>> I have many questions to do and I would'nt like to post those here.
>> If I could help if something in the project, please count on me.
>
> If you are interested in contributing, the following may be a good
> issue to start with.
>
> https://github.com/apache/fluo-docker/issues/9
>
>>
>> Thanks!
>> Alan Camillo
>> *BlueShift *I IT Director
>> Cel.: +55 11 98283-6358
>> Tel.: +55 11 4605-5082

Reply via email to