Thank you Keith for the answers and material you sent me. Just one more question about this solution: - What's the best way to consume data from Kafka to Flue. Do I need to implement something like in the webindex project: Kafka (Common Crawl) -> Spark -> Fluo? Or it's possible to ingest data directly from a Flue application?
Thank you again! Alan Camillo -----Original Message----- From: Keith Turner [mailto:ke...@deenlo.com] Sent: Tuesday, December 19, 2017 4:52 PM To: fluo-dev <dev@fluo.apache.org> Subject: Re: Fluo application question On Tue, Dec 19, 2017 at 1:25 PM, Alan Camillo <a...@blueshift.com.br> wrote: > We just start a project that the objective is consolidate some > personal information using some business rules. It's a kind of ranking > of the best information of a person. > > Today they use to reprocess every batch they receive comparing the new > data with all historical data. They're using Spark for this operation. > I'd like to propose something like this: > https://www.dropbox.com/s/glqhh7zzxd7g433/architecture.png?dl=0 > > Two questions: > - is it possible create an observer to synchronizes with HBase? You could use an export queue to make updates to an HBase instance. http://fluo.apache.org/docs/fluo-recipes/1.1.0-incubating/export-queue/ Also the slides below discuss the export queue (slide 27) and the concept of invert on export (slide 33). Invert on export would likely be useful for a key value store like hbase. https://www.slideshare.net/AccumuloSummit/accumulo-summit-2016-tips-for-writing-fluo-applications Fluo recipes does not currently have an exporter for HBase. It would be useful to add one to Fluo Recipes like the following for Accumulo. http://fluo.apache.org/docs/fluo-recipes/1.1.0-incubating/accumulo-export-queue/ > - Am I doing a good use of Fluo? If not, why? It sounds like it may be a good fit. However, the exporter for HBase would need to be implemented. The Accumulo exporter is written in such a way that multiple transactions can share a single writer for efficiency. Not sure if this pattern should be followed for HBase. > > Thank you all! > > -----Original Message----- > From: Keith Turner [mailto:ke...@deenlo.com] > Sent: Tuesday, December 19, 2017 2:21 PM > To: fluo-dev <dev@fluo.apache.org> > Subject: Re: About user group > > On Tue, Dec 19, 2017 at 8:18 AM, Alan Camillo <a...@blueshift.com.br> > wrote: >> Hello Fluo group! >> >> My name is Alan, I'm a big date architect and owner of a company >> called BlueShift Brasil. And I'm looking foward for Apache Fluo. I'd >> like to know about a user group to because I was no able to find, is >> it exist? > > We currently do not have a user list. Feel free to ask any questions > you have here on the dev list. > >> >> I have many questions to do and I would'nt like to post those here. >> If I could help if something in the project, please count on me. > > If you are interested in contributing, the following may be a good > issue to start with. > > https://github.com/apache/fluo-docker/issues/9 > >> >> Thanks! >> Alan Camillo >> *BlueShift *I IT Director >> Cel.: +55 11 98283-6358 >> Tel.: +55 11 4605-5082