Re: [DISCUSS] Incubating Proposal for Datark

2022-09-25 Thread hongbin ma
Good luck to Datark

On Mon, Sep 26, 2022 at 11:04 AM yOA zzha  wrote:

> Good luck to this project.
>
> Jiayi Liu  于2022年9月26日周一 10:55写道:
>
> > Good luck to Datark.
> >
> > Kaijie Chen  于2022年9月26日周一 10:47写道:
> >
> > > +1 (non-binding)
> > >
> > > Good luck.
> > >
> > > Kaijie
> > >
> > > -
> > > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> > > For additional commands, e-mail: general-h...@incubator.apache.org
> > >
> > >
> >
>


-- 
Regards,
Hongbin Ma


Re: [VOTE] Accept S2Graph into Apache Incubation

2015-11-23 Thread hongbin ma
+1 non-binding

On Tue, Nov 24, 2015 at 10:54 AM, Luke Han  wrote:

> +1 non-binding
>
>
> Best Regards!
> -
>
> Luke Han
>
> On Tue, Nov 24, 2015 at 8:53 AM, Hyunsik Choi  wrote:
>
> > Hello folks,
> >
> > Thanks for all the feedback on the S2Graph Proposal.
> >
> > I would like to call for a [VOTE] on S2Graph joining the ASF as an
> > incubation project.
> >
> > The vote is open for at least 72 hours:
> >
> > [ ] +1 accept S2Graph in the Incubator
> > [ ] ±0
> > [ ] -1 (please give reason)
> >
> > S2Graph provides a scalable distributed graph database engine over a
> > key/value store such as HBase. S2Graph provides a fully asynchronous
> > API to manipulate data as a property graph model and fast
> > breadth-first-search queries over the graph. S2Graph is designed for
> > OLTP-like workloads on graph data sets instead of batch processing,
> > and it also provides INSERT/UPDATE operations on them.
> >
> > The proposal is available on the wiki here:
> > https://wiki.apache.org/incubator/S2GraphProposal
> >
> > Best regards,
> > Hyunsik
> >
> >
> > 
> >
> >
> 
> > = S2Graph Proposal =
> >
> > == Abstract ==
> > S2Graph is a distributed and scalable OLTP graph database built on
> > Apache HBase to support fast traversal of extremely large graphs.
> >
> > == Proposal ==
> > S2Graph provides a scalable distributed graph database engine over a
> > key/value store such as HBase. S2Graph provides a fully asynchronous
> > API to manipulate data as a property graph model and fast
> > breadth-first-search queries over the graph. S2Graph is designed for
> > OLTP-like workloads on graph data sets instead of batch processing.
> > Also, S2Graph provides INSERT/UPDATE operations. Its name 'S2Graph' is
> > an abbreviated word of '''S'''uper '''S'''imple '''Graph''' Database.
> >
> > Here are additional materials to introduce S2Graph.
> >  * HBaseCon 2015 -
> http://www.slideshare.net/HBaseCon/use-cases-session-5
> >  * Apache: Big Data 2015 -
> > http://schd.ws/hosted_files/apachebigdata2015/06/s2graph_apache_con.pdf
> >
> > == Background ==
> > S2Graph initially started as an internal project at Kakao.com to
> > efficiently store user relations and user activities as one large
> > graph and to provide a unified query interface to traverse the graph.
> > It was open sourced on Github about a 3 months ago in June 2015.
> >
> > Over time, S2Graph using HBase as the storage tier has begun by
> > adapted into various applications, such as messaging, social feeds,
> > and realtime recommendations at Kakao.
> >
> > Users can benefit by using S2Graph`s generalized high level graph
> > abstraction API instead of querying via low-level key/value APIs, just
> > as Apache Phoenix provides a SQL layer over HBase.
> >
> > == Rationale ==
> > Graph data (highly interconnected data) is very abundant and important
> > these days. When users have a multitude of relationships, each with
> > complex properties associated with them, a graph model is more
> > intuitive and efficient than tabular formats (RDBMS).
> >
> > There are many ASF projects that provide SQL tiers, but there is no
> > ASF projects that provide a scalable graph layer on top of the
> > existing hadoop ecosystem. When graph data grows to the trillion edge
> > scale, the process of traversing takes a long time and can be costly.
> > However, with the benefit of HBase`s scalable architecture, S2Graph
> > can traverse large graphs in a breadth-first-search manner
> > efficiently.
> >
> > S2Graph also interoperates with several existing Apache projects
> > (HBase, Apache Spark) to provide means of merging real time events and
> > batch processed data using the property graph data model.
> >
> > Many developers run their own domain specific API servers to serve
> > their data products, but a graph model is general and the S2Graph API
> > fully supports traversal of the graph, so it can be used as a scalable
> > general purpose API serving layer for various domains. As long as data
> > can be modeled as graph, then users can avoid tedious work developing
> > customized API servers if they use S2Graph.
> >
> > == Initial Goals ==
> > The initial goals will be to move the existing codebase to Apache and
> > integrate with the Apache development process. Once this is
> > accomplished, we plan for incremental development and releases that
> > follow the Apache guidelines.
> >
> > == Current Status ==
> >
> > === Meritocracy ===
> > S2Graph operated on meritocratic principles from the get go.
> > Currently, all the discussions pertaining to S2Graph development are
> > public on Github. The current incubation proposal includes the major
> > code contributors to S2Graph. Several additional people have worked on
> > the S2graph codebase for industry use cases and would be interested in
> > becoming committers. We are starting with a 

Re: [DISCUSS] S2Graph Incubator Proposal

2015-11-23 Thread hongbin ma
hi Hyunsik

This is Hongbin Ma form Apache Kylin. Before working on Kylin I worked on
Microsoft Research Asia's Graph engine "Trinity"(
http://research.microsoft.com/en-us/projects/trinity/), mostly on a
ansi-like graph query language. My experience makes me very interested in
this project.

Usually graphdb tends to build upon native storage engine to boost
performance, but the idea of leveraging HBase is novel and interesting
(just like Kylin), can't wait to see how it worked out.


On Tue, Nov 24, 2015 at 9:22 AM, Marko Rodriguez <okramma...@gmail.com>
wrote:

> Hi,
>
> I just added a ticket describing the benefits of providing Apache
> TinkerPop support for S2Graph.
> https://github.com/kakao/s2graph/issues/175
> Just some thoughts in case you haven't considered TinkerPop.
>
> Good luck,
> Marko.
>
> http://markorodriguez.com
>
> On Nov 23, 2015, at 5:35 PM, Stack <st...@duboce.net> wrote:
>
> > Good luck!
> >
> > On Mon, Nov 23, 2015 at 4:31 PM, Hyunsik Choi <hyun...@apache.org>
> wrote:
> >
> >> Stack,
> >>
> >> I greatly appreciate your reviews. I reflected almost all comments you
> >> gave to the proposal. I'll make a vote thread soon.
> >>
> >> Best regards,
> >> Hyunsik
> >>
> >> On Thu, Nov 19, 2015 at 2:57 PM, Hyunsik Choi <hyun...@apache.org>
> wrote:
> >>> Thank you Stack for your kind work. I just understood. I will improve
> >>> the proposal from your suggestion.
> >>>
> >>> Best regards,
> >>> Hyunsik
> >>>
> >>> On Thu, Nov 19, 2015 at 11:35 AM, Stack <st...@duboce.net> wrote:
> >>>> On Wed, Nov 18, 2015 at 5:04 PM, Hyunsik Choi <hyun...@apache.org>
> >> wrote:
> >>>>
> >>>>> Thank you very much Stack! It definitely looks better than just wiki.
> >>>>> It would be helpful to improve the proposal.
> >>>>>
> >>>>>
> >>>> To be clear, proposal needs to be on the wiki. I just moved it over
> so I
> >>>> could show my edits as 'suggested' rather than change the original.
> Take
> >>>> whatever edits you think improve the proposal and then apply them to
> the
> >>>> wiki.
> >>>> St.Ack
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>> Best regards,
> >>>>> Hyunsik
> >>>>>
> >>>>> On Tue, Nov 17, 2015 at 10:20 PM, Stack <st...@duboce.net> wrote:
> >>>>>> Glad to see s2graph being put up as an incubator project.
> >>>>>>
> >>>>>> Here [1] are some suggested edits to try and help strengthen the
> >> proposal
> >>>>>> before it goes up for a vote. Hopefully they help.
> >>>>>>
> >>>>>> St.Ack
> >>>>>> 1.
> >>>>>>
> >>>>>
> >>
> https://docs.google.com/document/d/19iNc0u_-9ogb0kDC-WnLoWWg9J_LFeuSGB8GF_rQfEs/edit?usp=sharing
> >>>>>>
> >>>>>> On Tue, Nov 17, 2015 at 3:59 PM, Hyunsik Choi <hyun...@apache.org>
> >>>>> wrote:
> >>>>>>
> >>>>>>> Thank you Henry. Yes, we already had enough time to discuss the
> >>>>>>> S2Graph proposal. I'll make a vote thread soon.
> >>>>>>>
> >>>>>>> Best regards,
> >>>>>>> Hyunsik
> >>>>>>>
> >>>>>>> On Tue, Nov 17, 2015 at 2:22 PM, Henry Saputra <
> >> henry.sapu...@gmail.com
> >>>>>>
> >>>>>>> wrote:
> >>>>>>>> Looks like we have positive responses. I think it is time for VOTE
> >>>>>>> thread :)
> >>>>>>>>
> >>>>>>>> On Friday, November 6, 2015, Hyunsik Choi <hyun...@apache.org>
> >> wrote:
> >>>>>>>>
> >>>>>>>>> Hi folks,
> >>>>>>>>>
> >>>>>>>>> We would like to start a discussion on S2Graph as an incubation
> >>>>> project.
> >>>>>>>>>
> >>>>>>>>> S2Graph is a distributed and scalable OLTP graph database built
> >> on
> >>>>>>>>> HBase. It provides interactive queries for
> >> vertex/edge/sub-grap

Re: [VOTE] Accept Eagle into Apache Incubation

2015-10-25 Thread hongbin ma
+1 (non binding)

On Mon, Oct 26, 2015 at 12:20 AM, Ralph Goers 
wrote:

> +1 (binding)
>
> Ralph
>
> > On Oct 23, 2015, at 7:11 AM, Manoharan, Arun 
> wrote:
> >
> > Hello Everyone,
> >
> > Thanks for all the feedback on the Eagle Proposal.
> >
> > I would like to call for a [VOTE] on Eagle joining the ASF as an
> incubation project.
> >
> > The vote is open for 72 hours:
> >
> > [ ] +1 accept Eagle in the Incubator
> > [ ] ±0
> > [ ] -1 (please give reason)
> >
> > Eagle is a Monitoring solution for Hadoop to instantly identify access
> to sensitive data, recognize attacks, malicious activities and take actions
> in real time. Eagle supports a wide variety of policies on HDFS data and
> Hive. Eagle also provides machine learning models for detecting anomalous
> user behavior in Hadoop.
> >
> > The proposal is available on the wiki here:
> > https://wiki.apache.org/incubator/EagleProposal
> >
> > The text of the proposal is also available at the end of this email.
> >
> > Thanks for your time and help.
> >
> > Thanks,
> > Arun
> >
> > 
> >
> > Eagle
> >
> > Abstract
> > Eagle is an Open Source Monitoring solution for Hadoop to instantly
> identify access to sensitive data, recognize attacks, malicious activities
> in hadoop and take actions.
> >
> > Proposal
> > Eagle audits access to HDFS files, Hive and HBase tables in real time,
> enforces policies defined on sensitive data access and alerts or blocks
> user’s access to that sensitive data in real time. Eagle also creates user
> profiles based on the typical access behaviour for HDFS and Hive and sends
> alerts when anomalous behaviour is detected. Eagle can also import
> sensitive data information classified by external classification engines to
> help define its policies.
> >
> > Overview of Eagle
> > Eagle has 3 main parts.
> > 1.Data collection and storage - Eagle collects data from various hadoop
> logs in real time using Kafka/Yarn API and uses HDFS and HBase for storage.
> > 2.Data processing and policy engine - Eagle allows users to create
> policies based on various metadata properties on HDFS, Hive and HBase data.
> > 3.Eagle services - Eagle services include policy manager, query service
> and the visualization component. Eagle provides intuitive user interface to
> administer Eagle and an alert dashboard to respond to real time alerts.
> >
> > Data Collection and Storage:
> > Eagle provides programming API for extending Eagle to integrate any data
> source into Eagle policy evaluation framework. For example, Eagle hdfs
> audit monitoring collects data from Kafka which is populated from namenode
> log4j appender or from logstash agent. Eagle hive monitoring collects hive
> query logs from running job through YARN API, which is designed to be
> scalable and fault-tolerant. Eagle uses HBase as storage for storing
> metadata and metrics data, and also supports relational database through
> configuration change.
> >
> > Data Processing and Policy Engine:
> > Processing Engine: Eagle provides stream processing API which is an
> abstraction of Apache Storm. It can also be extended to other streaming
> engines. This abstraction allows developers to assemble data
> transformation, filtering, external data join etc. without physically bound
> to a specific streaming platform. Eagle streaming API allows developers to
> easily integrate business logic with Eagle policy engine and internally
> Eagle framework compiles business logic execution DAG into program
> primitives of underlying stream infrastructure e.g. Apache Storm. For
> example, Eagle HDFS monitoring transforms audit log from Namenode to object
> and joins sensitivity metadata, security zone metadata which are generated
> from external programs or configured by user. Eagle hive monitoring filters
> running jobs to get hive query string and parses query string into object
> and then joins sensitivity metadata.
> > Alerting Framework: Eagle Alert Framework includes stream metadata API,
> scalable policy engine framework, extensible policy engine framework.
> Stream metadata API allows developers to declare event schema including
> what attributes constitute an event, what is the type for each attribute,
> and how to dynamically resolve attribute value in runtime when user
> configures policy. Scalable policy engine framework allows policies to be
> executed on different physical nodes in parallel. It is also used to define
> your own policy partitioner class. Policy engine framework together with
> streaming partitioning capability provided by all streaming platforms will
> make sure policies and events can be evaluated in a fully distributed way.
> Extensible policy engine framework allows developer to plugin a new policy
> engine with a few lines of codes. WSO2 Siddhi CEP engine is the policy
> engine which Eagle supports as first-class citizen.
> > Machine Learning module: Eagle provides capabilities to define user
> activity patterns or 

Re: [VOTE] Release Apache Kylin 1.1-incubating (rc1)

2015-10-18 Thread hongbin ma
+1

unit test passed on my CI

On Mon, Oct 19, 2015 at 10:46 AM, Li Yang  wrote:

> +1
>
> Checksum is correct.
>
> Unit test passed on java version "1.7.0_71", OpenJDK Runtime Environment
> (rhel-2.5.3.1.el6-x86_64 u71-b14)
>
>
> On Thu, Oct 15, 2015 at 10:00 AM, Luke Han  wrote:
>
> > +1
> >
> > MD5 & SHA1 checksum verified
> > License files are all there
> >
> > Unit Test passed
> >
> >
> >
> > Best Regards!
> > -
> >
> > Luke Han
> >
> > On Thu, Oct 15, 2015 at 9:42 AM, ShaoFeng Shi 
> > wrote:
> >
> > > Hi all,
> > >
> > > The Apache Kylin community has voted on and approved a proposal to
> > release
> > > Apache Kylin 1.1-incubating.
> > >
> > > Proposal:http://s.apache.org/Jzu
> > >
> > > Vote result:
> > > 7 binding +1 votes
> > > 6 non-binding +1 votes
> > > No -1 voteshttp://s.apache.org/kylin-1.1-result_rc1
> > >
> > >
> > > The commit to be voted
> > > upon:
> > >
> >
> https://github.com/apache/incubator-kylin/commit/1955a2f9aea7b7f608f0496c00807715ea4246a5
> > >
> > > Its hash is 1955a2f9aea7b7f608f0496c00807715ea4246a5.
> > >
> > > The artifacts to be voted on are located
> > > here:
> > >
> >
> https://dist.apache.org/repos/dist/dev/incubator/kylin/apache-kylin-1.1-incubating-rc1/
> > >
> > > The hashes of the artifacts are as follows:
> > > apache-kylin-1.1-incubating-src.tar.gz.md5
> > 18dfbb012e1eb807b1a5c1b134a537aa
> > > apache-kylin-1.1-incubating-src.tar.gz.sha1
> > > 041ee010c67a0b5611d9dd06f7fd6f37388c5374
> > >
> > > A staged Maven repository is available for review
> > > at:
> > https://repository.apache.org/content/repositories/orgapachekylin-1012/
> > >
> > > Release artifacts are signed with the following
> > > key:https://people.apache.org/keys/committer/shaofengshi.asc
> > >
> > > Pursuant to the Releases section of the Incubation Policy and with
> > > the endorsement of our mentors we would now like to request
> > > the permission of the Incubator PMC to publish the release. The vote
> > > is open for 72 hours, or until the necessary number of votes (3 +1)
> > > is reached.
> > >
> > > [ ] +1 Release this package
> > > [ ]  0 I don't feel strongly about it, but I'm okay with the release
> > > [ ] -1 Do not release this package because...
> > >
> > >
> > > Shaofeng Shi, on behalf of Apache Kylin ppmcshaofeng...@apache.org
> > >
> >
>



-- 
Regards,

*Bin Mahone | 马洪宾*
Apache Kylin: http://kylin.io
Github: https://github.com/binmahone