[VOTE] Accept Marmotta into the incubator

Andy Seaborne Thu, 29 Nov 2012 03:28:49 -0800

Hi there,

Following the discussion thread, here is the formal vote on the Marmottaproposal:


Please cast your votes on whether to accept the Apache Marmotta proposal:

[ ] +1 Accept Marmotta into the Apache Incubator
[ ] +0 Indifferent to the acceptance of Marmotta
[ ] -1 Do not accept the Marmotta proposal because ...

The vote will be open until at least 23:59 Sunday 2nd December UTC
(which is three full days from midnight tonight)

        Andy

http://wiki.apache.org/incubator/MarmottaProposal

-----------------------

== Abstract

Marmotta is a Linked Data platform for industry-strength installations.

== Proposal

The goal of Apache Marmotta is to provide an open implementation of aLinked Data Platform that can be used, extended, and deployed easily byorganizations who want to publish Linked Data or build customapplications on Linked Data.

The phrase "Linked Data" is used here idiosyncratically to refer to adata integration paradigm across the Web. The term was coined by TimBerners-Lee in 2006, and it is based on four very simple principleswhich basically describe recommended best practices for exposing,sharing, and connecting pieces of data, information, and knowledge onthe Semantic Web using URIs and the RDF technology stack. ThereforeLinked Data is about using the Web to connect related data that wasn'tpreviously linked, or using the Web to lower the barriers to linkingdata currently linked using other methods.

Marmotta will follow the core recommendations of the W3C on RDF, SPARQLand Linked Data publishing, particularly the emerging Linked DataPlatform (LDP) recommendation. It will also offer extensions forfrequently needed additional functionalities like Linked Data Querying,WebID, WebACL, Reasoning, and Versioning. Marmotta aims to cover both,Linked Open Data, as well as Enterprise Linked Data scenarios, providingfacilities to deal with different data sources and requirements (smalldata/big data, open access/restricted access, etc).


== Background

The Semantic Web isn't just about putting data on the web. It is aboutmaking links, so that a person or machine can explore the web of data.Moreover, the Web has quickly evolved to a Read-Write paradigm, andLinked Data technologies too. And Marmotta will address this challengeand offer a common infrastructure for organizations working in this area.

Marmotta comes as a continuation of the work in the Linked MediaFramework (aka LMF) project. LMF is an easy-to-setup server applicationthat bundles central Semantic Web technologies to offer some advancedservices. The Linked Media Framework consists of LMF Core which providesa Read-Write Linked Data server, plus some modules that complement theserver with other added added capabilities, such as, SPARQL 1.1, LDPath,LDCache, Reasoning, Versioning, etc. Besides, LMF also provides a ClientLibrary, currently available in Java, PHP, and Javascript, as aconvenient API abstraction around the LMF web services. Currently LMFintegrates with other relevant tools (Apache Stanbol, Google Refine orDrupal) to cover a wider range of use cases and needs.


== Rationale

Linked Data technologies are now at a turning point from mostly researchprojects to industrial applications, and a lot of standardisation iscurrently in progress. Industrial applications require a reliable andscalable infrastructure that follows and helps defining a standard wayof publishing and consuming Linked Data on the Web. The proposers have astrong background in building such applications and have investedconsiderable effort in the last years to building up an initial versionof such a platform (the “Linked Media Framework” or “LMF”). Startingfrom this solid base, we strongly believe that Apache is the rightenvironment to open the development of this project to a wider scope.

Marmotta has the potential of being a reference implementation andApache provides a better environment for a collaborative developmenteffort. With its well-established governance model based on meritocracyand handling IP/legal issues, people from different organizations canmore easily contribute to the project. This will help unify the effortsof people implementing the Linked Data Platform specification and otherSemantic Web standards. In addition, it would considerably helporganizations in adopting Linked Data technologies and would provide asolid base for further research activities in the community.


== Initial Goals

* Foster the use of Semantic Web Technologies in industry

* Provide an open source and community-driven implementation of a LinkedData Platform and related Semantic Web standards, LDP 1.0 Draft andSPARQL 1.1 mainly

* Move the existing LMF source from the current Google Code page to theApache infrastructure

* Remove LMF extensions that are not relevant for a core Linked Dataplatform (e.g. semantic search and content enhancement)

* Define a plugable architeture for providing a data governanceframework for enterprise legacy sources

* Revise the architecture, moving to a non-proprietary RDF API (Sesameor Jena) and deciding whether to move to OSGi/Felix or stay withCDI/JavaEE as SOA framework

* Identify and replace dependencies with a non-compatible license (e.g.replace XOM with JDOM)


== Current Status

The source for the current LMF is a stable software artifact that,having emerged from research circles, has already a relevant number ofreal world installations i.e. Red Bull Media House, SalzburgerNachrichten, derStandard.at, etc.


== Meritocracy

LMF is the outcome of a number of research projectscoordinated/participated by Salzburg Research during the last fiveyears. The original developers are still part of the core developmentteam, while at the same time many new committers have joined the team.Taking this step we have made it clear to our community that goingforward, the community, rather than a single organization, willdetermine the future of Marmotta.

Meritocracy is inherent in the research community we come from, andsince Apache Marmotta aims to be a unifying project for this communityit is only natural to continue this approach.


== Community

Marmotta addresses two target communities: On the one hand,researchers/developers who are working with Semantic Web technologies.On the other hand, companies or organizations that require Semantic Webinfrastructure. The initial committers are active participants in bothcommunities.


== Core Developers

Sebastian Schaffert (sebastian dot schaffert at salzburgresearch dot at)
Thomas Kurz (thomas dot kurz at salzburgresearch dot at)
Jakob Frank (jakob dot frank at salzburgresearch dot at)
Dietmar Glachs (dietmar dot glachs at salzburgresearch dot at)
Sergio Fernández (sergio dot fernandez at salzburgresearch dot at)

== Alignment

Marmotta complements and integrates well with the current landscape ofApache projects, especially with the emerging “semantic technologies”cluster within the ASF. Concretely, Marmotta will align with thefollowing projects:

* Apache Commons (lang, loggging, http and so on) is extensively used inmany part of the project

* Apache Tomcat is currently the primary platform for deployment; withMarmotta, Tomcat can be turned into a Linked Data server

* Apache Stanbol will very likely adopt parts of the Marmottainfrastructure, particularly for implementing the entity hub and forexposing the RDF data as Linked Data

* Apache Jena could become the RDF API used throughout Marmotta; anarchitecural decision is yet to be taken

* Apache Any23 could be integrated in the LMF as wrapper around non-RDFdata sources to consume them as Linked Data; a similar approach hasalready been taken by the LMF


* Apache Tika could be use for metada extraction of content

* Apache Karaf and Apache Felix could become the OSGi container forrunning and configuring the Marmotta components

In addition to these more-or-less concrete proposals, there are someoptions that still require some strategic decisions. For example, itmake make sense to build a storage backend based on Apache Hadoop forlarge-scale installations using HBase (e.g. jena grande, h2rdf, hdrs,hadoop rdf). Several extensions also build on existing Apache projects,most importantly the LMF Semantic Search component, which offerssemantic search over Linked Data resources.


== Known Risks

Probably one of the major risks will not be able to engage the communityfor addressing the new challenges. Knowing this, we will do our best toprovide the greater facilities to attract new developers andorganizations. In particular, we will try to actively engage developersfrom the Linked Data community through our networks.


== Orphaned Products

The current project is part of the business portfolio and a strategicproject of the contributor organization, and will continue in that way.So there is no risk of any of the usual warning signs of orphaned orabandoned code.


== Inexperience with Open Source

The committers have large experience with open source development andcommunities. Several of the key committers have been actively involvedin Open Source projects for more than 10-15 years. The initial code baseof Marmotta has already been developed as Open Source project in thelast 5 years.


== Homogenous Developers

Because we are aware about the initial list of committers is not thebest for a long, it exists a strong commitment to spread the projectcreating a much more diverse development team. Part of the reason toenter the Apache incubation process is to open up the development tomore interested participants.


== Reliance on Salaried Developers

Right now most or all of that work is salaried, but the developers areidentifying themselves very much with the project. When opening up thedevelopment using Apache as a platform, we expect that the futuredevelopment will occur on both salaried and volunteer time, particularlyby participants from the Linked Data community.


== Relationships with Other Apache Projects

Although current RDF/SPARQL support in LMF is build on top of OpenRDFSesame API, Marmotta is closely related to many Apache projects, such asStanbol, Jena and Any23. See “Alignment” above.


== An Excessive Fascination with the Apache Brand

While we expect the Apache brand may help attract more contributors, ourinterests in starting this project is based on the factors mentioned inthe Rationale section.


== Documentation

Documentation for the current project can be found at:

    http://lmf.googlecode.com

    http://doc.lmf.googlecode.com/hg/api/index.html

    http://doc.lmf.googlecode.com/hg/rest/index.html

    http://doc.lmf.googlecode.com/hg/client/index.html

== Initial Source

LMF (formerly KiWi) has been developed since 2008. It is important tosay that the whole LMF will not be contributed to Marmotta, actuallyonly those parts that make up the "Linked Data Platform" functionality(Linked Data Server, RDF Store, SPARQL, LDCache, Versioning, Reasonerand LDPath) . The idea is to focus Marmotta much more in the core needs,keeping all surrounding functionalities (Media-related modules andSemantic Search, basically) out of the initial scope. Although thecommunity will be who ultimately decides what are the relevant modules.Since LMF is a very modular software artifact it will be pretty easy tomake such partitioning to kick-off Marmotta.

The current source code can be found at Google Code:http://lmf.googlecode.com


== Source and Intellectual Property Submission Plan

Salzburg Research Forschungsgesellschaft mbH is the sole copyright ownerof the initial code to be contributed, so should not be any problem withthe standard IP clearance process. Current licence is already ApacheSoftware License 2.0.


== External Dependencies

Most of current dependencies should have Apache compatible licenses,including BSD, CDDL, CPL, MPL and MIT licensed dependencies. We areaware of some incompatible licenses right now, but we will work to solvethis issue. See Appendix A for a detailed list of dependencies.


== Cryptography

Does Not Apply.

== Required Resources

Mailing lists

    marmotta-dev
    marmotta-commits
    marmotta-users

Repository

    git://git.apache.org/marmotta.git

Issue Tracking

    Jira: MARMOTTA (Kanban board enabled at GreenHopper)

Other Resources

    Jenkins/Hudson for builds and test running.
    Wiki for internal documentation purposes
    Blog to improve the project dissemination

== Initial Committers

Sebastian Schaffert
   (sebastian dot schafftert at salzburgresearch dot at)
Thomas Kurz
   (thomas dot kurz at salzburgresearch dot at)
Jakob Frank
   (jakob dot frank at salzburgresearch dot at)
Dietmar Glachs
   (dietmar dot glachs at salzburgresearch dot at)
Sergio Fernández
   (sergio dot fernandez at salzburgresearch dot at)
Rupert Westenthaler
   (rwesten at apache dot org)

== Affiliations

All initial committers are currently affiliated to Salzburg ResearchForschungsgesellschaft mbH.


== Sponsors

= Champion

    Andy Seaborne (andy at apache dot org)

= Nominated Mentors

    Fabian Christ (fchrist at apache dot org)
    Nandana Mihindukulasooriya (nandana at apache dot org)
    Andy Seaborne (andy at apache dot org)

= Sponsoring Entity

Apache Incubator PMC

---------------------------------------------------------------------
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org

[VOTE] Accept Marmotta into the incubator

Reply via email to