Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-11 Thread Andy Seaborne



On 10/11/10 23:28, Jeremy Carroll wrote:

- The core API for (mutable) graphs in:
http://incubator.apache.org/clerezza/mvn-site/org.apache.clerezza.rdf.core/apidocs/index.html


http://incubator.apache.org/clerezza/mvn-site/org.apache.clerezza.rdf.core/apidocs/org/apache/clerezza/rdf/core/TripleCollection.html#filter(org.apache.clerezza.rdf.core.NonLiteral,%20org.apache.clerezza.rdf.core.UriRef,%20org.apache.clerezza.rdf.core.Resource)


IteratorTriple filter(NonLiteral subject, UriRef predicate, Resource
object)

vs

http://jena.sourceforge.net/javadoc/com/hp/hpl/jena/graph/Graph.html#find(com.hp.hpl.jena.graph.Node,%20com.hp.hpl.jena.graph.Node,%20com.hp.hpl.jena.graph.Node)


ExtendedIteratorTriple find(Node s, Node p, Node o)

seems to be the fundamental choice.

The latter was the choice Chris Dollin and I made in 2002/2003 and I
still find it preferable, for program uniformity, to the closer to the
spec choice in Clerezza.
We were writing the spec at the same time, and I always saw it as a
description of a Web exchange format, and not of a programming interface
(for instance implementing RDF Semantics Rec is hard with the Clerezza
interface).


Isn't the model interface operation a more appropriate comparision 
because that is what the application sees?


StmtIterator listStatements(Resource s, Property p, RDFNode o)

Graph.find is the SPI interface to storage. The Graph level has named 
variables, not just RDF terms.  SPARQL uses this, heavily.


In SPARQL, literals can occur in any position during query processing. 
Patterns involving literals as subjects, or as predicates, just simply 
don't match the data (section 12.3.1).


Once upon a time, when we were going Jena1-Jena2, the idea was that the 
application API was just one presentation.  There could be other RDF 
APIs over the SPI.  There's not been a second RDF presentation API but 
the design concept was there and still is.  All the interfaces in the 
API are mainly implemented only once, and I'm not aware of any users 
which use the extensibility within the Resource API anymore 
(Parliament/BBN used to - I think they now use an associated 
datastructure to map to internal information for any API 
resources/literals from their storage).  The Resource-level API 
implementation could be simplified if theer is only one implementation 
of that presentation.  There is generality in Jena that we thought was a 
good idea at the time but looking at way the world has gone since, not 
all of it is used or useful nowadays.  Better use of factory/interface 
at the SPI would be more helpful. The experimental Jena3 core also has 
extension nodes and graph nodes with an eye to future possible needs 
from the standards world.



I am not quite sure what that means in terms of this discussion which is
more procedural than technical.


Same here.

Advice on an appropriate discussion forum welcome - while we're in a 
state of exploring the relationships of projects, I'm guessing that here 
is the only common place there is.


Andy

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-11 Thread Bertrand Delacretaz
Hi,

On Thu, Nov 11, 2010 at 11:21 AM, Andy Seaborne
andy.seabo...@epimorphics.com wrote:
 ...Advice on an appropriate discussion forum welcome - while we're in a state
 of exploring the relationships of projects, I'm guessing that here is the
 only common place there is...

Best might be to move this discussion to the clerezza dev list for now?

Email clerezza-dev-subscr...@incubator.apache.org to subscribe -
several Stanbol developers are subscribed AFAIK.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-10 Thread Jeremy Carroll

- The core API for (mutable) graphs in:
http://incubator.apache.org/clerezza/mvn-site/org.apache.clerezza.rdf.core/apidocs/index.html

http://incubator.apache.org/clerezza/mvn-site/org.apache.clerezza.rdf.core/apidocs/org/apache/clerezza/rdf/core/TripleCollection.html#filter(org.apache.clerezza.rdf.core.NonLiteral,%20org.apache.clerezza.rdf.core.UriRef,%20org.apache.clerezza.rdf.core.Resource)

IteratorTriple filter(NonLiteral subject, UriRef predicate, 
Resource object)


vs

http://jena.sourceforge.net/javadoc/com/hp/hpl/jena/graph/Graph.html#find(com.hp.hpl.jena.graph.Node,%20com.hp.hpl.jena.graph.Node,%20com.hp.hpl.jena.graph.Node)

ExtendedIteratorTriple find(Node s, Node p, Node o)

seems to be the fundamental choice.

The latter was the choice Chris Dollin and I made in 2002/2003 and I 
still find it preferable, for program uniformity, to the closer to the 
spec choice in Clerezza.
We were writing the spec at the same time, and I always saw it as a 
description of a Web exchange format, and not of a programming interface 
(for instance implementing RDF Semantics Rec is  hard with the Clerezza 
interface).


I am not quite sure what that means in terms of this discussion which is 
more procedural than technical.
Like in all things people make different choices and have different 
preferences, and a decision to all use the same libraries would be a 
restriction in design freedom, on such issues, which might be good, or 
might be bad.


===

On
http://mail-archives.apache.org/mod_mbox/incubator-clerezza-dev/201011.mbox/%3caanlktinwbvruoemfhh8ohdvvesgm09z0affgserbw...@mail.gmail.com%3e
[[

- graph isomorphism code
]]

what are the goals of the Clerezza isomorphism code? The Jena code is 
essentially scoped to testing, so that I checked that small pathological 
cases were OK, and larger non-pathological cases, but it is not meant to 
have production level performance, particular on graphs for which 
something like nauty would be more appropriate.


Jeremy





-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Paolo Castagna

Olivier Grisel wrote:

On 8 November 2010 15:54, Bertrand Delacretaz bdelacre...@apache.org wrote:

Hi,

I'm following up on discussions here regarding relationships between
the incubating Clerezza podling and incoming Stanbol and Jena podlings
(see proposals at http://wiki.apache.org/incubator).

Do people agree with the following structure?

1. Clerezza, Stanbol and Jena are independent podlings, each aiming
for top-level status


There is a depedency relationship:

- Stanbol is a of application level HTTP services and set of OSGi
components that use:
- Clerezza as an OSGi service provider which in turn is using:
- Jena as a lib for parsing and serializing RDF models and as a SPARQL
enabled triple store.


These are, at the moment, the Jena classes used by some of the
Clerezza modules:

com.hp.hpl.jena.datatypes.TypeMapper
com.hp.hpl.jena.datatypes.xsd.impl.XMLLiteralType
com.hp.hpl.jena.graph.Graph
com.hp.hpl.jena.graph.impl.GraphBase
com.hp.hpl.jena.graph.Node
com.hp.hpl.jena.graph.Node_URI
com.hp.hpl.jena.graph.Reifier
com.hp.hpl.jena.graph.Triple
com.hp.hpl.jena.graph.TripleMatch
com.hp.hpl.jena.mem.TrackingTripleIterator
com.hp.hpl.jena.query.Dataset
com.hp.hpl.jena.query.QueryExecException
com.hp.hpl.jena.query.QueryExecution
com.hp.hpl.jena.query.QueryExecutionFactory
com.hp.hpl.jena.query.QueryFactory
com.hp.hpl.jena.query.QuerySolution
com.hp.hpl.jena.query.ResultSet
com.hp.hpl.jena.rdf.model.impl.NodeIteratorImpl
com.hp.hpl.jena.rdf.model.Literal
com.hp.hpl.jena.rdf.model.Model
com.hp.hpl.jena.rdf.model.ModelFactory
com.hp.hpl.jena.rdf.model.RDFNode
com.hp.hpl.jena.rdf.model.RDFWriter
com.hp.hpl.jena.rdf.model.Resource
com.hp.hpl.jena.rdf.model.RSIterator
com.hp.hpl.jena.rdf.model.Statement
com.hp.hpl.jena.rdf.model.StmtIterator
com.hp.hpl.jena.shared.AlreadyReifiedException
com.hp.hpl.jena.shared.Lock
com.hp.hpl.jena.shared.ReificationStyle
com.hp.hpl.jena.sparql.core.DatasetGraph
com.hp.hpl.jena.sparql.core.Quad
com.hp.hpl.jena.sparql.util.Context
com.hp.hpl.jena.tdb.TDB
com.hp.hpl.jena.tdb.TDBFactory
com.hp.hpl.jena.util.iterator.ExtendedIterator
com.hp.hpl.jena.util.iterator.Filter
com.hp.hpl.jena.util.iterator.NullIterator
com.hp.hpl.jena.vocabulary.DC
com.hp.hpl.jena.vocabulary.RDF
com.hp.hpl.jena.vocabulary.RDFS


Reto Bachmann-Gmür wrote (on jena-devel):
 In Clerezza Jena is used in the implementation of some modules:
 - Jena Based Serializers and Parsers
 - Jena Sparql
 - Jena TDB Based storage
 - Implementation of the Jena API on top of Clerezza Graphs (jena.facade)

 Apart for the last one the modules are independent of Jena in terms of
 the API they expose. Clerezza can be used without any of these modules
 but if one of these are used Clerezza needs Bundles for the whole of
 Jena, it would be nice if the various modules could depend only on the
 parts of Jena that are actually needed by them.


Paolo



The dependency between Clerezza and Jena is optional as many OSGi
services in Clerezza can also work with sesame as an alternative RDF
lib and triple store.


2. A Semantic Commons area is created for common code between these
(and other) projects. Details to be discussed, this does probably not
warrant a separate Apache project, but might be managed by Clerezza,
as they were here first.


Ok on the principle but I would suggest not to create this as long we
don't have any code that does not belong to one of the three previous
projects. Developers of the upstream project should try to take care
an not reinvent the wheel and try to move new features and bugfix to
upstream if they naturally belong there.


3. It is expected that those projects will have a number of committers
in common, as there are many collaboration possibilities.


+1



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Reto Bachmann-Gmuer
Hi Jeremy

One of Clerezza aims was to use an RDF api that is maximally close to RDF
abstract syntax and semantics, on this RDF core api we have different
façades and utilities as well as a frontend adapter implementing the jena
API. Related standards like SPARQL and the various serialization formats are
supported as well, respective engines can be added at runtime (when running
in a OSGI container). We decided to design our own API as we found the
various API available (jena, openrdf, rdf2go) would neither be as modular
nor as close to the spec as we wanted them to be. The API comes with the
typical utilities like a command line tool and a maven plugin for the
transformation of vocabularies into classes

Apart from core part tightly coupled to RDF and related specs Clerezza also
provides a framework for implementing rest applications (JAX-RS). The
encourages design pattern is that requests are answered in terms of RDF
(i.e. a graph and typically a selected resource within this graph), clerezza
takes care about content-negotiation and for RDF formats the serializer
registered for that media type is used. For non RDF formats a template
(typically a Scala Server Pages) is selected and takes care of the
rendering.

I described this parts of Clerezza because they seem to be quite close to
what you suggest for commons. As it is hard to share utilities without
having shared APIs for the core stuff our code deals with I think some
efforts in this area could have the greatest benefit.

If you have some time, I would like to encourage feedback on the respective
APIs as currently used in Clerezza

- The core API for (mutable) graphs in:
http://incubator.apache.org/clerezza/mvn-site/org.apache.clerezza.rdf.core/apidocs/index.html
- Utilities (including resource-centric API):
http://incubator.apache.org/clerezza/mvn-site/org.apache.clerezza.rdf.utils/apidocs/index.html

These two layers are similar to the Graph/Model separation in Jena.

Cheers,
Reto



On Mon, Nov 8, 2010 at 7:32 PM, Jeremy Carroll jer...@topquadrant.comwrote:

 To make the commons discussion more concrete I would suggest the following
  items for the commons:

 - an IRI library
 - some code to do with vocabularies.
 - connecting to a URL and doing semweb aware content negotiation (this is
 typically done badly)

 (Actually the IRI code should probably be wider, Jena initially used the
 xerces URI code but then the needs exceeded what they supported)

 Jeremy




 On 11/8/2010 7:22 AM, Ian Dickinson wrote:

 On 08/11/10 15:09, Mattmann, Chris A (388J) wrote:

 Wow! Jena is proposing to come to Apache??!

 Yep, the proposal has been under discussion for some time within the
 project, and Ross is now taking the lead in bringing it publicly into
 the incubator process.

 Bertrand wrote:

 1. Clerezza, Stanbol and Jena are independent podlings, each aiming
 for top-level status

 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first.

 3. It is expected that those projects will have a number of committers
 in common, as there are many collaboration possibilities.

 As an Apache newbie it's hard to comment definitively, but it's not
 clear to me what the common needs of the projects might be, and how
 dependencies would handled. Are there examples of existing commons
 areas between Apache projects, other than basic application-library
 dependencies?


 Ian




 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org




Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Andy Seaborne
I haven't understood yet the relationship of Stanbol and Clerezza to be 
able to say anything about how a commons area between those two systems 
might work.  In terms of direct dependencies, does Stanbol just directly 
depend on Clerezza and only indirectly on Sesame, Jena rdf2go and the 
OWLAPI that the proposal lists?


Jena has two levels of API, the Model/Statement/Resource that 
applications usually use, and the SPI Graph/Triple/Node thathe the 
storage and inference layers plug into.  Roughly, speaking, the API 
faces upwards to applications and the SPI downwards to components. 
There is also the same design for the support for RDF datasets for 
SPARQL.  SPARQL, especially SPARQL Update, works on quads for named graphs.


There are significant investments by users in uses of both API and SPI 
and the cost of change for users isn't trivial.  Worse, the SPI is used 
by systems that are distributed on making any change over messy. There 
needs to be a real advantage to change.


Speaking personally, I'm not strongly motivated by a common API; I don't 
think it helps the semantic web enough compared with other things I 
could work on.  If others are working on one, I'll do what I can to make 
my contributions work as smoothly as possible with that through 
discussing what hooks and flexibility I can contribute to Jena to make 
it work smoothly with other systems.



I'm not particularly worried about there being several APIs since that's 
the state we're in and I don't see it changing particularly quickly 
because there are large investments in deployed systems that exists.  As 
some of these systems themselves are distributed onwards, chnage woudl 
be slow.


Most of my contribution is storage and query which works at the lower 
level anyway, so it has less effect on me but the SPARQL related parts 
of the public APIs are mostly my fault. I am currently working on a 
SPARQL 1.1 compliant server - the on-the-wire standardization is more 
important.  Working on storage and query does result in a slight 
obsession with efficiency - storage and query need to be tightly coupled 
for performance.


Jeremy identified the IRI library as a potential contribution to a 
commons area.  It is free-standing, and does not use or call any Jena 
RDF code - it depends only on ICU4J (and JUnit + Jflex in the build).


The client-side content negotiation code in Jena could do with an 
upgrade so if there is system-independent code that can be reused for 
the systems here and more widely, that would be great.  I would use it 
as I need to provide code-level access to for client SPARQL 1.1 access.


If nothing else, shared knowledge and expertise would move things along 
faster, and even if functionality needs tying to specific systems, 
having all the projects in Apache greatly helps community.


Andy

PS Reto - GVS [*] isn't on the list of thing to contribute to Apache as 
we're focusing on the core system that is used and we support.  But that 
does not preclude including GVS either now or later.  It is covered by 
our agreement-in-principle with HP.  Do you want to do that?


[*] Reto wrote a graph versioning system for Jena several years ago.

On 08/11/10 21:39, Reto Bachmann-Gmuer wrote:

Hi Jeremy

One of Clerezza aims was to use an RDF api that is maximally close to RDF
abstract syntax and semantics, on this RDF core api we have different
façades and utilities as well as a frontend adapter implementing the jena
API. Related standards like SPARQL and the various serialization formats are
supported as well, respective engines can be added at runtime (when running
in a OSGI container). We decided to design our own API as we found the
various API available (jena, openrdf, rdf2go) would neither be as modular
nor as close to the spec as we wanted them to be. The API comes with the
typical utilities like a command line tool and a maven plugin for the
transformation of vocabularies into classes

Apart from core part tightly coupled to RDF and related specs Clerezza also
provides a framework for implementing rest applications (JAX-RS). The
encourages design pattern is that requests are answered in terms of RDF
(i.e. a graph and typically a selected resource within this graph), clerezza
takes care about content-negotiation and for RDF formats the serializer
registered for that media type is used. For non RDF formats a template
(typically a Scala Server Pages) is selected and takes care of the
rendering.

I described this parts of Clerezza because they seem to be quite close to
what you suggest for commons. As it is hard to share utilities without
having shared APIs for the core stuff our code deals with I think some
efforts in this area could have the greatest benefit.

If you have some time, I would like to encourage feedback on the respective
APIs as currently used in Clerezza

- The core API for (mutable) graphs in:
http://incubator.apache.org/clerezza/mvn-site/org.apache.clerezza.rdf.core/apidocs/index.html
- 

Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Florent Guillaume
On Tue, Nov 9, 2010 at 1:19 PM, Andy Seaborne
andy.seabo...@epimorphics.com wrote:
 Jeremy identified the IRI library as a potential contribution to a commons
 area.  It is free-standing, and does not use or call any Jena RDF code - it
 depends only on ICU4J (and JUnit + Jflex in the build).

Please note that Abdera already has an IRI library.
http://svn.apache.org/repos/asf/abdera/java/trunk/dependencies/i18n/src/main/java/org/apache/abdera/i18n/iri/

Florent

-- 
Florent Guillaume, Director of RD, Nuxeo
Open Source, Java EE based, Enterprise Content Management (ECM)
http://www.nuxeo.com   http://www.nuxeo.org   +33 1 40 33 79 87

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Andy Seaborne



On 09/11/10 16:22, Florent Guillaume wrote:

On Tue, Nov 9, 2010 at 1:19 PM, Andy Seaborne
andy.seabo...@epimorphics.com  wrote:

Jeremy identified the IRI library as a potential contribution to a commons
area.  It is free-standing, and does not use or call any Jena RDF code - it
depends only on ICU4J (and JUnit + Jflex in the build).


Please note that Abdera already has an IRI library.
http://svn.apache.org/repos/asf/abdera/java/trunk/dependencies/i18n/src/main/java/org/apache/abdera/i18n/iri/


Florent,

Thanks for pointing that out.  I see it has a test suite as well and it 
would be good to make sure we've got things right.


Illegal IRIs in data have been a bit of a plague in RDF data and the IRI 
library (written by Jeremy) is a response to that.  It checks both rules 
for specific IRI schemes and also recommended forms as IRIs are often 
com pared for equality.  The library is quite picky.  It includes 
profiles for RDF URI references, IRI and the compromise we use in Jena 
as a balance of legacy and spec exactness.


There is an online test service for RDF data in non-RDF/XML formats at:

http://sparql.org/data-validator.html

The IRIs are checked with the IRI library.

Andy

A few examples:

http://example/a b

Code: 17/WHITESPACE in PATH: A single whitespace character. These match 
no grammar rules of URIs/IRIs. These characters are permitted in RDF URI 
References, XML system identifiers, and XML Schema anyURIs.


http://example/a[]b

Code: 0/ILLEGAL_CHARACTER in PATH: The character violates the grammar 
rules for URIs/IRIs.


http://example:80/

Code: 13/DEFAULT_PORT_SHOULD_BE_OMITTED in PORT: If the port is the 
default one for the scheme it should be omitted.
http://example:80/ Code: 14/PORT_SHOULD_NOT_BE_WELL_KNOWN in PORT: 
Ports under 1024 should be accessed using the appropriate scheme name


urn:xyz

Code: 61/SCHEME_PATTERN_MATCH_FAILED in PATH: The scheme specific syntax 
rules are violated.


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Jeremy Carroll

On 11/9/2010 8:22 AM, Florent Guillaume wrote:

On Tue, Nov 9, 2010 at 1:19 PM, Andy Seaborne
andy.seabo...@epimorphics.com  wrote:

Jeremy identified the IRI library as a potential contribution to a commons
area.  It is free-standing, and does not use or call any Jena RDF code - it
depends only on ICU4J (and JUnit + Jflex in the build).

Please note that Abdera already has an IRI library.
http://svn.apache.org/repos/asf/abdera/java/trunk/dependencies/i18n/src/main/java/org/apache/abdera/i18n/iri/

Florent



I'll take a look, it may take a little time.

Jeremy



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Jeremy Carroll
I could be accused of having gone overboard ... each of the slightly 
different specs as an explicit representation in the code ...
Having changed job and looking at this from a different perspective I am 
less convinced by the pickiness.
There is a largely unrealized goal in the IRI code to link errors right 
back to the specs so that the error messages quote chapter and verse.


Jeremy


On 11/9/2010 8:52 AM, Andy Seaborne wrote:



On 09/11/10 16:22, Florent Guillaume wrote:

On Tue, Nov 9, 2010 at 1:19 PM, Andy Seaborne
andy.seabo...@epimorphics.com  wrote:
Jeremy identified the IRI library as a potential contribution to a 
commons
area.  It is free-standing, and does not use or call any Jena RDF 
code - it

depends only on ICU4J (and JUnit + Jflex in the build).


Please note that Abdera already has an IRI library.
http://svn.apache.org/repos/asf/abdera/java/trunk/dependencies/i18n/src/main/java/org/apache/abdera/i18n/iri/ 



Florent,

Thanks for pointing that out.  I see it has a test suite as well and 
it would be good to make sure we've got things right.


Illegal IRIs in data have been a bit of a plague in RDF data and the 
IRI library (written by Jeremy) is a response to that.  It checks both 
rules for specific IRI schemes and also recommended forms as IRIs are 
often com pared for equality.  The library is quite picky.  It 
includes profiles for RDF URI references, IRI and the compromise we 
use in Jena as a balance of legacy and spec exactness.


There is an online test service for RDF data in non-RDF/XML formats at:

http://sparql.org/data-validator.html

The IRIs are checked with the IRI library.

Andy

A few examples:

http://example/a b

Code: 17/WHITESPACE in PATH: A single whitespace character. These 
match no grammar rules of URIs/IRIs. These characters are permitted in 
RDF URI References, XML system identifiers, and XML Schema anyURIs.


http://example/a[]b

Code: 0/ILLEGAL_CHARACTER in PATH: The character violates the grammar 
rules for URIs/IRIs.


http://example:80/

Code: 13/DEFAULT_PORT_SHOULD_BE_OMITTED in PORT: If the port is the 
default one for the scheme it should be omitted.
http://example:80/ Code: 14/PORT_SHOULD_NOT_BE_WELL_KNOWN in PORT: 
Ports under 1024 should be accessed using the appropriate scheme name


urn:xyz

Code: 61/SCHEME_PATTERN_MATCH_FAILED in PATH: The scheme specific 
syntax rules are violated.


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org




-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Andy Seaborne



On 09/11/10 16:59, Jeremy Carroll wrote:

I could be accused of having gone overboard ...


yes ... :-)


Having changed job and looking at this from a different perspective I am
less convinced by the pickiness.


At least leave the picky mode there, please.  It's been very helpful in 
working with data generated by one organisation and used by another. 
Script generated IRIs have been


Once dubious data gets into a database, it's troublesome to get it out. 
Having the data storage only cover the required spec common ground is 
better.


TDB uses the IRI profile (not RDF URI references) as strict as possible 
except that these two warnings are switched off:


ViolationCodes.UNWISE_CHARACTER
  that is {, }, |, \, ^, `
  It's the | that I found used in generated IRIs.

Whitespace is handled separately.
  are syntax errors.

ViolationCodes.UNREGISTERED_IANA_SCHEME
  because people invent schemes for data.

I did find the most strict mode a bit too strict :-)

Andy

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Bertrand Delacretaz
On Mon, Nov 8, 2010 at 3:54 PM, Bertrand Delacretaz
bdelacre...@apache.org wrote:
 ...I'm following up on discussions here regarding relationships between
 the incubating Clerezza podling and incoming Stanbol and Jena podlings...

Thanks everybody for your replies, I think they show that the proposed
semantic commons might make sense, but we shouldn't rush things until
there's actually something to put in there.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-09 Thread Tommaso Teofili
2010/11/8 Reto Bachmann-Gmuer reto.bachm...@trialox.org


 I described this parts of Clerezza because they seem to be quite close to
 what you suggest for commons. As it is hard to share utilities without
 having shared APIs for the core stuff our code deals with I think some
 efforts in this area could have the greatest benefit.


+1

Also in Clerezza there are some ontologies [1] defined which would be good
to share in a semantic repository for ontologies. I think especially
Clerezza and Stanbol would benefit from such a common repo. This, in my
opinion, would fit well into semantic commons.

Tommaso

[1] :
http://svn.apache.org/repos/asf/incubator/clerezza/trunk/org.apache.clerezza.parent/org.apache.clerezza.rdf.ontologies/src/main/resources/org/apache/clerezza/rdf/ontologies/


Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Bertrand Delacretaz
Hi,

I'm following up on discussions here regarding relationships between
the incubating Clerezza podling and incoming Stanbol and Jena podlings
(see proposals at http://wiki.apache.org/incubator).

Do people agree with the following structure?

1. Clerezza, Stanbol and Jena are independent podlings, each aiming
for top-level status

2. A Semantic Commons area is created for common code between these
(and other) projects. Details to be discussed, this does probably not
warrant a separate Apache project, but might be managed by Clerezza,
as they were here first.

3. It is expected that those projects will have a number of committers
in common, as there are many collaboration possibilities.

WDYT?

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Ross Gardler
As champion for Jena I agree in principle. I can't speak for the Jena team if 
course, but representatives are tracking this list and have started discussing 
in their project list. 

I will be making the formal Jena proposal here very soon (just as soon as I'm 
not limited to the iPhone) - its in the wiki if people want a preview. 

Ross



Sent from my mobile device.

On 8 Nov 2010, at 14:54, Bertrand Delacretaz bdelacre...@apache.org wrote:

 Hi,
 
 I'm following up on discussions here regarding relationships between
 the incubating Clerezza podling and incoming Stanbol and Jena podlings
 (see proposals at http://wiki.apache.org/incubator).
 
 Do people agree with the following structure?
 
 1. Clerezza, Stanbol and Jena are independent podlings, each aiming
 for top-level status
 
 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first.
 
 3. It is expected that those projects will have a number of committers
 in common, as there are many collaboration possibilities.
 
 WDYT?
 
 -Bertrand
 
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org
 

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Mattmann, Chris A (388J)
Wow! Jena is proposing to come to Apache??! Sweet! We've used it in OODT since 
2003...looking forward to it!

Cheers,
Chris


On 11/8/10 7:04 AM, Ross Gardler rgard...@apache.org wrote:

As champion for Jena I agree in principle. I can't speak for the Jena team if 
course, but representatives are tracking this list and have started discussing 
in their project list.

I will be making the formal Jena proposal here very soon (just as soon as I'm 
not limited to the iPhone) - its in the wiki if people want a preview.

Ross



Sent from my mobile device.

On 8 Nov 2010, at 14:54, Bertrand Delacretaz bdelacre...@apache.org wrote:

 Hi,

 I'm following up on discussions here regarding relationships between
 the incubating Clerezza podling and incoming Stanbol and Jena podlings
 (see proposals at http://wiki.apache.org/incubator).

 Do people agree with the following structure?

 1. Clerezza, Stanbol and Jena are independent podlings, each aiming
 for top-level status

 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first.

 3. It is expected that those projects will have a number of committers
 in common, as there are many collaboration possibilities.

 WDYT?

 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org




++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.mattm...@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Olivier Grisel
On 8 November 2010 15:54, Bertrand Delacretaz bdelacre...@apache.org wrote:
 Hi,

 I'm following up on discussions here regarding relationships between
 the incubating Clerezza podling and incoming Stanbol and Jena podlings
 (see proposals at http://wiki.apache.org/incubator).

 Do people agree with the following structure?

 1. Clerezza, Stanbol and Jena are independent podlings, each aiming
 for top-level status

There is a depedency relationship:

- Stanbol is a of application level HTTP services and set of OSGi
components that use:
- Clerezza as an OSGi service provider which in turn is using:
- Jena as a lib for parsing and serializing RDF models and as a SPARQL
enabled triple store.

The dependency between Clerezza and Jena is optional as many OSGi
services in Clerezza can also work with sesame as an alternative RDF
lib and triple store.

 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first.

Ok on the principle but I would suggest not to create this as long we
don't have any code that does not belong to one of the three previous
projects. Developers of the upstream project should try to take care
an not reinvent the wheel and try to move new features and bugfix to
upstream if they naturally belong there.

 3. It is expected that those projects will have a number of committers
 in common, as there are many collaboration possibilities.

+1

-- 
Olivier

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Bertrand Delacretaz
On Mon, Nov 8, 2010 at 4:14 PM, Olivier Grisel ogri...@nuxeo.com wrote:
 On 8 November 2010 15:54, Bertrand Delacretaz bdelacre...@apache.org wrote:
... 1. Clerezza, Stanbol and Jena are independent podlings, each aiming
 for top-level status

 There is a depedency relationship:...

Agreed - I meant independent communities, code dependencies are expected.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Chris Mattmann
Correction: actually not sure about 2003 but seems like we been using it for 
that long!

Cheers,
Chris 
Sent from my Verizon Wireless BlackBerry

-Original Message-
From: Mattmann, Chris A (388J) chris.a.mattm...@jpl.nasa.gov
Date: Mon, 8 Nov 2010 07:09:13 
To: general@incubator.apache.orggeneral@incubator.apache.org
Reply-To: general@incubator.apache.org general@incubator.apache.org
Subject: Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

Wow! Jena is proposing to come to Apache??! Sweet! We've used it in OODT since 
2003...looking forward to it!

Cheers,
Chris


On 11/8/10 7:04 AM, Ross Gardler rgard...@apache.org wrote:

As champion for Jena I agree in principle. I can't speak for the Jena team if 
course, but representatives are tracking this list and have started discussing 
in their project list.

I will be making the formal Jena proposal here very soon (just as soon as I'm 
not limited to the iPhone) - its in the wiki if people want a preview.

Ross



Sent from my mobile device.

On 8 Nov 2010, at 14:54, Bertrand Delacretaz bdelacre...@apache.org wrote:

 Hi,

 I'm following up on discussions here regarding relationships between
 the incubating Clerezza podling and incoming Stanbol and Jena podlings
 (see proposals at http://wiki.apache.org/incubator).

 Do people agree with the following structure?

 1. Clerezza, Stanbol and Jena are independent podlings, each aiming
 for top-level status

 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first.

 3. It is expected that those projects will have a number of committers
 in common, as there are many collaboration possibilities.

 WDYT?

 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org




++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.mattm...@jpl.nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Ian Dickinson
On 08/11/10 15:09, Mattmann, Chris A (388J) wrote:
 Wow! Jena is proposing to come to Apache??! 
Yep, the proposal has been under discussion for some time within the
project, and Ross is now taking the lead in bringing it publicly into
the incubator process.

Bertrand wrote:
 1. Clerezza, Stanbol and Jena are independent podlings, each aiming
 for top-level status

 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first.

 3. It is expected that those projects will have a number of committers
 in common, as there are many collaboration possibilities.
As an Apache newbie it's hard to comment definitively, but it's not
clear to me what the common needs of the projects might be, and how
dependencies would handled. Are there examples of existing commons
areas between Apache projects, other than basic application-library
dependencies?


Ian


-- 

Ian Dickinson   Epimorphics Ltd, Bristol, UK
mailto:i...@epimorphics.comhttp://www.epimorphics.com
cell: +44-7786-850536  landline: +44-1275-399069

Epimorphics Ltd.  is a limited company registered in England
(no. 7016688). Registered address: Court Lodge, 105 High St,
  Portishead, Bristol BS20 6PT, UK


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Bertrand Delacretaz
Hi Ian,

On Mon, Nov 8, 2010 at 4:22 PM, Ian Dickinson i...@epimorphics.com wrote:
 ...Bertrand wrote:
...
 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first

 As an Apache newbie it's hard to comment definitively, but it's not
 clear to me what the common needs of the projects might be, and how
 dependencies would handled. Are there examples of existing commons
 areas between Apache projects, other than basic application-library
 dependencies?


The best example is commons.apache.org of course - my idea is that
utilities and libraries that might be useful for several of those new
semantic projects should be shared whenever possible, to avoid
duplication of efforts.

As Olivier notes, it's best to create such a commons area only when we
really need it, there's no need to define things precisely right now.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Donald Whytock
Perhaps db.apache.org would be a better example?  Should there be a
semantic.apache.org?

On Mon, Nov 8, 2010 at 11:51 AM, Bertrand Delacretaz
bdelacre...@apache.org wrote:
 Hi Ian,

 On Mon, Nov 8, 2010 at 4:22 PM, Ian Dickinson i...@epimorphics.com wrote:
 ...Bertrand wrote:
...
 2. A Semantic Commons area is created for common code between these
 (and other) projects. Details to be discussed, this does probably not
 warrant a separate Apache project, but might be managed by Clerezza,
 as they were here first

 As an Apache newbie it's hard to comment definitively, but it's not
 clear to me what the common needs of the projects might be, and how
 dependencies would handled. Are there examples of existing commons
 areas between Apache projects, other than basic application-library
 dependencies?


 The best example is commons.apache.org of course - my idea is that
 utilities and libraries that might be useful for several of those new
 semantic projects should be shared whenever possible, to avoid
 duplication of efforts.

 As Olivier notes, it's best to create such a commons area only when we
 really need it, there's no need to define things precisely right now.

 -Bertrand

 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Ian Dickinson
Hi Donald,

On 08/11/10 17:01, Donald Whytock wrote:
 Perhaps db.apache.org would be a better example?  Should there be a
 semantic.apache.org?
I looked around in db.apache.org and I couldn't see anything that said
what the goals of that project are, separate from the goals of the
individual sub-projects. Where should I be looking?

As others have noted, the three semantic web-related projects currently
being proposed have rather different objectives and provide different
capabilities. I personally don't object in principle to factoring out
common code or needs where that's useful, but +1 to Olivier: let's wait
until a clear requirement is identified.

Ian

-- 

Ian Dickinson   Epimorphics Ltd, Bristol, UK
mailto:i...@epimorphics.comhttp://www.epimorphics.com
cell: +44-7786-850536  landline: +44-1275-399069

Epimorphics Ltd.  is a limited company registered in England
(no. 7016688). Registered address: Court Lodge, 105 High St,
  Portishead, Bristol BS20 6PT, UK


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Ross Gardler
For me this proposal means we should be aware of possibilities. There is no 
intention of forcing collaboration on incubating projects. Each project should 
continue to work on it's own graduation. 

However there will probably be code that fan be shared. The sharif of 
appropriate code can only serve to strengthen each of the communities. The key 
word being appropriate

Sent from my mobile device.

On 8 Nov 2010, at 17:30, Ian Dickinson i...@epimorphics.com wrote:

 Hi Donald,
 
 On 08/11/10 17:01, Donald Whytock wrote:
 Perhaps db.apache.org would be a better example?  Should there be a
 semantic.apache.org?
 I looked around in db.apache.org and I couldn't see anything that said
 what the goals of that project are, separate from the goals of the
 individual sub-projects. Where should I be looking?
 
 As others have noted, the three semantic web-related projects currently
 being proposed have rather different objectives and provide different
 capabilities. I personally don't object in principle to factoring out
 common code or needs where that's useful, but +1 to Olivier: let's wait
 until a clear requirement is identified.
 
 Ian
 
 -- 
 
 Ian Dickinson   Epimorphics Ltd, Bristol, UK
 mailto:i...@epimorphics.comhttp://www.epimorphics.com
 cell: +44-7786-850536  landline: +44-1275-399069
 
 Epimorphics Ltd.  is a limited company registered in England
 (no. 7016688). Registered address: Court Lodge, 105 High St,
  Portishead, Bristol BS20 6PT, UK
 
 
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org
 

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Benson Margulies
I sense an iPhone at work here.

Would that be Omar Sharif?

On Mon, Nov 8, 2010 at 1:04 PM, Ross Gardler rgard...@apache.org wrote:
 For me this proposal means we should be aware of possibilities. There is no 
 intention of forcing collaboration on incubating projects. Each project 
 should continue to work on it's own graduation.

 However there will probably be code that fan be shared. The sharif of 
 appropriate code can only serve to strengthen each of the communities. The 
 key word being appropriate

 Sent from my mobile device.

 On 8 Nov 2010, at 17:30, Ian Dickinson i...@epimorphics.com wrote:

 Hi Donald,

 On 08/11/10 17:01, Donald Whytock wrote:
 Perhaps db.apache.org would be a better example?  Should there be a
 semantic.apache.org?
 I looked around in db.apache.org and I couldn't see anything that said
 what the goals of that project are, separate from the goals of the
 individual sub-projects. Where should I be looking?

 As others have noted, the three semantic web-related projects currently
 being proposed have rather different objectives and provide different
 capabilities. I personally don't object in principle to factoring out
 common code or needs where that's useful, but +1 to Olivier: let's wait
 until a clear requirement is identified.

 Ian

 --
 
 Ian Dickinson                   Epimorphics Ltd, Bristol, UK
 mailto:i...@epimorphics.com        http://www.epimorphics.com
 cell: +44-7786-850536              landline: +44-1275-399069
 
 Epimorphics Ltd.  is a limited company registered in England
 (no. 7016688). Registered address: Court Lodge, 105 High St,
              Portishead, Bristol BS20 6PT, UK


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org


 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org



-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Jeremy Carroll
To make the commons discussion more concrete I would suggest the 
following  items for the commons:


- an IRI library
- some code to do with vocabularies.
- connecting to a URL and doing semweb aware content negotiation (this 
is typically done badly)


(Actually the IRI code should probably be wider, Jena initially used the 
xerces URI code but then the needs exceeded what they supported)


Jeremy



On 11/8/2010 7:22 AM, Ian Dickinson wrote:

On 08/11/10 15:09, Mattmann, Chris A (388J) wrote:

Wow! Jena is proposing to come to Apache??!

Yep, the proposal has been under discussion for some time within the
project, and Ross is now taking the lead in bringing it publicly into
the incubator process.

Bertrand wrote:

1. Clerezza, Stanbol and Jena are independent podlings, each aiming
for top-level status

2. A Semantic Commons area is created for common code between these
(and other) projects. Details to be discussed, this does probably not
warrant a separate Apache project, but might be managed by Clerezza,
as they were here first.

3. It is expected that those projects will have a number of committers
in common, as there are many collaboration possibilities.

As an Apache newbie it's hard to comment definitively, but it's not
clear to me what the common needs of the projects might be, and how
dependencies would handled. Are there examples of existing commons
areas between Apache projects, other than basic application-library
dependencies?


Ian





-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Jeremy Carroll

On 11/8/2010 7:14 AM, Olivier Grisel wrote:


There is a depedency relationship:

- Stanbol is a of application level HTTP services and set of OSGi
components that use:
- Clerezza as an OSGi service provider which in turn is using:
- Jena as a lib for parsing and serializing RDF models and as a SPARQL
enabled triple store.



  Developers of the upstream project should try to take care
an not reinvent the wheel and try to move new features and bugfix to
upstream if they naturally belong there.


I think all of the items I identified are currently in the Jena 
ballpark, and could safely stay there with this structure.

Possibly no need to separate them out

However we may want a vocabulary area in Jena where any dependent apache 
project can commit their specific vocabularies


Jeremy

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Ross Gardler
Indeed this iPhone is a pain in so many ways. Even worse I seem to have become 
blind to so many of it's useful corrections. 

I suppose sharing of appropriate code makes more sense than sharif, but I 
wouldn't mind having Omar around if that can be pulled off. 

Sent from my mobile device.

On 8 Nov 2010, at 18:15, Benson Margulies bimargul...@gmail.com wrote:

 I sense an iPhone at work here.
 
 Would that be Omar Sharif?
 
 On Mon, Nov 8, 2010 at 1:04 PM, Ross Gardler rgard...@apache.org wrote:
 For me this proposal means we should be aware of possibilities. There is no 
 intention of forcing collaboration on incubating projects. Each project 
 should continue to work on it's own graduation.
 
 However there will probably be code that fan be shared. The sharif of 
 appropriate code can only serve to strengthen each of the communities. The 
 key word being appropriate
 
 Sent from my mobile device.
 
 On 8 Nov 2010, at 17:30, Ian Dickinson i...@epimorphics.com wrote:
 
 Hi Donald,
 
 On 08/11/10 17:01, Donald Whytock wrote:
 Perhaps db.apache.org would be a better example?  Should there be a
 semantic.apache.org?
 I looked around in db.apache.org and I couldn't see anything that said
 what the goals of that project are, separate from the goals of the
 individual sub-projects. Where should I be looking?
 
 As others have noted, the three semantic web-related projects currently
 being proposed have rather different objectives and provide different
 capabilities. I personally don't object in principle to factoring out
 common code or needs where that's useful, but +1 to Olivier: let's wait
 until a clear requirement is identified.
 
 Ian
 
 --
 
 Ian Dickinson   Epimorphics Ltd, Bristol, UK
 mailto:i...@epimorphics.comhttp://www.epimorphics.com
 cell: +44-7786-850536  landline: +44-1275-399069
 
 Epimorphics Ltd.  is a limited company registered in England
 (no. 7016688). Registered address: Court Lodge, 105 High St,
  Portishead, Bristol BS20 6PT, UK
 
 
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org
 
 
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org
 
 
 
 -
 To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
 For additional commands, e-mail: general-h...@incubator.apache.org
 

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Bertrand Delacretaz
On Mon, Nov 8, 2010 at 6:01 PM, Donald Whytock dwhyt...@gmail.com wrote:
 Perhaps db.apache.org would be a better example?  Should there be a
 semantic.apache.org?...

That's a possibility, but I would keep such a website for information
that's common to ASF semantic projects, as opposed to making it an
umbrella project.

db.apache.org (and http://ws.apache.org/) are such umbrella projects
that group a number of more or less closely related subprojects.

The current trend is to move away from such umbrella projects, so that
top-level projects are clearly focused and their PMCs can work towards
a clearly defined set of goals.

-Bertrand

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Andy Seaborne



On 08/11/10 18:32, Jeremy Carroll wrote:

To make the commons discussion more concrete I would suggest the
following items for the commons:

- an IRI library
- some code to do with vocabularies.
- connecting to a URL and doing semweb aware content negotiation (this
is typically done badly)

(Actually the IRI code should probably be wider, Jena initially used the
xerces URI code but then the needs exceeded what they supported)

Jeremy


Good idea.  The IRI code is independent of the rest of Jena and is 
valuable in it's own right.


ARP (Jena RDF/XML parser) is also independent of the Jena code structure 
and once was (is it still possible to get just ARP?).  It's just the 
final step of generation that turns the output of parsing into 
Jena-specific objects.  Might be worth splitting out if it would be useful.


The lowest level of RIOT parsing, which defines the tokens for creating 
any of the Turtle family of langauges, is not Jena dependent.  The 
actual RIOT parsers themselves are as they directly generate 
Jena-specific objects to avoid the copy overhead.  It's a performance 
trade-off.


[RIOT is a set of faster parsers for non-XML serializations of RDF, 
currently part of ARQ, but should migrate to Jena core when fully 
stable. - original need was parsers for formats capable of delivering to 
the TDB database at full loading speed without heavy CPU load.]


But the command line tools based on RIOT which parse or validate one 
format are reusable - they use Jena internally, but the input and output 
are completely standard.


The RDF validator Eyeball is also a useful tool in its own right.

Andy


-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org



Re: Clerezza, Stanbol, Jena, Semantic Commons, WDYT?

2010-11-08 Thread Andy Seaborne



On 08/11/10 17:30, Ian Dickinson wrote:

Hi Donald,

On 08/11/10 17:01, Donald Whytock wrote:

Perhaps db.apache.org would be a better example?  Should there be a
semantic.apache.org?

I looked around in db.apache.org and I couldn't see anything that said
what the goals of that project are, separate from the goals of the
individual sub-projects. Where should I be looking?

As others have noted, the three semantic web-related projects currently
being proposed have rather different objectives and provide different
capabilities. I personally don't object in principle to factoring out
common code or needs where that's useful, but +1 to Olivier: let's wait
until a clear requirement is identified.


I agree.  I don't think that a semantic XYZ is useful if that makes 
semantic some sort of different software.  Semantic web technology is 
getting used more widely and a semantic commons or formal grouping of 
projects risks creating the idea that it's all or nothing, which isn't 
the case, hopefully.


Andy

-
To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
For additional commands, e-mail: general-h...@incubator.apache.org