Re: What would you build with a web of data? Decision support

2010-04-13 Thread Wolfgang Orthuber

Hi Georgi,

First let me underline that the following is not a detached theory, it 
is very practical:


The web of data can support the clinician in his cycle of decision:

(a)The clinician makes measurements (in the broadest sense, also 
speaking with the patient and looking at a picture is a measurement).
(b)The clinician focuses on those measurement results which are 
interesting for his therapeutic decisions (feature extraction).
(c)The clinician compares these measurement results with experience. 
At this he may use rules or models which are derived from common experience.
(d)The clinician decides for therapy, and measures the effect of his 
decision, i.e. the cycle starts again with (a).


Good and large experience is very important for step (c).

The cycle of decision (measurements - feature extraction - comparison 
with experience - decision) is also effective outside medicine: Before 
every conscious decision we *compare* decision relevant data with 
experience (or a model which is derived from common experience). 
Experience says, at *similar* situations possibility X yields better 
results than other possibilities, so we decide for possibility X. Even 
if we try to decide best, our decisions are suboptimal due to limited 
experience.


The web of data can be designed in a way, that it collects experiences 
(also decision relevant measurements of machines) in a precise and 
*comparable* way (much more precise and better comparable than text). So 
the web of data can summarize experiences in well defined comparable way 
for decision support.


For this a clear similarity relation is necessary. The natural way to do 
this is a vectorial description of resources, i.e. quantification of the 
resource's properties and regarding the result (a sequence of numbers) 
as vector. After defining an appropriate metric (distance function) we 
can calculate similarity of vectors by calculating the distance between 
them - the less the distance, the more similar are the vectors and (in 
case of good quantification) the original resources. Using HTTP URIs 
allows that all domain name owners can define these vectors and 
optimized distance functions.


Therefore i suggest to introduce standardized Vectorial Resource 
Descriptors (VRDs) on the WEB - and it seems the best possibility to 
integrate these in Linked Data. The paper 
http://www.orthuber.com/wp1.pdf describes details. It is not completely 
up to date, and though the basal content of the VRDs (and Vector Space 
Descriptors - VSDs) is clear, I have not been sure about the syntax of 
the RDF examples (Chapters 2.2.2 and 2.2.3 currently) - and I would like 
to adapt the syntax to suggestions from the community.


So comments and suggestions are very welcome!

Best

Wolfgang


Georgi Kobilarov schrieb:

Yesterday issued a challenge on my blog for ideas for concrete linked open
data applications. Because talking about concrete apps helps shaping the
roadmap for the technical questions for the linked data community ahead. The
real questions, not the theoretical ones...

Richard MacManus of ReadWriteWeb picked up the challenge:
http://www.readwriteweb.com/archives/web_of_data_what_would_you_build.php

Let's be creative about stuff we'd build with the web of data. Assume the
Linked Data Web would be there already, what would build?

Cheers,
Georgi

--
Georgi Kobilarov
Uberblic Labs Berlin
http://blog.georgikobilarov.com




  




Re: ‘New ontology pages’ as Semantic Web foundation

2010-02-06 Thread Wolfgang Orthuber

Dear Alex,

you wrote
this is a reminder that an original idea of Semantic Web is based on 
three foundations, namely, XML, RDF and ontology pages.


but this is by far not complete, much more is possible. The Semantic Web 
could also integrate VRDs (Vectorial Resource Descriptors) which can be 
perceived as members of Conceptual Spaces (see e.g.

http://luisa.open.ac.uk/publications/Dietze-BlendingRealVirtual.pdf
or simply as describer of quantifiable resources (in the broadest sense, 
also sources for feature extraction) in arbitrary resolution.
http://www.orthuber.com/wpa.htm contain most information, with some old 
conventions and partially old nomenclature


Every VRD contains a VSI which is a HTTP URI which identifies the Vector 
Space to which the VRD belongs, and which points to a VSD (a Vector 
Space Descriptor) which contains all necessary information about the 
vector space, also about the distance function for similarity 
comparison. This Similarity is well defined and does *not* need 
explicit creation of links and Vector Spaces can be used to expand the 
Web of Linked Data, both concepts could enrich each other.


All the best

Wolfgang Orthuber



Alex Abramovich schrieb:


Hi all,

   This is a reminder that an original idea of Semantic Web is based 
on three foundations, namely, XML, RDF and ontology pages. Web 
resources’ content must be duplicated in the machine-readable form 
(ontology page); RDF will link a total Web content into one semantic 
network (Semantic Web); intelligent agents, defined on this semantic 
network, will serve the Web visitors.


   As it seems to me, ‘Berners-Lee at al’ expected that Web resource’s 
owners will write ontology pages themselves. Unfortunately, their 
expectation failed.


Why?

   On the one hand, Web resources’ owners don’t ready to pay more for 
their Web resources maintain, and any Web design’s complication 
conflict with real tendency of the simplification (and even 
automation) of Web resources’ construction.


   On the other hand, ‘Berners-Lee at al’ didn’t provide both any 
unified scenario for the ontology pages creation and connective 
semantic mechanism.  

   The ontology pages idea was rebranded, as rightly observed Dan 
Brickley, into other variations on the theme, related to Linked Data 
and such instruments as RDFa, GRDDL etc., which aim to integrate RDF 
more closely into user-facing Web content.


As a result Semantic Web engineers must link now more than 13 billion 
RDF triples and unknown quantity of independent ontologies.


   I suggest returning to the original ontology pages’ idea based on 
the new knowledge representation language Need Language (NL). 
Herewith, I assume that Web publisher is extremely interested in 
success of the publication, but he is against additional and 
unmotivated expenses on the maintenance of Web resource, and also he 
does not wish to penetrate into additional technical details.


   You know that any Web publisher has in mind a satisfaction of a 
certain need of Web visitors. The main problem is to detect what need 
exactly may be satisfied by the given Web resource.


   NL based engine will provide a query-answering session with both 
any Web publisher and any Web visitor using their professional or/and 
everyday slang.  

As a result Web resource’s content and Web visitor’s specification are 
represented the same semantic marked syntax, and NL based engine will 
get an opportunity to find for the visitor an appropriate Web 
resource. If except a need description Web publisher provides a way of 
this need satisfaction, we will get an opportunity to meet Web 
visitor’s need directly or to compose a new way of the given need 
satisfactions using available need-resources.


   In other words, I mean that Web publishers will rewrite (in the 
scope of the query-answering session) their published information in 
the new specific form (or input a description of certain need’s 
satisfaction) that *includes all necessary constructive elements 
including documents and audio/video data* in the corresponding places 
of the new presentation of their Web resource.


   As a result Web visitor will be relieved of necessity to look 
through Web content in search of relevant information. He will deal 
mainly with Web of needs. System engine will interview Web visitor and 
find or generate the actual way of the given need satisfaction. 
Herewith, system engine will demonstrate to the customer *only those* 
documents and audio/video resources, *which* *are related to the found 
way of the given need satisfaction.*


   Optional, Internet provider supplies Web of needs in form of 
configurations ordered by the customer that will allow to the 
governments to regulate the information flow.


Neither Web publisher nor Web visitor will be obliged to know 
something else except the particularities of their needs. They will be 
interacting with Web of needs using their professional slang.


   With the purpose

Re: Making human-friendly linked data pages more human-friendly (was: dbpedia not very visible, nor fun)

2009-09-15 Thread Wolfgang Orthuber

Linking of data can be very successful, if it is not restricted to RDF 
enthusiasts. In this case the
vocabulary can grow extremely. Consider e.g. integration of healthcare data. 
Existing vocabularies like SNOMED
CT
http://www.ihtsdo.org/news/article/view/snomed-ct-and-interoperable-healthcare-conference-tutorials-tuesday-1st-july-2008/
contain about 40 concepts with increasing tendency.

So if the vocabulary is huge, it is not adequate, that the browser software 
knows about the information for
human readable representation, but it could know how to download this 
information from the web using the
linked data concept.

If http URIs are used as identifier, it is possible to store the information 
for human readable representation
at the location where the http URI points to.

Are there up to now rules for this?

Best

Wolfgang

- Original Message - 
From: Richard Cyganiak rich...@cyganiak.de

To: Matthias Samwald samw...@gmx.at
Cc: public-lod@w3.org
Sent: Tuesday, September 15, 2009 12:46 PM
Subject: Re: Making human-friendly linked data pages more human-friendly (was: 
dbpedia not very visible, nor
fun)



Hi Matthias,

Please allow me to present a contrarian argument.

First, there are some datasets that combine linked data output with a  
traditional website, e.g., by
embedding some RDFa markup. Of course,  in that case, all the rules of good web 
design and information
presentation still apply, and the site has to first and foremost  fulfill the 
visitor's information needs in
order to be successful.  That's self-evident and not what we are talking about 
here.

Most linked data is different. The main purpose is not to create a web  site 
where visitors go to look up
stuff. The main purpose is to  publish data in a re-usable way, in order to 
allow repurposing of the  data
in new applications.

In that case, the audience for the human-readable versions of the  RDF data 
is *not* a visitor that came
to the site while googling for  some bit of information. It's more likely to be 
a data analyst, mashup
developer, or integration engineer. So what I suggest is to think of  these 
pages not as something that end
users see, but rather as  something akin to Javadoc. Javadoc pages are 
auto-generated pages that  describe a
public interface of your system. Linked data pages are the  same, but rather 
than a Java API, they describe
your URI space. And  unlike Javadoc, they are directly connected to the 
documented  artifacts (URIs).

I think that the pages should mostly answer the following questions:  What 
concept is identified? What
*exactly* is the URI of this concept  (careful with /html or #this at the end)? 
Who curates this identifier?
Can I trust it to be stable? Most linked data pages actually do a  fairly 
decent job at answering these.

Every data publisher has limited resources, and spending them on  prettifying 
the HTML views is very
low-impact. It's much more  important to increase data quality, publish more 
data,  improve other
documentation, and create compelling demos/apps on top of the data.  The namespace 
documentation is
usually good enough, and the  geekiness of the pages actually helps to drive 
home the point that  it's about
*re-using this data elsewhere*, rather than looking at the  data in the boring 
old web browser.

That being said, of course nicer-looking pages that present  information in a 
more useful way are of course
always better, but  that's a somewhat secondary problem in the linked data 
context.

Best,
Richard


On 15 Sep 2009, at 10:08, Matthias Samwald wrote:


A central idea of linked data is, in my understanding, that every  resource has 
not only a HTTP -
resolvable RDF description of itself,  but also a human-friendly rendering that 
can be viewed in a web
browser. With the increasing popularity of RDFa, the URIs of these  resources 
are not only hidden away in
triplestores, but become  increasingly exposed on web pages. People want to 
click on them,  and, hopefully,
not all of these people come from the core community  of RDF enthusiasts.

This means that the HTML rendering of linked data resources might  need to look 
a bit sexier than it does
today. I dare to say that the  Pubby-esque rendering of DBpedia pages such as
http://dbpedia.org/page/Primary_motor_cortex
is helpful to get a quick overview of the RDF triples about this  resource, but 
non-RDF-enthusiasts would
not find it very inviting.

This could be improved by changes in the layout, and possibly a  manually 
curated ordering of properties.
For example,
http://d.opencalais.com/er/company/ralg-tr1r/f8a13a13-8dbc-3d7e-82b6-1d7968476cae.html
definitely looks more inviting than the typical DBpedia page (albeit  still a 
bit sterile).

In the case of DBpedia, it might be better to expose the excellent  
human-readable Wikipedia page for each
resource, plus a prominently  positioned 'show raw data' tab at the top. For 
other linked data  resources
that are not derived from existing 

Re: URIs for great circle arcs

2009-06-22 Thread Wolfgang Orthuber

Thanks for this implementation! If we implement such distance calculation in a 
search engine, and sort the
result according to distance, this would be local search (search within a special region). In this case the 
search vector has two dimensions (GPS coordinates). This is a special case of general similarity search

http://www.orthuber.com/wpa.htm
In general similarity search the search result is ordered according to a distance functions which can be 
defined by all http URI owners (using the convention shown in Figure 2 of

http://www.orthuber.com/wp1.pdf ).
It would be possible to search generally for objects which have similar quantitative (numeric) description. So 
the general application would be very attractive.


Wolfgang

- Original Message - 
From: Toby A Inkster t...@g5n.co.uk

To: Linked Data community public-lod@w3.org
Sent: Sunday, June 21, 2009 10:45 AM
Subject: URIs for great circle arcs



Great circle arcs are the shortest surface paths between two points  on a 
spherical body. I've minted some
URIs (and am serving up RDF/ XML) for great circle arcs on the surface of the 
Earth. Amongst other  things,
the RDF/XML returned will tell you the distance between the  points, the 
compass bearing and the midpoint.

For example, the arc between London and Tokyo is represented as:

http://ontologi.es/place/arc/51.507778;-0.128056/35.68;139.77

which is a geo:SpatialThing. There is also:

http://ontologi.es/place/arc/ 51.507778;-0.128056/35.68;139.77#points

which is an rdfs:Container representing the (infinite) set of all  points 
between the two endpoints of the
line.

An additional feature is the ability to link to URIs representing the  
endpoints. e.g.:

http://ontologi.es/place/arc/ 
51.507778;-0.128056/35.68;139.77?uri1=http://dbpedia.org/
resource/Londonuri2=http://dbpedia.org/resource/Tokyo

The document you get returned still has a primaryTopic of

http://ontologi.es/place/arc/51.507778;-0.128056/35.68;139.77

(i.e. it doesn't alter the linked data URI) but now contains explicit  dbpedia 
references for the end
points. Those URIs don't have to be  from dbpedia - they could be anything - 
they're not used in
calculations of distances, etc - just included in the output.

Possible uses: flight and travel linked data.

Any ideas for improvements?

--
Toby A Inkster
mailto:m...@tobyinkster.co.uk
http://tobyinkster.co.uk










Re: numeric web search (Was: URLs instead of URNs)

2009-05-27 Thread Wolfgang Orthuber

After our dialogue of yesterday I again thought about the best term for the pattern 
name in
http://www.orthuber.com/wp1.pdf which is simultaneously a identifier and a 
address. Because of chapter 2.1 of
http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/
the term  http URI  seems to be appropriate.

We have seen, that a complete, precise, and at once accessible definition in 
exactly one place on the web can
be very helpful to avoid misunderstandings, and can save much time. Therefore the 
identifier (the pattern
name) should not only identify (e.g. the meaning of some numbers), is should 
also point to (a file which
points to) all defining information (of this which should be identified, e.g. 
of numbers).

If numeric web search (similarity search) should be integrated into the semantic web and/or if there is 
interest in efficient representation of quantifiable objects, I would suggest to determine the concrete design 
in a meeting, e.g. a workshop.


Wolfgang

- Original Message - 
From: Kingsley Idehen kide...@openlinksw.com

To: Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: Dan Brickley dan...@danbri.org; semantic-web semantic-...@w3.org; 
Linked Data community
public-lod@w3.org
Sent: Tuesday, May 26, 2009 4:57 PM
Subject: Re: numeric web search (Was: URLs instead of URNs)



Wolfgang Orthuber wrote:

We know that a URL refers to a (unique) web address. If also
A URL is a Web Address based Identifier
then the Web Address determines also the URL. Because the Web address is 
globally unique, the URL is unique
and can be used as unique identifier.
Is this correct?

The URL can be used as an Identifier because you can use a globally unique 
Resource Location/Address as a
Name for a Thing (e.g. a Document), albeit with implications (i.e. mobility of 
the Thing you name).

(then I could write that the pattern name in http://www.orthuber.com/wp1.pdf is 
a URL, because it is based
on
the location of a unique linking file which points to all defining 
information)

http://www.orthuber.com/wp1.pdf is the Web Address constrained URI (nee. URL) 
for the resource: wp1.pdf
exposed to the Web via an HTTP server.  I've made no mention of all defining 
information .

Kingsley


Wolfgang

- Original Message - From: Kingsley Idehen kide...@openlinksw.com
To: Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: Dan Brickley dan...@danbri.org; semantic-web semantic-...@w3.org; 
Linked Data community
public-lod@w3.org
Sent: Tuesday, May 26, 2009 3:04 PM
Subject: Re: numeric web search (Was: URLs instead of URNs)



Wolfgang Orthuber wrote:

Dan,

can a http URI refer transiently or accidentally to some address?

Of course.

Which term do you suggest for something which permanently refers to a (unique, 
permanent) web address,
and
which differs if and only if the web address differs?

A URI that carries location/address specificity or dependency (transiently or 
accidentally).

An Identifier with endowed location specificity (overtly or covertly) isn't 
optimal, but that doesn't stop
it being an identifier.

A URL is a Web Address based Identifier -- a URI :-)



Kingsley


Wolfgang

- Original Message - From: Dan Brickley dan...@danbri.org
To: Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: semantic-web semantic-...@w3.org; Linked Data community 
public-lod@w3.org
Sent: Tuesday, May 26, 2009 1:31 PM
Subject: Re: numeric web search (Was: URLs instead of URNs)



On 26/5/09 15:17, Wolfgang Orthuber wrote:

Dan,

in http://www.w3.org/TR/uri-clarification/ I read An http URI is a URL
. So I concluded that a different http URI is a different URL (address).
At this I assumed, that all http URIs which refer to the same address
(case insensitive), are defined as identical. Is this correct?


I'd rather they'd have said URL is a technically obsolete but common 
colloquial term for http and
http-like URIs. Identity of identifiers is tricky because you have to try to 
distinguish between
identifiers which accidentally of transiently refer to the same thing, versus 
those where it is built-in
to the definition of the scheme (eg. the port 80 and domain name 
canonicalisation rules).

Dan









--


Regards,

Kingsley Idehen   Weblog: http://www.openlinksw.com/blog/~kidehen
President  CEO OpenLink Software Web: http://www.openlinksw.com












--


Regards,

Kingsley Idehen   Weblog: http://www.openlinksw.com/blog/~kidehen
President  CEO OpenLink Software Web: http://www.openlinksw.com











Re: numeric web search (Was: URLs instead of URNs)

2009-05-26 Thread Wolfgang Orthuber

David,

thanks for the address; I also think that entity identifiers should not be multiplied beyond necessity. Though 
different HTTP URIs always refer to different addresses, it is possbile that they have identical application 
(e.g. identification of a number with meaning Length). To reduce the frequency of redundancy, definitions of 
HTTP URIs could contain lists of keywords. Text search restricted to the definitions can disclose existing 
definitions about a specific topic to prevent from redefinitions. If there are nevertheless redefinitions, 
webmasters can ask web search engines for the most frequently used HTTP URI, and prefer it in their web pages. 
So we can get concentration to one name again. This is one possibility, I will contact okkam, perhaps we can 
exchange some ideas.


Best
Wolfgang

- Original Message - 
From: David Booth da...@dbooth.org

To: Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: semantic-web semantic-...@w3.org; Linked Data community 
public-lod@w3.org
Sent: Monday, May 25, 2009 4:55 PM
Subject: Re: numeric web search (Was: URLs instead of URNs)



Wolfgang,

It sounds like your work may be somewhat related to what the Okkam
project is doing:
http://www.okkam.org/

David Booth


On Mon, 2009-05-25 at 11:59 +0100, Wolfgang Orthuber wrote:

David,

we have independently drawn the same conclusions, this seems to be most efficient. Numeric similarity 
search

needs an efficient approach. Figure 2 in
http://www.orthuber.com/wp1.pdf shows that there are only a few steps that 
linked data are the first which
allow numeric web search with world wide task sharing. The searchable patterns 
are

HTTP URIs with feature vectors (sequences of numbers).

Sequences of numbers are the natural way to describe quantifiable objects, e.g. 
time as one floating point
number (e.g. seconds since 2000), GPS coordinates as two floating point numbers, complex measurement 
results

with more than 100 floating point numbers.

Concerning this suggestions and examples for the best (efficient) syntax are 
welcome!

You may also look at
http://www.bentham-open.org/pages/content.php?TOMINFOJ/2008/0002/0001/21TOMINFOJ.SGM
which shows an attractive application.

Best
Wolfgang

Address:
Dr. Wolfgang Orthuber, Mathematician, Orthodontist, University Clinic of 
Schleswig Holstein, Kiel, Germany
Arnold Heller Str. 16
24105 Kiel

- Original Message - 
From: David Booth da...@dbooth.org

To: W. Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: Hugh Glaser h...@ecs.soton.ac.uk; semantic-web semantic-...@w3.org; 
Linked Data community
public-lod@w3.org
Sent: Monday, May 25, 2009 3:45 AM
Subject: Re: URLs instead of URNs (Was URI lifecycle (Was: Owning URIs))


 On Thu, 2009-05-21 at 17:08 +0100, W. Orthuber wrote:
 David,

  In short, although semantic web architecture could be designed to permit
  unrestricted semantic drift,
 I think it is a better design -- better
  serving the semantic web community as a whole -- to adopt an
  architecture that permits the semantics of each URI to be anchored, by
  use of a URI declaration.
 Absolutement.
 Yes, I think also, URIs should be well defined. Up to now I thought they are, but your article shows 
 that

 URIs (which are not URLs)
 have not necessarily an unique definition! Moreover URI should be anchored; 
the best would be that they
 contain a link to all their
 definition and further bindingly associated information.

 Why not prefer URIs which are (special defining) URLs, which contain
 a link to a file which contains links to all defining
 information (unambiguous
 information, in multiple languages if wished)?
 So the anchor would be at once accessible and there would be exactly
 one location for the decisive information.

 Yes, the preferred way to do that is quite well described in Cool URIs
 for the Semantic Web:
 http://www.w3.org/TR/cooluris


 -- 
 David Booth, Ph.D.

 Cleveland Clinic (contractor)

 Opinions expressed herein are those of the author and do not necessarily
 reflect those of Cleveland Clinic.







--
David Booth, Ph.D.
Cleveland Clinic (contractor)

Opinions expressed herein are those of the author and do not necessarily
reflect those of Cleveland Clinic.







numeric web search (Was: URLs instead of URNs)

2009-05-25 Thread Wolfgang Orthuber

David,

we have independently drawn the same conclusions, this seems to be most efficient. Numeric similarity search 
needs an efficient approach. Figure 2 in
http://www.orthuber.com/wp1.pdf shows that there are only a few steps that linked data are the first which 
allow numeric web search with world wide task sharing. The searchable patterns are


HTTP URIs with feature vectors (sequences of numbers).

Sequences of numbers are the natural way to describe quantifiable objects, e.g. 
time as one floating point
number (e.g. seconds since 2000), GPS coordinates as two floating point 
numbers, complex measurement results
with more than 100 floating point numbers.

Concerning this suggestions and examples for the best (efficient) syntax are 
welcome!

You may also look at
http://www.bentham-open.org/pages/content.php?TOMINFOJ/2008/0002/0001/21TOMINFOJ.SGM
which shows an attractive application.

Best
Wolfgang

Address:
Dr. Wolfgang Orthuber, Mathematician, Orthodontist, University Clinic of 
Schleswig Holstein, Kiel, Germany
Arnold Heller Str. 16
24105 Kiel

- Original Message - 
From: David Booth da...@dbooth.org

To: W. Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: Hugh Glaser h...@ecs.soton.ac.uk; semantic-web semantic-...@w3.org; 
Linked Data community
public-lod@w3.org
Sent: Monday, May 25, 2009 3:45 AM
Subject: Re: URLs instead of URNs (Was URI lifecycle (Was: Owning URIs))



On Thu, 2009-05-21 at 17:08 +0100, W. Orthuber wrote:

David,

 In short, although semantic web architecture could be designed to permit
 unrestricted semantic drift,
I think it is a better design -- better
 serving the semantic web community as a whole -- to adopt an
 architecture that permits the semantics of each URI to be anchored, by
 use of a URI declaration.
Absolutement.
Yes, I think also, URIs should be well defined. Up to now I thought they are, 
but your article shows that
URIs (which are not URLs)
have not necessarily an unique definition! Moreover URI should be anchored; the 
best would be that they
contain a link to all their
definition and further bindingly associated information.

Why not prefer URIs which are (special defining) URLs, which contain
a link to a file which contains links to all defining
information (unambiguous
information, in multiple languages if wished)?
So the anchor would be at once accessible and there would be exactly
one location for the decisive information.


Yes, the preferred way to do that is quite well described in Cool URIs
for the Semantic Web:
http://www.w3.org/TR/cooluris




--
David Booth, Ph.D.
Cleveland Clinic (contractor)

Opinions expressed herein are those of the author and do not necessarily
reflect those of Cleveland Clinic.








Re: Owning URIs (Was: Yet Another LOD cloud browser)

2009-05-19 Thread Wolfgang Orthuber

Li,

it would be not difficult to define a syntax and rules so that exclusive 
ownership (standardization model) and
shared ownership (social wikipedia model) can exist simultaneously. At this the 
standardization model can be
designed very efficiently (decentrally defined templates for human readable 
representation in many languages
and metrics for similarity search are at once accessible, cf. my fist email of 
yesterday).

These advantages would be also possible for the social model, because shared 
ownership could be integrated
into the standardization model if only one socially good organized domain name 
owner (e.g. wikipedia.org)
allows this.

Of course this is only possible, if there is (official) support for a concrete 
design variant of the
standardization model. If there are questions concerning the concrete design in 
detail, please don't hesitate
to ask me.

Wolfgang

- Original Message - 
From: Li Ding di...@cs.rpi.edu

To: Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: Kingsley Idehen kide...@openlinksw.com; Linked Data community 
public-lod@w3.org;
semantic-...@w3.org
Sent: Monday, May 18, 2009 6:57 PM
Subject: Re: Owning URIs (Was: Yet Another LOD cloud browser)



If there is no explicit official standard and recommendation for such global
task sharing, there is the
danger that those who need a special vocabulary develop many incompatible
standards for exchange of machine
readable data.

The earlier such a standard for well defined task sharing is recommended by
W3C, the easier it can be
introduced.


well, we can learn from the past:
(i) biomedical terminologies are systematically maintained and
probably globalized -  I would say such terms are maintained via
standardization processes.  recent advance of DOI allows use to assign
an official URI for a publication. so does DNS.
(ii) English is evolving by daily life usage (that just reminded me an
earlier mail by Jeremy on live meaning and dead languages) - this is
case for a social vocabulary development

As mentioned in the previous email, standardization and social
evolution are useful and somehow complementary. However, they are more
or less an approach to the goal, which has been discussed in previous
email:
* given an URI in browser, a user really need to get it resolved (i.e.
be able to fetch its definition) , whether using the namespace of the
URI or use a search engine is just an option to the browser.
* knowing the ownership will be helpful for users to trust the
description of URI.  The coexistence of exclusive ownership
(standardization model) or shared ownership (social wikipedia model)
are complementary solutions.

As David said, social evolution could be dirty, but it also looks good:
* it is really not an easy job for one to maintain a comprehensive
description of URI for a long time (hard drive may crash, money may
run out, interests may switch, language may change). In addition to
the exclusive ownership for important concepts, open source style
social development should help too.  so why not allow multiple
ownership for one URI
* the web is open, how can we help end users to pick one from the
multiple URIs referring to the same concept. so why not use social
ranking (where ownership may play an important role) to enable the
Darwin evolution.


--
Li Ding
http://www.cs.rpi.edu/~dingl/






Re: Owning URIs (Was: Yet Another LOD cloud browser)

2009-05-18 Thread Wolfgang Orthuber

It is important to track the ownership (further provenance) of the
description of URI. we may want to know who published the definition,
and where the definition is copied from. Being able to connect RDF
triples with authors is an important step towards the social semantic
web.


In my proposal the name of the defining domain is included in the URI. It 
determines the decisive definition.
Responsible is the owner of the defining domain.

Wolfgang



Re: Owning URIs (Was: Yet Another LOD cloud browser)

2009-05-18 Thread Wolfgang Orthuber

Kingsley,

What I wanted to say is that the existing system for registration of domain names can be also efficiently used 
to
initiate well defined task sharing in the definition of a global vocabulary for the semantic web, using a 
simple standard as proposed in my initial email of today.


If there is no explicit official standard and recommendation for such global 
task sharing, there is the
danger that those who need a special vocabulary develop many incompatible 
standards for exchange of machine
readable data.

The earlier such a standard for well defined task sharing is recommended by 
W3C, the easier it can be
introduced.

Wolfgang

- Original Message - 
From: Kingsley Idehen kide...@openlinksw.com

To: Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de
Cc: Li Ding di...@cs.rpi.edu; Linked Data community public-lod@w3.org; 
semantic-...@w3.org
Sent: Monday, May 18, 2009 4:01 PM
Subject: Re: Owning URIs (Was: Yet Another LOD cloud browser)



Wolfgang Orthuber wrote:

It is important to track the ownership (further provenance) of the
description of URI. we may want to know who published the definition,
and where the definition is copied from. Being able to connect RDF
triples with authors is an important step towards the social semantic
web.


In my proposal the name of the defining domain is included in the URI. It 
determines the decisive
definition.
Responsible is the owner of the defining domain.

Wolfgang



Wolfgang,

A URL is a URI. It identifies an address.
A URI embodies URLs and URNs.
You can refer to things using HTTP URIs or URNs.
You can refer to things and resolve descriptions of said things via HTTP URIs.

In all cases, URIs inherently posses domain ownership implications via the 
authority component.

--


Regards,

Kingsley Idehen   Weblog: http://www.openlinksw.com/blog/~kidehen
President  CEO OpenLink Software Web: http://www.openlinksw.com










Re: numeric data on the web, numeric web search

2009-04-30 Thread Wolfgang Orthuber

Thanks for the information! I have posted
http://www.thenationaldialogue.org/ideas/numeric-web-search
an hope it helps.

Let me know if you have further ideas and suggestions, I am always interested.

Wolfgang

- Original Message - 
From: Kingsley Idehen kide...@openlinksw.com

To: Semantics-ProjectParadigm metadataport...@yahoo.com
Cc: public-lod@w3.org; Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de; 
semantic-web
semantic-...@w3.org
Sent: Wednesday, April 29, 2009 9:35 PM
Subject: Re: numeric data on the web, numeric web search



Semantics-ProjectParadigm wrote:

Excellent and timely starting point. Will definitely look into sending in ideas 
about making available data
that will help stimulate green revolution and generate green jobs.


Even if the Govts. of the world simply publish XML based structured data, that 
alone would deliver full
employment and lots of follow-on  opportunities for the RDFization technology 
developers  :-)

So as long as the data is structured, there will be huge opportunities for 
those that grok Linked Data and
the process of RDFization, esp. those that  generate wrapper/proxy based Linked 
Data URIs  :-)


Kingsley


December 2009 in Denmark follow up to Kyoto is coming up. Oceans of raw data 
waiting to be processed to
come up with policies that both address climate change issues AND generate new 
jobs!

Milton Ponson
GSM: +297 747 8280
Rainbow Warriors Core Foundation
PO Box 1154, Oranjestad
Aruba, Dutch Caribbean
www.rainbowwarriors.net
Project Paradigm: A structured approach to bringing the tools for sustainable 
development to all
stakeholders worldwide
www.projectparadigm.info
NGO-Opensource: Creating ICT tools for NGOs worldwide for Project Paradigm
www.ngo-opensource.org
MetaPortal: providing online access to web sites and repositories of data and 
information for sustainable
development
www.metaportal.info
SemanticWebSoftware, part of NGO-Opensource to enable SW technologies in the 
Metaportal project
www.semanticwebsoftware.info


--- On *Wed, 4/29/09, Kingsley Idehen /kide...@openlinksw.com/* wrote:


From: Kingsley Idehen kide...@openlinksw.com
Subject: Re: numeric data on the web, numeric web search
To: Semantics-ProjectParadigm metadataport...@yahoo.com
Cc: public-lod@w3.org, Wolfgang Orthuber
orthu...@kfo-zmk.uni-kiel.de, semantic-web semantic-...@w3.org
Date: Wednesday, April 29, 2009, 7:50 PM

Semantics-ProjectParadigm wrote:
 See

http://googleblog.blogspot.com/2009/04/adding-search-power-to-public-data.html.

 This is the first intent at making large amounts of data
available in structured formats.

 Although it is not linked data in all conceivable formats from
all sources on the web, the fact that the E-Government Act is
forcing US federal agencies public data to make their data more
accessible could be the push required to get linked data
initiatives to the next level.

 Time for a Semantic Web/Linked Data lobby in DC to make funding
available to expand to all public domains.\


We can start here:

http://www.thenationaldialogue.org/ideas/make-collecting-recovery-data-agile-using-semantic-web-technology
:-)


Kingsley

 Milton Ponson
 GSM: +297 747 8280
 Rainbow Warriors Core Foundation
 PO Box 1154, Oranjestad
 Aruba, Dutch Caribbean
 www.rainbowwarriors.net
 Project Paradigm: A structured approach to bringing the tools
for sustainable development to all stakeholders worldwide
 www..projectparadigm.info
 NGO-Opensource: Creating ICT tools for NGOs worldwide for
Project Paradigm
 www.ngo-opensource.org
 MetaPortal: providing online access to web sites and
repositories of data and information for sustainable development
 www.metaportal.info
 SemanticWebSoftware, part of NGO-Opensource to enable SW
technologies in the Metaportal project
 www.semanticwebsoftware.info


 --- On *Wed, 4/29/09, Wolfgang Orthuber
/orthu...@kfo-zmk.uni-kiel.de
/mc/compose?to=orthu...@kfo-zmk.uni-kiel.de/* wrote:


 From: Wolfgang Orthuber orthu...@kfo-zmk.uni-kiel.de
/mc/compose?to=orthu...@kfo-zmk.uni-kiel.de
 Subject: numeric data on the web, numeric web search
 To: public-lod@w3.org /mc/compose?to=public-...@w3.org
 Date: Wednesday, April 29, 2009, 3:25 PM

 Hello!
  We know that quantifiable objects play a central role
in daily
 life. Nevertheless up to now quantifiable objects have in
general
 no well defined globally machine readable and precise
 representation on the web. The following concept proposes a
simple
 data structure called pattern for such representation of
 quantifiable objects in general which also allows their
similarity
 search:
 
  * Numeric web search *
  Web search is up to now word

numeric data on the web, numeric web search

2009-04-29 Thread Wolfgang Orthuber
Hello!

We know that quantifiable objects play a central role in daily life. 
Nevertheless up to now quantifiable objects have in general no well defined 
globally machine readable and precise representation on the web. The following 
concept proposes a simple data structure called pattern for such 
representation of quantifiable objects in general which also allows their 
similarity search:


* Numeric web search *

Web search is up to now word based. Additionally language independent 
similarity search of quantifiable objects is desirable. For well defined 
numeric representation of quantifiable objects a simple data structure called 
pattern is proposed, which contains a feature vector (a sequence of numbers) 
for representation of the object, and a pattern name which is a URI which 
uniquely identifies the kind of object which is represented by the feature 
vector.

Pattern:  Pattern name   +feature vector  
(+ auxilliary data)

Patterns with the same pattern name represent the same kind of object. Because 
the number of possible pattern names is not limited*, infinitely* many 
different kinds of quantifiable objects can be represented by patterns.  (*only 
physically limited by finite time and energy)

So the search terms are not words, but feature vectors in patterns which allow 
quantification of similarity. Feature vectors of patterns with the same pattern 
name are directly comparable using a given metric. At this similarities of the 
original quantifiable objects are mapped to spatial similarities of the feature 
vectors. So similarity search is possible by calculating distances: Objects are 
the more similar, the smaller the distance between the feature vectors of the 
representing patterns is.

Due to the multitude of different kinds of quantifiable objects the work for 
development of efficient pattern resp. feature vector definitions for their 
representation is open ended. Global task sharing has the greatest potential: 
According to this suggestion every owner of an internet domain name abc.xyz 
gets the right to define feature vectors of all patterns with names abc.xyz/* 
(in well defined location abc.xyz/pat/*).

Patterns are machine readable, uniformly comparable and searchable. They allow 
to search with the same search engine not only for text, but also for an 
increasing number of well-defined quantifiable objects on the web. This 
bundling of the search activity into one crawler and web database for all 
quantifiable objects is much more efficient than building and managing a 
database and a crawler for every kind of object.

Numeric similarity search could be efficiently combined with conventional word 
based search. Details are described in http://www.orthuber.com/wpa.htm , don't 
hesitate to ask me further questions.


It seems clear that introduction of the above conventions would have relevant 
advantages. Can this get support that we can step by step realize this?

Regards

Wolfgang Orthuber   (Mathematician and Orthodontist at University of Kiel / 
Germany)