ANN: BauDataWeb: The European Building and Construction Materials Dataset for the Semantic Web
Dear all: I am glad to announce the official release of BauDataWeb, the European building and construction materials database for the Semantic Web: URI: http://semantic.eurobau.com/ With this project, we expose a major dataset reflecting the European building and construction materials market on the basis of the GoodRelations Web Vocabulary for E-Commerce. This allows for the fine-grained search for products, suppliers, and warehouses for any building-related sourcing need. BauDataWeb is one of the densest and richest public datasets for a well-defined vertical business sector that is available on the Semantic Web. It covers a major share of the European market for construction and building materials. Key distinctions from other datasets are: 1. The market for building materials shows a very high item specificity, which makes it very interesting for new types of search. 2. Transportation costs for building materials are usually very significant, which makes the distance from the warehouse to the point of consumption a critical dimension of search. 3. A large part of the items includes a rich, machine-readable description of product features using the FreeClassOWL ontology. We expect that the data can be very well combined with other related datasets on the Web of Linked Data, e.g. * dbPedia information about population or transportation infrastructure, * governmental information, or * real estate offers. Features * Over 60 million triples of real business data with a high domain density * Fully GoodRelations-compliant * Fully W3C-compliant * Geo data for warehouse locations * FreeClassOWL product classes and properties for a majority of the products Components == 1. Dataset: The full data is available in RDF. The data consists of ca. 1.5 million individual RDF/XML files plus a few large data dumps in N-Triples syntax that simplify the crawling of all data at once. For fetching the dataset, please use the sitemap at http://semantic.eurobau.com/sitemap.xml. 2. FreeClassOWL: A GoodRelations-compliant ontology for describing construction and building materials and services * HTML: http://www.freeclass.eu/freeclass_v1.html * OWL in RDF/XML: http://www.freeclass.eu/freeclass_v1.owl 3. The Eurobau Utility Ontology, which defines a few extensions to GoodRelations for the particular vertical domain * OWL in RDF/XML: http://semantic.eurobau.com/eurobau-utility.owl (no other representation available at the moment) 4. A (simple) demo application that demonstrates queries combining product features and warehouse distance * http://www.ebusiness-unibw.org/tools/freeclass-search/ SPARQL Access = Currently, the data is available for SPARQL queries via the OpenLink Software Virtuoso repositories at * http://lod.openlinksw.com/sparql and * http://linkeddata.uriburner.com/sparql Statistics == * 81 Manufacturers / Brands * 19 Resellers * 183 Warehouse locations in Austria * 56.360 Product types (including variants) * 1.783.798 Offerings * 95 % of the product models include rich FreeClassOWL descriptions, e.g. class membership and properties Acknowledgments === BauDataWeb is a joint project by inndata Datentechnik GmbH and the E-Business Web Science Research Group at Universität der Bundeswehr München. The data conversion and implementation was carried out by Andreas Radinger and Martin Hepp at the E-Business Web Science Research Group at the Universität der Bundeswehr München, Germany. The underlying relational database has been designed by Otto Handle and is being maintained and operated by inndata Datentechnik GmbH. The work on BauDataWeb was partially funded by the Austrian FFG under the project grant icontent.document (grant no. 819773). Thanks go also to Kingsley Idehen from OpenLink Software for hosting the data. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology
Support for Linked Data for E-Commerce in DotNetNuke Shop Software / GoodRelations
Dear all: The latest release of the NB_Store module [1] for e-commerce sites based on DotNetNuke CMS [2] seems to support GoodRelations in RDFa. See here for details: http://nbstore.codeplex.com/releases/view/45017 The underlying DotNetNuke CMS is said to power over 600,000 production web sites. I would be very interested in URIs of shops that are using this latest version of NB_Store or experience reports. Best wishes Martin Hepp [1] http://nbstore.codeplex.com/ [2] http://www.dotnetnuke.com/ -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Please stop massive crawling against http://openean.kaufkauf.net/id/
Dear all: The volunteer who is hosting http://openean.kaufkauf.net/id/, a huge set of GoodRelations product model data, is experiencing a problematic amount of traffic from unidentified crawlers located in Ireland (DERI?), the Netherlands (VUA?), and the USA. The crawling has been so intense that he had to temporarily block all traffic to this dataset. In case you are operating any kind of Semantic Web crawlers that tried to access this dataset, please 1. check your crawler for bugs that create excessive traffic (e.g. by redundant requests), 2. identify your crawler agent properly in the HTTP header, indicating a contact person, and 3. implement some bandwidth throttling technique that limits the bandwidth consumption on a single host to a moderate amount. Note that the full dataset is always up to date in the LOD SPARQL endpoint at http://lod.openlinksw.com/sparql Thus, there is rarely a need to crawl the complete dataset. Thanks for your consideration. Best wishes Martin Hepp -- -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
GoodRelations vs. Google RDFa vs. Open Graph vs. hProduct/hListing: Using GoodRelations in 10 Triples
Dear all: Some people think that the GoodRelations ontology for e-commerce (http://purl.org/goodrelations/) is powerful, but complex. I think it is important for everybody in the community to know that GoodRelations can be as simple (or simpler) than any more lightweight approach for product markup, as long as you compare the same level of granularity. Below, please find an example of offering a car for sales in just ten (!) triples. Of course, you can do more with GoodRelations than just encoding a price and carrying the semantics of the product itself in a string. The important message in here is that simple chunks of data are as simple in GoodRelations as they are in hProduct/hListing microformats, Google's RDFa vocabulary, or the Open Graph product markup. The key difference is that GoodRelations has a much more extensible and, in my biased ;-) judgement: cleaner, conceptual model so that IF you have more granular data available, THEN you can expose it, and make your products more findable on the Web. For example, GoodRelations distinguishes between products and product models / datasheets. That allows for powerful linking between individual items and rich technical specifications from the manufacturer's page. The Open Graph approach seems to use a plain topic semantics, which mixes items, datasheets, and offers. Please keep that in mind and spread the word. Here is the markup. Best wishes Martin Turtle/N3: = @prefix foo: http://www.example.com/xyz# . @prefix gr: http://purl.org/goodrelations/v1# . @prefix xsd: http://www.w3.org/2001/XMLSchema# . @prefix rdfs: http://www.w3.org/2000/01/rdf-schema# . foo:company a gr:BusinessEntity; gr:legalName Hepp Space Ventures Inc.; gr:offers foo:offering. foo:offering a gr:Offering; rdfs:label Volkswagen Station Wagon, 4WD, 400 $@en; rdfs:description I sell my old Volkswagen Station Wagon, 4WD, for 400 $@en; gr:hasPriceSpecification [ a gr:UnitPriceSpecification; gr:hasCurrencyValue 400^^xsd:string; gr:hasCurrency USD^^xsd:string. ]. RDFa: div xmlns:rdf=http://www.w3.org/1999/02/22-rdf-syntax-ns#; xmlns:rdfs=http://www.w3.org/2000/01/rdf-schema#; xmlns=http://www.w3.org/1999/xhtml; xmlns:gr=http://purl.org/goodrelations/v1#; xmlns:foo=http://www.example.com/xyz#; xmlns:xsd=http://www.w3.org/2001/XMLSchema#; div about=http://www.example.com/xyz#company; typeof=gr:BusinessEntity div property=gr:legalName content=Hepp Space Ventures Inc./div div rel=gr:offers div about=http://www.example.com/xyz#offering; typeof=gr:Offering div property=rdfs:label content=Volkswagen Station Wagon, 4WD, 400 $ xml:lang=en/div div property=rdfs:description content=I sell my old Volkswagen Station Wagon, 4WD, for 400 $ xml:lang=en/div div rel=gr:hasPriceSpecification div typeof=gr:UnitPriceSpecification div property=gr:hasCurrency content=USD datatype=xsd:string/div div property=gr:hasCurrencyValue content=400 datatype=xsd:string/div /div /div /div /div /div /div RDF/XML: === ?xml version=1.0? rdf:RDF xmlns:gr=http://purl.org/goodrelations/v1#; xmlns:rdf=http://www.w3.org/1999/02/22-rdf-syntax-ns#; xmlns:foo=http://www.example.com/xyz#; xmlns:xsd=http://www.w3.org/2001/XMLSchema#; xmlns:rdfs=http://www.w3.org/2000/01/rdf-schema#; gr:BusinessEntity rdf:about=http://www.example.com/xyz#company; gr:legalNameHepp Space Ventures Inc./gr:legalName gr:offers gr:Offering rdf:about=http://www.example.com/xyz#offering; rdfs:label xml:lang=enVolkswagen Station Wagon, 4WD, 400 $/rdfs:label rdfs:description xml:lang=enI sell my old Volkswagen Station Wagon, 4WD, for 400 $/rdfs:description gr:hasPriceSpecification gr:UnitPriceSpecification gr:hasCurrencyValue rdf:datatype=http://www.w3.org/2001/XMLSchema#string;400/gr:hasCurrencyValue gr:hasCurrency rdf:datatype=http://www.w3.org/2001/XMLSchema#string;USD/gr:hasCurrency /gr:UnitPriceSpecification /gr:hasPriceSpecification /gr:Offering /gr:offers /gr:BusinessEntity /rdf:RDF -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology
Re: GoodRelations vs. Google RDFa vs. Open Graph vs. hProduct/hListing: Using GoodRelations in 10 Triples
Apologies - There were a few minor bugs in the initial markup: - I forgot the business function. - The datatype for the price was xsd:string instead of xsd:float. - The legal name had no language tag. The correct examples are at http://ebusiness-unibw.org/pipermail/goodrelations/2010-May/000215.html On 03.05.10 10:38, Martin Hepp (UniBW) wrote: Dear all: Some people think that the GoodRelations ontology for e-commerce (http://purl.org/goodrelations/) is powerful, but complex. I think it is important for everybody in the community to know that GoodRelations can be as simple (or simpler) than any more lightweight approach for product markup, as long as you compare the same level of granularity. Below, please find an example of offering a car for sales in just ten (!) triples. Of course, you can do more with GoodRelations than just encoding a price and carrying the semantics of the product itself in a string. The important message in here is that simple chunks of data are as simple in GoodRelations as they are in hProduct/hListing microformats, Google's RDFa vocabulary, or the Open Graph product markup. The key difference is that GoodRelations has a much more extensible and, in my biased ;-) judgement: cleaner, conceptual model so that IF you have more granular data available, THEN you can expose it, and make your products more findable on the Web. For example, GoodRelations distinguishes between products and product models / datasheets. That allows for powerful linking between individual items and rich technical specifications from the manufacturer's page. The Open Graph approach seems to use a plain topic semantics, which mixes items, datasheets, and offers. Please keep that in mind and spread the word. Here is the markup. Best wishes Martin -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Re: GoodRelations vs. Google RDFa vs. Open Graph vs. hProduct/hListing: Using GoodRelations in 10 Triples
Hi Henry, Thanks for your feedback! I wonder if the following could make it even simpler though: First I think, why not make the currency a literal? foo:offering a gr:Offering; rdfs:label Volkswagen Station Wagon, 4WD, 400 $@en; rdfs:description I sell my old Volkswagen Station Wagon, 4WD, for 400 $@en; gr:hasPriceSpecification [ a gr:UnitPriceSpecification; gr:value 400^^gr:USD; ] . When you have a relation from a string to a value, you have a literal. monetary values are good examples it seems to me of this. It would remove at least three of your URL usages, as you had to specify the xsd:string twice, and also the type of the currency. There are several reasons for using the current GoodRelations pattern. The most important one is that GoodRelations represents all point values internally as intervals, in order to support reasoning over ranges with simple RDFS-style reasoners. As for prices, that has been an explicit request from the hotel domain. So gr:hasCurrencyValue is an rdfs:subPropertyOf gr:hasMaxCurrencyValue AND of gr:hasMinCurrencyValue. This gives a nice coherent model for point values and intervals, without the need for a SPARQL endpoint and either a bit of RDFS-style reasoning or a bit of SPARQL query expansion. Example: # 3..4 USD per kilogram, e.g. depending on the size of the potatoes foo:price a gr:UnitPriceSpecification ; gr:hasCurrency USD^^xsd:string ; gr:hasMinCurrencyValue 3.0^^xsd:float ; gr:hasMaxCurrencyValue 4.0^^xsd:float ; gr:hasUnitOfMeasurement KGM^^xsd:string ; gr:valueAddedTaxIncluded true^^xsd:boolean . # 4 USD per kilogram foo:price a gr:UnitPriceSpecification ; gr:hasCurrency USD^^xsd:string ; gr:hasCurrencyValue 4.0^^xsd:float ; gr:hasUnitOfMeasurement KGM^^xsd:string ; gr:valueAddedTaxIncluded true^^xsd:boolean . Both will work with the following query: # find something that has a price or upper price limit of less than 4 USD / kg SELECT ?offer WHERE { ?offer a gr:Offering . ?offer gr:hasPriceSpecification ?p . ?p gr:hasMaxCurrencyValue ?v . ?p gr:hasCurrency USD^^xsd:string . ?p gr:UnitOfMeasurement KGM^^xsd:string . FILTER (?p = 4) } The important message is that, without any domain-specific tweak of the GoodRelations vocabulary, you can model things as simple as this foo:offering a gr:Offering; rdfs:label Volkswagen Station Wagon, 4WD, 400 $@en; rdfs:comment I sell my old Volkswagen Station Wagon, 4WD, for 400 $@en; gr:hasBusinessFunction gr:Sell; gr:hasPriceSpecification [ a gr:UnitPriceSpecification; gr:hasCurrencyValue 400^^xsd:float; gr:hasCurrency USD^^xsd:string. ]. and as fine-granular as this: # Boat Rental: 24 USD / 6 hours for 0-3 days, 75 USD per day for any longer rental foo:company a gr:BusinessEntity ; gr:legalName Hepp Space Ventures, Inc.@en ; gr:offers foo:boat_offer. foo:boat_offer a gr:Offering ; rdfs:label Boat rental@en ; rdfs:comment We rent out boats@en ; gr:hasBusinessFunction gr:LeaseOut ; gr:hasPriceSpecification foo:rental_price1, foo:rental_price2 . foo:rental_price1 a gr:UnitPriceSpecification ; gr:hasCurrency USD^^xsd:string ; # 24 USD per 6 hours = 4 USD / hour gr:hasCurrencyValue 4^^xsd:float ; gr:hasUnitOfMeasurement HUR^^xsd:string ; gr:billingIncrement 6^^xsd:float gr:valueAddedTaxIncluded true^^xsd:boolean ; gr:hasEligibleQuantity [ a gr:QuantitativeValueInteger ; gr:hasUnitOfMeasurement DAY^^xsd:string ; gr:hasMinValueInteger 0^^xsd:int ; gr:hasMaxValueInteger 3^^xsd:int . ]. foo:rental_price2 a gr:UnitPriceSpecification ; gr:hasCurrency USD^^xsd:string ; gr:hasCurrencyValue 75^^xsd:float ; gr:hasUnitOfMeasurement DAY^^xsd:string ; gr:valueAddedTaxIncluded true^^xsd:boolean ; gr:hasEligibleQuantity [ a gr:QuantitativeValueInteger ; gr:hasUnitOfMeasurement DAY^^xsd:string ; gr:hasMinValueInteger 4^^xsd:int . ]. And both use the same set of conceptual elements, and will basically trigger the very same SPARQL queries. That is a major difference to any other conceptual approach for product / offer data on the Web - to keep the simple things simple while supporting the very complex cases off the box. Also in your example you could put the company in a different file, and refer to it by reference. Yes, I agree. In particular, in RDFa one will use the rev attribute to do that. Best Martin -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail:h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp
Re: GoodRelations vs. Google RDFa vs. Open Graph vs. hProduct/hListing: Using GoodRelations in 10 Triples
FYI - This may also be of interest for anybody of you working on linked data for e-commerce. Best Martin Hepp Original Message Subject: More examples of modeling price information with GoodRelations Date: Mon, 03 May 2010 17:47:48 +0200 From: Martin Hepp (UniBW) martin.h...@ebusiness-unibw.org Reply-To: martin.h...@ebusiness-unibw.org Organization: http://www.heppnetz.de To: goodrelations goodrelati...@ebusiness-unibw.org Dear all: I just completed a new GoodRelations recipe that describes how GoodRelations can be used to model price information in more detail, e.g. - quantity discounts, - seasonal discounts and special offers, - billing increments, - price ranges, - combining flat and metered price components (25 USD / day plus 1 USD per mile), and - cancellation policies. Please check http://www.ebusiness-unibw.org/wiki/GoodRelationsPricing Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples
Dear all: I am happy to announce one of many new major sources of GoodRelations data: http://www.shopforia.com/ recently added GoodRelations in RDFa to their 104,000 items pages in 26 categories: Example: http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=B003DRBA3S Sitemap: There seems to be no sitemap for the site, but you can reconstruct it by 1. crawling http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?SearchIndex=CategoryItemPage=n with n = 1...400 for Category = Apparel, Beauty etc. and 2. extracting all item links, which follow the pattern http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=ASIN, with ASIN = e.g. B0009RL86E Many items have EAN/UPC codes, which allows for powerful linking with product model data (features etc.). I guess that the total amount of triples will be in the order of magnitude of 50 triples per page x 104,000 pages = 5,200,000 triples. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Re: www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples
Hi Constatine, Yes, I consider that good practice. I can't see the benefit this might give to any agent making use of the triples. owl:imports means that the model shall include all axioms from the imported ontologies. Often (but not always), the same effect can be achieved by dereferencing the URIs of all conceptual elements mentioned in a document. I know that some people in the LOD community don't think owl:imports is necessary, but I do. There have been lengthy discussions about that, but basically RDFa is just one of many syntactical variants for RDF, so there is no strong argument for arbitrarily omitting anything. BTW, I just found that there are some other quirks in the markup. We already contacted the site owner and hope he will fix them shortly. Best Martin On 28.04.10 12:57, Hondros, Constantine wrote: Looking at the example source, I'm curious what's going on here: div class=description about=http://www.shopforia.com/; typeof=owl:Ontology div rel=owl:imports resource=http://www.w3.org/2006/vcard/ns;/div div rel=owl:imports resource=http://purl.org/goodrelations/v1;/div Is this considered good practise? I can't see the benefit this might give to any agent making use of the triples. -Original Message- From: public-lod-requ...@w3.org [mailto:public-lod-requ...@w3.org] On Behalf Of Martin Hepp (UniBW) Sent: 28 April 2010 11:07 To: public-lod@w3.org Subject: www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples Dear all: I am happy to announce one of many new major sources of GoodRelations data: http://www.shopforia.com/ recently added GoodRelations in RDFa to their 104,000 items pages in 26 categories: Example: http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=B003DRBA3S Sitemap: There seems to be no sitemap for the site, but you can reconstruct it by 1. crawling http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?SearchIndex=CategoryItemPage=n with n = 1...400 for Category = Apparel, Beauty etc. and 2. extracting all item links, which follow the pattern http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=ASIN, with ASIN = e.g. B0009RL86E Many items have EAN/UPC codes, which allows for powerful linking with product model data (features etc.). I guess that the total amount of triples will be in the order of magnitude of 50 triples per page x 104,000 pages = 5,200,000 triples. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009 This email and any attachments may contain confidential or privileged information and is intended for the addressee only. If you are not the intended recipient, please immediately notify us by email or telephone and delete the original email and attachments without using, disseminating or reproducing its contents to anyone other than the intended recipient. Wolters Kluwer shall not be liable for the incorrect or incomplete transmission of of this email or any attachments, nor for unauthorized use by its employees. Wolters Kluwer nv has its registered address in Alphen aan den Rijn, The Netherlands, and is registered with the Trade Registry of the Dutch Chamber of Commerce under number 33202517. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de
GoodRelations Ontology: Update + New Features + New Documentation
Dear all: We just released a service update to the GoodRelations ontology for e-commerce. The update is backwards-compatible to previous releases and mainly adds additional properties that are important for the rental and accommodation business. We also redesigned the complete language reference. Please find the latest resources from the following URIs: * http://purl.org/goodrelations/v1 (RDF/XML or HTML via content negotiation) * http://purl.org/goodrelations/v1.owl (RDF/XML) * http://purl.org/goodrelations/v1.html (HTML) A change log is at * http://bit.ly/dhXqY9 == !!! Please refresh your caches !!! == Highlights == - New gr:owns property for indicating the products that you own, which allows exposing ownership data, e.g. to feed for future recommender systems. So companies and individuals can now express offers (gr:offers), demand (gr:seeks), and ownership (gr:owns). - Ordering relations for gr:QualitativeValues, which may be useful e.g. for modeling garment sizes (XLLMS) (gr:greater, gr:lesser, gr:greaterOrEqual, gr:lesserOrEqual, gr:equal, and gr:nonEqual) - gr:validFrom and gr:validThrough can now be applied to opening hours and payment / delivery charge specifications. This allows for modeling e.g. longer opening hours in the holiday season, or free shipping in a certain period. - New properties gr:eligibleDuration, gr:eligibleTransactionVolume, gr:advanceBookingRequirement, gr:billingIncrement, gr:deliveryLeadTime, gr:availabilityStarts, and gr:availabilityEnds for being more specific about the offer or pricing. * Examples: o „Free shipping starting from 20 EURO“ (attached to a gr:DeliveryChargeSpecification) o „Minimal order volume 10 Euro“ (attached to a gr:Offering) o „Service fee for credit card payments below 10 Euro“ (attached to a gr:PaymentChargeSpecification) - New property gr:serialNumber - Ordering relations between gr:DayOfWeek, useful for sorting them in user interfaces and for including adjacent days (gr:hasPrevious and gr:hasNext). - New properties gr:successorOf and gr:predecessorOf for linking newer product models (gr:ProductOrServiceModel) to discontinued ones. Backward Compatibility Issues = The only minimal changes that may (but should not) affect existing data or code are the following: 1. We removed the rdfs:range statement for gr:legalName. It used to be xsd:string, but that prevented someone from indicating the natural language, since the language tag can only be attached to untyped RDF literals. 2. We changed the domain of gr:validFrom and gr:validThrough to the union of gr:Offering, gr:OpeningHoursSpecification, and gr:PriceSpecification instead of just gr:Offering and gr:UnitPriceSpecification. This allows specifying the validity interval of offers, opening hours, payment charge specifications, and delivery charge specifications. All existing data remains valid. It is possible that you have to relax any domain validation heuristics that you apply. Acknowledgments: === Special thanks go to Andreas Radinger for reviewing the specification and handling the deployment, and to Andreas Stolz for writing the Python script and templates for the language reference. Our script uses the RDFlib library for parsing and handling RDF and the Jinja template engine for generating the HTML document. The service update in particular and the current state of GoodRelations in general would have been impossible without the valuable feedback and suggestions from many individuals. In alphabetical order, we would like to thank Daniel Bingel, Andreas Harth, Kingsley Idehen, Jay Myers, Markus Linder, Peter Mika, Andreas Radinger, Martin Schliefnig, Alex Stolz, Jamie Taylor, Giovanni Tummarello, Jon Udell, and Andreas Wechselberger. Best Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe:
eClassOWL 5.1.4 Ontology for Products and Services
Dear all: eClassOWL, initially released in 2004, has since been the most comprehensive and mature Web ontology for types of products and services. It defines classes for more than 30,000 product types and more than 5,000 properties for product features. Under the hood, eClassOWL is a non-trivial transformation of the e...@ss standard [1]. eClassOWL complements the GoodRelations ontology [2] in the sense that - eClassOWL can be used to describe the product type and features, and - GoodRelations can be used to describe the offer and other commercial aspects. We just replaced the pretty dated documentation and primer Web page at http://www.heppnetz.de/projects/eclassowl/ so that it better explains the interplay with GoodRelations etc. and is based on the current state of Web engineering. The full theoretical background is described in [3] and [4]. Notes: = - eClassOWL 5.1.4 is based on eClass 5.1.4, while the latest version of e...@ss is now 6.1. - We are discussing an official release of eClassOWL 6.x these days, but it may take a while. - Due to backward compatibility issues, eClassOWL 5.1.4 still uses hash URIs and is pretty large (in total 60,000 classes, 30 - 70 MB file size depending on the format). As per the nature of hash URI-based ontologies, any application dereferencing a single element will always request the full file. This will change in any future release, but is a limitation of the current state. - Due to licensing issues, eClassOWL 5.1.4 is available freely for research purposes only. Commercial usages will require a bilateral agreement with e...@ss e.V. Acknowledgments: === We would like to thank eClass e.V., in particular Friedhelm Hausmann and Thomas Einsporn, and Jos de Bruijn, doug foxvog, Axel Polleres, and Amit Sheth for their kind support and feedback over the seven years of the project. The work on eClassOWL was partly funded by Florida Gulf Cost University, by the European Commission under the project DIP (FP6 - 507483), and by the Trans IT Entwicklungs- und Transfercenter at the University of Innsbruck. Best wishes Martin Hepp and Andreas Radinger [1] http://www.eclass-online.com/ [2] http://purl.org/goodrelations/ [3] Hepp, Martin; de Bruijn, Jos: GenTax: A Generic Methodology for Deriving OWL and RDF-S Ontologies from Hierarchical Classifications, Thesauri, and Inconsistent Taxonomies, Proceedings of the 4th European Semantic Web Conference (ESWC 2007), June 3-7, 2007, Innsbruck, Austria, Springer LNCS, Vol. 4519, pp. 129-144, 2007. PDF: http://www.heppnetz.de/files/hepp-de-bruijn-ESWC2007-gentax-CRC.pdf [4] Hepp, Martin: Products and Services Ontologies: A Methodology for Deriving OWL Ontologies from Industrial Categorization Standards, in: Int'l Journal on Semantic Web Information Systems (IJSWIS), Vol. 2, No. 1, pp. 72-99, January-March 2006. PDF: http://www.heppnetz.de/files/IJSWIS-eclassOWL-APA-Style-2005-final-11-17-Web.pdf -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Preview of the New GoodRelations Language Reference
Dear all: Over the last weeks, we have been working on a much improved documentation of the GoodRelations vocabulary for e-commerce [1]. Our main goal was to provide a more readable, more accessible official specification. Please find a preview here: http://www.heppnetz.de/ontologies/goodrelations/20100401/v1.html This will replace the current document at http://purl.org/goodrelations/v1.html shortly. Please note that the ontology itself did not change, we just updated the rendering and organization of the HTML document. Also, this is not meant to replace the GoodRelations Primer [2], which is a tutorial-style introduction to using GoodRelations, and the GoodRelations Cookbook [3], which contains recipes for common scenarios. Any feedback would be greatly appreciated! Thanks to Alex Stolz and Andreas Radinger for their hard work on that! Best wishes Martin Hepp [1] http://purl.org/goodrelations/ [2] http://www.heppnetz.de/projects/goodrelations/primer/ [3] http://www.ebusiness-unibw.org/wiki/GoodRelations#CookBook:_GoodRelations_Recipes_and_Examples -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Re: Generate RDFa with Epiphany
Dear Benjamin, Excellent! If you want me to review any of the resulting patters, please just send me the URI of a sample result. For training the system, it's maybe better to use a native reference data set, e.g. this one: http://www.ebusiness-unibw.org/wiki/Rdfa4google By the way, you can also use the GoodRelations Validator (beta) to check your output http://www.ebusiness-unibw.org/tools/goodrelations-validator/ It runs about 20 queries (and counting ;-) against the data. It accepts and RDFa or RDF/XML resource with GoodRelations data. Best wishes Martin Benjamin Adrian wrote: Dear Martin, The RDF extraction service SCOOBIE behind Epiphany can be configured or let's say trained with any RDF model that describes a certain domain. I'm working on extedning the Epiphany web service to let user define which model (even their own) to take for annotating web pages with Epiphany. So, to support Good Relations, we just will have to give it an RDF model such as it is available at the Amazon sponger data. But beside Epiphany I will try to configure SCOOBIE with a small GoodRelations training set in the next days and keep you up-to-date. Best regards, Ben Martin Hepp (UniBW) schrieb: Hi Benjamin, Nice - can you create GoodRelations (http://purl.org/goodrelations/) patterns in RDFa for existing shop pages, e.g. identify price and product information? Just spotting the product name, description, EAN/UPC code, and price would already very valuable. Best Martin PS: Did you see the Amazon sponger data, e.g. at http://linkeddata.uriburner.com/about/html/http://linkeddata.uriburner.com/about/id/entity/http/www.amazon.com/exec/obidos/ASIN/0596518552 Benjamin Adrian wrote: Hi everyone! Let me introduce the RDFa annotator Epiphany: It uses configurable domain-specific Linked Data to enrich web pages with RDFa annotation, automatically. These annotations link text passages to instances inside the Linked Data model. Hovering an annotation with your mouse opens a lighting box with additional information from the RDF graph behind the instance's HTTP URI. Epiphany runs at: http://projects.dfki.uni-kl.de/epiphany/ On the top right you'll find an example. Under http://projects.dfki.uni-kl.de/epiphany/form, you can write your own text and receive RDFa content. Currently, the underlying Linked Data model is a subset of DBpedia covering German politics. In later versions you will be able to upload or link your own Linked Data model to annotate web pages with your own domain specific RDFa. Please don't hesitate in giving me your comments :). Twitter hashtag is #RDFEPIPHANY Regards Ben -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Re: [Freebase-experts] ISBNs, owl:sameAs, etc
hi ross, apologies for the delay! as long as both URIs represent a book title, i have no objections. my comment was a rather generic warning - URNs and non-URN URIs can of course be linked by owl:sameAs. intelligence of a human or machine agent depends on the subtleness and quality of that agent's category system. if everything is sameAs to lots of other things, then the amount of intelligent conclusions you can draw from the LOD cloud will be limited. best martin Ross Singer wrote: Martin, Given that there's apparently considerable disagreement here (and probably plenty of confusion), can you elaborate on what exactly you object to in this request? The first objection I saw raised was by Tom Morris (is a FreeBase topic really a book?) and agree that urn:isbn:9780670063260 is not the same thing as http://rdf.freebase.com/ns/en.on_the_road. However, I'm not sure I see that distinction between urn:isbn:9780670063260 and http://rdf.freebase.com/ns/soft.isbn.9780670063260.best as they're both trying to identify an edition (or, loosely, a frbr:Manifestation) of a work. My questions would be: 1) If the latter relationship is not a good fit for owl:sameAs, why? 2) if #1 is true, what is a better alternative? I understand the rationale of not wanting to water down the semantics of owl:sameAs to the point that it's little more than rdfs:seeAlso with louder voice. At the same time, it's difficult to see if your issue is with a type mismatch between a URN and a URI; how FreeBase models their data; or general mutually assured destruction-ness of owl:sameAs. Thanks, -Ross. On Mon, Jan 4, 2010 at 4:22 PM, Martin Hepp (UniBW) martin.h...@ebusiness-unibw.org wrote: dear all: quite clearly, a book title is not the same as a book, and if the linked data community continues to link apples and oranges via owl:sameAs then either - the socially agreed meaning owl:sameAs is being degraded to untyped href in HTML or - vast amount of data in the linked data cloud turns useless but as a minimum precaution against spoiling the graph of data, put your sameAs statements into separate graphs so that meaningful applications can filter them out easily. best martin Kingsley Idehen wrote: Daniel O'Connor wrote: On Tue, Dec 29, 2009 at 6:02 PM, Tom Morris tfmor...@gmail.com mailto:tfmor...@gmail.com wrote: But is the description of the book (ie Freebase topic) really the same thing as the book? Assuming it was, wouldn't you not only have to make the sameAs assertions and publish them someplace, but also get them loaded into sameas.org http://sameas.org? Tom On Tue, Dec 29, 2009 at 1:35 AM, Shawn Simister narphor...@gmail.com mailto:narphor...@gmail.com wrote: Are you just looking for a script that can make the 'same as' assertions like this? http://rdfbooks.freebaseapps.com/ns/soft.isbn.9780670063260.best Ah hah! Now how to (nicely) get that indexed... Make a linkset and publish it via: 1. A data dump 2. SPARQL endpoint 3. Your Linked Data Space . Then you have your index, and others can make their variants by meshing their data with yours :-) Re. DBpedia, we can add the linkset to its own Named Graph as we've done with other linksets. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009 -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http
Re: [Freebase-experts] ISBNs, owl:sameAs, etc
dear all: quite clearly, a book title is not the same as a book, and if the linked data community continues to link apples and oranges via owl:sameAs then either - the socially agreed meaning owl:sameAs is being degraded to untyped href in HTML or - vast amount of data in the linked data cloud turns useless but as a minimum precaution against spoiling the graph of data, put your sameAs statements into separate graphs so that meaningful applications can filter them out easily. best martin Kingsley Idehen wrote: Daniel O'Connor wrote: On Tue, Dec 29, 2009 at 6:02 PM, Tom Morris tfmor...@gmail.com mailto:tfmor...@gmail.com wrote: But is the description of the book (ie Freebase topic) really the same thing as the book? Assuming it was, wouldn't you not only have to make the sameAs assertions and publish them someplace, but also get them loaded into sameas.org http://sameas.org? Tom On Tue, Dec 29, 2009 at 1:35 AM, Shawn Simister narphor...@gmail.com mailto:narphor...@gmail.com wrote: Are you just looking for a script that can make the 'same as' assertions like this? http://rdfbooks.freebaseapps.com/ns/soft.isbn.9780670063260.best Ah hah! Now how to (nicely) get that indexed... Make a linkset and publish it via: 1. A data dump 2. SPARQL endpoint 3. Your Linked Data Space . Then you have your index, and others can make their variants by meshing their data with yours :-) Re. DBpedia, we can add the linkset to its own Named Graph as we've done with other linksets. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
Re: Generate RDFa with Epiphany
Hi Benjamin, Nice - can you create GoodRelations (http://purl.org/goodrelations/) patterns in RDFa for existing shop pages, e.g. identify price and product information? Just spotting the product name, description, EAN/UPC code, and price would already very valuable. Best Martin PS: Did you see the Amazon sponger data, e.g. at http://linkeddata.uriburner.com/about/html/http://linkeddata.uriburner.com/about/id/entity/http/www.amazon.com/exec/obidos/ASIN/0596518552 Benjamin Adrian wrote: Hi everyone! Let me introduce the RDFa annotator Epiphany: It uses configurable domain-specific Linked Data to enrich web pages with RDFa annotation, automatically. These annotations link text passages to instances inside the Linked Data model. Hovering an annotation with your mouse opens a lighting box with additional information from the RDF graph behind the instance's HTTP URI. Epiphany runs at: http://projects.dfki.uni-kl.de/epiphany/ On the top right you'll find an example. Under http://projects.dfki.uni-kl.de/epiphany/form, you can write your own text and receive RDFa content. Currently, the underlying Linked Data model is a subset of DBpedia covering German politics. In later versions you will be able to upload or link your own Linked Data model to annotate web pages with your own domain specific RDFa. Please don't hesitate in giving me your comments :). Twitter hashtag is #RDFEPIPHANY Regards Ben -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
ANN: New GoodRelations and Generic RDF/RDFa Tools
(Apologies for cross-posting) Dear all: In the past year, we released a lot of tools in the context of the GoodRelations vocabulary (http://purl.org/goodrelations/) that may be of wider interest. All tools are available on-line and most are also released as sourcecode under LPGL. 1. Generic Tools a) RDF2DataRSS: === Yahoo accepts RDF only as either RDFa embedded into existing Web pages or via their proprietary DataRSS feed format. The RDF2DataRSS tool turns any RDF data submitted as RDF/XML or Turtle/N3 into such Yahoo feeds and can be used to submit RDF/XML into Yahoo's SearchMonkey index. http://www.ebusiness-unibw.org/tools/rdf2datarss/ b) RDF2RDFa converter: == This tool creates handy snippets of invisible RDFa from RDF/XML content such that the RDF can be pasted easily into any existing page without hard-wiring it with the HTML page mark-up. http://www.ebusiness-unibw.org/tools/rdf2rdfa/ 2. Tools for Creating GoodRelations Data a) Google Product Feed Converter: = Many shops are already exposing their product and price information in one of the proprietary Google feed formats (RSS 1.0, 2.0 / CSV-based). With our tool, you can generate GoodRelations RDF/XML data from that data. That can be particularly interesting for adding RDF export to existing shops easily. http://www.ebusiness-unibw.org/tools/google-product-feed-converter/ b) BMEcat2GoodRelations converter: == BMEcat is a popular XML Schema Definition (and DTD) for exchanging catalog data between enterprises in B2B settings. Many PIM, shop, and ERP software packages can create BMEcat XML documents. This tool transforms a BMEcat catalog into a GoodRelations-variant in RDF/XML. It can reuse / preserve more features than any other tool. http://www.ebusiness-unibw.org/tools/bmecat2goodrelations/ By the way, a free tool for creating BMEcat catalogs is at http://www.eclass.de/user/software/democat_2_0.zip (in German only, unfortunately). c) GoodRelations Annotator: === A straightforward form-based Web application that creates a detailed company / business profile in RDFa and RDF/XML. http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ d) Plug-ins / Extensions for Shop Software: === * osCommerce Shop Software: http://code.google.com/p/goodrelations-for-oscommerce/ * Joomla/Virtuemart CMS/Shop combo: http://code.google.com/p/goodrelations-for-joomla/ * Magento shop software: Contact Uwe Stoll, http://www.semantium.de/ (demo shop at http://www.la-mousson.de/) 3. GoodRelations-compliant Domain Ontologies for Vertical Industries a) ClassOWL 5.1.4 Products and Services Ontology * OWL or HTML: http://www.ebusiness-unibw.org/ontologies/eclass/5.1.4/ (Warning: The file is very large - 38.3 MB for RDF/XML and 71.8 MB for HTML): * ZIP: OWL+HTML: http://www.ebusiness-unibw.org/ontologies/eclass/5.1.4/eclass_514en.zip (eClassOWL 5.1.4 ontology and documentation, compressed - OWL + HTML, zip, 4.7 MB): b) freeClassOWL - Ontology of Construction and Building Materials = * http://www.freeclass.eu/freeclass_v1 (add .html / .owl if content negotiation should fail) c) Consumer Electronics Ontology (CEO) == * http://www.ebusiness-unibw.org/ontologies/consumerelectronics/v1 (add .html / .owl if content negotiation should fail) 3. Applications for Inspecting and Consuming GoodRelations Data a) iGoogr: Imagine Google was using the GoodRelations vocabulary for e-commerce = A quick demo of how Google search results could be improved by GoodRelations meta-data; also useful for inspecting your own page, because it fetches and parses RDFa in real-time. http://igoogr.appspot.com/ b) GoodRelations Statistics === A tool that collects and exposes detailed data on the popularity of GoodRelations elements in the wild (with RDFa export of the stats!). http://goodrelations-stats.appspot.com/ c) GoodRelations Validator (alpha) == A tool that spots semantic inconsistencies in GoodRelations data beyond the axioms of the ontology. http://www.ebusiness-unibw.org/tools/goodrelations-validator/ Much more is on our agenda for 2010, so please stay tuned by * bookmarking http://www.ebusiness-unibw.org/wiki/GoodRelations and * subscribing to http://ebusiness-unibw.org/cgi-bin/mailman/listinfo/goodrelations . Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax:
Re: Ontology Wars? Concerned
There are lots of trade-offs when designing an ontology, e.g. specificity vs. size of the target user community - this has e.g. been discussed in Hepp, Martin: Possible Ontologies: How Reality Constrains the Development of Relevant Ontologies, in: IEEE Internet Computing, Vol. 11, No. 1, pp. 90-96, Jan-Feb 2007. A PDF is at http://www.heppnetz.de/files/IEEE-IC-PossibleOntologies-published.pdf Martin Paul Houle wrote: On Wed, Nov 18, 2009 at 2:23 PM, Nathan nat...@webr3.org wrote: I'm finding the path to entry in to the linked open data world rather difficult and confusing, and only for one specific reason - ontologies; it /feels/ like there are some kind of ontology wars going on and I can never get a definitive clear answer. An ontology war is preferable to the alternative: the one ring that rules them all. If you're trying to develop an ontology for topic X, it's usually easy to make one that's good but obviously not perfect: let's say, 95% correct. You need to cross an uncanny valley in the attempt to go from 95% to 100%, and often things get worse rather than better. This is one of the reasons why Cyc is perceived as a failure: although it was trying to model the common sense knowledge that we all share, the actual structures in Cyc that try to represent everything in a consistent way are bizzare, counterintuitive and certainly not representative of how people think, no matter how correct they may be. People don't have a completely consistent taxonomy of the world either; they have models of different parts of reality that they'll mesh when they need to mesh them. My 94% correct version of topic X might be great for what I'm doing w/ topic X and your 96% version is great for what you're doing. Trying to build one system that's perfect might result in something that's not as good for what we're doing... But in the long term we do need tools that let us mesh these easily. SPARQL + OWL can take us part of the way in that direction, but really, we need something better in that direction, largely because of the many almost the same as relationships that are out there... -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
ANN: GoodRelations Annotator 2.0 + New Webcast
Dear all: We are happy to release a significantly improved version of the GoodRelations Annotator, a tool that helps any business in the world to create a rich description of its line of business, opening hours, payment options, and the like in RDF. The new version creates a handy RDFa snippet of the data. Now, it is sufficient to copy and paste that snippet right before the closing body elements of the main Web page. This will work for anybody who can see and access a page's source code,i.e. even in Content Management Systems or Wikis, etc. To our knowledge, this is the easiest approach of creating rich business descriptions for the Web of Linked Data. Please check it out! And please spread the word to any site owner in your address book: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ A step-by-step tutorial of 13 minutes complements the tool and is available at http://vimeo.com/7583816 By the way, once you will have published respective meta-data, you can check the improved appearance in future search engines immediately using the http://igoogr.appspot.com/ application. Simply enter search terms that will bring up the respective page among the top five hits in Google, and you will see the effect of GoodRelations within seconds. Best wishes Martin Hepp -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009 attachment: martin_hepp.vcf
Pre-release: New GoodRelations Annotator - Simple RDFa snippets for copy-and-paste
Dear all: We are in the process of finalizing an improved version of the GoodRelations Annotator Tool. With that tool, any company can quickly create a rich description of its line of business, opening hours, payment options, and the like. In the past, publishing the resulting RDF data proved to be too difficult for many users - or impossible due to the lack of access privileges. Now, based on our RDF2RDFa component, the resulting rich meta-data is turned into a simple 20+ lines RDFa snippet that can be easily copied into your main Web page, directly before the closing /body element. This will work for anybody who can see and access a page's source code, i.e. even in Content Management Systems or Wikis, etc. Please check it out and provide feedback: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ By the way, once you have published respective meta-data, you can check the improved appearance in future search engines directly using the http://igoogr.appspot.com/ application. Simply enter search terms that bring up your respective page among the top five hits in Google, and you will see the effect of GoodRelations within seconds. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_IEEE_CEC%2709
Re: Need help mapping two letter country code to URI
Hi Aldo, Note that there are multiple branches of the ISO 3166 familiy of codes. See pages 23 and 24 of the GoodRelations Technical Report (http://www.heppnetz.de/projects/goodrelations/GoodRelations-TR-final.pdf) for a more detailed discussion. I am still not aware of any authoritative URI schema for ISO 3166, which is why GoodRelations uses string literals for that code. The key ISO page http://www.iso.org/iso/country_codes.htm does also not refer to any established http or URN URI schema for the ISO 3166 family of codes. I assume that dbPedia URIs may be well suited, but they are not as authoritative. If they have ISO 3166 codes attached via properties, entity consolidation on that basis may be relatively simple. Below, please find an excerpt from the discussion re identifiers for countries in the GoodRelations Technical Report: Country or Region ... GoodRelations could reuse several approaches for ontologies of regions and places for specifying Countries and Regions. However, we suggest a more pragmatic approach of reusing the ISO Standard 3166, in particular ISO 3166-1 (ISO, 2006) and ISO 3166-2 (ISO, 1998). The first defines 2- or 3-letter identifiers for existing countries and a few independent geopolitical entities. ISO 3166-1 alpha-2 defines 2-letter codes for most countries. There exist alternative standards with 3-letter codes and a numerical representation. For the following reasons, we suggest using the 2-letter codes: First, they are well established and people are likely more familiar with them (they are also used for most top-level domains). Second, and more important, the 2-letter variant is the basis for ISO 3166-2, which breaks down the countries from ISO 3166-1 into administrative subdivisions (ISO, 1998). The code elements used in ISO 3166-2 consist of “the alpha-2 code element from ISO 3166-1 followed by a separator and a further string of up to three alphanumeric characters e. g.” (from: http://www.iso.org/iso/en/prods- services/iso3166ma/04background-on-iso-3166/iso3166-2.html). This allows using simple string operations on the respective ISO 3166 codes in order to handle administrative subdivisions. For example, if a certain Offering is said to be valid for Canada (ISO 3166-1 two-letter code “CA”), then one can infer that any longer search string specifying an administrative subdivision of Canada (e.g. British Columbia, ISO 3166-2 “CA-BC”) is also an eligible region. Examples: Canada (CA), Austria (AT), Canada: British Columbia (CA-BC), Italy (IT), Italy: Province of Milano (IT-MI) Note: More complex modeling of Countries and Regions may be useful in some scenarions, and GoodRelations can be imported and extended if necessary. However, most offerings on the Web contain statements on the level of countries only, for which ISO 3166-1 is sufficient and very common. Martin Aldo Bucchi wrote: Hi, I found a dataset that represents countries as two letter country codes: DK, FI, NO, SE, UK. I would like to turn these into URIs of the actual countries they represent. ( I have no idea on whether this follows an ISO standard or is just some private key in this system ). Any ideas on a set of candidata URIs? I would like to run a complete coverage test and take care I don't introduce distortion ( that is pretty easy by doing some heuristic tests against labels, etc ). There are some border cases that suggest this isn't ISO3166-1, but I am not sure yet. ( and if it were, which widely used URIs are based on this standard? ). Thanks! A -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_IEEE_CEC%2709 attachment: martin_hepp.vcf
Re: The Power of Virtuoso Sponger Technology
Guys, the Web of Data cannot rely on mass data crawling of the whole Web but must combine cached data with federated on-demand queries. Structured data requires much faster update cycles than typical text-based Web indices. For example, Google and Yahoo can rely on the fact that http://www.cnn.com; is relevant for news. Such will not change within minutes. And both Google and Yahoo need up to several weeks to visit your page again. When it comes to structured price and availability information, your data may become outdated within hours, if not seconds. Think of eBay auctions, hotel or flight availability, etc. So it will boil down to technology that combines (1) crawling and caching rather stable data sets with (2) distributing queries and parts of queries among the right SPARQL endpoints (whatever actual DB technology they expose). You can keep a text index of the whole Web, if crawling cycles in the order of magnitude of weeks are fine. For structured, linked data that exposes dynamic database content, dumb crawling and caching will not scale. If the DB technology is able to involve the right set of endpoints for parts of the query, why would you need a complete replication of all databases in the world inside one huge repository? That repository will be a million-node cluster anyway. Why not directly use the millions of nodes that provide the data and cache just the endpoint meta-data? Martin Giovanni Tummarello wrote: With respect to crawling and scraping or sponging or .. trying to guess based on partial fragments of structured information i can say 3 thngs a) No, we're not doing it at the moment, we are only covering those who chose to put structured semantics. Some book stuff shows up in Sig.ma .. e.g. http://sig.ma/search?q=frank+van+harmelensources=100 bookfinder, our jerome digital library installation, but the triplees they provide are scarce and dont contribute much. It would take so little for this to improve on their side i believe. b) No, we are not religious about this. We have talked about it several times, it might make sense to try to understand as much as the web as possible and index it. Maybe we'll do it in the future for selected fractions of the web to show how it looks c) crawling should be just one mean of acquiring the semantic web. in case of bestbuy or other large retailers where prices change possibly everyday crawling as a mean to emulate a simple.. call to a web service seems really not the smart thing to do. Will data providers really support with data dumps? cheers Giovanni On Sat, Oct 17, 2009 at 3:32 PM, Juan Sequeda juanfeder...@gmail.com wrote: But Sindice could at least crawl Amazon. It would be great to use sig.ma to create a meshup with the amazon data. Juan Sequeda, Ph.D Student Dept. of Computer Sciences The University of Texas at Austin www.juansequeda.com www.semanticwebaustin.org On Sat, Oct 17, 2009 at 9:28 AM, Martin Hepp (UniBW) h...@ebusiness-unibw.org wrote: I don't think so, because this would require that Sindice crawled the whole regular web and checked the Spongers for each URL (sic!). Juan Sequeda wrote: Does Sindice crawl this (or any other semantic web search engines)? Juan Sequeda, Ph.D Student Dept. of Computer Sciences The University of Texas at Austin www.juansequeda.com www.semanticwebaustin.org On Sat, Oct 17, 2009 at 4:24 AM, Martin Hepp (UniBW) h...@ebusiness-unibw.org wrote: Dear all: I just found out that the Virtuoso Sponger technology is even more powerful than I thought. Briefly: Spongers create rich GoodRelations (and other RDF) meta-data for existing Web pages on-the-fly. Other than traditional screen-scraping approaches, Spongers reuse public APIs and other techniques, so the data is of unprecedented degree of structure. Now, this can be directly used in arbitrary queries... by simply using the URI of the *existing* HTML Web page in the FROM clause of a SPARQL query. Example: http://www.amazon.com/Semantic-Web-Real-World-Applications-Industry/dp/0387485309 is a Web page in plain HTML offering a book. Amazon does not yet produce GoodRelations meta-data on their pages. If you go to http://uriburner.com/sparql and paste the URI in the Default Graph URI field and select Retrieve remote RDF for all missing source graphs, then a query like SELECT * WHERE {?s ?p ?o} LIMIT 50 returns a fully-fledged GoodRelations description for that page - as if Amazon was already supporting GoodRelations for each of its 4 million items! There are spongers for BestBuy, eBay, Zillow, and many other types of resources. Wow! Congrats to Kingsley and his team! Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness
Re: Breaking News: GoodRelations data now shows up in Yahoo!
Hi David, Daniel O'Connor wrote: http://goodrelations.doconnor.user.dev.freebaseapps.com/ Freebase data being rendered as Good Relations (Or Barbie and Ken's Semantic Web Playset) thanks for the initiative - very valuable! What's the best way to validate this / check it would show up in Yahoo search results? You can use http://developer.search.yahoo.com/help/objects/product A few comments as for the data: What you find in Freebase are likely gr:ProductOrServiceModel instances, not offers. So you should create instances of gr:ProductOrServiceModel for each Product in Freebase first. Those define the properties of the model - e.g. that what you would usually find in a manufacturer's datasheet: - description - image - EAN/UPC - weight etc. Note that the price is not a feature of the product model, but a property of one specific offer to sell such objects, i.e. a gr:Offering. (I assume there will be way more product models in Freebase than those which you find currently, and it could be that querying for a price is the reason.) If you have a business entity and a price, you could also add a gr:Offering etc. as you are doing right now. But note that Mattel will often not sell individual barbie dolls to end users at the suggested retail price. So the offer must be constrained to resellers. And then you don't have price... This is why I would suggest to limit the export to the model data. Those can be linked in the LOD cloud to actual offers, e.g. from BestBuy or from eBay (via OpenLink's new eBay sponger). So the basic structure should be a) Model data (Datasheets) foo:Barbie1234 a gr:ProductOrServiceModel; rdfs:label blabla@en; rdfs:comment blabla@en; gr:hasEAN_UCC-13 1234567890123^^xsd:string; gr:hasManufacturer foo:Mattel. #etc. foo:Mattel a gr:BusinessEntity. gr:legalName Mattel Toys Inc.@en. #etc. You can add the statement that Mattel also offers individuals of that type: foo:Offer a gr:Offering; gr:includes foo:SomeBarbie1234s. foo:Mattel gr:offers foo:Offer. foo:SomeBarbie1234s a gr:ProductOrServicesSomeInstancesPlaceholder; rdfs:label blabla@en; rdfs:comment blabla@en; gr:hasEAN_UCC-13 1234567890123^^xsd:string; gr:hasMakeAndModel foo:Barbie1234. #etc. But then you should not attach a UnitPriceSpecification. For your reference, I add a list of properties for gr:Offering, gr:ProductOrServiceModel, and gr:ProductOrServicesSomeInstancesPlaceholder. Also, I recommend the UML diagram at http://www.ebusiness-unibw.org/wiki/File:Goodrelations-UML-2009-07-18.pdf gr:Offering owl:includes owl:hasBusinessFunction owl:availableDeliveryMethods owl:eligibleCustomerTypes owl:includesObject owl:availableAtOrFrom owl:hasPriceSpecification owl:hasWarrantyPromise owl:acceptedPaymentMethods owl:eligibleRegions owl:hasEAN_UCC-13 owl:hasGTIN-14 owl:hasStockKeepingUnit owl:validFrom owl:validThrough gr:ProductOrServiceModel specific: none inherited: owl:isAccessoryOrSparePartFor owl:qualitativeProductOrServiceProperty owl:isSimilarTo owl:isConsumableFor owl:quantitativeProductOrServiceProperty owl:hasManufacturer owl:datatypeProductOrServiceProperty owl:hasEAN_UCC-13 owl:hasGTIN-14 owl:hasStockKeepingUnit gr:ProductOrServicesSomeInstancesPlaceholder specific: owl:hasInventoryLevel owl:hasMakeAndModel inherited: owl:isAccessoryOrSparePartFor owl:qualitativeProductOrServiceProperty owl:isSimilarTo owl:isConsumableFor owl:quantitativeProductOrServiceProperty owl:hasManufacturer owl:datatypeProductOrServiceProperty owl:hasEAN_UCC-13 owl:hasGTIN-14 owl:hasStockKeepingUnit Best Martin -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey
Useful rules axioms for GoodRelations
Dear all: I just finalized a Wiki page on useful rules axioms for GoodRelations: http://www.ebusiness-unibw.org/wiki/GoodRelationsOptionalAxiomsAndLinks The page now contains recommended default rules for - product models, - product model variants, and - the gr:includes shortcut. Any feedback is greatly appreciated. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_IEEE_CEC%2709
OpenLink Software brings full power of GoodRelations BestBuy and others
Dear all: I just found out that Kingsley Idehen released a major update of Virtuoso sponger cartridges that create and collate virtual meta-data for existing Web pages. See http://tr.im/bestbuygr2 for an example. The cool thing is - The data does not come from BestBuy directly; it is collected and collated on the fly from APIs and other resources (i.e. this is not yet the original BestBuy data). - It used the full level of detail of the GoodRelations ontology for e-commerce, http://www.ebusiness-unibw.org/wiki/GoodRelations (not just 1 -2 properties). - It exposes both price information and technical features in GoodRelations. - If you request an XHTML version of the page, you get the meta-data in RDFa for further reuse and combination in novel scenarios. There are a few very minor issues with the data; we already provided respective feedback to Openlink SW. But all should be quick fixes. I am confident that this is a major step for a Web of Linked Data, because the same approach can bring unprecedented amounts of highly detailed GoodRelations data for existing Web content. Well done, Kingsley! Best Martin -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Talk at Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
Re: Yahoo RDFa enahced results example
Hi Eugenio: I think I'm missing the point, Yahoo announced the first support for microformats more or less one year and half ago, and for RDFa one year ago (http://developer.yahoo.net/blog/archives/2008/09/searchmonkey_support_for_rdfa_enabled.htmlhttp://developer.yahoo.net/blog/archives/2008/09/searchmonkey_support_for_rdfa_enabled.html ), am I wrong? Yahoo announced the support for RDFa in the SearchMonkey index last autumn. That meant, they started crawling RDFa on the basis of a set of selected vocabularies (foaf, sioc, goodrelations, ...) and make the resulting meta-data available for Yahoo-internal and other applications via the extra SearchMonkey index. Since then, application developers have been able to access that index for their own software or build own search engines with the Yahoo BOSS framework. For example, we wrote a small demo app that mimics Yahoo search and shows the RDFa meta-data for any page as in the Yahoo index, if any such data is available: http://goodrelations-search.appspot.com/ (Note that the N3 output has quite some quirks as compared to what PyRDFa extracts; there is a bug at Yahoo's side, but they are working on that). In a nutshell: Until recently, SearchMonkey meta-data was visible only - inside BOSS-applications or - if you were signed in to Yahoo AND activated an extension from the Yahoo SearchMonkey gallery or - if you were signed in to Yahoo AND wrote your own extension based on SearchMonkey meta-data. Now, very recently Yahoo announced that SearchMonkey meta-data for a couple of typical page contents, namely GoodRelations-based price information, would be used to improve the rendering of the general Yahoo search results pages - i.e., for anybody, without the need to use any extra software or signing in. That is a major innovation because it creates an incentive for any business in the world to use GoodRelations and other vocabularies for augmenting their pages. I hope that clarifies the issue. Martin Eugenio Tacchini wrote: At 16.15 25/09/2009 +0200, you wrote: Dear Juan: Juan Sequeda wrote: So I guess there isn't an answer to Eugenio's question then. The answer is: I don't know a URI that, as of today, already appears in Yahoo with more details based on RDFa mark-up, but that will change very soon. If Peter has one, I would be happy to kearn of it, of course. Note that - is was only something like five weeks ago that Yahoo officially activated the enhanced display functionality and - that the current re-crawling/update cycles at all major search engines are in the order of magnitude of 2 - 8 weeks, depending on your page popularity. I think I'm missing the point, Yahoo announced the first support for microformats more or less one year and half ago, and for RDFa one year ago (http://developer.yahoo.net/blog/archives/2008/09/searchmonkey_support_for_rdfa_enabled.htmlhttp://developer.yahoo.net/blog/archives/2008/09/searchmonkey_support_for_rdfa_enabled.html ), am I wrong? E. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Talk at Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
Re: Yahoo RDFa enahced results example
Hi Juan, Juan Sequeda wrote: Gotcha! Now I understand perfectly. I'm trying to get the local businesses in Austin to add RDFa. However, I have nothing tangible to show them. IMHO, the best way to convince business to do this is if you go through the SEO people. But until we don't see Yahoo (and Google) taking advantage of RDFa to enhance results, the business won't go through the hassle of adding RDFa. Yes and no ;-) Yes, as far as the SEOs are concerned: I will be presenting GoodRelations + RDFa at SES 2009 in Chicago, likely one of the most important SEO events, and there is already give some interest among SEO experts in GoodRelations. SEOs will have a huge market opportunity for helping companies optimize their GoodRelations markup. It will turn SEO from the art of improving a rank to the science of minimizing the search effort for a very specific target audience. No, as far as Yahoo and Google are concerned: Their current moves are into the right direction, but in my opinion much too slow and cautious. Additional details in Google and Yahoo search results are nice for convincing a local business to create a bit of GoodRelations markup. But that would be exploiting only 1% of the business potential. Getting the remaining 99% of the cake will require different technology approaches and business models. I guess they will smell the cheese and increase their investment fundamentally very soon. So it may be good for a small business to see a bit of extra data in Google, Yahoo, and Bing. But the real target applications will be more fundamental innovations. *One final question. Yahoo crawls all vocabularies while Google only crawls their vocabulary, right? To my knowledge, both crawl only a predefined list of vocabs. Fortunately, Yahoo crawls standard vocabs, Google invented their own Austin is the live music capital of the world, so imagine the amount of music and event data on websites. If I use the music ontology to mark up the data, will Yahoo crawl this and potentially use it in their search results? You have to ask Yahoo :-) What is the best vocabulary for events (venue, time, description, price)? For events, I don't know. There is an austrian initiative, but it is still pretty much alpha. As for the price: GoodRelations. Because, again an important distinction: It is not the event that has a price - it is a ticket (permission) to attend the event that has a price ;-) There will be a respective recipie at http://www.ebusiness-unibw.org/wiki/GoodRelationsTickets soon; currently it is a stub... Best Martin -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Talk at Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
Re: Yahoo RDFa enahced results example
Dear Juan: Juan Sequeda wrote: So I guess there isn't an answer to Eugenio's question then. The answer is: I don't know a URI that, as of today, already appears in Yahoo with more details based on RDFa mark-up, but that will change very soon. If Peter has one, I would be happy to kearn of it, of course. Note that - is was only something like five weeks ago that Yahoo officially activated the enhanced display functionality and - that the current re-crawling/update cycles at all major search engines are in the order of magnitude of 2 - 8 weeks, depending on your page popularity. Since most early adopters are not among the pages with the highest page-rank, it is quite natural that it will take a few weeks or months to see the real impact of this innovation. I personally think that http://www.heppnetz.de/searchmonkey/product.html will be among the first pages to showcase the effect, because it was one of the first pages with proper mark-up that went online (and still it took me a few hits to get the mark-up fully Yahoo-compliant). As a side comment: I am reading a bit of impatience from in between the lines of your message. After more than 100 Mio euro of European and US research funding, eight years of research, and a large share of all academic papers in the field with rather limited impact, it would be pretty unfair by anyone from the core Semantic Web research cohort to be impatient over delays in the order of magnitude of weeks with Yahoo SearchMonkey or GoodRelations to deliver. Martin Juan Sequeda wrote: So I guess there isn't an answer to Eugenio's question then. Juan Sequeda, Ph.D Student Dept. of Computer Sciences The University of Texas at Austin www.juansequeda.com www.semanticwebaustin.org On Wed, Sep 23, 2009 at 4:45 AM, Eugenio Tacchini euge...@favoriti.itwrote: At 10.24 21/09/2009 +0200, Martin Hepp (UniBW) wrote: Hi Juan, Eugenio: Where can we see search results in Yahoo that come from the RDF of Bestbuy and Goodrelations? First, note that currently, only the standard Yahoo search results are by default enhanced by structured meta-data, i.e., you do not appear in the local business directory by Yahoo automatically. I spoke with Yahoo about that recently and they said that inclusion in those special Yahoo pages would be a future option, assumed that the amount of data out there is of sufficient quantity and quality. You can check the appearance of a particular page using * wwwurl:your uri as the search parameter at http://www.yahoo.comhttp://www.yahoo.com. [...] Hi Martin and thanks for your reply. My question was probably more basic: I just wanted to know some good examples to show during a seminar I have to give in a faculty of economics. So is there a *real* business Web site (not a demo) which provides RDFa data crawled by Yahoo? I just would like to see the RDFa in the source of the site and the enanched result in Yahoo; it would be ok even without a good relation-enanchement. Cheers, Eugenio -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearchMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Talk at Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
Re: Yahoo RDFa enahced results example
Hi Juan, Eugenio: Where can we see search results in Yahoo that come from the RDF of Bestbuy and Goodrelations? First, note that currently, only the standard Yahoo search results are by default enhanced by structured meta-data, i.e., you do not appear in the local business directory by Yahoo automatically. I spoke with Yahoo about that recently and they said that inclusion in those special Yahoo pages would be a future option, assumed that the amount of data out there is of sufficient quantity and quality. You can check the appearance of a particular page using * wwwurl:your uri as the search parameter at http://www.yahoo.com. For example, * wwwurl:http://www.heppnetz.de/searchmonkey/company.html * wwwurl:http://www.heppnetz.de/searchmonkey/product.html If you try that example, you already see the reviews meta-data displayed correctly, but the price and other details are missing. The reason is that the SearchMonkey index is not yet up-to-date; we fixed some bugs in the mark-up, but the crawler has not yet come back. One problem with the SearchMonkey index and the underlying crawler is that, depending on the popularity of you page, the update cycles can be in the order of magnitude of weeks or even months. If you want to check what meta-data is already included in the Yahoo! SearchMonkey index, you can use the tool at http://goodrelations-search.appspot.com/ It is basically a variant of the Yahoo! search service that displays all meta-data that is found in the Yahoo! index. Note: At the time of writing, there are some problems in Yahoo's RDF export, so even though my page has been crawled by Yahoo!, the SeachMonkey index does not return fully correct RDF. Yahoo is aware of the bug. For links etc., see http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey#Congratulations.21 As for BestBuy: BestBuy is currently only exposing RDF/XML, while Yahoo crawls only RDFa. So the BestBuy data is not yet in the SearchMonkey index (but in most other SemWeb indices). There are two solutions to that: a) Convert the RDF/XML into the Yahoo RDF2DataRSS tool at http://www.ebusiness-unibw.org/tools/rdf2datarss/ and feed that into Yahoo using OPML. b) Create RDFa snippets using our toll RDF2RDFa at http://www.ebusiness-unibw.org/tools/rdf2rdfa ans insert that into original pages or a second XHTML presence of the catalog. We are in contact with BestBuy regarding that as a major showcase. Best Martin Juan Sequeda wrote: Good question. Where can we see search results in Yahoo that come from the RDF of Bestbuy and Goodrelations? Juan Sequeda, Ph.D Student Dept. of Computer Sciences The University of Texas at Austin www.juansequeda.com www.semanticwebaustin.org On Sun, Sep 20, 2009 at 12:50 PM, Eugenio Tacchini euge...@favoriti.itwrote: Hi all, maybe I'm a bit OT, anyway: could someone give me a good example of yahoo local business RDFa-enanched result? All the examples I read so far have the same problem: the entity is included in local.yahoo.com too. Example: if I search for gary danko san francisco I'll get a nice structured infobox about the Gary Danko restaurant with overview, reviews, map, pics and so on; but that's because that restaurant is also available in local.yahoo.com (and maybe added manually) and not because http://www.garydanko.com/ http://www.garydanko.com/ provided RDFa, am I right? Maybe other sites provide RDFa for that restaurant and they are the source for local.yahoo.com? If yes, which sites for this example? Thanks! Cheers, E. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearcMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
IT Conversations Podcast Features GoodRelations and RDFa
Dear all: IT Conversations, the longest-running podcast on the planet, featured the GoodRelations vocabulary and RDFa. In a recent episode of the Interviews With Innovators track, Jon Udell interviewed Kingsley Idehen on RDFa and Structured Data. For the podcast, see http://itc.conversationsnetwork.org/shows/detail4233.html For a related blog post by Jon, see http://blog.jonudell.net/2009/09/09/talking-with-kingsley-idehen-about-mastering-your-own-search-index/ For Kingsley's blog, see http://www.openlinksw.com/blog/~kidehen/ Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearcMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
ANN: BestBuy.com starts publishing full catalog as RDF/XML using GoodRelations - 27 million triples
Dear all: BestBuy.com has just started to serve a complete RDF/XML dump of their products and price information to the Web of Linked Data, using the GoodRelations vocabulary for e-commerce. The data dump is updated on a daily basis and contains detailed descriptions for roughly 450,000 individual items. With about 60 triples per item, this totals to about 27 million RDF triples. Semantic Sitemap: http://products.semweb.bestbuy.com/sitemap.xml Examples: a) Software: http://products.semweb.bestbuy.com/products/8182593/semanticweb.rdf b) Hardgoods: http://products.semweb.bestbuy.com/products/8794691/semanticweb.rdf c) Movies: http://products.semweb.bestbuy.com/products/7590289/semanticweb.rdf d) Games: http://products.semweb.bestbuy.com/products/9223752/semanticweb.rdf Other than many existing large RDF transcripts, the data very dynamic, holding the daily prices for all items. According to Wikipedia, BestBuy.com is the largest specialty retailer of consumer electronics in the United States accounting for 19% of the market. It is likely the first Fortune 500 company to start publishing offer details on the Web of Linked Data. Congratulations to Jay Myers from BestBuy.com for this excellent contribution, and a big thanks to Andreas Radinger and Alex Stolz for their support, Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearcMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
Re: ANN: BestBuy.com starts publishing full catalog as RDF/XML using GoodRelations - 27 million triples
Hi Daniel: Daniel Schwabe wrote: That is indeed very good and auspicious news. Are there any SPARQL endpoints available? The data is available as RDF/XML files from BestBuy.com, but it will soon be regularly included in the Linked Open Commerce dataspace at http://loc.openlinksw.com/sparql which itself provides an SPARQL endpoint. Martin -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearcMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 attachment: martin_hepp.vcf
ANN: RDF2RDFa: Turning RDF/XML into Snippets for Copy-and-Paste
Dear all: We are proud to announce the first prototype of the RDF2RDFa tool, which aims at simplifying the publication of RDF for Web developers. This tool converts arbitrary RDF/XML documents into a block of div/span elements without human-readable content. Such a block of markup can be simply pasted before the closing /body tag of an XHTML Web resource. This allows embedding even sophisticated RDF models into Web pages (even complex OWL models etc.). The tool is available at http://www.ebusiness-unibw.org/tools/rdf2rdfa/ For background information, please see our Technical Report at http://www.heppnetz.de/files/RDF2RDFa-TR.pdf Acknowledgments: This service is provided by Roberto García, from the GRIHO Human-Computer Interaction and Data Integration Research Group at Universitat de Lleida and by Martin Hepp and Andreas Radinger from the E-Business and Web Science Research Group at Bundeswehr University Munich. We would like to thank Mark Birbeck, Kavi Goel, Othar Hansson, Kingsley Idehen, and everybody who contributed to the original discussion on semantic-...@w3.org, for useful feedback on the initial idea, and Andreas Wechselberger for testing our snippets in MediaWiki. Best wishes Martin -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearcMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearcMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Weg 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: Two-way Search demo anyone?
Side note: I just put a recipe on-line that shows how the very same GoodRelations elements used for describing offers can be used for describing demand with the gr:seeks property. http://tr.im/grseeks BTW, you can combine that with most other recipes from http://tr.im/cookbook For instance, you could attach an image of a car wreck using foaf:depiction and ask for offers to repair (or to dispose ;-) ) that car. Danny Ayers wrote: 2009/7/28 Kingsley Idehen kide...@openlinksw.com: Please have a look at: 1. http://lod.openlinksw.com - LOD Cloud Cache (5 Billion+ triples from the LOD Cloud Data Set collection plus others) 2. http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VirtuosoFacetsWebService -- Service API I've also posted comments to his blog post at: http://www.scripting.com/stories/2009/07/26/twowaySearch.html?dsq=13437902#comment-13437902 Right, I agree this is the key bit of 'Part 2 of the solution' (looser, Google-like results may also be desirable, but that would be icing). Part 1 is a little more than front end though, IMHO. You'd also need to : * snag the searcher's profile * generate filter clauses based on that (in practice the filtering would probably be better the other way around, i.e. reduce search space using profile facts for triple matching, then use something like FILTER regex(?object, ?searchterms, i) ) Having said that a bit of experimentation is no doubt needed to get a good interaction between the sloppy text bits and the explicit profile stuff - e.g. if I searched for airports (and my profile contained my geo location), behind the scenes you'd probably want airports to map to ?s a x:Airport - something your Entity Search tool presumably can do (alas it keeps timing out for me right now). Cheers, Danny. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Recipe for Yahoo SearcMonkey: http://tr.im/rAbN Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Tutorial materials: CEC'09 2009 Tutorial: The Web of Data for E-Commerce: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://tr.im/grcec09 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Recipe for Shops: Showing up in Yahoo and in the Web of Data in One Turn
Dear all: I just completed a recipe meant for larger audiences (Web developers, SEO companies) on how a business can enrich its pages using RDFa+GoodRelations so that the data - shows up in Yahoo AND - it at the same time useful for comprehensive RDF applications. The recipe is at http://tr.im/rAbN It tries to combine pure recipes from the RDF world with the Web developer's how-tos provided by Yahoo. Any feedback is very welcome. Best Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: Excellent News for LOD: Yahoo Provides Tool for RDFa+GoodRelations for Site Owners
Hi Danny, thanks. Note that the value proposition of more structure and a higher link density in commerce data is not even tougher price comparison shopping, but deep comparison shopping - a better match between the diversity of offers and their individual value proposion on one hand and the diversity of consumer preferences on the other hand. The current web forces us to reduce the search space prematurely to very few makes and models and then watch for the price only, because we cannot search and compare across sites. See the my sketchy Deep Comparison Shopping presentation at http://www.slideshare.net/mhepp/deep-comparison-shopping-1591651 Danny Ayers wrote: Chipping in a little late - yep, this really is excellent news. E-commerce was a huge driver for the Web (/me sidesteps the bust), there's every reason it could be a shot in the arm for the semweb too. Also the lure of shopkeeper $$$s makes this kind of thing great pedagogical material - note the old MS demo Northwind database (btw RDFized by Kingsley co.) and Java EE Pet Store [1]. Speaking of $$$s, I reckon there's a significant market opening for an out-of-the-box semweb-enabled online store 'solution'. Incidentally I've still not got around to figuring out the part-whole product description as mentioned at [2] (s/Tinocaster/Vinocaster - there was a preexisting Tinocaster :) so any suggestions there would still be appreciated. Yep, the fully composition substitution ontology is still on my agenda, but not top priority. Note that GoodRelations already offers basic patterns for modeling - bundles (one bottle of red wine, a pizza, and an iPhone for $500) via gr:includesObject - spare parts via gr:isAccessoryOrSparePartFor - related products via gr:isSimilarTo - consumables via gr:isConsumableFor Cheers, Martin Cheers, Danny. [1] http://java.sun.com/developer/releases/petstore/ [2] http://lists.w3.org/Archives/Public/semantic-web/2007Apr/0024.html -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
IJSWIS Special Issue on Scalability and Performance of Semantic Web Systems
(apologies for cross-posting) Dear all: We are pleased to announce the latest issue of IJSWIS, which is a special issue on Scalability and Performance of Semantic Web Systems ~~ The contents of the latest issue of: International Journal on Semantic Web and Information Systems (IJSWIS) Volume 5, Issue 2, April-July 2009 Published: Quarterly in Print and Electronically ISSN: 1552-6283 EISSN: 1552-6291 www.ijswis.org www.igi-global.com/ijswis Editor-in-Chief: Amit Sheth, Kno.e.sis Center, Wright State University, USA Associate Editors: Martin Hepp, Bundeswehr University Munich, Germany Gottfried Vossen, University of Muenster, Germany Impact factor of this journal: 1.8 GUEST EDITORIAL PREFACE JSWIS 5(2) Vassilis Christophides, Institute of Computer Science Foundation for Research, Greece Jeff Heflin, Lehigh University, USA Recently, the W3C Linking Open Data effort has boosted the publication and interlinkage of larger amounts of RDF/S datasets on the Semantic Web (SW). Various ontologies and knowledge bases with millions of RDF/ S triplets from Wikipedia and other sources have been created and are available online. It is clear that the increasing number and size of the available SW datasets presents a real challenge for Semantic Web systems in order to cope with scalability and performance concerns. In this special issue, four articles cover a wide range of techniques for benchmarking or enhancing the scalability of Semantic Web systems. The authors build systems that process terabytes of data, have response times on the order of seconds or less, and rely on reasoning to solve problems not easily solved before. To read the guest editorial preface, please consult this issue of IJSWIS in your library. PAPER ONE The Berlin SPARQL Benchmark Christian Bizer, Freie Universität Berlin, Germany Andreas Schultz, Freie Universität Berlin, Germany The SPARQL Query Language for RDF and the SPARQL Protocol for RDF are implemented by a growing number of storage systems and used within enterprise and open Web settings. As SPARQL is taken by the community, there is a growing need for benchmarks to compare the performance of storage systems that expose SPARQL endpoints via the SPARQL protocol. Such systems include native RDF stores as well as systems that rewrite SPARQL queries to SQL queries against non-RDF relational databases. This article introduces the Berlin SPARQL Benchmark (BSBM) for comparing the performance of native RDF stores with the performance of SPARQL-to-SQL rewriters across architectures. This article discusses the design of the BSBM benchmark and presents the results of a benchmark experiment comparing the performance of four popular RDF stores with the performance of two SPARQL-to-SQL rewriters as well as the performance of two relational database management systems. To obtain a copy of the entire article, click on the link below. http://infosci-on-demand.com/content/details.asp?ID=33737 PAPER TWO Learning of OWL Class Descriptions on Very Large Knowledge Bases Sebastian Hellmann, Universität Leipzig, Germany Jens Lehmann, Universität Leipzig, Germany Sören Auer, Universität Leipzig, Germany The vision of the Semantic Web is to make use of semantic representations on the largest possible scale - the Web. Large knowledge bases such as DBpedia, OpenCyc, GovTrack, and others are emerging and are freely available as linked data and SPARQL endpoints. Exploring and analysing such knowledge bases is a significant hurdle for Semantic Web research and practice. As one possible direction for tackling this problem, the authors present an approach for obtaining complex class descriptions from objects in knowledge bases by using machine learning techniques. They describe in detail how to leverage existing techniques to achieve scalability on large knowledge bases available as SPARQL endpoints or linked data. To obtain a copy of the entire article, click on the link below. http://infosci-on-demand.com/content/details.asp?ID=33738 PAPER THREE Scalable Authoritative OWL Reasoning for the Web Aidan Hogan, National University of Ireland, Ireland Andreas Harth, National University of Ireland, Ireland Axel Polleres, National University of Ireland, Ireland In this article, the authors discuss the challenges of performing reasoning on large scale RDF datasets from the Web. Using ter-Horst’s pD* fragment of OWL as a base, the authors compose a rule-based framework for application to Web data; they argue their decisions using observations of undesirable examples taken directly from the Web. The authors further temper their OWL fragment through consideration of “authoritative stheirces,” which counter-acts an observed behavitheir which we term “ontology hijacking”. This article presents a system for performing rule-based forward-chaining reasoning which they call SAOR (scalable authoritative OWL reasoned). Based upon observed characteristics of Web
Excellent News for LOD: Yahoo Provides Tool for RDFa+GoodRelations for Site Owners
Dear all: Great news: ANY site owner in the world has now a clear incentive to add GoodRelations meta-data in RDFa to his/her page: As of now, Yahoo will display price and offering details and other meta-data of any e-commerce Web page if the site owner uses GoodRelations vocabulary elements. Previously, such data was only used within special applications developed in the Yahoo ecosystem. Now, every site owner can add meta-data to enhance the search results of offers in Yahoo. Just go to http://developer.search.yahoo.com/help/objects/product and click on the tab RDFa - GoodRelation. The tool generates mark-up patterns for copy-and paste into xhtml page content. Thanks to Peter Mika and everybody at Yahoo for this initiative! It will for sure help lower the entrance barrier for any business to use Semantic Web technology in general and GoodRelations in particular. Screenshots are at http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Best wishes Martin -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: .htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation
Google has just changed the wording of the documentation: http://knol.google.com/k/google-rich-snippets/google-rich-snippets/32la2chf8l79m/1# The mentioning of cloaking risk is removed. While this is not final clearance, it is a nice sign that our concerns are heard. Best Martin Martin Hepp (UniBW) wrote: Dear all: Fyi - I am in contact with Google as for the clarification of what kind of empty div/span elements are considered acceptable in the context of RDFa. It may take a few days to get an official statement. Just so that you know it is being taken care of... Martin Mark Birbeck wrote: Hi Martin, b) download RDFa snippet that just represents the RDF/XML content (i.e. such that it does not have to be consolidated with the presentation level part of the Web page. By coincidence, I just read this: Hidden div's -- don't do it! It can be tempting to add all the content relevant for a rich snippet in one place on the page, mark it up, and then hide the entire block of text using CSS or other techniques. Don't do this! Mark up the content where it already exists. Google will not show content from hidden div's in Rich Snippets, and worse, this can be considered cloaking by Google's spam detection systems. [1] Regards, Mark [1] http://knol.google.com/k/google-rich-snippets/google-rich-snippets/32la2chf8l79m/1# -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: .htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation
Hi Tom: Amen. Thank you for writing this. I completely agree. RDFa has some great use cases but (like any technology) has its limitations. Let's not oversell it. We seem to agree on the observation, but not on the conclusion. What I want and suggest is using RDFa also for exchanging a bit more complex RDF models / data by simply using a lot of div / span or whatever elements that represent the RDF part in the SAME document BUT NOT too closely linked with the presentation level. body h1This is the car I want to sell/h1 Actually, a pretty cool car, for only $1.000. Offer valid through July 31, 2009 span ... my whole RDF in RDFa /span body The advantage of that would be that - you just have to maintain ONE file, - data and metadata are close by, so the likelihood of being up to date increases, and - at the same time, the code does not get too messy. - Also - no problems setting up the server (*). - Easy to create on-line tools that generate RDFa snippets for simple pasting. - Yahoo and Google will most likely honor RDFa meta-data only. Also note that often the literal values will be in content attributes anyway, because the string for the presentation is not suitable as meta-data content anyway (e.g. dates, country codes,...) I think the approach sketched above would be a cheap and useful way of publishing RDF meta-data. It could work with CMS / blogging software etc. Imaging if we were able to allow eBay sellers to put GoodRelations meta-data directly into the open XHTML part of their product description. The main problem with my proposal is that there is the risk that Google considers this cloaking and may remove respective resources from their index (Mark raised that issue). If that risk was confirmed, we would really have a problem. Imagine me selling Semantic Web markup as a step beyond SEO ... and the first consequence of following my advice is being removed from the Google index. A second problem is that if the document contains nodes that have no counterpart on the presentation level (e.g. intermediate nodes for holding n-ary relations), then they will also not be dereferencable. The same holds for URIs or nodes that are outside the scope of the actual RDFa / XHTML document - I see no simple way of serving neither XHTML nor RDF content for those. Best Martin Tom Heath wrote: Martin, 2009/6/27 Martin Hepp (UniBW) martin.h...@ebusiness-unibw.org: So if this hidden div / span approach is not feasible, we got a problem. The reason is that, as beautiful the idea is of using RDFa to make a) the human-readable presentation and b) the machine-readable meta-data link to the same literals, the problematic is it in reality once the structure of a) and b) are very different. For very simple property-value pairs, embedding RDFa markup is no problem. But if you have a bit more complexity at the conceptual level and in particular if there are significant differences to the structure of the presentation (e.g. in terms of granularity, ordering of elements, etc.), it gets very, very messy and hard to maintain. Amen. Thank you for writing this. I completely agree. RDFa has some great use cases but (like any technology) has its limitations. Let's not oversell it. Tom. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: .htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation
Hi Yihong: I am a big fan of Codd's one fact in one place credo. However, in this particular case, that principle is violated anyway, since the literal values are often duplicated for presentation and meta-data prupolses anyway (think of 2009-06-29 vs. June 29, 2009). Second, for dynamic Web apps, it does not really matter whether the same fact is exposed once or twice, since the central location is one place in the database anyway. Third, this is the only way how a tool like the GoodRelations annotator [1] can create RDFa snippets for simple copy-and-paste into existing pages. Also note that in the particular case of RDFa, the principle of one fact in one place clashes with the separation of concerns principle, in particular, that of keeping data and presentation separate. The textbook-style beauty of simplicity of RDFa holds for adding a dc:creator property to a string value that is the same for presentation and at the data level. Beyond that, RDFa can create code that is very hard to maintain. In fact, I know that a large software company dismissed the use of RDFa in their products because of the unmanageable mix of conceptual and presentation layer. As far as security is concerned: I there is no real difference in my proposal, as the content attribute of RDFa allows serving different data to human and to machines, and this is a needed feature anyway. Digital signatures at the document or element level and / or data provenance approached will likely cater for that. Best Martin Yihong Ding wrote: Hi Kingley and Martin, A potential problem of the model Martin suggested is that the same data has to be presented at least TWICE in one document. Although the RDFa portion of the data is supposed to be automatically generated, it, however, does not prohibit anybody from manually revising it. Therefore, it leaves a huge hole for the hackers (or anybody who want to do some deceptive job). In our imperfect world, this problem is severe. Adding an extra layer of data mapping always causes additional work on data maintenance. This time, the extra work could be a nightmare though the architecture is neat. yihong On Mon, Jun 29, 2009 at 8:03 AM, Kingsley Idehen kide...@openlinksw.comwrote: Martin Hepp (UniBW) wrote: Hi Tom: Amen. Thank you for writing this. I completely agree. RDFa has some great use cases but (like any technology) has its limitations. Let's not oversell it. We seem to agree on the observation, but not on the conclusion. What I want and suggest is using RDFa also for exchanging a bit more complex RDF models / data by simply using a lot of div / span or whatever elements that represent the RDF part in the SAME document BUT NOT too closely linked with the presentation level. body h1This is the car I want to sell/h1 Actually, a pretty cool car, for only $1.000. Offer valid through July 31, 2009 span ... my whole RDF in RDFa /span body The advantage of that would be that - you just have to maintain ONE file, - data and metadata are close by, so the likelihood of being up to date increases, and - at the same time, the code does not get too messy. - Also - no problems setting up the server (*). - Easy to create on-line tools that generate RDFa snippets for simple pasting. - Yahoo and Google will most likely honor RDFa meta-data only. Also note that often the literal values will be in content attributes anyway, because the string for the presentation is not suitable as meta-data content anyway (e.g. dates, country codes,...) I think the approach sketched above would be a cheap and useful way of publishing RDF meta-data. It could work with CMS / blogging software etc. Imaging if we were able to allow eBay sellers to put GoodRelations meta-data directly into the open XHTML part of their product description. The main problem with my proposal is that there is the risk that Google considers this cloaking and may remove respective resources from their index (Mark raised that issue). If that risk was confirmed, we would really have a problem. Imagine me selling Semantic Web markup as a step beyond SEO ... and the first consequence of following my advice is being removed from the Google index. A second problem is that if the document contains nodes that have no counterpart on the presentation level (e.g. intermediate nodes for holding n-ary relations), then they will also not be dereferencable. The same holds for URIs or nodes that are outside the scope of the actual RDFa / XHTML document - I see no simple way of serving neither XHTML nor RDF content for those. Martin, If Google doesn't see invisible DIVs as cloaking, the issue vaporizes. Also, if people take the SEO + SDQ (Linked Data Expressed in RDFa) approach they will at least remain in the Google index via usual SEO oriented keyword gimmickry, albeit generally suboptimal. If we make a recipe doc showcasing these issues, we will more than likely get Google to recalibrate back to the Web
Re: .htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation
Hi Toby, Toby A Inkster wrote: On 25 Jun 2009, at 21:18, Pat Hayes wrote: If [RDF] requires people to tinker with files with names starting with a dot [...] then the entire SWeb architecture is fundamentally broken. RDF doesn't. Apache does. Many hosts do have front ends for configuring Apache, allowing redirects to be set up and content-types configured by filling in simple web forms. But there are such a variety of these tools with different capabilities and different interfaces that it would be difficult to produce advice suitable for them all, so instead .htaccess recipes are provided instead. That said, there are a couple of steps that Martin could remove from his recipe and still be promoting reasonably good practice: Step 5a - this rewrites http://example.org/semanticweb to http://example.org/semanticweb.rdf. Other than aesthetics, there's no real reason to do this. Yes, I've read timbl's old Cool URIs document, and understand about not wanting to include hints of file format in a URI. But realistically, this file is going to always include some RDF - perhaps in a non-RDF/XML serialisation, but I don't see anything inappropriate about serving other RDF serialisations using a .rdf URL, provided the correct MIME type is used. Yes - while it breaks my heart, we will uses URIs including the .rdf extension in the future. Comparing benefits and trouble caused, it is not worth pushing it. Step 5b - the default Apache mime.types file knows about application/rdf+xml, so this should be unnecessary. Perhaps instead have a GoodRelations validator which checks that the content type is correct, and only suggests this when it is found to be otherwise. Well, our experience is that about 30% of the servers don't use the proper mime type by default, which causes trouble with many semweb applications Steps 3 and 4 could be amalgamated into a single validate your RDF file step using the aforementioned validator. The validator would be written so that, upon a successful validation, it offers single-click options to ping semweb search engines, and Yahoo (via a RDF/XML-DataRSS converter). With those adjustments, the recipe would just be: 1. Upload your RDF file. 2. Add a rel=meta link to it. 3. Validate using our helpful tool. Yes, that would be a good option. But actually I am prone to go for a more radical shift, which is offering just three alternative publication mechanisms: a) download RDF/XML or N3 file (for expert users) b) download RDFa snippet that just represents the RDF/XML content (i.e. such that it does not have to be consolidated with the presentation level part of the Web page. c) have us publish it on our servers (this will require some techniques of validating users, update / refresh - requires some more thoughts. Best Martin -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
.htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation
Hi all: After about two months of helping people generate RDF/XML metadata for their businesses using the GoodRelations annotator [1], I have quite some evidence that the current best practices of using .htaccess are a MAJOR bottleneck for the adoption of Semantic Web technology. Just some data: - We have several hundred entries in the annotator log - most people spend 10 or more minutes to create a reasonable description of themselves. - Even though they all operate some sort of Web sites, less than 30 % of them manage to upload/publish a single *.rdf file in their root directory. - Of those 30%, only a fraction manage to set up content negotiation properly, even though we provide a step-by-step recipe. The effects are - URIs that are not dereferencable, - incorrect media types and and other problems. When investigating the causes and trying to help people, we encountered a variety of configurations and causes that we did not expect. It turned out that helping people just managing this tiny step of publishing Semantic Web data would turn into a full-time job for 1 - 2 administrators. Typical causes of problems are - Lack of privileges for .htaccess (many cheap hosting packages give limited or no access to .htaccess) - Users without Unix background had trouble name a file so that it begins with a dot - Microsoft IIS require completely different recipes - Many users have access just at a CMS level Bottomline: - For researchers in the field, it is a doable task to set up an Apache server so that it serves RDF content according to current best practices. - For most people out there in reality, this is regularly a prohibitively difficult task, both because of a lack of skills and a variety in the technical environments that turns into an engineering challenge what is easy on the textbook-level. As a consequence, we will modify our tool so that it generates dummy RDFa code with span/div that *just* represents the meta-data without interfering with the presentation layer. That can then be inserted as code snippets via copy-and-paste to any XHTML document. Any opinions? Best Martin [1] http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Danny Ayers wrote: Thank you for the excellent questions, Bill. Right now IMHO the best bet is probably just to pick whichever format you are most comfortable with (yup it depends) and use that as the single source, transforming perhaps with scripts to generate the alternate representations for conneg. As far as I'm aware we don't yet have an easy templating engine for RDFa, so I suspect having that as the source is probably a good choice for typical Web applications. As mentioned already GRDDL is available for transforming on the fly, though I'm not sure of the level of client engine support at present. Ditto providing a SPARQL endpoint is another way of maximising the surface area of the data. But the key step has clearly been taken, that decision to publish data directly without needing the human element to interpret it. I claim *win* for the Semantic Web, even if it'll still be a few years before we see applications exploiting it in a way that provides real benefit for the end user. my 2 cents. Cheers, Danny. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: .htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation
As mostly, recently ;-), I agree with Kingsley - I did not want to say that proper usage of http is bad or obsolete. But it turned out unfeasible for broad adoption my owners of small Web sites. For huge data sources and for vocabularies, the current recipes are fine. But I want every single business in the world to use GoodRelations for publishing at least their opening hours - 19 Million companies in Europe alone. I cannot explain to every single one of them how to configure their server. Another thing that might have gone lost in the discussion: Even though we knew the recipes, helping the site owners was difficult, because we experienced hundreds of different environments - preexisting .htaccess, MS IIS, hoster-specific scenarios, etc. So the problem is really that such a low-level technique is not feasible if you face so much diversity as far as the target system is concerned. Maybe some day a certain LOD/SW package will be installed by default on most servers. But we cannot wait till then. BTW: We did not even require the full beauty of LOD best practices. We simply want them to do as described here: http://www.ebusiness-unibw.org/wiki/GoodRelations_Recipe_8 Best Martin Kingsley Idehen wrote: Giovanni Tummarello wrote: That can then be inserted as code snippets via copy-and-paste to any XHTML document. Any opinions? Great, why bother with any other solution. even talking about any other solution is extraordinarely bad for the public perception of the semantic web community. Giovanni Giovanni, We don't need mutual exclusivity re. Linked Data Deployment. There's nothing wrong with an array of options that cover a broad range of Linked Data deployment circumstances. HTTP is the essence of the Web (what makes it what it is), and Content Negotiation is intrinsic to HTTP. Don't throw the baby out with the bathwater, really. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: .htaccess a major bottleneck to Semantic Web adoption / Was: Re: RDFa vs RDF/XML and content negotiation
Hi John: We also thought of hosting meta-data for the users, but I don't like that because I want the shop operators to feel ownership for the data: If the opening hours expressed in RDF are wrong but on the personal Web page of that restaurant, anybody facing closed doors will blame the restaurant. If the outdated opening hours in RDF are on my SW server, the unlucky customer will blame the Semantic Web for having crappy data. So maybe the snippet solution in RDFa is the best. Best Martin John Graybeal wrote: This is a principal reason MMI decided to offer a vocabulary server for its community. The idea that 1000 different providers would all develop a level of web competency (for which there is evidence at only a minority of providers) for serving their RDF and OWL content -- let alone the capability to do versioning, adopt best practices, learn SKOS, and whatever other nuances are called for -- seemed like a non-starter. This is not exactly the same problem you're facing, but something to consider (if the model allows it) is creating a way to serve the annotations from another place than the host institution. The institution can refer to those served files from their own sites, and even update them remotely, but not have to incur all the management overhead as standards improve, files change, authorship changes, etc. (Which is not to disagree with your plan either. That sounds fine.) One other delivery model could be for them to give you an existing HTML, you give them back the modified HTML (saves them cutting and pasting steps?). I'm a little ignorant on your tools and processes, so apologies if these are non-starters. John On Jun 25, 2009, at 9:44 AM, Martin Hepp (UniBW) wrote: Hi all: After about two months of helping people generate RDF/XML metadata for their businesses using the GoodRelations annotator [1], I have quite some evidence that the current best practices of using .htaccess are a MAJOR bottleneck for the adoption of Semantic Web technology. Just some data: - We have several hundred entries in the annotator log - most people spend 10 or more minutes to create a reasonable description of themselves. - Even though they all operate some sort of Web sites, less than 30 % of them manage to upload/publish a single *.rdf file in their root directory. - Of those 30%, only a fraction manage to set up content negotiation properly, even though we provide a step-by-step recipe. The effects are - URIs that are not dereferencable, - incorrect media types and and other problems. When investigating the causes and trying to help people, we encountered a variety of configurations and causes that we did not expect. It turned out that helping people just managing this tiny step of publishing Semantic Web data would turn into a full-time job for 1 - 2 administrators. Typical causes of problems are - Lack of privileges for .htaccess (many cheap hosting packages give limited or no access to .htaccess) - Users without Unix background had trouble name a file so that it begins with a dot - Microsoft IIS require completely different recipes - Many users have access just at a CMS level Bottomline: - For researchers in the field, it is a doable task to set up an Apache server so that it serves RDF content according to current best practices. - For most people out there in reality, this is regularly a prohibitively difficult task, both because of a lack of skills and a variety in the technical environments that turns into an engineering challenge what is easy on the textbook-level. As a consequence, we will modify our tool so that it generates dummy RDFa code with span/div that *just* represents the meta-data without interfering with the presentation layer. That can then be inserted as code snippets via copy-and-paste to any XHTML document. Any opinions? Best Martin [1] http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Danny Ayers wrote: Thank you for the excellent questions, Bill. Right now IMHO the best bet is probably just to pick whichever format you are most comfortable with (yup it depends) and use that as the single source, transforming perhaps with scripts to generate the alternate representations for conneg. As far as I'm aware we don't yet have an easy templating engine for RDFa, so I suspect having that as the source is probably a good choice for typical Web applications. As mentioned already GRDDL is available for transforming on the fly, though I'm not sure of the level of client engine support at present. Ditto providing a SPARQL endpoint is another way of maximising the surface area of the data. But the key step has clearly been taken, that decision to publish data directly without needing the human element to interpret it. I claim *win* for the Semantic Web, even if it'll still be a few years before we see applications exploiting it in a way that provides real benefit for the end user. my 2 cents. Cheers
Re: http://ld2sd.deri.org/lod-ng-tutorial/
Hi Dan: I think Alan already gave examples this morning. An ontology can contain statements about the relationship between conceptual elements - classes, properties, individuals - that (1) influence the result to queries but (2) are not likely retrieved when you just dereference an element from that ontology. The more complex an ontology is, the more difficult is it to properly modularize it. But basically my main point is that the use of owl:imports is defined pretty well in http://www.w3.org/TR/owl-ref/#imports-def and there is no need to deviate from the spec just for the matter of gut feeling and annoyance about the past dominance of DL research in the field. And as the spec says - with a proper owl:imports statement, any application can decide if and what part of the imported ontologies are being included to the local model for the task at hand. Martin Dan Brickley wrote: On 22/6/09 23:16, Martin Hepp (UniBW) wrote: Yves Raimond wrote: Ontology modularization is a pretty difficult task, and people use various heuristics for deciding what to put in the subset being served for an element. There is no guarantee that the fragment you get contains everything that you need. There is no safe way of importing only parts of an ontology, unless you know that its modularization is 100% reliable. Serving fragments of likely relevant parts of an ontology for reducing the network overhead is not the same as proper modularization of the ontology. Can you give a concrete example of the danger described here? ie. the pair of a complete (safe) ontology file and a non-safe subset, and an explanation of the problems caused. I can understand there is no guarantee that the fragment you get contains everything you need, and I also remind everyone that dereferencing is a privilege not a right: sometimes the network won't give you what you want, when you want it. But I've yet to hear of anyone who has suffered due to term-oriented ontology fragment downloads. I guess medical ontologies would be the natural place for horror stories? cheers, Dan -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast: http://www.heppnetz.de/projects/goodrelations/webcast/ Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://tinyurl.com/semtech-hepp Tool for registering your business: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: http://tinyurl.com/goodrelations-universe Project page and resources for developers: http://purl.org/goodrelations/ Tutorial materials: Tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/GoodRelations_Tutorial_ESWC2009 begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: http://ld2sd.deri.org/lod-ng-tutorial/
Hi Kingsley, You are of course right - I assume that, despite the terminological mess I introduced, you agree with my line of argument; I fully acknowledge it is heavily inspired by our San Jose sushi talk ;-) Martin Kingsley Idehen wrote: Martin, [SNIP] As Kingsley said - deceptively simple solutions are cheap in the beginning but can be pretty costly in the long run. I meant: Deceptively Simple is good. While Simply Simple is bad due to inherent architectural myopia obscured by initial illusion of cheapness etc.. What made the Web so powerful is that its Architecture is extremely well-thought underneath the first cover of simplicity. That's what I meant by: Deceptively Simple, architectural apex is narrow (simple) while the base is broad (a pyramid) :-) Exactly the opposite of I will use this pragmatic pattern until it breaks but instead That's what I meant by: Simple Simple, architectural apex is broad while the base is narrow (think inverted pyramid). architectural beauty for eternity. Yes! That what you get with: Deceptively Simple :-) Kingsley Just look at the http specs. The fact that you can do a nice 303 is because someone in the distant past very cleverly designed a protocol goes well beyond the pragmatic I have a URL (sic!) and want to fetch the Web page in HTML (sic!). So when being proud of being the pragmatic guys keep in mind that nothing is as powerful in practice as something that is theoretically consistent. Best Martin begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
PingTheSemanticWeb and Sitemaps
Dear all: I am sure you all agree that PingTheSemanticWeb is a key service for many of our applications. Unfortunately, it does currently not support the bulk notification for datasets for which a semantic sitemap is available. Since some of you maybe facing the same problem, I just put a short Python script on-line at http://www.ebusiness-unibw.org/wiki/PTSW4Sitemaps which you can use to notify PingTheSemanticWeb of multiple files using a given semantic sitemap. Best Martin -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax:+49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast explaining the Web of Data for E-Commerce: - http://www.heppnetz.de/projects/goodrelations/webcast/ Tool for registering your business: -- http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: - http://tinyurl.com/goodrelations-universe Project page and resources for developers: - http://purl.org/goodrelations/ Upcoming events: --- Full-day tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.eswc2009.org/program-menu/tutorials/70 Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology More information: http://www.semantic-conference.com/session/1881/ Slides: http://tinyurl.com/semtech-hepp begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: Bestbuy.com goes Semantic Web the GoodRelations Way
PS. In some cases, its a good idea to stop at the first kiss... but I think that the prospect of permeating the e-commerce dinosaur is sexy as hell ;) Yes, that is what drives me ;-) Martin PS: Think of the many novel e-business scenarios in affiliate marketing etc. when you can have web-wide search on the product data space. More here: http://tinyurl.com/semtech-hepp Aldo Bucchi wrote: Guys, Let me add: Having real world use cases validate the cause in many levels. In practice, this will generate a second sell, and a third, etc. Domino effect. Or rather, the CYA effect: Until someone else has taken the risk, why put your A on the line? So, I would call this a nice escape goat as well. And that's a LOT. First kiss is most elusive. From then on it's downhill ;) Congratulations to whoever made it happen! Regards A PS. In some cases, its a good idea to stop at the first kiss... but I think that the prospect of permeating the e-commerce dinosaur is sexy as hell ;) On Tue, Jun 9, 2009 at 7:41 AM, Kingsley Idehenkide...@openlinksw.com wrote: Bill Roberts wrote: Yeah, I too think this is a big deal. Semantic web in the commercial world obviously suffers the same chicken and egg problem as elsewhere, but if a big company like BestBuy just does it anyway, then services that consume and aggregate this kind of data are likely to spring up. Semantic SEO industry starts here? Yes, but I would say SDQ in addition to SEO :-) Links: 1. http://tr.im/iv9e -- post about Serendipitous Discovery Quotient (SDQ) -- Regards, Kingsley Idehen Weblog: http://www.openlinksw.com/blog/~kidehen President CEO OpenLink Software Web: http://www.openlinksw.com -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax:+49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast explaining the Web of Data for E-Commerce: - http://www.heppnetz.de/projects/goodrelations/webcast/ Tool for registering your business: -- http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: - http://tinyurl.com/goodrelations-universe Project page and resources for developers: - http://purl.org/goodrelations/ Upcoming events: --- Full-day tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.eswc2009.org/program-menu/tutorials/70 Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology More information: http://www.semantic-conference.com/session/1881/ Slides: http://tinyurl.com/semtech-hepp begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: Bestbuy.com goes Semantic Web the GoodRelations Way
Hi Aldo: Kingsley: Does any of the upper level ontologies or LOD datasets provide industry sectors with a useful level of granularity? Note that GoodRelations provides the gr:hasISICv4 and the gr:hasNAICS properties for linking a gr:BusinessEntity to the proper ISIC/NAICS category code for that company. For a more precise description of the range of skills or products offered, one should use the regular gr:Offering pattern, though, because a one-dimensional industry classification is pretty much limited. http://purl.org/goodrelations/v1#hasISICv4 Definition: The International Standard of Industrial Classification of All Economic Activities (ISIC), Revision 4 code for a particular Business Entity. See http://unstats.un.org/unsd/cr/registry/isic-4.asp for more information. Note: While ISIC codes are sometimes misused for classifying products or services, they are designed and suited only for classifying business establishments. http://purl.org/goodrelations/v1#hasNAICS Definition: The North American Industry Classification System (NAICS) code for a particular Business Entity. See http://www.census.gov/eos/www/naics/ for more details. Note: While NAICS codes are sometimes misused for classifying products or services, they are designed and suited only for classifying business establishments. Best Martin Congratulations to whoever made it happen! Regards A PS. In some cases, its a good idea to stop at the first kiss... but I think that the prospect of permeating the e-commerce dinosaur is sexy as hell ;) On Tue, Jun 9, 2009 at 7:41 AM, Kingsley Idehenkide...@openlinksw.com wrote: Bill Roberts wrote: Yeah, I too think this is a big deal. Semantic web in the commercial world obviously suffers the same chicken and egg problem as elsewhere, but if a big company like BestBuy just does it anyway, then services that consume and aggregate this kind of data are likely to spring up. Semantic SEO industry starts here? Yes, but I would say SDQ in addition to SEO :-) Links: 1. http://tr.im/iv9e -- post about Serendipitous Discovery Quotient (SDQ) -- Regards, Kingsley Idehen Weblog: http://www.openlinksw.com/blog/~kidehen President CEO OpenLink Software Web: http://www.openlinksw.com -- Aldo Bucchi U N I V R Z Office: +56 2 795 4532 Mobile:+56 9 7623 8653 skype:aldo.bucchi http://www.univrz.com/ http://aldobucchi.com/ PRIVILEGED AND CONFIDENTIAL INFORMATION This message is only for the use of the individual or entity to which it is addressed and may contain information that is privileged and confidential. If you are not the intended recipient, please do not distribute or copy this communication, by e-mail or otherwise. Instead, please notify us immediately by return e-mail. INFORMACIÓN PRIVILEGIADA Y CONFIDENCIAL Este mensaje está destinado sólo a la persona u organización al cual está dirigido y podría contener información privilegiada y confidencial. Si usted no es el destinatario, por favor no distribuya ni copie esta comunicación, por email o por otra vía. Por el contrario, por favor notifíquenos inmediatamente vía e-mail. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax:+49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast explaining the Web of Data for E-Commerce: - http://www.heppnetz.de/projects/goodrelations/webcast/ Tool for registering your business: -- http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: - http://tinyurl.com/goodrelations-universe Project page and resources for developers: - http://purl.org/goodrelations/ Upcoming events: --- Full-day tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.eswc2009.org/program-menu/tutorials/70 Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology More information: http://www.semantic-conference.com/session/1881/ Slides: http://tinyurl.com/semtech-hepp begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
HTTP 200 instead of 303
Dear all, is this an acceptable practice from an LOD point of view or are there technical concerns? Martin Damian Steer wrote: On 20 May 2009, at 18:54, Martin Hepp (UniBW) wrote: Hi Damian: Thanks for your e-mail! I must admit that I don't get what exactly you are proposing with thought entertained=minimal If the location header was set in the response I guess that might help. /thought Best Martin Hi Martin, Apologies. The perils of hasty emails :-) I meant setting the 'Content-Location' header, to indicate where the data came from. It's lighter than a redirect. Example: GET http://logs.jruby.org/jruby/latest = HTTP/1.1 200 OK ... Content-Location: latest.txt Similarly you could try: GET http://openean.kaufkauf.net/id/EanUpc_0001067792600 = HTTP/1.1 200 OK ... Content-Location: EanUpc_range_xx GET http://openean.kaufkauf.net/id/EanUpc_0008811127923 = HTTP/1.1 200 OK ... Content-Location: EanUpc_range_xx Which seems like a reasonable way to indicate they have the same content. Damian -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax:+49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast explaining the Web of Data for E-Commerce: - http://www.heppnetz.de/projects/goodrelations/webcast/ Tool for registering your business: -- http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: - http://tinyurl.com/goodrelations-universe Project page and resources for developers: - http://purl.org/goodrelations/ Upcoming events: --- Full-day tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.eswc2009.org/program-menu/tutorials/70 Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology More information: http://www.semantic-conference.com/session/1881/ Slides: http://tinyurl.com/semtech-hepp begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
ANN: RDF2dataRSS Conversion - Feed RDF/XML into Yahoo SearchMonkey (beta)
Dear all: As you may know, Yahoo SearchMonkey accepts Semantic Web data only if either provided as RDFa or via the Yahoo-specific dataRSS feed format. This means that RDF/XML and other formats are currently not considered. Since it is sometimes desirable to publish data in RDF/XML rather than RDFa, we developed a conversion service RDF2dataRSS that turns RDF/XML into a dataRSS feed, which can then be submitted to Yahoo. The tool is available at http://www.ebusiness-unibw.org/tools/rdf2datarss/ It can also be directly accessed from applications in a REST style by sending an HTTP GET request to the URI http://www.ebusiness-unibw.org/tools/rdf2datarss/RDF2dataRSSServlet?type=xmluri=uri with uri being the URI of an RDF/XML file to be converted, e.g. http://www.ebusiness-unibw.org/tools/rdf2datarss/RDF2dataRSSServlet?type=xmluri=http://www.host.com/rdffile.rdf This returns an XML file containing the RDF content as dataRSS. Disclaimer: The tool is in beta only, so there are no explicit or implicit guarantees given... A big thanks to Thomas Irmscher and Andreas Radinger for their hard work! Best Martin -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax:+49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast explaining the Web of Data for E-Commerce: - http://www.heppnetz.de/projects/goodrelations/webcast/ Tool for registering your business: -- http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: - http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page and resources for developers: - http://purl.org/goodrelations/ Upcoming events: --- Full-day tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.eswc2009.org/program-menu/tutorials/70 Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.semantic-conference.com/session/1881/ begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Publishing RDF/XML Data on MS IIS Platforms
Dear all: Here is a brief summary of related resources - thanks to everybody who replied: • Ionics Isapi Rewrite Filter for IIS 5.0 and IIS 6.0 – http://iirf.codeplex.com/ • Microsoft URL Rewrite Module for IIS 7.0 – http://support.microsoft.com/kb/324064 (Thanks to Christophe Debruyne and Robert Meersman for the hint!) • ModRewrite – http://www.micronovae.com/ModRewrite/ModRewrite.html (Thanks to Sergio Fernández for the link!) Best Martin Martin Hepp (UniBW) wrote: Dear all: Are there any recommended Web resources for configuring Microsoft IIS to support redirects and content negotiation? What is easily done via .htaccess on Apache servers seems to be difficult or require commercial add-ons on MS IIS, as far as my quick Google search indicates. Is anybody of you serving RDF/XML data or vocabularies from MS IIS platforms? Thanks for any hints. Best Martin begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: ANN: GoodRelations - E-Commerce on the Web of Data - New Datasets and Applications
Hi Libby, That's rather fabulous! Can you give some information about how often this dataset is updated, and what's its geographical and product type reach? Thanks! This particular data set is a rather static collection and has a bias towards US products. It will soon be complemented by a more dynamic and European-centric second data set. In the long run, we will have to convince professional providers of commodity master data (e.g. GS1) to release their data following our structure. Currently, this is not possible due to licensing restrictions (there are look-up services like GEPIR, but none of them allows redistribution of the data). The upcoming second data set will be based on a community process, i.e., shop owners enter labels for EAN/UPCs in a Wiki. Since EAN/UPCs must (theoretically) not be reused, the current data set should be pretty reliable, though not necessarily very complete. I see the main benefit of the current data set in - using it as a showcase how small businesses can fetch product master data from the Semantic Web and - showing how data on the same commodity from multiple sources can be easily linked on the basis of having the same http://purl.org/goodrelations/v1.html#hasEAN_UCC-13 property value. Individual commodity descriptions can be retrieved as follows: http://openean.kaufkauf.net/id/EanUpc_UPC/EAN Example: http://openean.kaufkauf.net/id/EanUpc_0001067792600 This seems to give me multiple product descriptions - am I misunderstanding? The whole data set is divided in currently 100 (will be changed to 1000 soon) RDF files, which are being served via a bit complicated .htaccess configuration. The reason is that the large number of instance data would otherwise require 1 million very small files (a few triples each), which may cause problems with several file systems. Also, since we want as much of our data as possible to stay within OWL DL (I know not everybody in the community shares that), this would cause a lot of redundancy due to ontology imports / header data in each single file. But as far as I can see, the current approach should not have major side effects - you get back additional triples, but the size of the files being served is limited. Currently, we serve 4 MB file chunks. We will shortly reduce that to 400 - 800 KB. That seems reasonable to me. Best Martin Libby begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: ANN: GoodRelations - E-Commerce on the Web of Data - New Datasets and Applications
Hi Steve, as I replied to Libby (but did not include all mailing lists): The whole data set is served from currently 100 smaller files, which will be broken down to 1000 files shortly. For various reasons however, we don't want to serve one file per element, because that will create a huge overhead - the individual data sets are rather small (a few triples per item). Having one million micro-files is hard to manage. Also, since we want to stay within OWL DL, we would have to duplicate proper ontology header meta-data a million times. Thus, we use a (rather large) set of rules in the .htaccess file to serve that part of the data set that contains the element you are actually looking for. You will receive a few more triples than you need, but simply discard those ;-) Martin Steve Harris wrote: Very cool resource. On 20 May 2009, at 10:18, Libby Miller wrote: Individual commodity descriptions can be retrieved as follows: http://openean.kaufkauf.net/id/EanUpc_UPC/EAN Example: http://openean.kaufkauf.net/id/EanUpc_0001067792600 This seems to give me multiple product descriptions - am I misunderstanding? Yeah, looks like it returns the entire document that the particular EAN appears in. Not very linked data friendly (you'll end up with a large proportion of repeated triples in identical graphs, with different graph URIS), but certainly better than nothing. - Steve -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax:+49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast explaining the Web of Data for E-Commerce: - http://www.heppnetz.de/projects/goodrelations/webcast/ Tool for registering your business: -- http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: - http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page and resources for developers: - http://purl.org/goodrelations/ Upcoming events: --- Full-day tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.eswc2009.org/program-menu/tutorials/70 Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.semantic-conference.com/session/1881/ begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
vCard - Old vs. New?
Dear all: As far as I can see, there are now two vCard variants in use - the original http://www.w3.org/2001/vcard-rdf/3.0# and the new one http://www.w3.org/2006/vcard/ns# Now - which one should data providers and application developers use? I see the technical advantages of the new variant and the deficiencies of the old approach. However, a quick SWOOGLE statistics shows that there are only 470 RDF documents using the new version vs. 233,595 documents using the old version. How do current Semantic Web applications handle this issue? Do they honor data expressed in either variant? I see that Yahoo Searchmonkey, for instance, endorses the new namespace - what's with others? We should not irritate potential users of Semantic technology by already confusing them by two vocabularies for such basic data as contact details Best Martin old: http://swoogle.umbc.edu/index.php?option=com_frontpageservice=digestqueryType=digest_nssearchString=http://www.w3.org/2001/vcard-rdf/3.0%23 new: http://swoogle.umbc.edu/index.php?option=com_frontpageservice=digestqueryType=digest_nssearchString=http://www.w3.org/2006/vcard/ns%23 -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: mh...@computer.org phone: +49-(0)89-6004-4217 fax:+49-(0)89-6004-4620 www:http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp Check out the GoodRelations vocabulary for E-Commerce on the Web of Data! Webcast explaining the Web of Data for E-Commerce: - http://www.heppnetz.de/projects/goodrelations/webcast/ Tool for registering your business: -- http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ Overview article on Semantic Universe: - http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Project page and resources for developers: - http://purl.org/goodrelations/ Upcoming events: --- Full-day tutorial at ESWC 2009: The Web of Data for E-Commerce in One Day: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.eswc2009.org/program-menu/tutorials/70 Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.semantic-conference.com/session/1881/ begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: ANN: The GoodRelations Annotator: How any business can get onto the Web of Data - today!
Dear all: Apologies for the late reply.. To my knowledge, there is a pretty mature osCommerce output available at http://code.google.com/p/goodrelations-for-oscommerce/ It was developed by a student of mine. A similar approach for the Joomla/Virtuemart combo by the same student is available at http://code.google.com/p/goodrelations-for-joomla/ Best Martin Daniel O'Connor wrote: On Fri, Apr 10, 2009 at 1:12 PM, Daniel O'Connor daniel.ocon...@gmail.com mailto:daniel.ocon...@gmail.com wrote: Got any plans around baking this into e-commerce software? I'm about 3 hours away from adding rdf/xml output into oscommerce; and i'm sure there are lots of other platforms out there. Oh neat, already done with triplify! begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: ANN: The GoodRelations Annotator: How any business can get onto the Web of Data - today!
Hi all: By the way, any chance of asserting that a gr:BusinessEntity is equivalent to a foaf:Organisation or foaf:Agent? As a statement in a particular data space, I think such a link is pretty accurate and useful. However, we currently prefer to collate such heuristics-based mapping axioms in separate files instead of including them in the vocabulary specification. The reason is that some users of GoodRelations manage clean OWL DL models inside corporate applications; there, importing, or linking to, RDF schema elements has unwanted side-effects. Don't get me wrong: We are very interested and collecting practically useful link statements. But I think they should be managed in a modular fashion. Best Martin Daniel O'Connor wrote: gr:BusinessEntity rdf:ID=BusinessEntity ... owl:sameAs rdf:resource=http://www.3kbo.com/people/richard.hancock/foaf.rdf#i/ ... /gr:BusinessEntity By the way, any chance of asserting that a gr:BusinessEntity is equivalent to a foaf:Organisation or foaf:Agent? begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Hosting Semantic Web data on MS IIS?
Dear all: Are there any recommended Web resources for configuring Microsoft IIS to support redirects and content negotiation? What is easily done via .htaccess on Apache servers seems to be difficult or require commercial add-ons on MS IIS, as far as my quick Google search indicates. Is anybody of you serving RDF/XML data or vocabularies from MS IIS platforms? Thanks for any hints. Best Martin begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
ANN: The GoodRelations Annotator: How any business can get onto the Web of Data - today!
Dear all: We are proud to announce the release of the GoodRelations Annotator, a form-based tool that will help any business in the world to create a description of its offerings suitable for the Web of Data, and that in less than 5 minutes. The tool is available at http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ It creates a straightforward yet complete description of the key aspects of a typical business using the GoodRelations vocabulary and current Semantic Web standards. The resulting RDF/XML file can be either directly published on the company's Web site or used as a skeleton for developing a more fine-grained description with price information etc. The work on the tool has been funded by the Oesterreichische Forschungsfoerderungsgesellschaft GmbH (FFG) and the Austrian Bundesministerium fuer Verkehr, Innovation und Technologie (BMVIT) under the myOntology project in the FIT-IT Semantic Systems program (contract number 812515). Please help spread the word. Best wishes Martin Hepp http://www.heppnetz.de http://www.unibw.de/ebusiness/ Tool: http://www.ebusiness-unibw.org/tools/goodrelations-annotator/ GoodRelations Project: http://www.heppnetz.de/projects/goodrelations/ Webcast (15 Minutes) http://www.heppnetz.de/projects/goodrelations/webcast/ begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
Re: Commonly supported properties
I need to encode postal addresses and phone numbers for businesses (not people) in some RDF I'm working on. Is there any particular existing vocabulary that I should prefer over another for this application? For representing a business, I would also recommend importing the GoodRelations ontology from http://purl.org/goodrelations/v1 and represent the legal entity as an instance of http://purl.org/goodrelations/v1#BusinessEntity You can then attach the address and phone number etc. to that instance. If you have branches or shop locations of the same legal entity, those should be instances of http://purl.org/goodrelations/v1#LocationOfSalesOrServiceProvisioning Also here, you can attach vCard attributes easily. If you need an example, just drop a note. For more info, see http://purl.org/goodrelations/ Best Martin --- martin hepp Richard Cyganiak wrote: On 14 Feb 2009, at 20:01, Bob Wyman wrote: I need to encode postal addresses and phone numbers for businesses (not people) in some RDF I'm working on. Is there any particular existing vocabulary that I should prefer over another for this application? vCard in RDF is the canonical one: http://www.w3.org/2006/vcard/ns# Best, Richard bob wyman On Sat, Feb 14, 2009 at 11:20 AM, Jay Luker lb...@reallywow.com wrote: Hi all, I'm interested in getting people's thoughts on a particular line from the How to publish Linked Data on the Web tutorial. Specifically the following... It is common practice to mix terms from different vocabularies. We especially recommend the use of rdfs:labelhttp://www.w3.org/TR/rdf-schema/#ch_labeland foaf:depiction http://xmlns.com/foaf/0.1/#term_depiction properties whenever possible as these terms are well-supported by client applications. I'm curious about a couple of things in regards to this: a) does anyone know what client applications the authors might be referring to, and b) are there examples of other properties that enjoy similar support? Thanks, --jay begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:mh...@computer.org tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard
LOD Applications / Redland binaries for Mac OS
Dear all: Hopefully not too much off-topic (but I assume there are many Pythonists and Pythonians listening): Does anybody of you have hints on how to install Redland and Python bindings [1] *easily* on Mac OS X? Ideal would be a binary, as available for - I just don't want to dig to deeply into porting and compiling Redland - but it seems there are no binaries for Leopard on the Web, or am I wrong? Any help would be very much appreciated :-) PS: I am posting this here because Redland seems to be the most powerful RDF API for Python, but all the simplicity of Python for developing LOD applications by a broad audience is gone if installing the lib requires a day or so... Best Martin [1] http://sourceforge.net/projects/librdf/ begin:vcard fn:Martin Hepp n:Hepp;Martin org:Bundeswehr University Munich;E-Business and Web Science Research Group adr:;;Werner-Heisenberg-Web 39;Neubiberg;;D-85577;Germany email;internet:[EMAIL PROTECTED] tel;work:+49 89 6004 4217 tel;pager:skype: mfhepp url:http://www.heppnetz.de version:2.1 end:vcard