Hi Sean & Bob,
     Your emails were making it through to the tech list.
see: https://lists.spdx.org/g/Spdx-tech/message/5277

To look and see the ones that have been posted this month
see:  https://lists.spdx.org/g/Spdx-tech/messages?start=8:2023:300

Thanks,
Kate

On Wed, Aug 2, 2023 at 2:51 PM Martin, Robert A <[email protected]> wrote:

> And this one also doesn't seem to have made it to the list.
>
> Bob
>
> *From: *Sean Barnum <[email protected]> <[email protected]>
> *Date: *Tuesday, August 1, 2023 at 2:06 PM
> *To: *[email protected] <[email protected]>
> <[email protected]>
> *Subject: *FW: Some fodder for the discussion of blank nodes
>
> I just noticed that this has still not shown up in my inbox though it
> looks like it made it out of my outbox.
>
> This is the email I was referring to during the call thinking that you all
> had it already.
>
> Sorry about that.
>
>
>
> Hopefully it makes it past the spdx list server this time.
>
>
>
> sean
>
>
>
> *From: *Sean Barnum <[email protected]> <[email protected]>
> *Date: *Tuesday, August 1, 2023 at 11:50 AM
> *To: *[email protected] <[email protected]>
> <[email protected]>
> *Subject: *Some fodder for the discussion of blank nodes
>
> All,
>
>
>
> I apologize for the lateness of this. I threw it together yesterday and
> sent it to the list but just noticed that it never left my outbox so I must
> have messed something up.
>
>
>
> This is a VERY simple overview of some of the aspects of blank nodes we
> should consider when discussing whether they should be used for SPDX 3.0
>
> It is VERY informal and quickly thrown together so please do not interpret
> it as anything too rigorous. Rather than me spending time writing up
> rigorous argumentation I instead took an approach of pulling together
> several reference links addressing various aspects and let those do the
> talking with only a simple summarization of the aspect issue from me.
>
>
>
>    - Some VERY quick and short notes on the question of using blank nodes
>    or not
>    -
>    - The below short outline includes several relevant links to resources
>    on various aspects of this issue. All of these links were found within 20
>    mins of very simple Google querying and all were within the first 5-10
>    results for each Google query.
>    -
>    - There is broad consensus on the existence of significant issues and
>    challenges with using blank nodes. 15-30 mins of googling will yield scores
>    of papers, blog posts, articles, etc. calling out various reasons that
>    blank nodes are problematic and should be avoided wherever possible in the
>    large majority of situations. Defined semantics and specifications
>    regarding Bnodes are inconsistent and contradictory leading to
>    inconsistency between tools, ambiguity in how they will be processed,
>    interpreted or queried.
>
>
>    -
>       http://richard.cyganiak.de/blog/2011/03/blank-nodes-considered-harmful/
>       - https://aidanhogan.com/docs/blank_nodes_jws.pdf
>       - https://marceloarenas.cl/publications/iswc11.pdf
>       - https://terminusdb.com/blog/blank-nodes-in-rdf/
>       - https://terminusdb.com/blog/blank-nodes-in-rdf/
>
>
>    1. When blank nodes are used it is typically for the convenience of
>       the producer but often comes at significant cost to the consumer in the
>       form of ambiguity, uncertainty, complexity, and resources (time and
>       computing resources)
>       2. *IF* they are decided to be used they are *ONLY* for a single
>       scope of a single datastore or single serialized document and *NOT*
>       for global or cross-scope use. This is explicitly stated in all of the 
> W3C
>       specs dealing with Bnodes. Using them for cross-scope use as SPDX 3.0 is
>       intended leads to significant potential data integrity issues.
>
>
>    - https://www.w3.org/wiki/BlankNodes
>          - https://www.w3.org/TR/rdf11-concepts/#section-blank-nodes
>          - https://www.w3.org/TR/rdf-mt/#unlabel
>          - https://en.wikipedia.org/wiki/Blank_node
>
>
>    1. Avoiding these significant potential issues typically requires
>       skolemization (replacing the localized ids with globally unique IRIs) of
>       the Bnodes. This extra effort is forced on the consumer and is often 
> done
>       by processors and graph stores as part of deserialization/ingestion.
>       However, due to the inconsistencies in the specs regarding Bnodes this 
> is
>       not consistent. Some processors and stores do not perform skolemization 
> an
>       simply utilize the localized Bnode ids (especially if they are producer
>       asserted in any way). This leads to significant integrity issues as 
> these
>       ids collide (simple example is even explicitly in some W3C docs/specs 
> and
>       on the Wikipedia page) and increases significantly with the volume of
>       cross-scope content ingested. Skolemization also does not provide any
>       id-related context for the source of the nodes such as that provided by
>       namespaces in producer specified IRIs.
>
>
>    -
>          
> https://www.w3.org/2011/rdf-wg/wiki/Skolemisation#:~:text=Blank%20nodes%20do%20not%20have,are%20known%20as%20Skolem%20IRIs
>          .
>          - https://www.w3.org/wiki/BnodeSkolemization
>
>
>    1. Bnodes also have very significant issues for SPARQL, the definitive
>       standard mechanism for querying rdf graphs. The two do NOT play well
>       together at all due to inherent issues in Bnode design and 
> inconsistencies
>       in the rdf specs related to Bnodes. Many queries can lead to 
> inconsistent
>       and non-integrous results. Various academics and companies have offered
>       workarounds and schemes to attempt to address this disconnect but they 
> all
>       come at significant compute complexity and cost. These issues increase
>       significantly with the volume of Bnodes in the overall graph being 
> queried.
>
>
>    - https://www.w3.org/2009/12/rdf-ws/papers/ws23
>          -
>          https://aidanhogan.com/docs/certain_answers_sparql_blank_nodes.pdf
>          - http://www.jsoftware.us/vol7/jsw0709-09.pdf
>
>
>    1. Bnodes also cause significant issues with semantic entailment
>       (ability to determine full semantic integrity and correctness) of the
>       graph. Entailment is required for any higher-order semantic inferencing 
> and
>       analysis. A couple of academics have offered papers that purport to
>       mathematically prove that entailment using Bnodes is NPComplete though
>       there is broad consensus that while it is likely possible it is almost
>       always impractical and problematic.
>
>
>
>
>
> Sean
>
> 
>
>


-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#5280): https://lists.spdx.org/g/Spdx-tech/message/5280
Mute This Topic: https://lists.spdx.org/mt/100489649/21656
Group Owner: [email protected]
Unsubscribe: 
https://lists.spdx.org/g/Spdx-tech/leave/2656181/21656/1901338254/xyzzy 
[[email protected]]
-=-=-=-=-=-=-=-=-=-=-=-


Reply via email to