Re: triple Indexing for Apps like Cimba

Brent Shambaugh Sun, 18 Jan 2015 10:56:54 -0800

I was reading through something I wrote and found that SwarmLinda is not P2P. 
(see the aggregation of data section, Tolksdorf ref 117) [1], but that could 
have its benefits.


I think I said it was on the sematic-web list when asking about finding a 
triple on the web (federated SPARQL in a distributed and decentralized way). 
Thanks again for your input (I am still going through it).

[1] http://bshambaugh.org/Master_17.html
[2] http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.110.2876

Sent from my iPhone
> On Jan 18, 2015, at 11:18 AM, Brent Shambaugh <[email protected]> 
> wrote:
> 
> Andrei (and others in the reply all?),
> 
> Last year you gave a talk about cimba.co at MIT. During the Q&A there was 
> some discussion about what sort of index or triple retrieval mechanism there 
> would be. Sandro Hawke put up the talk, which I linked to here [0]. I was 
> wondering if you came up with something.
> 
> Thanks for your time.
> 
> My thoughts:
> 
> From what I have read, it is difficult to index everything. The best you can 
> do is index triples that are "important"that will eventually lead you to less 
> important triples that you might want. 
> 
> Perhaps this is accomplished by some form of semantic clustering? Perhaps 
> this clustering is accomplished by some sort of distributed RDF store, such 
> as Swarm Linda [1]. Or perhaps this clustering is accomplished by only 
> indexing the names of linked data containers with some sort of description 
> about what they are about. Or perhaps, collections, which seem to have less 
> structure defined about what they are about and can exist (iirc) at multiple 
> Network nodes with different ownership, are described in some way and cleaned 
> up to be more query able using swarm intelligence provided by Swarm Linda, or 
> something similar like building a Folksonomy with Twitter tags [2]. I might 
> need to compare these more, but it seems you are looking at semantic and 
> syntactic similarities where the semantic similarities need some sort of 
> global reference to make things more manageable/possible.                     
>  
> For the index you either need some sort of centralized index or decentralized 
> index. If being a purist in decentralization is desired even YaCy won't do 
> since there are 4 nodes that are not decentralized [3]. Not knowing much, 
> there may be times when you want a centralized index. Perhaps P2P would 
> introduce too much latency and use too much bandwidth in the network. Perhaps 
> sometimes you want P2P because you are constructing a Mesh Network where you 
> might even want local versions of some ontologies because you are closed off 
> for some reason.  
> [0] 
> http://adistributedeconomy.blogspot.com/2014/12/links-to-building-social-applications.html?m=1
> [1] 
> http://www.mi.fu-berlin.de/inf/publications/techreports/tr2009/B-09-04/TR-B-09-04.pdf?1346662692
>  [2] 
> http://people.kmi.open.ac.uk/motta/papers/SpeciaMotta_ESWC-2007_Final.pdf     
>                                 
> [3] https://fedcsis.org/proceedings/2011/pliks/237.pdf
> 
>

Re: triple Indexing for Apps like Cimba

Reply via email to