Re: [FRIAM] General-Purpose Computing on a Semantic Network Substrate

Marko A. Rodriguez Mon, 30 Apr 2007 08:54:13 -0700

Hi Josh.

We are interested in models for distributed computing that caneasily exist in very heterogeneous environments such as highperformance computers/web service servers/desktop PCs down tophones and other specialized network devices with low levels ofresources, but interestingly also lowest latency with regard tothe so called 'user'.

We are currently developing the Fhat processor in Java and forvarious triple-store interfaces. What this means is that nothing willrun faster than Java and on top of that, it will always beconstrained by the read/write speed of the triple-store. For triple-stores like AllegroGraph, this is fast, for stores like Kowari..... idunno---we will see when we benchmark the prototype ... This is not ahigh-performance computing paradigm as much as its a distributedcomputing (internet computing) paradigm. Let me point you to thisarticle by some Carnegie-Mellon people that is related in thought:


http://isr.cmu.edu/doc/isr-ieee-march-2007.pdf

Would this new language make managing data and process on multiplecomputers easier to program for in a more general sense? How do wemake a network based computer that gets us away from having toworry about where a particular data set is -- or where aparticular process is running? I know this is focused on thesemantic web but can this help me deal with manageing my manyoverlapping data streams that I want available on any computer Icome in contact with -- such as model output or more importantlydigital photos, mp3s, and videos?

If you do represent your data in RDF (which could be a resolvable URIto some byte-stream---e.g. music, movies, images, etc.) then yourdata is always present/accessible on the "Semantic Web" (in a triple-store repository or pointed to by a URI in the RDF network that canbe resolved to a "physical" file). Furthermore, the execution of thatdata is also on the Semantic Web (the RVM state is represented inRDF). Lets say you want to move to computer B but you have somethingexecuting on your current computer A. Well you just halt the RVM andits stored frozen in the Semantic Web (you can halt at theinstruction level--meaning, in mid-method...). Then you just move tocomputer B and start the RVM process up again. It continues at thelast instruction you halted it at. To the RVM time didn't stop. Youcan move to different computers and always have the same applicationsrunning where you left off---no sleeping or shutting down.

Again, your data, your computing machine (RVM), and your software(triple-code) is all in the Semantic Web. It doesn't matter whichhardware device you are running the RVM process on (as a long as theRVM process code is written for that machine---thats why we arebuilding the Fhat process code in Java). Also, check this out. Assumethat your hardware CPU is VERY slow (lets say a mobile device). Well,you need not use the mobile device's CPU to execute the RVM process.You can leverage another CPU in the pool of Semantic Web hardwaredevices to execute the code while the state changes are read by yourmobile device. Your mobil device is only an I/O device, not a "numbercruncher". You can have your home computer doing all the RVM processcode while your mobile device controls that flow of execution. Yourmobile device leverages the computer power of the desktop machine.

However, there is a great price to pay for all this. BecauseEVERYTHING is represented internal to the triple-store, there is agreat deal of read/writes. The triple-store is the bottle-neck. Whileyou can federate triple-stores (which is blind to the endapplications), this is still the limiting factor. However, we are notonly developing Fhat, but r-Fhat (reduced Fhat) which is an RVM whosestate is not represented in RDF. This does not provide the niceportability seen with Fhat, but does greatly reduce the number ofread/writes to the triple-store. For this reason, I wouldn't posethis as a "high-performance" computing paradigm.

(I haven't thought much about multi-threading where you can havemultiple RVM processes executing a single RDF program, but I know itspossible and will write something up about it as the logistics of itsolidify in my mind... In such cases you would want your RDF softwaredistributed across multiple triple-stores so as to avoid the read/write bottle neck.)

Finally, because everything in the Semantic Web is a URI (for which aURL is a sub-class of), the software you write it at the world stage.This gets into the whole Semantic Web Services concept. You can haveinstantiated objects or APIs (objects that need instantiating) thatjust exist and can be leveraged by your software. There is nodownloading of .jars and classpathing stuff. Its all in one largesoftware repository called the Semantic Web.

Hope this clears things up... Please ask more questions. It helps meto clear up my thoughts (again, this is all very new to me too :).


Take care,
Marko.

I think a wed-tech talk would be very welcome.

--joshua

---
Joshua Thorp
Redfish Group
624 Agua Fria, Santa Fe, NM




On Apr 26, 2007, at 8:01 AM, Marko A. Rodriguez wrote:
LANL is currently building a compiler and virtual machine that iscompliant with the specification in the paper. If RedFish isinterested, perhaps in a month or two, I could demo this computingparadigm at a Wednesday tech session.
============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
lectures, archives, unsubscribe, maps at http://www.friam.org


Marko A. Rodriguez
Los Alamos National Laboratory (P362-proto)
Los Alamos, NM 87545
Phone +1 505 606 1691
http://www.soe.ucsc.edu/~okram

============================================================
FRIAM Applied Complexity Group listserv
Meets Fridays 9a-11:30 at cafe at St. John's College
lectures, archives, unsubscribe, maps at http://www.friam.org

Re: [FRIAM] General-Purpose Computing on a Semantic Network Substrate

Reply via email to