Le dim. 29 août 2021 à 23:59, Linas Vepstas <[email protected]> a écrit : > > Hi Amirouche, > > On Thu, Aug 19, 2021 at 1:20 AM Amirouche Boubekki > <[email protected]> wrote: >> >> >> If you deliver a set of json or sexp files that is relevant to >> opencog, I think about one terabyte or something like that, I can >> demonstrate a JSON / s-exp database. > > > I've been out of town. I can send you two. One will be a dump of (a portion > of) the agi-bio dataset. That dataset is itself just an import into the > atomspace of assorted external gene and protein databases. It's just "pure" > s-expressions, no truth values or counts on them. It's not a terabyte, its > probably much smaller than a gigabyte (I'll find out shortly) > > The other will be a natural language dataset. Here, each s-exp will have a > numerical count on it. It's the counts that matter. I have small, medium, > large versions of this. I'll send the small one, no point in struggling with > something huge.
That is wiser. Let me know where I can fetch the data, and whether the server must be behind a login and password. My server is located in Helsinki in Finland, and it is not encrypted so better keep secrets away from it. I think it will be easier for me to make sense of the natural language data, but anything sexp should do. > > The format will be "Atomese": Atoms in s-expressions are globally unique and > immutable and indexed (thus, searchable). Values in s-expressions are > fleeting, ephemeral, subject to change, and not indexed (thus, not searchable) > > --linas -- Amirouche ~ https://hyper.dev -- You received this message because you are subscribed to the Google Groups "opencog" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/opencog/CAL7_Mo9B-ZQMgCbyTAcQL23PwX50w-qwYqSekdRdaHP0ryGchQ%40mail.gmail.com.
