Re: [agi] Indexing

J. Andrew Rogers Sat, 27 Dec 2008 20:37:16 -0800


On Dec 26, 2008, at 7:40 PM, Jim Bromer wrote:

I noticed that neither linked lists nor arrays were particularly
efficient for general operations that would include insertions,
deletions and searches, which, when you think about it, are pretty
much the norm.  How often do you need a large data index that only
rarely needs to be searched.  The irony is that you cannot combine the
two forms in a simple manner so that you can have a linked list for
fast insertion and deletion and an array for fast searches.

There are data structures and algorithms that offer fast insert/deleteand fast search, approximately constant computational complexity forboth even. It does require slightly more cleverness than a linkedlist though since your glorified lookup table will require a space-preserving representation. It is much more common and usually simplerto merely use order-preserving representations like the common B+treevariants unless you have vast quantities of data. Brute-force can beexceedingly efficient in small doses.

And with indirect indexes (using a handle or an index to an location
entry) the data requires frequent compression (to squeeze out the gaps
in the data area) if there is a heavy insertion and deletion.

An old, solved problem. Well, "solved" in the sense that thetradeoffs and methods for managing this are well-understood.

I believe the problem is directly related to agi because data relevant
to some particular situation will tend to be distributed in a file so
that a lot of relational indexing is needed.

Perhaps the most relevant application to AGI is that it would verysignificantly improve the computational complexity of of representingand manipulating high-dimensionality relationships, particularly indistributed systems. In conventional data-mining and patterndiscovery analytics, the lack of scalability of high-dimensionalityrepresentations has long been major limitation on what one could do.

But for AI, consider algorithms like SIFT, which turn massiveaggregates of 2-dimensional representations of 3-dimensional space(i.e. "photos") into a virtual model of the 3D space represented. Aneat algorithm, but limited by the fact that the algorithm representsthe data in a 128-dimensional space before reducing it to 3-dimensional space, limiting the amount of data you could apply as apractical matter.

Since a lot of data can
be described as being analogously similar to other kinds of data and
since many variations in some particular kind of data might already
exist in a database, a great many complicated modifications of
concepts could, hypothetically, be done by modifying the indexes
alone.

In an ideal system, the database relation *is* the index. Externalindexes are largely a software engineering artifact of only being ableto represent one dimension per relation in a scalable manner.


J. Andrew Rogers



-------------------------------------------
agi
Archives: https://www.listbox.com/member/archive/303/=now
RSS Feed: https://www.listbox.com/member/archive/rss/303/
Modify Your Subscription: 
https://www.listbox.com/member/?member_id=8660244&id_secret=123753653-47f84b
Powered by Listbox: http://www.listbox.com

Re: [agi] Indexing

Reply via email to