Re: Longitude Index: B-Tree or Cluster

Peter Bozek via 4D_Tech Tue, 14 Apr 2020 13:23:13 -0700

John,

 I do not think using text for dat that can be represented by number is a
good idea. It require more memory - do not forget that Unicode uses 2 bytes
for each character - and manipulation of bigger data structures is slower,
but, especially, string manipulation, including comparison, depends on a
locale, and is using a build-in ICU library. It means that, instead of
comparing two numbers, you have all the overhead of calling a library
function that applies potentially multi-pass comparison algorithm using
mapping tables to two strings. Definitely, much slower.

Second, regarding standard and cluster index: as I understand it - and I
may be wrong, 4D in general does not publish technical info - B-Trees
(there are several types of them) offer optimal performance when result of
search contains few records, can well do a range search, as traversing the
tree is reasonably fast, but may be less effective when returning a big
selection, as building a selection can be time consuming. How much slower
depends, but IMHO it will be still pretty fast, unless the selection is
huge.

Cluster index is suitable when the result of search is a (potentially
large) selection. While B-tree node stores a reference to a record, Cluster
index node stores a selection, so no selection building is needed. However,
if I would design such an index, I would use hash table for cluster index.
Hash table is much faster when you need to find one particular value, but
is totally unsuitable for range searches, as it cannot be effectively
traversed. That make a perfect sense for foreign key indexes - when loading
1-record, you have immediately a selection of N-records available, and you
(almost) never do a search for range of foreign keys (with UUID keys, such
search would not make any sense.)

So my advice would be to go for representation of position with reals and
use standard index. Although speed difference may be small for middle-size
data (how many locations there may be?)

HTH,

Peter Bozek

On Tue, Apr 14, 2020 at 9:17 PM John J Foster via 4D_Tech <
4d_tech@lists.4d.com> wrote:

> Hi Arnaud,
>
> I have split out the longitudes into a text and real component. I have
> formatted the longitudes as “000” and if west of GMT prefixed with “-“ and
> if east prefixed with “+“. I'll do something similar with latitudes and
> then try building some radial searches and see.
>
> Since all string is text is there a still a way to tell 4D that a field
> and thus an index is a fixed size?
>
> Anyway we shall see if theory and implementation matches!
>
> Appreciate,
> John…
>
>
>
> > On Apr 14, 2020, at 12:00 PM, 4d_tech-requ...@lists.4d.com wrote:
> >
> > From: Arnaud de Montard <arn...@init5.fr <mailto:arn...@init5.fr>>
> > Subject: Re: Longitude Index: B-Tree or Cluster
> > Date: April 14, 2020 at 2:23:26 AM PDT
> > To: 4D iNug Technical <4d_tech@lists.4d.com <mailto:4d_tech@lists.4d.com
> >>
> >
> >
> >
> >> Le 13 avr. 2020 à 19:36, Bernard Escaich via 4D_Tech <
> 4d_tech@lists.4d.com <mailto:4d_tech@lists.4d.com>> a écrit :
> >>
> >> Hi John,
> >>
> >> From the top of my head, indexes on long are less efficient than on
> text values ; I have a doubt, perhaps only for updating.
> >
> > I'd think the opposite, an index based on a "fixed length" field is
> always more efficient (search, update…) than one based on a "variable
> length" field.
> >
> > --
> > Arnaud de Montard
>
> **********************************************************************
> 4D Internet Users Group (4D iNUG)
> Archive:  http://lists.4d.com/archives.html
> Options: https://lists.4d.com/mailman/options/4d_tech
> Unsub:  mailto:4d_tech-unsubscr...@lists.4d.com
> **********************************************************************

-- 
--

Peter Bozek
**********************************************************************
4D Internet Users Group (4D iNUG)
Archive:  http://lists.4d.com/archives.html
Options: https://lists.4d.com/mailman/options/4d_tech
Unsub:  mailto:4d_tech-unsubscr...@lists.4d.com
**********************************************************************

Re: Longitude Index: B-Tree or Cluster

Reply via email to