<[EMAIL PROTECTED]>
>To: r-help@stat.math.ethz.ch
>Subject: Re: [R] newbie: new_data_frame <- selected set of rows
>Date: Fri, 1 Dec 2006 14:52:25 -0800 (PST)
>
>Two missing things:
>
> >distances
> [1] 13 14 10 11 2 4 6 1 3 9 8 12 7 5
>
>#numbers corres
Hi!
> distances <- order(distancevector(scaled_DB, scaled_DB['query',],
> d="euclid"))
Just compute the distances WITHOUT ordering, here. And then
> 1) create a small top_five frame
top = scaled_DB[rank(distances)<=5, ]
rank() is better for this than order() in case there are ties.
>
Two missing things:
>distances
[1] 13 14 10 11 2 4 6 1 3 9 8 12 7 5
#numbers correspond to rows in my_dataframe
> my_dataframe
V2 V3 V4
V5 V6
ENSP0354687 35660.45 0.04794521 0.05479452
0.06849315 0.07534247
ENSP0355046 38942
Hello,
this is probably trivial but I failed to find this
particular snippet of code.
What I got:
my_dataframe (contains say a 40k rows and 4 columns)
distances (vector with euclidean distances between a
query vector and each of the rows of my_dataframe)
What I do:
after scaling data my_datafram