Pierre Roudier wrote:
Dear list,

I am processing gridded data like DEMs, imported in R from geotiff
files. Of course, such data is regularly gridded, but very often with
NA values. In the framwork of some tests, I have to generate a
neighbourhood of the DEM, so that to extract local extrema of the
layer. For this task, I wrote a function based on  the knearneig()
function from the spdep package, with k=4 or k=8, to generate and
analyse the neighbourhood of each point.

Unfortunately, I often have to process big datasets (let's say grids
with between  50,000 and 350,000 points), and that's working but
knearneigh() takes *hours* to process.

Does anybody would have any suggestion to improve the efficiency of this step ?
the algorithm (knn.c file in the spdep sources) seems to compute all distances between all grid cells, and for each grid cell find the nearest k by some kind of insertion sort. That makes it general for all kinds of spatial data points, including irregularly spread ones, but makes it also inefficient (because O(n^2) with n the number of grid cells) for large data sets.

Suggestions to improve this would be to use an index structure (for the generic case of possibly irregularly spread data) such as the PR-bucket quadtree used in package gstat (nsearch.c), or specifically for gridded data to write a special knearneigh() that exploits the grid indexing already present: you can explicitly address the nearest 4/8/12/... neighbours by using row/col indexes. In this latter case you'd probably want to rewrite the test function for the full grid, instead of looping over the cells.
Thanks,

Pierre

_______________________________________________
R-sig-Geo mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/r-sig-geo

--
Edzer Pebesma
Institute for Geoinformatics (ifgi), University of Münster Weseler Straße 253, 48151 Münster, Germany. Phone: +49 251 8333081, Fax: +49 251 8339763 http://ifgi.uni-muenster.de http://www.52north.org/geostatistics [email protected]

_______________________________________________
R-sig-Geo mailing list
[email protected]
https://stat.ethz.ch/mailman/listinfo/r-sig-geo

Reply via email to