[mapserver-users] state of the art to efficiently serve aerial images via WMS ?

Landry Breuil Wed, 01 Jul 2020 07:16:02 -0700

Hi,

currently rebuilding an infra on new servers, i'm contemplating updatingour stack to the state of the art (to be defined ?)

So far, we're using mapserver 7.6/gdal 2.4 on debian buster, eventuallymapproxy 1.12 in front of it (not all layers), our 25cm imagery ismostly stored in 4000px TIFs (YCbCr, TILED, JPEG 90%, 3/4 levels ofoverviews, about 6/7Mb per file), depending on datasets/layers/areas wehave between 6000 and 600000 files, all stored locally. many datasetsare between 50 and 300Gb.


In mapserver, we use GROUP layers to 'merge' 3 layers:

* a layer using TILEINDEX (pointing at a postgis table generated withgdaltindex) below 1:25000 - thus directly hitting the original tiles* for upper scales, two layers pointing at 6m & 24m resamples of thesame dataset on the complete area, stored in single-file TIFFs (with thesame compression params, those resamples are between 200Mb & some GB files)

So far performance is quite acceptable for end-users (mostly QGISconsuming mapserver or mapproxy as WMS), but i'd like to eventually getrid of mapproxy (less cache handling/recompression/resample issues, lessstorage, etc...)

I've of course looked at COG, as i'm able to convert most of my datasetsto it - from my limited testing with GDAL 3.1.0 (now available in debiantesting) it only 'reorders' the existing metadata/overviews in a file ifit's already compressed as JPEG (and rebuilds the overviews w/ 512pxinstead of the default 128px i had so far), so from my understandingthat wouldnt be lossy 'recompressing already compressed data'.


But i fail to see in which direction to go for mapserver.

- i've tried keeping the same mechanism with TILEINDEX, it still worksand doesnt seem to have an impact on perf. I dunno if it would squeezesome perfs from reading the file, as gdal might read 'less' from thetiff if the MD is COG-optimized, even if stored locally ?- i've tried building a huge (7Mb) vrt for the dataset, pointingmapserver at it via DATA /path/to/vrt - works too, perf seems to be thesame. Is it 'clever' than using TILEINDEX, i dunno.- should i rather build/use a huge single-file COG for the dataset, atits original resolution (25cm), and point mapserver at it like forupper-scale resamples ? for a 5800km2 area, a regular JPEG-in-TIFFsinglefile is about 17Gb, with 6Gb external overviews.

And of course, the same questions also apply to a similar dataset, thistime at a 5cm resolution, so much larger sizes.

As COG was meant to be used (among other things) via /vsicurl/, is therea point/improvement by pointing mapserver (or the vrt file) at all thesame files via /vsicurl/ (and of course a webserver in-between) ratherthan pointing at local files - ie is GDAL as efficient at reading alocal file header as it is at getting chunks from a /vsicurl/ url ? I'veplayed with that scheme, it works, but i dunno if it really brings animprovement for users.

I get it that COG/vsicurl allows separating the storage from the actualmapserver process, but in my situation i have no urge to change my infrain this direction, unless it really brings perf improvements.

Sure, also serving COG files via a webserver allows nifty things likeopening a remote vrt/tif in QGIS and natively use files on a remote webserver, which would be somewhat an alternative to WMS (bringing all theshinies of having native files in the client), but all users are notready yet for such modern concepts...and this doesnt allow setting scalelimits serverside, ie if you open a vrt which points at 6000 images andzoom to the dataset extent, you will get as many calls as files to gettheir metadata - that's not very efficient.

All that to say - how are people handling large aerial datasets, withmany files, served over WMS (because that's the lowest commondenominator so far) in 2020 ? Still using tile caches in front ofmapserver ?


--
Landry Breuil
_______________________________________________
mapserver-users mailing list
[email protected]
https://lists.osgeo.org/mailman/listinfo/mapserver-users

[mapserver-users] state of the art to efficiently serve aerial images via WMS ?

Reply via email to