[ https://issues.apache.org/jira/browse/SDAP-440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Riley Kuttruff updated SDAP-440: -------------------------------- Component/s: analysis > Switch handling of tile data to L2 format > ----------------------------------------- > > Key: SDAP-440 > URL: https://issues.apache.org/jira/browse/SDAP-440 > Project: Apache Science Data Analytics Platform > Issue Type: Task > Components: analysis > Reporter: Riley Kuttruff > Priority: Major > > In our current design, when tiles are loaded the data is formatted to be > shaped like gridded data (L3/L4). This is obviously fine for L4/L3 tiles. The > problem is with L2 (swath) tiles. The swath -> grid-like transform requires > transforming an m x n data array for the L2 tile to an (m * n) x (m * n) > array with the original data values occupying the diagonal of the array and > the rest of the array locations unused. It goes without saying that this is > EXTREMELY inefficient for memory, and L2 tile sizes >15x15 can very easily > consume memory by the gigabyte. > > Proposed solution: Instead of handling loaded tiles in gridded format, handle > them in swath format. This would remove the issues from having to transform > L2 tiles, but would still require expanding the latitude and longitude (and > time?) arrays to match shape with the data array. This would require > SIGNIFICANTLY less extra memory to do (I even believe numpy can do it with > constant extra memory rather than the expected O\(n\)). > The problem with this is that we would need to individually adapt each of > SDAP's algorithms to work with swath formatted data rather than grid > formatted data. The scale of this required change has caused us to hold off > on this implementation. > > Plan: I plan to mitigate that issue by adapting the NexusTileService to be > (temporarily) configurable to allow choice in how returned tile data is > formatted (default will be gridded). We can then roll out the changes for the > various algorithms and switch over their NTS to serve swath data. Upon > completion of the rollout, we can (optionally) remove the configuration > option from the NTS or switch its default to swath. -- This message was sent by Atlassian Jira (v8.20.10#820010)