Add some notes on RasterIndex design #4

dcherian · 2025-04-09T22:25:14Z

benbovy · 2025-04-10T09:27:24Z

design_notes/raster_index.md

+    2. All possible transforms have **offsets** which means they need to be kept up-to-date during slicing.
+    3. Allow extracting metadata necessary to accurately represent the information on disk.
+
+## Some complications


Another complication is the special but very common case where the model space can be represented by orthogonal 1D coordinates (e.g., rectilinear affine with no rotation), which by design is not supported by the xarray.CoordinateTransform base class.

In corteva/rioxarray#846 the workaround consists in introducing an AxisAffineTransform(CoordinateTransform) subclass that wraps a 2D affine but handles one axis, as well as a AxisAffineTransformIndex(CoordinateTransformIndex) subclass. The latter allows providing useful features in the 1D case, such as automatically converting the index into a PandasIndex when indexing at arbitrary locations.

benbovy · 2025-04-10T09:48:44Z

design_notes/raster_index.md

+
+1.  Make it easy to read a GeoTIFF with CRS and raster -> model space transformation information in to Xarray with appropriate indexes. There are at least two indexes: one associated with the CRS; and one with the transformation.
+2.  The raster ↔ model transformation information can be ambiguous, so an explicit API should be provided.
+    1.  [RPCs](http://geotiff.maptools.org/rpc_prop.html):


I'm not familiar with RPCs, but I wonder if this wouldn't allow an Xarray Dataset with, e.g.,

x and y coordinates associated with a RasterIndex[AffineTransformIndex]

and

lat and lon coordinates associated with a RasterIndex[RPCIndex].

which would enable both ds.sel(x=..., y=...) and ds.sel(lat=..., lon=...).

However, I read that

The RPC model allows a row/column location to be computed for a given lat, long and height value. It is not inherently invertable, though it is usually possible to compute at lat,long location from a row, column and height value using iterative methods.

So IIUC CoordinateTransform.forward() would be trickier to implement. Maybe we could make the implementation optional? This would mean that lat, lon are virtual coordinates that cannot be materialized. But since the RPC model allows CoordinateTransform.inverse() it would still allow ds.sel(lat=..., lon=...).

This would mean that lat, lon are virtual coordinates that cannot be materialized.

Yes, that's right. I think this may still work with the current model we have, perhaps we need to override the repr.

benbovy · 2025-04-10T09:53:08Z

design_notes/raster_index.md

+Each of the wrapped index has an associated transform:
+```python
+@dataclass
+class RasterTransform:


Is this class meant to inherit from xarray.CoordinateTransform or is it a different concept?

It's a different concept that represents which of the 5 ways the transformation is recorded in GeoTIFF metadata.

benbovy · 2025-04-10T10:07:41Z

That looks great @dcherian. I left a few comments mainly about how to build RasterIndex on top of Xarray coordinate transform, although I'm not sure that these are really relevant here since you seem to focus the notes on how to pass information from/to metadata.

benbovy · 2025-04-11T12:19:33Z

design_notes/raster_index.md

+2.  ModelTransformationIndex ↔ ModelTransformationTag
+3.  ModelTiepointScaleIndex ↔ ModelTiepointTag + ModelPixelScaleTag
+4.  GCPIndex ↔ Ground Control Points
+5.  RPCIndex ↔ Rational Polynomial Coefficients


Apparently a 6th possible one could be based on subsampled auxiliary coordinates, detailed in CF section 8.3 and equivalent to GDAL's geolocation arrays with PIXEL_STEP and/or LINE_STEP > 1.

I think that a decode/encode workflow strategy makes a lot of sense in this case.

Taking the example 8.3 from CF section 8.3, the decode step may consist in:

turn the tie point coordinate variables lat(tp_yc, tp_xc) and lon(tp_yc, tp_xc) into lat(yc, xc) and lon(yc, xc) Xarray coordinates associated with a custom transformation index that stores only the tie points. In other words, uncompress the dimensions of the lat & lon coordinates without uncompressing their data.

also remove the tie point index variables and the interpolation variable, and track their data / metadata internally in the index

The encode step would then consist in restoring the compressed tie point coordinate & index variables as well as the interpolation variable.

Ah nice, i agree completely

Co-authored-by: Benoit Bovy <[email protected]>

dcherian changed the title ~~Add some notes~~ Add some notes on RasterIndex design Apr 9, 2025

dcherian force-pushed the raster-index-nots branch 11 times, most recently from aa17148 to 3118ca1 Compare April 9, 2025 22:52

Add some notes

07acaf2

dcherian force-pushed the raster-index-nots branch from 3118ca1 to 07acaf2 Compare April 10, 2025 02:31

benbovy reviewed Apr 10, 2025

View reviewed changes

benbovy reviewed Apr 11, 2025

View reviewed changes

dcherian and others added 3 commits April 15, 2025 15:51

Add example notebook

151162b

Add RasterIndex (affine) (#5)

56586cf

Update

e8d21c1

Co-authored-by: Benoit Bovy <[email protected]>

dcherian force-pushed the raster-index-nots branch from 0097974 to e8d21c1 Compare April 15, 2025 21:54

dcherian merged commit 633698f into main Apr 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add some notes on RasterIndex design #4

Add some notes on RasterIndex design #4

Uh oh!

dcherian commented Apr 9, 2025 •

edited

Loading

Uh oh!

benbovy Apr 10, 2025

Uh oh!

benbovy Apr 10, 2025

Uh oh!

dcherian Apr 10, 2025

Uh oh!

benbovy Apr 10, 2025

Uh oh!

dcherian Apr 10, 2025

Uh oh!

benbovy commented Apr 10, 2025

Uh oh!

benbovy Apr 11, 2025

Uh oh!

benbovy Apr 11, 2025 •

edited

Loading

Uh oh!

dcherian Apr 15, 2025

Uh oh!

Uh oh!

Add some notes on RasterIndex design #4

Add some notes on RasterIndex design #4

Uh oh!

Conversation

dcherian commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

benbovy Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

benbovy Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

dcherian Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

benbovy Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

dcherian Apr 10, 2025

Choose a reason for hiding this comment

Uh oh!

benbovy commented Apr 10, 2025

Uh oh!

benbovy Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

benbovy Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dcherian Apr 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dcherian commented Apr 9, 2025 •

edited

Loading

benbovy Apr 11, 2025 •

edited

Loading