You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
(Not strictly a stac-geoparquet issue, but just dumping this here for now)
I extracted a week's worth of sentinel-2 data from the PC's STAC API and wrote it out with deltalake.write_deltalake. This worked great.
Next, I wanted to try writing out something that was spatially partitioned, and that failed:
In [1]: importdeltalake, httpxIn [2]: token=httpx.get("https://planetarycomputer.microsoft.com/api/sas/v1/token/pcstacitems/items").json()["token"]
In [3]: table=deltalake.DeltaTable("az://items/sentinel-2-delta/data.delta", storage_options={"account_name": "pcstacitems", "sas_token": token}) # this is the table with the unpartitioned assets.In [4]: ds=table.to_pyarrow_dataset()
In [5]: dsOut[5]: <pyarrow._dataset.FileSystemDatasetat0x7ff50c54e080>In [6]: pa_table=table.to_pyarrow_table()
In [7]: deltalake.write_deltalake("/tmp/split.delta/", pa_table, engine="rust", partition_by=["s2:mgrs_tile"]) # eventually killed by my OS
I haven't looked into what's going on. There's a couple upstream issues in delta-rs about memory spikes, but nothing definitive.
The text was updated successfully, but these errors were encountered:
(Not strictly a stac-geoparquet issue, but just dumping this here for now)
I extracted a week's worth of sentinel-2 data from the PC's STAC API and wrote it out with
deltalake.write_deltalake
. This worked great.Next, I wanted to try writing out something that was spatially partitioned, and that failed:
I haven't looked into what's going on. There's a couple upstream issues in delta-rs about memory spikes, but nothing definitive.
The text was updated successfully, but these errors were encountered: