API for supporting reef suitability assessments.
- Quick Start
- Setup
- Configuration
- JWT Authentication
- API Usage
- Development
- Performance Notes
- Reef Edge Alignment
- Docker Usage
- Troubleshooting
Start the API server with a configuration file:
using ReefGuideAPI
# To enable debug messages:
# ENV["JULIA_DEBUG"] = "ReefGuideAPI"
# If multiple threads are available, a parallel server will be spun up
ReefGuideAPI.start_server(".config.toml")
Ensure your local stack of reefguide is running. You can then do the below:
Setup env file:
cp .env.local .env
Then run the worker
] add DotEnv
using DotEnv
DotEnv.load!()
using ReefGuideAPI
ReefGuideAPI.start_worker()
The example below points to a local setup.
API_ENDPOINT=http://localhost:5000
AWS_REGION=ap-southeast-2
JOB_TYPES=SUITABILITY_ASSESSMENT,REGIONAL_ASSESSMENT,TEST
[email protected]
WORKER_PASSWORD=password
JULIA_DEBUG=ReefGuideAPI
CONFIG_PATH=config.toml
AWS_ACCESS_KEY_ID=minioadmin
AWS_SECRET_ACCESS_KEY=minioadmin
# For minio dropin
S3_ENDPOINT=http://localhost:9000
MINIO_USERNAME=minioadmin
MINIO_PASSWORD=minioadmin
Typically you would deploy a production worker using a separate process to manage the env variables. But the below shows an example .env file (noting we do not include the minio dropin)
API_ENDPOINT='https://web-api.reefguide.example.com'
AWS_REGION='ap-southeast-2'
JOB_TYPES=SUITABILITY_ASSESSMENT,TEST
[email protected]
WORKER_PASSWORD=<password>
POLL_INTERVAL_MS=5000
IDLE_TIMEOUT_MS=600000
JULIA_DEBUG=ReefGuideAPI
CONFIG_PATH=config.toml
# Run with Docker
docker run -p 8000:8000 -v ./data:/data/reefguide reefguide
# Or with Docker Compose
docker compose up reefguide-src
Initialize the project the usual way:
]instantiate
A TOML file should be defined indicating location of the MPA dataset. These are currently the files/data created in Step/Script 1a in https://github.com/open-AIMS/GBR-reef-guidance-assessment
[prepped_data]
PREPPED_DATA_DIR = "C:/some_path_to_data/MPA/"
[server_config]
TIFF_CACHE_DIR = "<some location to cache geotiffs>"
REGIONAL_CACHE_DIR = "<some location to cache regional datasets>"
DEBUG_MODE = "false" # Optional, disables file caching and displays debug logs
COG_THREADS = "2" # Optional, Number of threads to use when creating COGs (defaults to 1)
TILE_SIZE = "256" # Optional, tile block size to use (defaults to 256)
By convention, this file is named .config.toml
(note the leading .
).
The API can be additionally configured to expect a valid JWT in the Authorisation: Bearer <token>
header format.
Add the following to .config.toml
:
[jwt_auth]
# Enable JWT auth : bool true/false
JWT_ENABLED = true
# Which iss to validate for the JWTs?
JWT_ISS = "https://issuer.com"
# WKT JWKS endpoint where public key can be retrieved
WKT_ENDPOINT = "https://issuer.com/api/.well-known/jwks.json"
Pay attention to the issuer and wkt endpoints. The first should exactly match the expected JWT issuer claim. The second should be web-resolvable and return a WKT JSON which provides the public key.
- When JWT authentication is enabled, most API routes require a valid token
- The health check endpoint (
/health
) remains accessible without authentication - Tokens must be included in the Authorisation header format:
Authorization: Bearer <token>
- For testing purposes, you can disable authentication by setting
JWT_ENABLED = false
- Bump the version in
Project.toml
and commit - Identify the previous version
- Identify the intended version bump (new tag)
- Create tag
git tag v1.x.y -a
# Then fill in description
- Push to remote
git push origin --tags
- On GitHub - draft the new release targeting this as the new version
Example URL:
http://127.0.0.1:8000/assess/Cairns-Cooktown/slopes?Depth=-9.0:0.0&Slope=0.0:40.0
Example URL:
http://127.0.0.1:8000/tile/8/231/139?region=Cairns-Cooktown&rtype=slopes&Depth=-9.0:0.0&Slope=0.0:40.0&Rugosity=0.0:3.0
The steps below assumes you are in the project root.
Create a sandbox environment:
$ mkdir sandbox
$ cd sandbox
$ julia --project=.
(sandbox) julia> ]add Revise Infiltrator
(sandbox) julia> ]dev ..
Copy the quickstart to a file (e.g., dev_server.jl
).
Create the .config.toml
file and save to the sandbox directory.
Assuming VS Code is configured to default to the sandbox environment and start the Julia REPL at project root:
;cd sandbox
include("dev_server.jl")
Note that the server now caches the initially loaded spatial data in between server launches to reduce downtime. It will be necessary to restart the Julia session to reload spatial data.
The config setting COG_THREADS
controls how many threads should be requested when writing
out COGs. Ideally this will be set to at least 2 (preferably 4).
Higher values do seem to reduce write times but with diminishing returns (tested up to 8).
Locally, write times with four threads configured range from 10 to 15 seconds.
identify_edge_aligned_sites()
can be used to identify potential sites that only align with
the nearest reef edge (or specified rotations away from this angle).
This method works by identifying the closest edge of reef polygon geometries that have been
converted into lines.
The following processing is required before use:
- Reef polygons should be simplified (
GO.simplify()
) and buffered to avoid matching possibly inaccurate reef edges. - Simplified reef polygons should be provided as vertex-vertex lines with
polygon_to_lines()
. - Require raster of target pixels to search, and their indices (currently a vector of
CartesianIndices
for identifying search pixels). Usefindall(bool_search_raster)
to return pixel indices. - Raster of search pixels should be masked by reef polygons or simplified reef polygons.
- The target region name should be specified in GBRMPA format.
- E.g. "Townsville/Whitsunday Management Area" rather than "Townsville-Whitsunday".
- A parquet GeoDataFrame must be loaded and filtered for unsuitable pixels based on user criteria thresholds using a Dict and
within_thresholds()
. lons
andlats
columns (FLoat64) must be added to the GeoDataFrame.- E.g.
valid_pixels.lons = first.(GI.coordinates.(valid_pixels.geometry))
The column used for masking should be the same as the column specified as geometry_col inidentify_edge_aligned_sites
(default =:geometry
).
- E.g.
The ReefGuideAPI.jl package has an associated Dockerfile
and build/publish process. This means you can run an instance of the ReefGuideAPI.jl package without needing to compile/build it with a local Julia
installation. You will be able to view the latest published versions of the Docker image on the repository packages page.
The Julia version is specified by the JULIA_VERSION
arg. The full version is specified
to maintain build stability, but should be bumped to the latest version of Julia when a
release is published.
Due to how Julia (particularly v1.11) handles precompilation, it significantly reduces the build time by explicitly installing MKL_jll before installing any explicit project dependencies.
For this reason, the Dockerfile extracts the MKL_jll version from the Manifest file using Pkg.dependency(), precompiles this in an anonymous project, then compiles the main dependencies. This cuts the build time from around 15 minutes down to around 6-7.
To publish a new version of the Docker image:
- Bump version in Project.toml and PublishDockerImage.yml
- Create PR, merge to
main
branch - Publish Release on GitHub (this triggers the PublishDockerImage workflow)
The ReefGuideAPI.jl
package requires:
- a
config.toml
file and - a set of input data files
Please include these in a folder called data
in your working directory.
When running the commands below, it is assumed you have data
available locally with the required files.
Note: Due to how Docker excludes .
files, we have named the config file config.toml
in the data folder (not .config.toml
). This is required to launch the server.
Using Docker:
docker build . --target reefguide-src -t reefguide
Using Docker Compose:
docker compose build reefguide-src
# Or to build and run in one command:
docker compose up --build reefguide-src
Using Docker:
docker run -p 8000:8000 -v ./data:/data/reefguide reefguide
Using Docker Compose:
docker compose up reefguide-src
This will start a Julia shell where ReefGuideAPI
is compiled and ready for use:
docker run --rm --interactive --entrypoint="julia" --tty -v ./data:/data/reefguide reefguide
Then in the Julia REPL:
using ReefGuideAPI
ReefGuideAPI.start_server("/data/reefguide/config.toml")
The ReefGuide Job System is a job processing framework written in Julia. The system is designed to poll for available jobs from the reefguide-web-api, claim them, process them using appropriate handlers, and report results back to the API.
The job system is built around these core components:
- WorkerService: The main orchestrator that manages the job processing lifecycle
- JobHandler: A wrapper around the job registry which manages dispatching jobs to the proper registry handler, and reporting the results
- API Client: For communication with the ReefGuide API, automatically managing logins/token refreshing
- Job Registry: A centralized registry that maps job types to their handlers, including input and output payload configurations
The worker requires the following environment variables to be set:
Variable | Description | Example |
---|---|---|
API_ENDPOINT |
Base URL of the ReefGuide API | "https://api.reefguide.example.com/api" |
JOB_TYPES |
Comma-separated list of job types the worker should handle | "SUITABILITY_ASSESSMENT,TEST" |
WORKER_USERNAME |
API authentication username | "worker-service" |
WORKER_PASSWORD |
API authentication password | "secure-password" |
POLL_INTERVAL_MS |
(Optional) Polling interval in milliseconds | 2000 (default) |
IDLE_TIMEOUT_MS |
(Optional) Idle timeout in milliseconds | 300000 (default) |
AWS_REGION |
AWS region for S3 operations | "ap-southeast-2" |
ECS_CONTAINER_METADATA_URI_V4 |
ECS task metadata endpoint | Automatically set in ECS |
For local development, you can use the DotEnv.jl package to load environment variables from a .env
file:
using DotEnv
# Load environment variables from .env file
DotEnv.load!()
# Create and start worker
using ReefGuideAPI
ReefGuideAPI.start_worker()
Example .env
file:
API_ENDPOINT=http://localhost:8000
JOB_TYPES=SUITABILITY_ASSESSMENT,TEST
WORKER_USERNAME=local-dev
WORKER_PASSWORD=local-password
POLL_INTERVAL_MS=5000
IDLE_TIMEOUT_MS=600000
AWS_REGION=ap-southeast-2
The worker operates in a continuous loop:
- Poll for available jobs matching the configured job types
- If a job is found, claim it to get an assignment
- Process the job with the appropriate handler
- Report the result (success/failure + payload)
- Sleep for the configured poll interval
- Check for idle timeout and shut down if idle too long
The loop includes error handling to ensure that failures in one job don't affect the processing of subsequent jobs.
The worker will automatically shut down after being idle (no jobs processed) for the configured timeout period. This helps manage resources in cloud environments where workers can be dynamically scaled.
To add a new job type:
- Define a new value in the
JobType
enum inJobs.jl
- Create input and output type definitions that extend
AbstractJobInput
andAbstractJobOutput
- Implement a handler that extends
AbstractJobHandler
- Register the handler during application initialization
NOTE: the JobType, input and output should correspond to the typed interfaces in the reefguide-web-api project. This just provides Julia structs around the existing types. The API will reject improperly formed types.
Example:
# 1. Add to enum
@enum JobType begin
SUITABILITY_ASSESSMENT
TEST
# Add another
end
# 2. Define input/output types
struct NewJobInput <: AbstractJobInput
# Define input fields
parameter1::String
parameter2::Int
end
struct NewJobOutput <: AbstractJobOutput
# Define output fields
result::String
end
# 3. Implement handler
struct NewJobHandler <: AbstractJobHandler end
function handle_job(
::NewJobHandler, input::NewJobInput, storage_uri::String
)::NewJobOutput
# Implement job processing logic
result = process_data(input.parameter1, input.parameter2)
# Return result
return NewJobOutput(result)
end
# 4. Register in __init__
function __init__()
# Register existing handlers
register_job_handler!(
TEST,
TestHandler(),
TestInput,
TestOutput
)
# Register new handler
register_job_handler!(
NEW_JOB_TYPE,
NewJobHandler(),
NewJobInput,
NewJobOutput
)
end
In a Julia REPL, you can launch the worker using ReefGuideAPI.start_worker()
.
e.g.
# Set environment variables
export API_ENDPOINT=https://api.reefguide.example.com
export JOB_TYPES=SUITABILITY_ASSESSMENT,TEST
export WORKER_USERNAME=worker-service
export WORKER_PASSWORD=secure-password
# Start the Julia REPL with the ReefGuide module
cd sandbox
julia --project=.
pkg > dev ..
pkg > instantiate
julia > using ReefGuideAPI
julia > ReefGuideAPI.start_worker()
Issue: Server fails to start with configuration error
- Solution: Double-check that your config file has the correct path format for your OS (Windows uses backslashes, Unix uses forward slashes)
- Solution: Ensure all required directories specified in the config actually exist on your system
Issue: Data not found
- Solution: Verify the
PREPPED_DATA_DIR
path is correct and contains the required MPA dataset files - Solution: When using Docker, check that the volume mount path (
-v ./data:/data/reefguide
) is correct and thedata
directory contains your files
Issue: Docker container exits immediately after starting
- Solution: Check Docker logs with
docker logs <container-id>
to see the specific error - Solution: Verify that your
config.toml
is named correctly (not.config.toml
) in the data directory
Issue: "Permission denied" errors when accessing data directory
- Solution: Check file permissions on your data directory and ensure the Docker user has read/write access
Issue: Slow COG generation
- Solution: Increase the
COG_THREADS
value in your config (recommend 2-4) - Solution: Ensure the server has sufficient memory for processing (minimum 4GB recommended)
Issue: JWT token rejected as invalid
- Solution: Verify the token's issuer (
iss
claim) matches exactly what's in your config - Solution: Check that the token is not expired
- Solution: Ensure the
WKT_ENDPOINT
is accessible from the server
If you encounter issues not covered here, please:
- Check the Julia REPL output for specific error messages
- Enable debug logs with
ENV["JULIA_DEBUG"] = "ReefGuideAPI"
- Open an issue on the GitHub repository with a detailed description of the problem