spatial_graph provides a data structure for directed and undirected graphs,
where each node has an nD position (in time or space).
- support for arbitrary number of dimensions
 - typed node identifiers and attributes
- any fixed-length type that is supported by 
numpy 
 - any fixed-length type that is supported by 
 - efficient node/edge queries by
- ROI
 - kNN (by points / lines)
 
 - numpy-like interface for efficient:
- graph population and manipulation
 - query results
 - attribute access
 
 - minimal memory footprint
 - minimal dependencies
cython/witty/cheetah3for runtime compilation- numpy for array interfaces
 
 - PYX API for graph algorithms in C/C++
 
- graph algorithms
 - I/O
 - non-typed arguments
 - non-spatial graphs
 - out-of-memory support
 - networkx compatibility
 
Graph creation:
graph = sg.SpatialGraph(
    ndims=3,
    node_dtype="uint64",
    node_attr_dtypes={"position": "double[3]"},
    edge_attr_dtypes={"score": "float32"},
    position_attr="position",
)Adding nodes/edges:
graph.add_nodes(
    np.array([1, 2, 3, 4, 5], dtype="uint64"),
    position=np.array(
        [
            [0.1, 0.1, 0.1],
            [0.2, 0.2, 0.2],
            [0.3, 0.3, 0.3],
            [0.4, 0.4, 0.4],
            [0.5, 0.5, 0.5],
        ],
        dtype="double",
    ),
)
graph.add_edges(
    np.array([[1, 2], [3, 4], [5, 1]], dtype="uint64"),
    score=np.array([0.2, 0.3, 0.4], dtype="float32"),
)Query nodes/edges in ROI:
# nodes/edges will be numpy arrays of dtype uint64 and shape (n,)/(n, 2)
nodes = graph.query_nodes_in_roi(np.array([[0.0, 0.0, 0.0], [0.25, 0.25, 0.25]]))
edges = graph.query_edges_in_roi(np.array([[0.0, 0.0, 0.0], [0.25, 0.25, 0.25]]))Query nodes/edges by position:
nodes = graph.query_nearest_nodes(np.array([0.3, 0.3, 0.3]), k=3)
edges = graph.query_nearest_edges(np.array([0.3, 0.3, 0.3]), k=3)Access node/edge attributes:
node_positions = graph.node_attrs[nodes].position
edge_scores = graph.edge_attrs[edges].scoreDelete nodes/edges:
graph.remove_nodes(nodes[:1000])A SpatialGraph consists of three data structures:
- The 
Graphitself, holding nodes, edges, and their attributes (graphlite). - Two R-trees for spatial node and edge queries (based on rtree.c). We modified the original code to also include a fast kNN search.
 
spatial_graph compiles C/C++ code at runtime, and as such needs access to a
compiler. If you already have one, great! You can use the PyPI package.
If you (or your users) don't have a compiler installed, you either need to
- Install a compiler. This might be weird for non-technical users.
 - Install 
spatial_graphfromconda-forge, where we include a compiler (clang) in its dependencies. 
There is no cross-platform C/C++ compiler that we can install using pip.
numba is maybe the closest to having solved
that problem: numba does compile during runtime even if you don't have a
compiler locally installed. This works because numba is generating LLVM IR,
an intermediate representation language that LLVM can compile into machine
code. numba depends on llvmlite, which
provides a subset of the LLVM API, statically linked into the binaries in that
package. This is just enough to compile the numba generated LLVM IR into
machine code. We can't use this strategy, because we compile general C/C++
code. Converting that into LLVM IR is exactly what we need a compiler for.
To create a new release, tag the current commit with a
version number and push it to the upstream remote:
git tag -a "vX.Y.Z" -m "vX.Y.Z"
git push upstream --follow-tagsThis will trigger the CI workflow, which will build the package and upload it to PyPI.
To simulate a naive user environment, with no assumptions made about the
availability of a C/C++ compiler, you can run the included Dockerfile
(where the key part of the conda env is the compilers package):
docker build -t spatial_graph .
docker run --rm spatial_graph