Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance playground #183

Merged
merged 132 commits into from
Sep 20, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
132 commits
Select commit Hold shift + click to select a range
a21597c
Add a basic simple slice implementation with cython to compare perfor…
asmeurer Jul 3, 2024
6fa0f53
Ignore .c files in .gitignore
asmeurer Jul 3, 2024
a318185
Improve the performance of simple_slice_cython
asmeurer Jul 3, 2024
7d5e7ac
Fix time script to actually time cython
asmeurer Jul 3, 2024
a459c41
Print as soon as the times are available
asmeurer Jul 3, 2024
8bac965
Handle None again in simple_slice_cython
asmeurer Jul 3, 2024
762f6dd
Use timeit in time_slice
asmeurer Jul 3, 2024
38670d8
Optimize the cython implementation
asmeurer Jul 3, 2024
e0ac418
Don't include the overhead of generating random numbers in the timings
asmeurer Jul 3, 2024
930ec4f
Add raw, start, stop, and step to SimpleSlice
asmeurer Jul 3, 2024
7c1df0b
Add raw to Cython SimpleSlice
asmeurer Jul 3, 2024
7645d80
Add __repr__, __str__, and __eq__ to SimpleSlice
asmeurer Jul 3, 2024
0234f21
Rename the Cython SimpleSlice to CythonSimpleSlice
asmeurer Jul 3, 2024
744970f
Fix code
asmeurer Jul 3, 2024
5d349ec
Add __eq__ to SimpleSliceCython
asmeurer Jul 3, 2024
8abf6ea
Fix timing script
asmeurer Jul 3, 2024
b68cb05
Fix type checking in SimpleSliceCython
asmeurer Jul 3, 2024
e75376d
Add a test script for the simple slice implementations
asmeurer Jul 3, 2024
f3eb8da
Improve the performance of the type checks
asmeurer Jul 3, 2024
08821c0
Link directly against NumPy
asmeurer Jul 3, 2024
dc1ab69
Revert "Link directly against NumPy"
asmeurer Jul 3, 2024
70ec38f
Add pybind11 implementation of SimpleSlice
asmeurer Jul 3, 2024
608ed32
Add a SimpleSliceRust implementation
asmeurer Jul 3, 2024
15a9989
Fix __module__ for SimpleSliceRust
asmeurer Jul 3, 2024
f44b9d7
Use better time formatting in time_slice.py
asmeurer Jul 3, 2024
9079a1a
Some improvements to the pybind11 implementation
asmeurer Jul 3, 2024
a7e51d7
Add Cargo.lock to .gitignore
asmeurer Jul 3, 2024
60a2368
Update pybind11 implementation to use has_start trick
asmeurer Jul 11, 2024
94425d5
Use the has_start trick in the rust implementation
asmeurer Jul 11, 2024
d1ad217
Test the performance of adding reduce to a subclass
asmeurer Jul 11, 2024
ea72f73
Better class order in the timing script
asmeurer Jul 11, 2024
904a021
Better formatting in time_slice output
asmeurer Jul 11, 2024
890dfe3
Make pybind11 implementation faster
asmeurer Jul 11, 2024
f3090d2
Fix bool type checking in the pybind11 implementation
asmeurer Jul 11, 2024
cad231c
Implement != correctly in SimpleSliceCython
asmeurer Jul 11, 2024
3523097
Use SimpleSliceCython in Slice
asmeurer Jul 11, 2024
d7bde48
Remove unused code
asmeurer Jul 11, 2024
455db8d
Allow SimpleSliceRust to be subclassed
asmeurer Jul 11, 2024
b12dae3
Test that the slice classes can create objects from slices
asmeurer Jul 12, 2024
0385bcc
Use PyNumber_AsSsize_t instead of PyIndex_Check
asmeurer Jul 12, 2024
dd099a0
Compare against np.bool_ more efficiently in pybind11
asmeurer Jul 12, 2024
809fc49
Re-add missing methods
asmeurer Jul 12, 2024
47f1042
Compute args once in pybind11
asmeurer Jul 12, 2024
9bd0f00
Add versioned_hdf5_benchmark.py
asmeurer Jul 17, 2024
5bfd96e
Add cythonized version of crt.py
asmeurer Jul 23, 2024
d3ee33c
Update setup.py to compile Cython code in ndindex/
asmeurer Jul 23, 2024
65aab68
Restrict the size of inputs in test_crt
asmeurer Jul 25, 2024
d03e563
WIP subindex_helpers_cython work
asmeurer Jul 25, 2024
8b516a3
Add _crt_cython.pxd
asmeurer Jul 25, 2024
1f5ad86
Fix the subindex_slice ufunc to properly return three arrays
asmeurer Jul 26, 2024
893d46d
Remove cython imports in Python subindex code
asmeurer Jul 26, 2024
6210e71
Use cython subindex_slice
asmeurer Jul 26, 2024
4e187da
Avoid recomputing reduce() on already reduced (with shape=None) slices
asmeurer Jul 31, 2024
fc9f85d
Split the subindex_slice ufunc into a separate function
asmeurer Aug 1, 2024
a6f63bc
Add early exits for some common cases in _crt_cython.pyx
asmeurer Aug 2, 2024
9077a61
Debugging comment
asmeurer Aug 2, 2024
92e3744
Add pure Python SimpleTuple for testing
asmeurer Aug 5, 2024
a9322cb
Allow specifying a different conversion function in check_same()
asmeurer Aug 5, 2024
7da0c27
Add a Cython version of SimpleTuple and update tests
asmeurer Aug 5, 2024
751187c
Add time_tuple.py
asmeurer Aug 5, 2024
3195006
Add simple_tuple_cython.pyx
asmeurer Aug 5, 2024
84d9d13
Move simple_tuple_cython to ndindex/ and integrate it with Tuple
asmeurer Aug 5, 2024
6751213
Fix setup_simple_slice.py
asmeurer Aug 6, 2024
0f8ec04
Faster version of simple_tuple_cython that uses lazy imports
asmeurer Aug 9, 2024
fc395cf
Remove code from Tuple that should come from SimpleTupleCython
asmeurer Aug 9, 2024
82634b2
Add a benchmark for Tuple.raw
asmeurer Aug 9, 2024
a72bf95
Print generating inputs in time_tuple.py
asmeurer Aug 9, 2024
e171d23
Remove C array args handling from simple_tuple_cython
asmeurer Aug 9, 2024
a1f39ea
Correct SimpleTupleCython.__eq__
asmeurer Aug 9, 2024
63c44f2
Remove unnecessary numpy imports
asmeurer Aug 9, 2024
c66c321
Fix bug in simple_tuple_cython
asmeurer Aug 9, 2024
6e04761
Make subindex cython code use native types better
asmeurer Aug 9, 2024
823898c
Remove simple_slice_pybind11 and simple_slice_rust
asmeurer Aug 21, 2024
fef8a34
Remove time_slice and time_tuple and move them to the benchmarks
asmeurer Aug 21, 2024
297fa50
Remove test_slice and test_tuple
asmeurer Aug 21, 2024
06b5314
Remove Cargo.toml
asmeurer Aug 21, 2024
57557b0
Update .gitignore
asmeurer Aug 21, 2024
374e5c3
Move simple_slice_cython into ndindex/
asmeurer Aug 21, 2024
86378e8
Remove versioned_hdf5_benchmark.py
asmeurer Aug 21, 2024
7b20c31
Add Cython as a dev requirement
asmeurer Aug 21, 2024
b61e91e
Update tests GitHub Actions to build the package
asmeurer Aug 21, 2024
9cd83f3
Fix building the package
asmeurer Aug 21, 2024
1de1371
Enable verbosity on CI
asmeurer Aug 21, 2024
beaafbe
Restore comments to SimpleTupleCython
asmeurer Aug 21, 2024
c13c32b
Add more tests for the Tuple constructor
asmeurer Aug 21, 2024
e74f8d3
Disable the Cython subindex code for now
asmeurer Aug 21, 2024
6eea59d
Remove --full-trace from pytest.ini
asmeurer Aug 21, 2024
8fcd086
Fix detection of multiple ellipses in the Tuple constructor
asmeurer Aug 21, 2024
aa4ddab
Fix slotscheck
asmeurer Aug 23, 2024
4e5dd2c
Add sympy as a dev dependency
asmeurer Aug 23, 2024
bf93957
Remove Cythonized subindex code
asmeurer Aug 23, 2024
69c01eb
Fix pyflakes errors
asmeurer Aug 23, 2024
d4390f5
Merge branch 'main' into performance-playground
asmeurer Sep 9, 2024
0f32e93
Fix docs builds on CI
asmeurer Sep 9, 2024
0766c9a
Fix documentation cross-references to NDIndex.method
asmeurer Sep 9, 2024
ff4cf47
Rename NDIndexBase to NDIndexCommon and add a short docstring
asmeurer Sep 9, 2024
8252a92
Fix docs CI build
asmeurer Sep 9, 2024
67c5808
Fix docs CI build
asmeurer Sep 9, 2024
d275080
Add an example for coverage
asmeurer Sep 10, 2024
3720fa6
Avoid generating arrays with more than 32 dimensions in the tests
asmeurer Sep 10, 2024
a993ba1
Rename SimpleSliceCython and SimpleTupleCython to just _Slice and _Tuple
asmeurer Sep 10, 2024
974a0bf
Remove docs for experimental cythonization
asmeurer Sep 10, 2024
4e6dbd0
Remove unnecessary Cython compilation flags
asmeurer Sep 10, 2024
26f2178
Remove nogil compiler warning
asmeurer Sep 10, 2024
a927157
Reinstate some docstrings that were removed in Cythonization
asmeurer Sep 10, 2024
53a80d4
Clean up some comments
asmeurer Sep 10, 2024
ac75e2a
Merge branch 'main' into performance-playground
asmeurer Sep 10, 2024
934eac0
Make Slice.raw a little bit faster
asmeurer Sep 10, 2024
9f7262a
Small Cython optimization
asmeurer Sep 10, 2024
dd860ee
Fix Cython Slice __eq__ not being used
asmeurer Sep 10, 2024
be8e5f0
Enable coverage for the Cython files
asmeurer Sep 10, 2024
b5c8093
Add cibuildwheel action
asmeurer Sep 16, 2024
c38ea75
Add functioning setup.py clean
asmeurer Sep 16, 2024
ca97105
Test that importing ndindex works when building wheels
asmeurer Sep 16, 2024
06c47cd
Use consistent file naming
asmeurer Sep 16, 2024
7590669
Remove defunct CYTHONIZE_NDINDEX from publish-package.yml
asmeurer Sep 16, 2024
a0e42a6
Skip building wheels for pp39-manylinux_i686
asmeurer Sep 16, 2024
0114550
Use wheels from cibuildwheel in publish-package
asmeurer Sep 16, 2024
aec5612
NumPy should not be a build-time dependency
asmeurer Sep 16, 2024
a18ac66
Fix cibuildwheel test command
asmeurer Sep 17, 2024
a4c84d1
Try inverting the quotes in the test command
asmeurer Sep 17, 2024
75174bd
Fix publish-package
asmeurer Sep 17, 2024
fe3db77
Try to fix cython coverage on CI
asmeurer Sep 17, 2024
0565fcd
Move cibuildwheel into the publish-package action
asmeurer Sep 17, 2024
72963c9
Print something when Cython coverage support is enabled
asmeurer Sep 17, 2024
8045b9e
Fix setting Cython coverage on GitHub Actions
asmeurer Sep 17, 2024
3f4a4aa
Fix coverage testing in Python 3.12 on CI
asmeurer Sep 17, 2024
9edf699
Don't install dev versions of every dependency in the CI dev build
asmeurer Sep 17, 2024
65a5191
Do not download the wheel artifacts that should already be there
asmeurer Sep 17, 2024
0d57827
Fix publish-package syntax
asmeurer Sep 17, 2024
30b32e5
Handle test examples that index too many dimensions in a better way
asmeurer Sep 20, 2024
96e8e05
Fix coverage
asmeurer Sep 20, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,10 @@ jobs:
command: |
cd docs
pip install -r requirements.txt
- run:
name: In-place build
command: |
python setup.py build_ext --inplace
- run:
name: Build docs
no_output_timeout: 25m
Expand Down
1 change: 1 addition & 0 deletions .coveragerc
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,7 @@ omit =
ndindex/_version.py
ndindex/tests/doctest.py
ndindex/tests/test_no_dependencies.py
plugins = Cython.Coverage

[report]
exclude_lines =
Expand Down
3 changes: 3 additions & 0 deletions .github/workflows/docs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,9 @@ jobs:
set -x
set -e
python -m pip install -r docs/requirements.txt
- name: In-place build
run: |
python setup.py build_ext --inplace

- name: Build Docs
run: |
Expand Down
34 changes: 27 additions & 7 deletions .github/workflows/publish-package.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ concurrency:
cancel-in-progress: true

jobs:
build:
build_sdist:
name: Build Python distribution
runs-on: ubuntu-latest

Expand All @@ -48,29 +48,49 @@ jobs:
- name: Install dependencies
run: python -m pip install -r requirements-dev.txt

- name: Build a wheel and a sdist
- name: Build an sdist
run: |
CYTHONIZE_NDINDEX=0 PYTHONWARNINGS=error,default::DeprecationWarning python -m build .
PYTHONWARNINGS=error,default::DeprecationWarning python -m build --sdist .

- name: Verify the distribution
run: twine check --strict dist/*

- name: List contents of sdist
run: python -m tarfile --list dist/ndindex-*.tar.gz

- name: List contents of wheel
run: python -m zipfile --list dist/ndindex-*.whl

- name: Upload distribution artifact
uses: actions/upload-artifact@v4
with:
name: dist-artifact
path: dist

build_wheels:
name: Build wheels on ${{ matrix.os }}
runs-on: ${{ matrix.os }}
strategy:
matrix:
# macos-13 is an intel runner, macos-14 is apple silicon
os: [ubuntu-latest, windows-latest, macos-13, macos-14]

steps:
- uses: actions/checkout@v4

- name: Build wheels
uses: pypa/[email protected]
with:
output-dir: dist/
env:
CIBW_TEST_COMMAND: 'python -c "import ndindex"'

- uses: actions/upload-artifact@v4
with:
name: cibw-wheels-${{ matrix.os }}-${{ strategy.job-index }}
path: ./wheelhouse/*.whl

publish:
name: Publish Python distribution to (Test)PyPI
if: github.event_name != 'pull_request' && github.repository == 'Quansight-Labs/ndindex'
needs: build
needs: [build_sdist, build_wheels]
runs-on: ubuntu-latest
# Mandatory for publishing with a trusted publisher
# c.f. https://docs.pypi.org/trusted-publishers/using-a-publisher/
Expand Down
23 changes: 19 additions & 4 deletions .github/workflows/tests.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,20 +25,35 @@ jobs:
if [[ ${{ matrix.numpy-version }} == 'latest' ]]; then
python -m pip install --upgrade numpy
elif [[ ${{ matrix.numpy-version }} == 'dev' ]]; then
python -m pip install --pre --upgrade --extra-index https://pypi.anaconda.org/scientific-python-nightly-wheels/simple numpy -r requirements-dev.txt
python -m pip install --pre --upgrade --extra-index https://pypi.anaconda.org/scientific-python-nightly-wheels/simple numpy
else
python -m pip install --upgrade numpy==${{ matrix.numpy-version }}.*
fi
- name: Test Installation
run: |
python -m pip install --verbose .
# COVERAGE_CORE=sysmon does not work with Cython. Since coverage is
# really slow without it (see
# https://github.com/nedbat/coveragepy/issues/1665), just omit Cython
# coverage in 3.12. Hopefully by the time it is our minimal version it
# will work with Cython, or trace coverage will be faster again.
- name: Set CYTHON_COVERAGE=1
run: echo "CYTHON_COVERAGE=1" >> $GITHUB_ENV
if: matrix.python-version != 3.12
- name: Disable Coverage Plugin
run: sed -i '/plugins = Cython.Coverage/d' .coveragerc
if: matrix.python-version == 3.12
- name: In-place build
run: |
python setup.py clean
python setup.py build_ext --inplace
- name: Run Doctests
run: |
./run_doctests
# A NumPy 2.0 compatible skimage doesn't support 3.9, and we need a
# version of matplotlib that requires NumPy 1.23. Easiest to just
# skip this for now.
if: matrix.numpy-version != 'dev' && matrix.python-version != '3.9' && matrix.numpy-version != '1.22'
- name: Test Installation
run: |
python -m pip install .
- name: Run Slotscheck
run: |
python -m slotscheck ndindex
Expand Down
4 changes: 4 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -146,3 +146,7 @@ dmypy.json
/scratch/

.DS_Store

# Cython
ndindex/*.c
ndindex/*.html
1 change: 1 addition & 0 deletions MANIFEST.in
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
include versioneer.py
include ndindex/_version.py
include ndindex/*.pyx
include LICENSE
include pytest.ini
include conftest.py
Expand Down
12 changes: 12 additions & 0 deletions benchmarks/slice.py
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,18 @@ def time_constructor_invalid(self):
except TypeError:
pass

def time_args(self):
self.s1.args
self.s2.args

def time_start(self):
self.s1.start
self.s2.start

def time_raw(self):
self.s1.raw
self.s2.raw

def time_reduce(self):
self.s1.reduce()
self.s2.reduce()
Expand Down
10 changes: 10 additions & 0 deletions benchmarks/tuple.py
Original file line number Diff line number Diff line change
Expand Up @@ -30,6 +30,16 @@ def time_constructor_invalid_array_broadcast(self):
except IndexError:
pass

def time_args(self):
self.t.args
self.t_arrays.args
self.t_boolean_scalars.args

def time_raw(self):
self.t.raw
self.t_arrays.raw
self.t_boolean_scalars.raw

def time_reduce(self):
self.t.reduce()

Expand Down
1 change: 1 addition & 0 deletions docs/api/internal.rst
Original file line number Diff line number Diff line change
Expand Up @@ -21,6 +21,7 @@ Base Classes

.. autoclass:: ndindex.ndindex.NDIndex
:members:
:inherited-members:

.. autoclass:: ndindex.array.ArrayIndex
:members:
Expand Down
21 changes: 0 additions & 21 deletions docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -77,27 +77,6 @@ their limitations:
ndindex can be installed using pip (`pip install ndindex`) or conda (`conda
install -f conda-forge ndindex`).

### Experimental Cythonization

When installing ndindex from source (using `python setup.py install`) all
Python modules (except tests) will be Cythonized when Cython and a working
compiler are installed. This can improve the general performance of ndindex.
The environment variable `CYTHONIZE_NDINDEX` is used to explicitly control
this default behavior:

- `CYTHONIZE_NDINDEX=0`: disables Cythonization (even if a
working Cython environment is available).

- `CYTHONIZE_NDINDEX=1`: force Cythonization (will fail when Cython or a
compiler isn't present).

- `CYTHONIZE_NDINDEX` not set: the default behavior (Cythonize if Cython is
installed and working).

Cythonization is still experimental, and is only enabled for direct source
installations. The pip and conda packages are not Cythonized. Future versions
of ndindex may enable Cythonization for all installs.

## Features

ndindex is still a work in progress. The following things are currently
Expand Down
6 changes: 4 additions & 2 deletions docs/requirements.txt
Original file line number Diff line number Diff line change
@@ -1,10 +1,12 @@
Cython
furo
linkify-it-py
matplotlib @ git+https://github.com/asmeurer/matplotlib.git@output-base-name
myst-parser
scikit-image
setuptools
sphinx
sphinx-autobuild
sphinx-copybutton
sphinx-reredirects
sphinx-design
sphinx-autobuild
sphinx-reredirects
11 changes: 2 additions & 9 deletions ndindex/_crt.py
Original file line number Diff line number Diff line change
Expand Up @@ -44,9 +44,7 @@
DAMAGE.
"""

from functools import reduce
from operator import mul
from math import gcd
from math import gcd, prod

def gcdex(a, b):
"""Returns x, y, g such that g = x*a + y*b = gcd(a, b).
Expand Down Expand Up @@ -93,11 +91,6 @@ def gcdex(a, b):

return (x*x_sign, y*y_sign, a)


# np.prod has overflow and math.prod is Python 3.8+ only
def prod(seq):
return reduce(mul, seq, 1)

def _crt(U, M):
"""
Chinese Remainder Theorem.
Expand Down Expand Up @@ -183,7 +176,7 @@ def combine(c1, c2):
a1, m1 = c1
a2, m2 = c2
a, b, c = m1, a2 - a1, m2
g = reduce(gcd, [a, b, c])
g = gcd(a, b, c)
a, b, c = [i//g for i in [a, b, c]]
if a != 1:
inv_a, _, g = gcdex(a, c)
Expand Down
Loading
Loading