update 2 #2

jonas208 · 2023-09-29T12:30:00Z

No description provided.

* Fix grad conv im2col * Also fix depthwise * Enable prevously broken tests * Revert "Enable prevously broken tests" This reverts commit d648fdd. * Add explicit im2col test * Fix and test third case * More tests now pass

* unbreak gpu conv test * cleanup

* Add EnzymeRule for conv * Fix * Add missing file * Change to enzymecore ext * attempt fix * Finish enzymecore rewrite * Add missing file * Also add gather * Additional functions, tests, and fixes * minor fixup * Add pooling * Add dropout * Fix scatter bug * fix pool * More fixups * fix up pool * split conv/depth * Cleanup rule * Fix minor test bug * Fix depthwise conv * Fix typo * Bound tests * Failing to extension * Add file * Address review * Remove inlining

* Fix typo in version comparison * Allow non-Int64 indices in scatter * Disable Enzyme for AMDGPU * Refactor

… existing compat) (#549) Co-authored-by: CompatHelper Julia <[email protected]>

* AbstractGPUArray -> AnyGPUArray * tests * don't test Enzyme * add test on discontinuous view * Update test/runtests.jl

* Set compatibilities for standard packages * Update Project.toml * Update Project.toml * Update Project.toml * Update Project.toml * Uncomment Random --------- Co-authored-by: Anton Smirnov <[email protected]>

* Bump AMDGPU compat to 0.8 * Bump AMDGPU CI to 1.10

Co-authored-by: CompatHelper Julia <[email protected]>

* Fixes #555 It fixes the issue #555 where we need to convert Int type dimension to a Tuple * Add test for PR #556

* Add rrule for `oftf` Otherwise diffing with Zygote is type unstable.

* Use julia-actions/cache This does everything automatically for us and should speed up CI times.

* Add dependencies * Add code * Add test * clear docs * Fix FiniteDifferences.fdm_central * Include Refactoring * Code cleaning and fix adjoint for trivial rotations * Code cleaning * Fix bug with even and odd arrays and trivial rotations * First parts of test are generalized, not gradients yet * Add gradtests, some fail * Tests working and subtle bug fixed for trivial rotations * Fix space before && [skip ci] * Add documentation and fix issue with FillArrays * Fix function name * Fix size(arr) * Relax some tests since they failed on CUDA * Test with rel error of 1f-2 * Refine rotation tests * Introduce show statement for buildkite * Remove show statement, introduce even test case again * Rename midpoint to rotation_center and change rounding * Add more tests, nearest neighbour fails sometimes * Lower tolerance tests * Proper error handling * Add underscore _ to internal methods. Clean docs * Change to Float64 test * Lower testing accuracy for Float64 * Revert "Lower testing accuracy for Float64" This reverts commit fcba1c3. * Rerun CI * Fix typo, rerun CI * Improve docstring [skip ci]

In test mode, the CUDA cuDNN implementation of batchnorm was not matching the CPU batchnorm in FLUX. In FLUX, with track_stats=False, the mean and variance of the current batch are used. Here, mean and variance were initialized to 0 and 1, respectively, and passed to cudnnBatchNormalizationForwardInference. To fix this, we need to calculate the mean and variance over the current batch to match the CPU implementation. Unfortunately, cudnnBatchNormalizationForwardInference requires a trained running mean and variance. However, batchnorm train and test should be identical without tracked stats since they both normalize over the current batch. As a result we can use cudnnBatchNormalizationForwardTraining in test mode as well, which works without a running mean and variance.

remove `NNPACK` and move `ForwardDiff` to an extension

* Bump patch version to release new Enzyme support * chore: force latest Enzyme install [DON'T MERGE] * revert: "chore: force latest Enzyme install [DON'T MERGE]" This reverts commit 16f8075. --------- Co-authored-by: Avik Pal <[email protected]>

* Fix backtick * Update language highlight tags for fenced code samples * Update whitespace * Use TeX primitives * Update reference links * Remove duplicate backticks * Fix admonition block * Add backticks

- Use `1.10` in Buildkite CI (and `nightly` for CUDA). - Use `lts`, `1` and `pre` in GitHub CI. - Add compat GPUArraysCore for `0.2`. - Bump ChainRulesCore to `0.25`.

* Add SpecialFunctions as dependency * add full `gelu`, changes old `gelu` -> `gelu_fast` * Add tests and docs * Change names: `gelu_fast` -> `gelu`, `gelu` -> `gelu_full` * Rename gelus and use OpenLibm_jll instead of SpecialFunctions.jl for erf * Create NNlibSpecialFunctionsExt for gelu_erf * specific import in SpecialFunctions extension * add gelu export to list of aliases

We mistakenly did not register v0.9.28.

… at version 2, (keep existing compat)

…01-01-26-30-687-03248299932 CompatHelper: add new compat entry for SpecialFunctions in [weakdeps] at version 2, (keep existing compat)

This should fix #631.

Add type alias for gelu -> gelu_tanh

#633) * don't spawn if only one job * fix what appears to be a typo * revertme: temporary test * fix check * rm test * add NNlib.ALLOW_THREADING control * use ScopedValues * use `@with` to avoid new scope * rename do_work functions * add note * v0.9.29

…xisting compat) (#637) Co-authored-by: CompatHelper Julia <[email protected]>

wsmoses and others added 30 commits September 26, 2023 17:43

Fix grad conv im2col (#539)

8da76bd

* Fix grad conv im2col * Also fix depthwise * Enable prevously broken tests * Revert "Enable prevously broken tests" This reverts commit d648fdd. * Add explicit im2col test * Fix and test third case * More tests now pass

unbreak gpu conv test (#541)

d0a03f2

* unbreak gpu conv test * cleanup

Update Project.toml

6d98fe7

Don't run CPU tests on AMDGPU CI machines (#542)

af0aa2c

Allow non-Int64 indices in scatter (#543)

607de4b

* Fix typo in version comparison * Allow non-Int64 indices in scatter * Disable Enzyme for AMDGPU * Refactor

Update AMDGPU compat to 0.7 (#545)

2f2ae84

CompatHelper: add new compat entry for Statistics at version 1, (keep…

00fccc7

… existing compat) (#549) Co-authored-by: CompatHelper Julia <[email protected]>

make gather/scatter work with views (#546)

8598c08

* AbstractGPUArray -> AnyGPUArray * tests * don't test Enzyme * add test on discontinuous view * Update test/runtests.jl

Update Project.toml

3f9e45d

Set compatibilities for standard packages (#550)

b276338

* Set compatibilities for standard packages * Update Project.toml * Update Project.toml * Update Project.toml * Update Project.toml * Uncomment Random --------- Co-authored-by: Anton Smirnov <[email protected]>

Bump AMDGPU compat to 0.8 (#551)

7f70079

* Bump AMDGPU compat to 0.8 * Bump AMDGPU CI to 1.10

Bump version

e3903e2

CompatHelper: bump compat for Adapt to 4, (keep existing compat) (#554)

f9c3002

Co-authored-by: CompatHelper Julia <[email protected]>

Fixes #555 (#556)

f3cc467

* Fixes #555 It fixes the issue #555 where we need to convert Int type dimension to a Tuple * Add test for PR #556

Update Project.toml

0da50d6

Add rrule for oftf (#559)

ad0a3ce

* Add rrule for `oftf` Otherwise diffing with Zygote is type unstable.

Use julia-actions/cache (#560)

1096ddc

* Use julia-actions/cache This does everything automatically for us and should speed up CI times.

bump version

935bf3d

CUDA: Fix compilation failure. (#566)

e3f22c6

Update Project.toml

97410d7

replace unsafe_convert with pointer (#564)

9c31d9d

Fix typo in dot product attention error message (#567)

050b835

Update softmax.jl (#569)

1af2535

Bump EnzymeCore compat (#571)

a5cd730

Update Project.toml

0783363

Add export for imrotate gradient (#574)

d85402a

Update Project.toml

2f0b555

avik-pal and others added 30 commits August 28, 2024 23:27

fix: remove uses of NNPACK

4cea75c

refactor: move ForwardDiff to an ext

4e957ad

fix: remove uses of Requires

36e3b96

fix: remove usage of Pkg

393e830

Merge pull request #603 from LuxDL/ap/load_times

f76a38d

remove `NNPACK` and move `ForwardDiff` to an extension

chore: bump version for release

6c8e717

Prepare for pending Enzyme breaking change (#605)

11ee324

Update method docstrings (#607)

ba29c90

* Fix backtick * Update language highlight tags for fenced code samples * Update whitespace * Use TeX primitives * Update reference links * Remove duplicate backticks * Fix admonition block * Add backticks

fix test (#611)

0213868

Bump deps (#613)

2ae295b

- Use `1.10` in Buildkite CI (and `nightly` for CUDA). - Use `lts`, `1` and `pre` in GitHub CI. - Add compat GPUArraysCore for `0.2`. - Bump ChainRulesCore to `0.25`.

Bump to 0.9.25

25735a4

Update Project.toml (#615)

0b51bc2

Update Project.toml

81e6cd1

Fix NaN gradient in spectrogram (#617)

be9c1c8

Update bias_act.jl (#618)

5732b97

Add unthunk in conv.jl (#620)

b56ff50

Add Grid Sampling for 3D images. (#627)

491e6da

Update Project.toml

307d402

bump to v0.9.29

a5201fb

revert version bump

2edc2c0

We mistakenly did not register v0.9.28.

CompatHelper: add new compat entry for SpecialFunctions in [weakdeps]…

3125822

… at version 2, (keep existing compat)

Merge pull request #630 from FluxML/compathelper/new_version/2025-03-…

cf5a07f

…01-01-26-30-687-03248299932 CompatHelper: add new compat entry for SpecialFunctions in [weakdeps] at version 2, (keep existing compat)

Add type alias for gelu -> gelu_tanh

0a9a709

This should fix #631.

Merge pull request #632 from FluxML/bc/gelu-type-alias

ec337e6

Add type alias for gelu -> gelu_tanh

CompatHelper: bump compat for ForwardDiff in [weakdeps] to 1, (keep e…

f2327bd

…xisting compat) (#637) Co-authored-by: CompatHelper Julia <[email protected]>

use the correct nthreads (#636)

0018789

version 0.9.30

1468582

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update 2 #2

update 2 #2

Uh oh!

jonas208 commented Sep 29, 2023

Uh oh!

Uh oh!

update 2 #2

Are you sure you want to change the base?

update 2 #2

Uh oh!

Conversation

jonas208 commented Sep 29, 2023

Uh oh!

Uh oh!