Skip to content

Conversation

@lhk
Copy link
Member

@lhk lhk commented Apr 15, 2024

Adding 3 different weight freezing approaches in an example script under: notebooks/embedding_wrapper.py.

The first wrapper uses a mask and ended up being equivalent to the code that Chris already had set up.
The second wrapper uses a hook. That's only recommended for debugging, so ultimately not useful for us. It helped me spot an error in the other code though, so I think the pattern is useful and I'm keeping it in for reference.
The third wrapper uses a custom autograd function. That may be interesting, since it's more lowlevel than the masking code.

Finally, this code contains a test setup which doesn't pull in hf / transformers/ etc. That's convenient for quick debugging.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants