gau-nernst

Thien Tran gau-nernst

87 followers · 74 following

Singapore
@gaunernst

Achievements

x3 x2

Achievements

x3 x2

gau-nernst/README.md

Hi there 👋

Pinned Loading

learn-cuda Public

Learn CUDA with PyTorch

Cuda 19 2
quantized-training Public

Explore training for quantized models

Python 17 1

908 contributions in the last year

Learn how we count contributions

Less

Activity overview

Contributed to pytorch/ao, gau-nernst/quantized-training, gau-nernst/rectified-flow and 59 other repositories

Contribution activity

March 2025

Created 17 commits in 4 repositories

Created 3 repositories

gau-nernst/gemma3-int4
This contribution was made on Mar 23
gau-nernst/vllm Python
This contribution was made on Mar 10
gau-nernst/gguf-pytorch Python
This contribution was made on Mar 5

Created a pull request in vllm-project/vllm that received 4 comments

Mar 12

[Bugfix][Kernel][CPU] Fix num_tokens in CPU rotary embedding kernel

num_tokens was computed incorrectly in the CPU kernel of rotary embedding. I found this issue while working on MLA kernel for CPU backend, which le…

+1 −1 lines changed • 4 comments

Opened 9 other pull requests in 3 repositories

menloresearch/cortex.cpp 4 merged 1 open

bugfix: add more stringent out-of-bounds checks for GGUF parser
This contribution was made on Mar 21
chore: remove python engine
This contribution was made on Mar 18
chore: small refactor for /models/pull
This contribution was made on Mar 18
chore: delete unused old download functions
This contribution was made on Mar 17
chore: Remove supported_engines check in CLI
This contribution was made on Mar 17

vllm-project/vllm 1 open 1 merged

[Kernel][CPU] CPU MLA
This contribution was made on Mar 13
[Bugfix][IPEX] Add VLLM_CPU_MOE_PREPACK to allow disabling MoE prepack when CPU does not support it
This contribution was made on Mar 12

menloresearch/ichigo 2 merged

fix: update config name in api/asr.py
This contribution was made on Mar 3
fix: Explicitly load checkpoint to CPU to avoid CUDA error
This contribution was made on Mar 3

Reviewed 6 pull requests in 2 repositories

menloresearch/cortex.cpp 4 pull requests

fix: prevent unlimited loop due to invalid filename in path
This contribution was made on Mar 20
feat: allow to configure api_keys by cli
This contribution was made on Mar 20
chore: remove python engine (#2146)
This contribution was made on Mar 20
fix: crash if invalid url is set
This contribution was made on Mar 18

vllm-project/vllm 2 pull requests

[Kernel][CPU] CPU MLA
This contribution was made on Mar 22
[Bugfix][IPEX] Add VLLM_CPU_MOE_PREPACK to allow disabling MoE prepack when CPU does not support it
This contribution was made on Mar 13

Created an issue in google-deepmind/gemma that received 6 comments

Mar 17

`gemma3-4b-it-int4` is does not contain `"mm_input_projection"`

I downloaded gemma3-4b-it-int4 from Kaggle https://www.kaggle.com/models/google/gemma-3/flax/gemma3-4b-it-int4 and load the checkpoint with the fol…

6 comments

Opened 5 other issues in 1 repository

menloresearch/cortex.cpp 4 open 1 closed

idea: Use llama.cpp HIP build for AMD GPUs
This contribution was made on Mar 22
chore: Retrieve engine from DB instead of model.yml
This contribution was made on Mar 21
idea: CLI should not access backend services directly
This contribution was made on Mar 21
idea: /v1/models/import should be file upload
This contribution was made on Mar 21
bug: /v1/files/ invalid filename will hang the server
This contribution was made on Mar 20

3 contributions in private repositories Mar 7 – Mar 12

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thien Tran gau-nernst

Achievements

Achievements

Block or report gau-nernst

Hi there 👋

Pinned Loading

908 contributions in the last year

Activity overview

Contribution activity

March 2025

Created a pull request in vllm-project/vllm that received 4 comments

[Bugfix][Kernel][CPU] Fix num_tokens in CPU rotary embedding kernel

Created an issue in google-deepmind/gemma that received 6 comments

`gemma3-4b-it-int4` is does not contain `"mm_input_projection"`

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat

Thien Tran gau-nernst

Achievements

Achievements

Hi there 👋

Pinned Loading

908 contributions in the last year

Activity overview

Contribution activity

March 2025

Created a pull request in vllm-project/vllm that received 4 comments

[Bugfix][Kernel][CPU] Fix num_tokens in CPU rotary embedding kernel

Created an issue in google-deepmind/gemma that received 6 comments

gemma3-4b-it-int4 is does not contain "mm_input_projection"

`gemma3-4b-it-int4` is does not contain `"mm_input_projection"`

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat