- Singapore
- @gaunernst
Pinned Loading
908 contributions in the last year
Day of Week | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Contributed to
pytorch/ao,
gau-nernst/quantized-training,
gau-nernst/rectified-flow
and 59 other
repositories
Loading
Contribution activity
March 2025
Created 17 commits in 4 repositories
Created 3 repositories
-
gau-nernst/gemma3-int4
This contribution was made on Mar 23
-
gau-nernst/vllm
Python
This contribution was made on Mar 10
-
gau-nernst/gguf-pytorch
Python
This contribution was made on Mar 5
Created a pull request in vllm-project/vllm that received 4 comments
[Bugfix][Kernel][CPU] Fix num_tokens in CPU rotary embedding kernel
num_tokens
was computed incorrectly in the CPU kernel of rotary embedding. I found this issue while working on MLA kernel for CPU backend, which le…
+1
−1
lines changed
•
4
comments
Opened 9 other pull requests in 3 repositories
menloresearch/cortex.cpp
4
merged
1
open
-
bugfix: add more stringent out-of-bounds checks for GGUF parser
This contribution was made on Mar 21
-
chore: remove python engine
This contribution was made on Mar 18
-
chore: small refactor for
/models/pull
This contribution was made on Mar 18 -
chore: delete unused old download functions
This contribution was made on Mar 17
-
chore: Remove supported_engines check in CLI
This contribution was made on Mar 17
vllm-project/vllm
1
open
1
merged
-
[Kernel][CPU] CPU MLA
This contribution was made on Mar 13
-
[Bugfix][IPEX] Add
VLLM_CPU_MOE_PREPACK
to allow disabling MoE prepack when CPU does not support itThis contribution was made on Mar 12
menloresearch/ichigo
2
merged
-
fix: update config name in
api/asr.py
This contribution was made on Mar 3 -
fix: Explicitly load checkpoint to CPU to avoid CUDA error
This contribution was made on Mar 3
Reviewed 6 pull requests in 2 repositories
menloresearch/cortex.cpp
4 pull requests
-
fix: prevent unlimited loop due to invalid filename in path
This contribution was made on Mar 20
-
feat: allow to configure api_keys by cli
This contribution was made on Mar 20
-
chore: remove python engine (#2146)
This contribution was made on Mar 20
-
fix: crash if invalid url is set
This contribution was made on Mar 18
vllm-project/vllm
2 pull requests
-
[Kernel][CPU] CPU MLA
This contribution was made on Mar 22
-
[Bugfix][IPEX] Add
VLLM_CPU_MOE_PREPACK
to allow disabling MoE prepack when CPU does not support itThis contribution was made on Mar 13
Created an issue in google-deepmind/gemma that received 6 comments
gemma3-4b-it-int4
is does not contain "mm_input_projection"
I downloaded gemma3-4b-it-int4 from Kaggle https://www.kaggle.com/models/google/gemma-3/flax/gemma3-4b-it-int4 and load the checkpoint with the fol…
6
comments
Opened 5 other issues in 1 repository
menloresearch/cortex.cpp
4
open
1
closed
-
idea: Use llama.cpp HIP build for AMD GPUs
This contribution was made on Mar 22
-
chore: Retrieve
engine
from DB instead ofmodel.yml
This contribution was made on Mar 21 -
idea: CLI should not access backend services directly
This contribution was made on Mar 21
-
idea:
/v1/models/import
should be file uploadThis contribution was made on Mar 21 -
bug:
/v1/files/
invalid filename will hang the serverThis contribution was made on Mar 20
3
contributions
in private repositories
Mar 7 – Mar 12