Skip to content
View pei0033's full-sized avatar

Block or report pei0033

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pei0033/README.md

Eunik Park | ML Engineer

LinkedIn Email

About Me

I am a 3-year experienced ML Engineer specializing in making AI models lightweight, fast, and efficient. My primary focus is on Efficient AI, particularly Quantization. I optimize various models (Vision, Audio, LLM) for mobile, GPU, and NPU platforms.

Skills

Python PyTorch TFLite ONNX
C++ vLLM TensorRT
TensorFlow Triton CUDA Android

Work Experience

ML Engineer @ SqueezeBits SqueezeBits Logo

06/2022 - Present

  • Optimizing models for target hardware & platforms
  • Enhancing performance-speed trade-offs through PTQ and QAT
  • Conducted benchmarking of vLLM and TensorRT-LLM serving

Internship @ LG CNS LG CNS Logo

07/2021 - 08/2021

  • Built AWS 3-tier web service using Terraform

Projects

owlite_logo OwLite

08/2023 - Present

[Website] [Github] [OwLite Examples]

  • Developed a framework for easy model quantization from PyTorch to TensorRT
  • Implemented various quantization algorithms and simulations
  • Produced various examples and identified optimization patterns

fistonchips_logo Fits-on-Chips

02/2024 - 06/2024

[Website]

Efficient Keyword Spotting Research

02/2024 - 06/2024

Education

POSTECH

  • Bachelor's in IT Convergence Engineering
  • 03/2016 - 09/2023

Changwon Science High School

  • 03/2014 - 02/2016

Pinned Loading

  1. SqueezeBits/owlite SqueezeBits/owlite Public

    OwLite is a low-code AI model compression toolkit for AI models.

    Python 45 5