Skip to content

rppadmakumar3/MachineHack_LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

MachineHack_LLM

Results

The project's optimization efforts have yielded impressive results:

  • 65% Speedup: The model's response time is significantly faster, making it suitable for real-time applications.

  • Improved Accuracy: The model's answers are more precise and contextually relevant.

Technologies Used

  • DistilBERT
  • Intel Extension for PyTorch
  • Intel Neural Compressor
  • Python

Demo Application

DEMO

Demo Video

Click Here to Watch Demo Video

Process Flow

TPF1RXen48Rl-nHpgGHI8b6fxG4E3G4YeJQ107a0rpiKArvxQu_JRf--7iSBc-xcjZl-v_Tyl-qRJy9Hg7IHFefYkh1QeoHO2X8m-ZIvcamcS7126ML-mXxf2Zxc8dhAjV6iK4SOfPx78BIY1ZRVlew1JcXWrA0V5m3JUrkYZdhUG5apuzfIHHSDjwlT0T8wLzdiJXcyff2sVK0iiVpVBtQ

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published