OSSBIG AI engineering team 2024

This is the central repository for the OSSBIG AI engineering team.

General Informations

Useful Links

Benchmark Referenz: https://nicholas.carlini.com/writing/2024/my-benchmark-for-large-language-models.html
Neue Veröffentlichung: https://opencodeinterpreter.github.io/#example
GPT from Numpy: https://jaykmody.com/blog/gpt-from-scratch/

Karpathy - Build GPT From Scratch

Very informative videos about the internals of a LLM. A real step by step description were everything is expained in detail.

Lets build the GPT Tokenizer: https://youtu.be/zduSFxRajkE?si=oiY3IYLJjNB3nR86
The spelled-out intro to language modeling: building makemore: https://youtu.be/PaCmpygFfXo?si=Xo-7vP22l-I6l028
Building makemore Part 2: MLP: https://youtu.be/TCH_1BHY58I?si=Ozu9nNz6EAiBMAFP
Building makemore Part 3: Activations & Gradients, BatchNorm: https://youtu.be/P6sfmUTpUmc?si=qNfH2TI5QzDh4gVa
Building makemore Part 4: Becoming a Backprop Ninja: https://youtu.be/q8SA3rM6ckI?si=_50lnWJpjG4iwknc

OSSBIG

A link collection to the code and data that was generated during the OSSBIG researches.

Workbooks and Repositories

IDEA Plugin

Modified IDEA Plugin (Branch KRM) to work with LLMs. Builded versions are in the OSSBIG cloud: https://github.com/mankra/llm-intellij

Finetuning workbooks and repositories

Source of the following experiments https://huggingface.co/blog/personal-copilot
Finetuning Code Repository (Data preperation, training): https://github.com/mankra/finetune
Full finetuning workbook for endless-sky: https://colab.research.google.com/drive/1d99cUGdeHpu06pCMT-ioapw97EkR7ha1#scrollTo=zqtjrruXXKu9
Full finetuning workbook for fltk: https://colab.research.google.com/drive/1u7PLPCE3bKHzhkSdxhcrFqko-k26pPfw#scrollTo=zqtjrruXXKu9
PEFT finetuning workbook for endless-sky: https://colab.research.google.com/drive/18XqYBVkJv9lqwrnWuoI06ZUxQUuwiR1U#scrollTo=4AXJGt-ag4l5
PEFT finetuning workbook for fltk: https://colab.research.google.com/drive/1dXruWXemEYI7srTbNAjEe_ESEvRFGBBT#scrollTo=v8MG0g-eEW9X
Merge PEFT Layer with base model: https://colab.research.google.com/drive/1GWCLdRmZNXh6CbOMAwou4FRQCe19K-8x#scrollTo=dOIeFZfbiQmz
WanDB finetuning results (manfredkral9 at gmail account needed to view): https://wandb.ai/ossbig/huggingface

CUDA

CUDA acceleration in C: https://colab.research.google.com/drive/1N50mFGvlg0CFGDONdsS0ZQL2NyJEAMMt#scrollTo=HLdoj4cz-xal
CUDA code repository (Branch CUDA): https://github.com/mankra/llama2.c

Hugging Face

The datasets and models are set to private, so a login in the OSSBIG-HF account (Franz's-OSSBIG-email) is required.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OSSBIG AI engineering team 2024

General Informations

Useful Links

Karpathy - Build GPT From Scratch

OSSBIG

Workbooks and Repositories

IDEA Plugin

Finetuning workbooks and repositories

CUDA

Hugging Face

Datasets

Models

Inference Endpoints

About

Uh oh!

Releases

Packages

mankra/ossbig2024

Folders and files

Latest commit

History

Repository files navigation

OSSBIG AI engineering team 2024

General Informations

Useful Links

Karpathy - Build GPT From Scratch

OSSBIG

Workbooks and Repositories

IDEA Plugin

Finetuning workbooks and repositories

CUDA

Hugging Face

Datasets

Models

Inference Endpoints

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages