This my work from the book "LLMs-from-scratch", here I am trying to implement the code from the book. In the ch-5, I have only included code before the loading of weights from OPEN-AI GPT2 due to limited resources.