Bigram-Level-Language Model : Nano_GPT

This is the model trained on shakespearian dataset using the decoder only transformer architecture utilizing pytorch framework to generate random shakespearian text. This project is for educational purpose and to get deep look into the inner workings of the transofrmer architecture which is used in GPT3.5 and other LLMs.

Model Specifications :