Text-Generation-with-Decoder-Architecture

Code to implemnent a decoder only transformer model that predicts the next sentence from a given input phrase

Overview

This code implemented a decoder only transformer model with a multihead attention mechanism. This model was trained using the TinyStories dataset and produces a 20 word generative output which follows on semantically and syntactically from a given input phrase.

Project Structure

tokens.py - creates and trains sentence piece tokeniser
dataset.py - implements tokenised dataset
positional_encoding.py - implements positional encoding from decoder architecture
multi_head_attention.py - implements multihead attention mechansim from decoder architecture
position_wise_feed_forward.py - implements position wise feedforward layers from from decoder architecture
decoder_layer.py - implements decoder layer from previosuly defined building blocks
transformer.py - implements decoder only transformer from decoder layer and positional encoder building blocks
train.py - trains the transformer using TinyStories dataset
sentence_completer.py - generates output text from input phrase
server.py - connects to server to allow model to be accessed and interacted with from website
constants.py - contains constants for the project
utilities.py - contains simple functions for accessing and loading latest transformer models

Author

Louis Chapo-Saunders

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-Generation-with-Decoder-Architecture

Overview

Project Structure

Author

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
README.md		README.md
constants.py		constants.py
dataset.py		dataset.py
decoder_layer.py		decoder_layer.py
multi_head_attention.py		multi_head_attention.py
position_wise_feed_forward.py		position_wise_feed_forward.py
positional_encoding.py		positional_encoding.py
sentence_completer.py		sentence_completer.py
server.py		server.py
tokens.py		tokens.py
train.py		train.py
transformer.py		transformer.py
utilities.py		utilities.py

louisc-s/Text-Generation-with-Decoder-Architecture

Folders and files

Latest commit

History

Repository files navigation

Text-Generation-with-Decoder-Architecture

Overview

Project Structure

Author

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages