lang2lang

Python library to easily build and train an encoder-decoder model based on the Transformer architecture outlined in "Attention is All You Need" by Vaswani et. al. I made this library for educational purposes as I wanted to understand how the transformer model worked in-depth. The model was written from scratch using PyTorch and experiment tracking was integrated with MLflow.

Details

The model.py file defines each "block" of the network as a custom PyTorch Module subclass. Here's the diagram from the original paper for reference.

The Transformer model from the paper "Attention is All You Need" [1].

There is a custom module for the Embeddings, PositionalEncoding, Multi-Head Attention, Feed Forward Network, EncoderLayer, DecoderLayer, and the overall Transformer. The various hyperparameters of the model are retrieved from config.py file including the parameters related to the dataset as well as training related configurations.

The dataset is based on the iswlt2017 dataset and is applicable for all it's different language variations. To train on other datasets, the dataset.py file will have to be configured for that particular dataset or the data will have to be reformatted into the following format:

[
    {
        'input': 'hello'
        'output': 'bonjour'
    },
    ...
]

Training

Training the model simply requires passing the config dictionary to the train function.

from train import train
fron config import get_config

config = get_config()
train(config)

Inference

from inference import inference

input = "hello"
output = inference(input)

Requirements

datasets==2.14.6
mlflow==2.8.1
tokenizers==0.15.0
torch==2.1.0
tqdm==4.64.0

References

[1] - A. Vaswani et al., "Attention is All You Need," arXiv preprint arXiv:1706.03762, 2017. [Online]. Available: https://arxiv.org/abs/1706.03762.

[2] - J. Alammar, "The Illustrated Transformer," 2023. [Online]. Available: https://jalammar.github.io/illustrated-transformer/.

[3] - U. Jamil, "Attention is all you need (Transformer) - Model explanation (including math), Inference and Training" YouTube, 2023. [Online]. Available: https://www.youtube.com/watch?v=bCz4OMemCcA.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.gitignore		.gitignore
README.md		README.md
config.py		config.py
dataset.py		dataset.py
inference.py		inference.py
lang2lang.ipynb		lang2lang.ipynb
model.py		model.py
requirements.txt		requirements.txt
tests.py		tests.py
train.py		train.py
train_lang2lang.ipynb		train_lang2lang.ipynb
transformer.png		transformer.png
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

lang2lang

Details

Training

Inference

Requirements

References

About

Uh oh!

Releases

Packages

Uh oh!

Languages

jsai28/lang2lang

Folders and files

Latest commit

History

Repository files navigation

lang2lang

Details

Training

Inference

Requirements

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages