Skip to content

LM1B: Jax #621

Open
Open
@dvsaisurya

Description

@dvsaisurya

Workload

LM1B

Task

Transformer language model training, resembling https://github.com/google/flax/tree/main/examples/lm1b .

Dataset

LM1B dataset. https://www.tensorflow.org/datasets/catalog/lm1b

Model

decoder-only Transformer, encoder+decoder based Transformer.

Reference Implementation

https://github.com/google/flax/tree/main/examples/lm1b


ToDo

  • Implement data input pipeline
    • Document specific dataset version in workload-specific README
  • Add model
    • Document model in workload-specific README
  • Provide sample submission (and sample tuning search space)
    • Document results of sample submission in workload-specific README (how long did it take to reach the target performance)

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions