Introduce toyllm, we can do much more things based on this gpt-2 implementation, such as speculative sampling, kv cache and so on #610
shenxiangzhuang
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I' here to share my project named toyllm which is based on the gpt-2 implementation on this book. Take the gpt-2 implementation as a start point, with a 16GB GPU, I implemented some interesting algorithms from scratch, such as speculative sampling, kv cache.
Which is a wonderful journey and learn a lot from it. So I decide to share it here, help this can help you too: https://github.com/ai-glimpse/toyllm
Beta Was this translation helpful? Give feedback.
All reactions