Introduce toyllm, we can do much more things based on this gpt-2 implementation, such as speculative sampling, kv cache and so on #610

shenxiangzhuang · 2025-04-09T08:52:13Z

shenxiangzhuang
Apr 9, 2025

Hi, I' here to share my project named toyllm which is based on the gpt-2 implementation on this book. Take the gpt-2 implementation as a start point, with a 16GB GPU, I implemented some interesting algorithms from scratch, such as speculative sampling, kv cache.

Which is a wonderful journey and learn a lot from it. So I decide to share it here, help this can help you too: https://github.com/ai-glimpse/toyllm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce toyllm, we can do much more things based on this gpt-2 implementation, such as speculative sampling, kv cache and so on #610

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Introduce toyllm, we can do much more things based on this gpt-2 implementation, such as speculative sampling, kv cache and so on #610

Uh oh!

Uh oh!

shenxiangzhuang Apr 9, 2025

Replies: 0 comments

shenxiangzhuang
Apr 9, 2025