Skip to content

Actions: rasbt/LLMs-from-scratch

Spell Check

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
876 workflow runs
876 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Handle other Qwen3 tokenizer settings (#716)
Spell Check #872: Commit 0405b0c pushed by rasbt
June 30, 2025 22:49 45s main
June 30, 2025 22:49 45s
Handle other Qwen3 tokenizer settings
Spell Check #871: Pull request #716 opened by rasbt
June 30, 2025 22:43 43s qwen3-tokenizer
June 30, 2025 22:43 43s
Fix d_out code comment in bonus materials (#715)
Spell Check #870: Commit 4e61dc4 pushed by rasbt
June 28, 2025 15:07 49s main
June 28, 2025 15:07 49s
Fix d_out code comment in bonus materials
Spell Check #869: Pull request #715 opened by rasbt
June 28, 2025 15:07 43s d_out
June 28, 2025 15:07 43s
Support different Qwen3 sizes in pkg (#714)
Spell Check #868: Commit c4ec55e pushed by rasbt
June 28, 2025 13:00 44s main
June 28, 2025 13:00 44s
Support different Qwen3 sizes in pkg
Spell Check #867: Pull request #714 opened by rasbt
June 28, 2025 12:52 45s qwen3-sizes
June 28, 2025 12:52 45s
Use test mode arg in ch07 (#713)
Spell Check #866: Commit ddbaf0d pushed by rasbt
June 28, 2025 00:28 47s main
June 28, 2025 00:28 47s
Use test mode arg in ch07
Spell Check #865: Pull request #713 opened by rasbt
June 28, 2025 00:05 44s ch07-testmode
June 28, 2025 00:05 44s
fix: embed_dim -> d_out
Spell Check #864: Pull request #711 opened by d-kleine
June 25, 2025 19:33 50s d-kleine:emb_dim_d_out
June 25, 2025 19:33 50s
Remove unused params for hparam script (#710)
Spell Check #863: Commit 8b3e4b2 pushed by rasbt
June 25, 2025 17:50 44s main
June 25, 2025 17:50 44s
Remove unused params for hparam script
Spell Check #862: Pull request #710 opened by rasbt
June 25, 2025 17:41 44s unused-args
June 25, 2025 17:41 44s
Add Qwen3 1.7, 4B, 8B, and 32B support to from-scratch nb (#709)
Spell Check #861: Commit 190c66b pushed by rasbt
June 25, 2025 13:53 46s main
June 25, 2025 13:53 46s
Add Qwen3 1.7, 4B, 8B, and 32B support to from-scratch nb
Spell Check #860: Pull request #709 opened by rasbt
June 25, 2025 13:45 46s qwen3-larger
June 25, 2025 13:45 46s
Link the other KV cache sections (#708)
Spell Check #859: Commit 2f53bf5 pushed by rasbt
June 24, 2025 21:52 43s main
June 24, 2025 21:52 43s
Link the other KV cache sections
Spell Check #858: Pull request #708 opened by rasbt
June 24, 2025 19:50 47s kv-cache-resources
June 24, 2025 19:50 47s
Add link to free exercise PDF (#706)
Spell Check #857: Commit 47a7500 pushed by rasbt
June 24, 2025 13:24 48s main
June 24, 2025 13:24 48s
Add link to free exercise PDF
Spell Check #856: Pull request #706 opened by rasbt
June 24, 2025 13:12 45s exercise-pdf
June 24, 2025 13:12 45s
Update Llama 3 table for consistency with Qwen3
Spell Check #855: Commit 3bdf18a pushed by rasbt
June 23, 2025 23:33 42s main
June 23, 2025 23:33 42s
Improve KV cache code for torch.compile (#705)
Spell Check #854: Commit 81eda38 pushed by rasbt
June 23, 2025 23:08 43s main
June 23, 2025 23:08 43s
Improve KV cache code for torch.compile
Spell Check #853: Pull request #705 synchronize by rasbt
June 23, 2025 23:02 44s kvcache-alt
June 23, 2025 23:02 44s
Improve KV cache code for torch.compile
Spell Check #852: Pull request #705 synchronize by rasbt
June 23, 2025 23:00 46s kvcache-alt
June 23, 2025 23:00 46s
Improve KV cache code for torch.compile
Spell Check #851: Pull request #705 opened by rasbt
June 23, 2025 22:49 48s kvcache-alt
June 23, 2025 22:49 48s
Fix bug in masking when kv cache is used. (#697)
Spell Check #850: Commit 6522be9 pushed by rasbt
June 23, 2025 18:12 48s main
June 23, 2025 18:12 48s
Fix bug in masking when kv cache is used.
Spell Check #849: Pull request #697 synchronize by rasbt
June 23, 2025 17:35 53s martinzwm:fix_masking_kv_cache
June 23, 2025 17:35 53s
Fix bug in masking when kv cache is used.
Spell Check #848: Pull request #697 synchronize by rasbt
June 23, 2025 17:30 52s martinzwm:fix_masking_kv_cache
June 23, 2025 17:30 52s