Skip to content

Commit 16964a6

Browse files
authored
Chapter 6 ablation studies (#127)
* Chapter 6 ablation studies * add table * formatting * formatting * formatting
1 parent 0bd2608 commit 16964a6

File tree

5 files changed

+848
-0
lines changed

5 files changed

+848
-0
lines changed

ch06/01_main-chapter-code/README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
In progress.
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
# Additional Experiments
2+
3+
| Model | Trainable token | Trainable layers | CPU/GPU | Training time | Training acc | Validation acc | Test acc |
4+
|--------------------|-----------------|------------------|---------|---------------|--------------|----------------|----------|
5+
| gpt2-small (124M) | last | last_block | V100 | 0.39 min | 96.63% | 97.99% | 94.33% |
6+
| gpt2-small (124M) | first | last_block | V100 | 0.37 min | 78.46% | 80.54% | 75.00% |
7+
| gpt2-small (124M) | last | last_layer | V100 | 0.33 min | 78.65% | 87.25% | 78.33% |
8+
| gpt2-small (124M) | last | all | V100 | 0.94 min | 99.62% | 96.64% | 96.33% |
9+
| gpt2-medium (355M) | last | last_block | V100 | 0.91 min | 87.50% | 51.01% | 56.67% |
10+
| gpt2-large (774M) | last | last_block | V100 | 1.91 min | 99.52% | 98.66% | 96.67% |

0 commit comments

Comments
 (0)