Here are some recent and important revisions. 👉 Complete list of results.
Key: 📄: table, 📈: time plot, 🧠: memory plot
date | fork/ref | hash/flags | vs. 3.11.0: | vs. 3.12.0: | vs. 3.13.0: | vs. base: |
---|---|---|---|---|---|---|
2022-03-23 | python/v3.10.4 | 9d38120 |
date | fork/ref | hash/flags | vs. 3.11.0: | vs. 3.12.0: | vs. 3.13.0: | vs. base: |
---|---|---|---|---|---|---|
2025-10-12 | python/d6dd64ac654898b4ce71 | d6dd64a | ||||
2025-10-04 | python/880c9526f91960b9cba5 | 880c952 | ||||
2025-10-04 | python/main | 9964320 |
*
indicates that the exact same versions of pyperformance was not used.
For the results above, the "faster/slower" result is a geometric mean of each of the benchmarks. The "reliability (rel)" number is the likelihood that the change is faster or slower based on the Hierarchical Performance Testing (HPT) method. For more details, visit each individual result's README.md.
Below are longitudinal timing results. There are also 🧠 longitudinal memory results.
Improvement of the geometric mean of key merged benchmarks, computed with pyperf compare
.
The results have a resolution of 0.01 (1%).
There is also a longitudinal plot by benchmark.
Visit the 🔒 benchmark action and click the "Run Workflow" button.
The available parameters are:
fork
: The fork of CPython to benchmark. If benchmarking a pull request, this would normally be your GitHub username.ref
: The branch, tag or commit SHA to benchmark. If a SHA, it must be the full SHA, since finding it by a prefix is not supported.machine
: The machine to run on. One oflinux-amd64
(default),windows-amd64
,darwin-arm64
orall
.benchmark_base
: If checked, the base of the selected branch will also be benchmarked. The base is determined by runninggit merge-base upstream/main $ref
.pystats
: If checked, collect the pystats from running the benchmarks.
To watch the progress of the benchmark, select it from the 🔒 benchmark action page. It may be canceled from there as well. To show only your benchmark workflows, select your GitHub ID from the "Actor" dropdown.
When the benchmarking is complete, the results are published to this repository and will appear in the complete table. Each set of benchmarks will have:
- The raw
.json
results from pyperformance. - Comparisons against important reference releases, as well as the merge base of the branch if
benchmark_base
was selected. These include- A markdown table produced by
pyperf compare_to
. - A set of "violin" plots showing the distribution of results for each benchmark.
- A set of plots showing the memory change for each benchmark (for immediate bases only, on non-Windows platforms).
- A markdown table produced by
The most convenient way to get results locally is to clone this repo and git pull
from it.
To automate benchmarking runs, it may be more convenient to use the GitHub CLI.
Once you have gh
installed and configured, you can run benchmarks by cloning this repository and then from inside it:
$ gh workflow run benchmark.yml -f fork=me -f ref=my_branch
Any of the parameters described above are available at the commandline using the -f key=value
syntax.
To collect Linux perf sampling profile data for a benchmarking run, run the _benchmark
action and check the perf
checkbox.
Follow this by a run of the _generate
action to regenerate the plots.