Open
Description
We have some testing that is very resource intensive to run marked with an "extra large" annotation. This includes our fenced docstring tests, and testing for most of our presets.
We should try to get automated coverage for these to avoid things like #782. Though these tests are too slow to hold up every PR with.
One good solution would be run this testing on GCP every night, as well as build a way to manually invoke the tests (e.g. before cutting a new release).