feat: add test-runner functionality #221

svishnus · 2025-09-10T17:36:00Z

Test Runner integration with CI

Copies over existing benchmarking scripts with appropriate modifications to their respective .mlb files.

shwestrick · 2025-09-10T21:48:55Z

Would it be possible to run tests by checking out parallel-ml-bench only within the github action? To avoid duplicating that code here. We can let parallel-ml-bench be the definitive source of the benchmark codes.

Forthoney · 2025-09-17T02:01:24Z

make test run all the tests with mlton in a push-button manner. The same should apply for make test SMLC=../build/bin/mpl, but some tests like "dedup" apparently use deprecated functionality. If @svishnus can look at such tests and replace the deprecated functionality with their modern counterparts I think we would be ready to publish this PR.

shwestrick · 2025-09-17T12:24:52Z

@svishnus @Forthoney my one request would just be to keep the benchmark sources outside of MPLLang/mpl version control. It would be a bit of a nightmare to have two separate versions of the benchmarks to maintain. We can let MPLLang/parallel-ml-bench be the source of the benchmarks.

Forthoney · 2025-09-17T14:41:34Z

@svishnus @Forthoney my one request would just be to keep the benchmark sources outside of MPLLang/mpl version control. It would be a bit of a nightmare to have two separate versions of the benchmarks to maintain. We can let MPLLang/parallel-ml-bench be the source of the benchmarks.

I was actually thinking of freezing the scripts we have in this repo and think of them as rudimentary tests (compiling properly, not segfaulting) rather than benchmarks. It just happened to be that the benchmarks repo had the most diverse set of realistic programs.

If we have new benchmarks or optimizations to the algos in the benchmarks, these would not reflect it since they are for correctness.

shwestrick · 2025-09-17T17:50:26Z

Gotcha -- this makes sense but I worry that the code duplication will become a problem. Inevitably, updates will happen in one place and we will want to port them across... we'll end up with duplicated work and/or inconsistent versions.

Rather than vendor the frozen benchmarks, we could accomplish the same by hardcoding a particular commit of parallel-ml-bench and git cloning. This would (I think?) be a fairly small change from what you currently have and would allow for updating the commit hash in the future if we update benchmarks.

And, this would move in the direction of making parallel-ml-bench be more useful. We can extend it with additional functionality. Already it is good for performance tests; we could incrementally make it better for correctness tests, too. (Some of the benchmarks already have a --check flag; we could incrementally add this to more benchmarks.)

feat: skeleton

cc099e0

svishnus requested a review from shwestrick September 10, 2025 17:36

svishnus self-assigned this Sep 10, 2025

svishnus added the feature label Sep 10, 2025

Forthoney added 8 commits September 10, 2025 13:54

add mpllib code

cefd38e

Add benchmarking scripts from github.com/mpllang/parallel-ml-bench

bb1e922

Copies over existing benchmarking scripts with appropriate modifications to their respective .mlb files.

add brief readme

b02e612

move actual test scripts to bench child directory

aa3b791

fix mpllib imports

25a76a5

ignore generated inputs (size is too large)

b0211a7

ignore pbbsbench

f1f6819

add makefiles for building tests & setting up inputs

2d33f37

Forthoney added 9 commits September 11, 2025 14:26

ignore binaries

a23f7a4

delete tests with invalid functions

a3a54cd

use compatibility layer instead of MLton.parallel

120a8a2

add rules for generating words input

73e4846

document words-n file creation process

ce67e05

use consistent flag names

2495870

ignore tests/bin

f9d87c6

push button execution

765dd16

fix makefile

6652537

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add test-runner functionality #221

feat: add test-runner functionality #221

Uh oh!

svishnus commented Sep 10, 2025

Uh oh!

shwestrick commented Sep 10, 2025

Uh oh!

Forthoney commented Sep 17, 2025

Uh oh!

shwestrick commented Sep 17, 2025

Uh oh!

Forthoney commented Sep 17, 2025

Uh oh!

shwestrick commented Sep 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add test-runner functionality #221

Are you sure you want to change the base?

feat: add test-runner functionality #221

Uh oh!

Conversation

svishnus commented Sep 10, 2025

Uh oh!

shwestrick commented Sep 10, 2025

Uh oh!

Forthoney commented Sep 17, 2025

Uh oh!

shwestrick commented Sep 17, 2025

Uh oh!

Forthoney commented Sep 17, 2025

Uh oh!

shwestrick commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shwestrick commented Sep 17, 2025 •

edited

Loading