Skip to content

[monarch] use spawn as multiprocessing start method #564

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 3 commits into from

Conversation

suo
Copy link
Contributor

@suo suo commented Jul 17, 2025

Stack from ghstack (oldest at bottom):

Always use spawn as a start method, fork is a giant footgun. We were observing hangs in megatron checkpointing, I suspect this is why.

Ref: python/cpython#84559

Differential Revision: D78465499

NOTE FOR REVIEWERS: This PR has internal Meta-specific changes or comments, please review them on Phabricator!

Always use spawn as a start method, fork is a giant footgun. We were observing hangs in megatron checkpointing, I suspect this is why.

Ref: python/cpython#84559

Differential Revision: [D78465499](https://our.internmc.facebook.com/intern/diff/D78465499/)

**NOTE FOR REVIEWERS**: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D78465499/)!

[ghstack-poisoned]
suo added a commit that referenced this pull request Jul 17, 2025
Always use spawn as a start method, fork is a giant footgun. We were observing hangs in megatron checkpointing, I suspect this is why.

Ref: python/cpython#84559

Differential Revision: [D78465499](https://our.internmc.facebook.com/intern/diff/D78465499/)

**NOTE FOR REVIEWERS**: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D78465499/)!

ghstack-source-id: 296751605
Pull Request resolved: #564
@meta-cla meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jul 17, 2025
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D78465499

Always use spawn as a start method, fork is a giant footgun. We were observing hangs in megatron checkpointing, I suspect this is why.

Ref: python/cpython#84559

Differential Revision: [D78465499](https://our.internmc.facebook.com/intern/diff/D78465499/)

**NOTE FOR REVIEWERS**: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D78465499/)!

[ghstack-poisoned]
suo added a commit that referenced this pull request Jul 17, 2025
Pull Request resolved: #564

Always use spawn as a start method, fork is a giant footgun. We were observing hangs in megatron checkpointing, I suspect this is why.

Ref: python/cpython#84559

Differential Revision: [D78465499](https://our.internmc.facebook.com/intern/diff/D78465499/)

**NOTE FOR REVIEWERS**: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D78465499/)!
ghstack-source-id: 296751863
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D78465499

Always use spawn as a start method, fork is a giant footgun. We were observing hangs in megatron checkpointing, I suspect this is why.

Ref: python/cpython#84559

Differential Revision: [D78465499](https://our.internmc.facebook.com/intern/diff/D78465499/)

**NOTE FOR REVIEWERS**: This PR has internal Meta-specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D78465499/)!

[ghstack-poisoned]
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D78465499

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in cef93c7.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Meta Open Source bot. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants