Adding SFT Notebook #424

HosseinKaviani-H · 2025-10-15T20:31:45Z

Adds an interactive Jupyter notebook and supporting utilities to configure and run SFT training without YAML files, making experimentation more accessible.

What's New

Interactive Configuration Notebook (interactive_config_notebook.ipynb)

Configure training entirely in notebook cells
Step-by-step configuration for model, optimizer, training, parallelism
Two execution modes: simple (await run_actor()) or manual lifecycle control

Supporting Files

spawn_actor.py - Actor spawning and lifecycle management
trainer_actor.py - Trainer actor implementation
actor.py - Base actor abstractions
utils.py - Helper functions
README.md - Documentation

Example Usage

# Configure in notebook cells model_config = {"name": "llama3", "flavor": "8B", ...} training_config = {"local_batch_size": 1, "steps": 1000, ...} # Run training await run_actor(TrainerActor, cfg)

Benefits

✅ No YAML editing required
✅ Interactive experimentation
✅ Educational with clear documentation
✅ Backward compatible - CLI workflow unchanged
✅ Production-ready

Compatibility

✅ No breaking changes
✅ YAML-based workflow still supported
✅ Works with single-GPU and multi-GPU setups

init27 · 2025-10-16T23:26:46Z

Notebook doesnt render in diffs so leaving some suggestions here:

Add explanations inline to the notebook
Wrap the output or shorten it for the trainer, right now its 70% of the notebook
Rename "Configure" headings-having 7 configure settings is a bit daunting

HamidShojanazeri

Thanks for the PR @HosseinKaviani-H , hard to load the notebook in the PR here, so leaving notebook comments in the following:

opening message "This notebook allows you to configure and run SFT training without any YAML files!"

it would be good to align with Forge message. something along these lines " This notebook introduces a seamless fine-tuning experience by abstracting away the complexities of distributed training, allowing you to configure and run SFT jobs across multiple nodes"

Please add a level of explanation at the top/ intro what we user will see here, dataset, hardware requirements, capabilities etc.
The "Benefits" section might not be very necessary, should be ok to remove it or replace it with some value props of Forge.
Please add a reference to Forge doc for readers to educate themselves.
8 steps configuration, can we please either remove "step" from the text, we can keep Step 1 configurations then follows by different cells/ sections.
"Alternative: Manual Lifecycle Control" section needs more clarification and explanation on actors and how this separation help.

…g the extra steps

HosseinKaviani-H · 2025-10-20T20:34:40Z

Notebook doesnt render in diffs so leaving some suggestions here:

Add explanations inline to the notebook

Wrap the output or shorten it for the trainer, right now its 70% of the notebook

Rename "Configure" headings-having 7 configure settings is a bit daunting

@init27 Thanks for your comments Sanyam. I have implemented them.

HosseinKaviani-H · 2025-10-20T21:31:32Z

Thanks for the PR @HosseinKaviani-H , hard to load the notebook in the PR here, so leaving notebook comments in the following:

opening message "This notebook allows you to configure and run SFT training without any YAML files!"

it would be good to align with Forge message. something along these lines " This notebook introduces a seamless fine-tuning experience by abstracting away the complexities of distributed training, allowing you to configure and run SFT jobs across multiple nodes"

Please add a level of explanation at the top/ intro what we user will see here, dataset, hardware requirements, capabilities etc.

The "Benefits" section might not be very necessary, should be ok to remove it or replace it with some value props of Forge.

Please add a reference to Forge doc for readers to educate themselves.

8 steps configuration, can we please either remove "step" from the text, we can keep Step 1 configurations then follows by different cells/ sections.

"Alternative: Manual Lifecycle Control" section needs more clarification and explanation on actors and how this separation help.

@HamidShojanazeri Thanks for the helpful comments Hamid. I have addressed your comments and updated the PR.

Adding SFT Notebook

95650dd

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 15, 2025

init27 self-assigned this Oct 16, 2025

HosseinKaviani-H closed this Oct 16, 2025

HosseinKaviani-H reopened this Oct 16, 2025

HamidShojanazeri requested changes Oct 17, 2025

View reviewed changes

Hossein Kavianihamedani added 3 commits October 20, 2025 09:07

Adding more explanation, narrating the overal story flow, and removin…

bf0839a

…g the extra steps

Removing README

42b28ae

Implemented changes in the SFT notebook to have a better flow

815a44f

Adding refrences

ff6dd19

Update the notebook

dfdcd56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding SFT Notebook #424

Adding SFT Notebook #424

HosseinKaviani-H commented Oct 15, 2025

Uh oh!

init27 commented Oct 16, 2025

Uh oh!

HamidShojanazeri left a comment

Uh oh!

HosseinKaviani-H commented Oct 20, 2025

Uh oh!

HosseinKaviani-H commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Adding SFT Notebook #424

Are you sure you want to change the base?

Adding SFT Notebook #424

Conversation

HosseinKaviani-H commented Oct 15, 2025

What's New

Example Usage

Benefits

Compatibility

Uh oh!

init27 commented Oct 16, 2025

Uh oh!

HamidShojanazeri left a comment

Choose a reason for hiding this comment

Uh oh!

HosseinKaviani-H commented Oct 20, 2025

Uh oh!

HosseinKaviani-H commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants