feat: Add vLLM V1 support w/Unsloth model service #185

bradhilton · 2025-07-01T23:30:15Z

Migrate the Unsloth model service to also support vLLM V1 which has some performance improvements and is the future of vLLM development.

…ronment variable

bradhilton · 2025-07-01T23:38:22Z

There are a few current limitations with Unsloth Zoo that disallow V1 support. Generally, Unsloth Zoo does not support V1's collective RPC pattern yet. The collective RPC call to get the weight IPC handles failed with CUDA error: invalid argument. Also, the collective RPC calls do not check if the results are coroutines and so fail when called from AsyncLLM instances.

corbt · 2025-07-02T13:51:56Z

I'm not seeing any chatter on the Unsloth side about working towards this. How hard would it be to do it ourselves?

bradhilton · 2025-07-02T20:28:43Z

Hard to say, could take a while.

bradhilton · 2025-07-12T22:34:52Z

Probably will end up closing this if decoupling vLLM & Unsloth works out

refactor: Comment out num_scheduler_steps and update VLLM_USE_V1 envi…

f037958

…ronment variable

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add vLLM V1 support w/Unsloth model service #185

feat: Add vLLM V1 support w/Unsloth model service #185

Uh oh!

bradhilton commented Jul 1, 2025 •

edited

Loading

Uh oh!

bradhilton commented Jul 1, 2025

Uh oh!

corbt commented Jul 2, 2025

Uh oh!

bradhilton commented Jul 2, 2025

Uh oh!

bradhilton commented Jul 12, 2025

Uh oh!

Uh oh!

feat: Add vLLM V1 support w/Unsloth model service #185

Are you sure you want to change the base?

feat: Add vLLM V1 support w/Unsloth model service #185

Uh oh!

Conversation

bradhilton commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bradhilton commented Jul 1, 2025

Uh oh!

corbt commented Jul 2, 2025

Uh oh!

bradhilton commented Jul 2, 2025

Uh oh!

bradhilton commented Jul 12, 2025

Uh oh!

Uh oh!

bradhilton commented Jul 1, 2025 •

edited

Loading