Skip to content

Releases: kvcache-ai/Mooncake

v0.3.2.post1

26 May 11:51
5451d32
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.3.2...v0.3.2.post1

v0.3.2

25 May 14:35
88290d5
Compare
Choose a tag to compare

Highlights

  • TE supports fault tolerency
  • Store: supports eviction and lease

What's Changed

  • chore(deps): bump golang.org/x/net from 0.36.0 to 0.38.0 in /mooncake-p2p-store/src/p2pstore by @dependabot in #376
  • [Build]: exclude cuda so files in auditwheel. by @doujiang24 in #379
  • [TransferEngine] Remove unused local variable by @jellor in #382
  • [Doc] Add document for integrating Mooncake Store to LMCache V1 by @XucSh in #385
  • [Store] Add features: lease and eviction by @ykwd in #374
  • fix bug in async apis in python, the batch_id's dtype is int64_t, not int by @niqi-lyu in #387
  • [Bugfix] Fix PID retrieval when destroying the vLLM thread in the vllm benchmark demo by @0x777a6c in #389
  • [TE FIX] Updating the outstanding work request counting when closing a QP by @alogfans in #390
  • [TransferEngine] Fix hang problem due to previous failed connection by @alogfans in #392
  • Remove duplicated code between transferSync[Read|Write] and transferSync by @alogfans in #394
  • [FIX] avoid locking for the same spinlock multiple times by @alogfans in #395
  • [Build] do not hard code release build type. by @doujiang24 in #393
  • [TransferEngine] add lock for handle_map_ to avoid segfault. by @doujiang24 in #396
  • [TransferEngine] Update software-based timeout mechanism by @alogfans in #398
  • [TransferEngine] Revert to disable slice timeout by @alogfans in #401
  • chore: bump version to 0.3.2 in pyproject.toml by @ShangmingCai in #402

New Contributors

Full Changelog: v0.3.1...v0.3.2

v0.3.1

19 May 05:36
edff54c
Compare
Choose a tag to compare

Highlights

  • Performance: Optimized local data transfer via memcpy, Enhanced buffer allocation logic and path selection strategy.
  • CI/CD: more build scripts, Docker support for master server and CI testing.
  • Observability: Improved error logging, metrics and null pointer checks
  • Create mooncake website
  • Bug fixes: GCC10 build fixes, Dependency and RDMA transport fixes.

What's Changed

  • docs: add lmcache integration documentation by @xiaguan in #336
  • [Store] Improve: Add RemoveAll rpc for remove all keys by @maobaolong in #327
  • [fix]fix compile error for gcc10 by @qicosmos in #339
  • feat(Mooncake Integration): Support pure client without store mode by @maobaolong in #341
  • ARM build_wheel.sh by @johnnynunez in #344
  • [DOC] Add news about NIXL supports Mooncake as a backend by @alogfans in #348
  • Fix news render in README by @alogfans in #350
  • chore: enhance error logging for tcp transport by @xiaguan in #349
  • add arm compatibility by @johnnynunez in #343
  • gitmodules: use full path instead of relative path. by @doujiang24 in #345
  • feat(transfer-engine): enhance logging for RPC and topology discovery by @xiaguan in #352
  • chore(ci): expand python version testing matrix to include 3.8-3.13 by @xiaguan in #353
  • feat(Mooncake Integration): Supply a MooncakeConfig into whl file by @maobaolong in #338
  • feat(docs): build documentation website for Mooncake using Sphinx by @Risc-lt in #354
  • [Build] feature: start the Mooncake master server through Docker by @Chasing1020 in #320
  • fix wrong command in XpYd by @gujingit in #357
  • fix(master): Fix negative storage size metrics after removeAll by @maobaolong in #355
  • [CI] add dockerfile CI test by @stmatengss in #362
  • [FIX] avoid sending request after setting inactive by @alogfans in #361
  • [Transfer Engine] Check the buffer size before register by @XucSh in #364
  • fix: incorrect urls and update page deployment by @Risc-lt in #363
  • [Build] Call find_package() before using external deps by @tchaikov in #359
  • [BugFix] Buffer Allocation Always Tries on the Same Allocator by @ykwd in #365
  • [TransferEngine] Add sanity check on nullptrs by @alogfans in #366
  • fix(store): the metadata leak after umount segment by @maobaolong in #337
  • store: optimize local data transfer with memcpy fast path by @xiaguan in #340
  • [DOC] update blog url by @stmatengss in #369
  • build(transport): add glog and pthread to rdma target link libraries by @xiaguan in #370
  • Add LRU in MasterService, complexity O(1) by @zhaoyongke in #287
  • [Store] feat: Add a MooncakeStoreService to serve store and rest api by @maobaolong in #328
  • [hotfix] Allow compile flag USE_LRU_MASTER to enable/disable the LRU feature by @ykwd in #372
  • [FIX] update path reselection stragegy to cover all possible available devices by @alogfans in #373
  • feat(py): integrate python http metadata server by @xiaguan in #367
  • chore: bump version to 0.3.1 in pyproject.toml by @xiaguan in #377

New Contributors

Full Changelog: v0.3.0...v0.3.1

v0.3.0

07 May 11:12
7b64c07
Compare
Choose a tag to compare

Highlights

What's Changed

  • fix: when detect async event, stop the outstanding requests & prevent cq burn out leading severe error by @alogfans in #84
  • [TransferEngine] fix: add auto_discover in transfer_engine_c by @chestnut-Q in #87
  • [TransferEngine] feature: registerLocalMemory support the "*" location. by @doujiang24 in #86
  • [Build] feature: add dockerfile to enable use Mooncake in Docker by @stmatengss in #92
  • [Transfer Engine] add USE_ETCD option to retire the etcd dependency. by @doujiang24 in #94
  • [TransferEngine]chore: replace slices vector with slice counter by @RuixiangMa in #93
  • [Build] feature: add dev container. by @doujiang24 in #96
  • [TransferEngine] fix: wrong numa number in example. by @doujiang24 in #99
  • [TransferEngine] fix: use ibv_get_device_list to get the IB devices. by @doujiang24 in #102
  • fix: memcpy direction wrong for read operations by @alogfans in #101
  • [TransferEngine] Build: build/install shared object for transfer engine. by @doujiang24 in #100
  • Fix P2P store bugs and improvement by @alogfans in #80
  • [Bugfix] Add shared and unique lock for local_memory_regions_ (#107) by @power-more in #108
  • [TransferEngine] Correct the Typo by @power-more in #109
  • [TransferEngine] Fix typos by @fengquyoumo in #110
  • [TransferEngine] BenchMark: Introduce report gflags for enhanced configurability by @fengquyoumo in #111
  • [TransferEngine] feature: introduce USE_NVMEOF to enable NVMe-oF separately. by @doujiang24 in #106
  • [Doc] add FAST25 paper and traces by @chestnut-Q in #116
  • [TransferEngine] Suppress gtest type cast compile warning and fix some trivial bugs by @yuan-luo in #115
  • [Doc] update README.md by @chestnut-Q in #128
  • [Bugfix] fix the gid choice of IB device: only choose IBV_GID_TYPE_ROCE_V2 now by @fengquyoumo in #113
  • [MooncakeStore] Introduce MooncakeStore: A new Distributed Object Store for XpYd PD disaggregation by @alogfans in #126
  • [FIX] Fix the content of README.md by @alogfans in #130
  • [DOC] add description about Docker-based deployment by @alogfans in #133
  • Fix compilation error with clang toolchain by @xiaguan in #135
  • [MooncakeStore] Add getSize method to DistributedObjectStore by @xiaguan in #137
  • [DOC] remove useless metadata_type flags in example.md by @cxz66666 in #138
  • [FIX] golang vulnerabilities in P2P store by @alogfans in #140
  • [TransferEngine] Support Status return value by @yuan-luo in #125
  • [Doc] Fix typo in mooncake-store-preview by @simpx in #142
  • [MooncakeStore] Enhance performance in DistributedObjectStore::get by @xiaguan in #141
  • [Doc] Add vllm integration v1 doc by @ShangmingCai in #129
  • [Optimize] Update EndpointStore interface to use const references for… by @xiaguan in #152
  • fix compilation errors by @xhyf77 in #145
  • [MooncakeStore] fix: treat object_not_found/already_exists as info in glog by @stmatengss in #156
  • [Fix]: Handle SIGINT & SIGTERM for graceful shutdown by @xhyf77 in #153
  • [Doc] typo error in README by @Unprincess17 in #158
  • [FEATURE] catch exceptions during allocator creation by @xiaguan in #157
  • [FEATURE] Update MooncakeStroe CI workflow by @xiaguan in #146
  • [DOC] remove MULAN license to ease vllm deployment by @stmatengss in #163
  • [Refactor] Replace rpc logic with MasterClient in Client class by @xiaguan in #164
  • [CI] Add spell check process to CI by @xiaguan in #165
  • [Chore] Disable garbage collection in Mooncake master by @xiaguan in #167
  • [TransferEngine] fix typos for cancel status in transfer engine by @alogfans in #169
  • Extend VLLMAdaptor API by @xhyf77 in #173
  • [MooncakeAdaptor] Init sglang_adaptor to support SGLang using transfer engine by @stmatengss in #181
  • [HOTFIX] add env MC_DISABLE_METACACHE to force pull metadata from etcd by @alogfans in #175
  • [TransferEngine] Fix compilation bugs of nvmeof transport by @alogfans in #174
  • Package mooncake_vllm_adaptor and MooncakeDistributedStore into a wheel by @xhyf77 in #177
  • [Fix] Fix issues with run path and etcd dependency integration by @xhyf77 in #182
  • [WorkFlow] Add a release.yaml to publish the .whl file by @xhyf77 in #186
  • [Chore] Change logging level from INFO to VLOG for object existence by @xiaguan in #185
  • [Refactor] [MooncakeStore] Migrate Master Service from gRPC to coro_rpc by @xiaguan in #179
  • [Doc] Update vllm MooncakeStore integration doc by @ShangmingCai in #191
  • [CI] Refine release workflow to enable publishing to PyPI by @xiaguan in #189
  • [Enhance] Add an RPATH that takes precedence over LD_LIBRARY_PATH and other paths in the wheel package by @xhyf77 in #190
  • [MooncakeStore] fix: python env path in sglang by @stmatengss in #196
  • [CI] Update etcd-cpp CMake policy version in dependencies script by @xiaguan in #199
  • [Feature] use randomly-selected port in TCP-based OOB communication by @alogfans in #202
  • [FIX] prevent allocating the same port by using real random values by @alogfans in #206
  • [TransferEngine] set use http protocol as default by @stmatengss in #208
  • [CI] Update GitHub Actions permissions for build-and-release job by @xiaguan in #209
  • Enable ccache in Mooncake compile by @yuan-luo in #203
  • [TransferEngine] fix: avoid leaking Slice memory in RDMA transport by @eryugey in #210
  • [Refactor] Remove the dependency of etcd-cpp-api-v3 by @alogfans in #188
  • [MooncakeIntegration] Refactor py mod arch by @stmatengss in #212
  • feat(ci): enhance build process with new import structure test by @xiaguan in #214
  • chore(scripts): set rpath for shared objects in build script by @xiaguan in #216
  • feat(mooncakestore): add mooncake_master CLI entry point and tests by @xiaguan in #218
  • [TransferEngine] Use RDMA transport to transfer data in local process rapidly by @alogfans in #220
  • [Doc] Refactor dependencies.sh and for improved installation process by @xiaguan in #195
  • [FIX] reduce performance overhead from allocating slices by @alogfans in #223
  • [FIX] bind failure from legacy RPC_PORT_BINDING by @alogfans in #226
  • [Doc] Add sglang integration doc and update vllm integration doc by @ShangmingCai in #227
  • Update go version to 1.23.8 in dependencies.sh and other enhancements by @eryugey in #231
  • chore(ci): integrate auditwheel ...
Read more

v0.3.0-beta

08 Apr 09:23
ed2f4d4
Compare
Choose a tag to compare
v0.3.0-beta Pre-release
Pre-release

Highlights

What's Changed

  • fix: when detect async event, stop the outstanding requests & prevent cq burn out leading severe error by @alogfans in #84
  • [TransferEngine] fix: add auto_discover in transfer_engine_c by @chestnut-Q in #87
  • [TransferEngine] feature: registerLocalMemory support the "*" location. by @doujiang24 in #86
  • [Build] feature: add dockerfile to enable use Mooncake in Docker by @stmatengss in #92
  • [Transfer Engine] add USE_ETCD option to retire the etcd dependency. by @doujiang24 in #94
  • [TransferEngine]chore: replace slices vector with slice counter by @RuixiangMa in #93
  • [Build] feature: add dev container. by @doujiang24 in #96
  • [TransferEngine] fix: wrong numa number in example. by @doujiang24 in #99
  • [TransferEngine] fix: use ibv_get_device_list to get the IB devices. by @doujiang24 in #102
  • fix: memcpy direction wrong for read operations by @alogfans in #101
  • [TransferEngine] Build: build/install shared object for transfer engine. by @doujiang24 in #100
  • Fix P2P store bugs and improvement by @alogfans in #80
  • [Bugfix] Add shared and unique lock for local_memory_regions_ (#107) by @power-more in #108
  • [TransferEngine] Correct the Typo by @power-more in #109
  • [TransferEngine] Fix typos by @fengquyoumo in #110
  • [TransferEngine] BenchMark: Introduce report gflags for enhanced configurability by @fengquyoumo in #111
  • [TransferEngine] feature: introduce USE_NVMEOF to enable NVMe-oF separately. by @doujiang24 in #106
  • [Doc] add FAST25 paper and traces by @chestnut-Q in #116
  • [TransferEngine] Suppress gtest type cast compile warning and fix some trivial bugs by @yuan-luo in #115
  • [Doc] update README.md by @chestnut-Q in #128
  • [Bugfix] fix the gid choice of IB device: only choose IBV_GID_TYPE_ROCE_V2 now by @fengquyoumo in #113
  • [MooncakeStore] Introduce MooncakeStore: A new Distributed Object Store for XpYd PD disaggregation by @alogfans in #126
  • [FIX] Fix the content of README.md by @alogfans in #130
  • [DOC] add description about Docker-based deployment by @alogfans in #133
  • Fix compilation error with clang toolchain by @xiaguan in #135
  • [MooncakeStore] Add getSize method to DistributedObjectStore by @xiaguan in #137
  • [DOC] remove useless metadata_type flags in example.md by @cxz66666 in #138
  • [FIX] golang vulnerabilities in P2P store by @alogfans in #140
  • [TransferEngine] Support Status return value by @yuan-luo in #125
  • [Doc] Fix typo in mooncake-store-preview by @simpx in #142
  • [MooncakeStore] Enhance performance in DistributedObjectStore::get by @xiaguan in #141
  • [Doc] Add vllm integration v1 doc by @ShangmingCai in #129
  • [Optimize] Update EndpointStore interface to use const references for… by @xiaguan in #152
  • fix compilation errors by @xhyf77 in #145
  • [MooncakeStore] fix: treat object_not_found/already_exists as info in glog by @stmatengss in #156
  • [Fix]: Handle SIGINT & SIGTERM for graceful shutdown by @xhyf77 in #153
  • [Doc] typo error in README by @Unprincess17 in #158
  • [FEATURE] catch exceptions during allocator creation by @xiaguan in #157
  • [FEATURE] Update MooncakeStroe CI workflow by @xiaguan in #146
  • [DOC] remove MULAN license to ease vllm deployment by @stmatengss in #163
  • [Refactor] Replace rpc logic with MasterClient in Client class by @xiaguan in #164
  • [CI] Add spell check process to CI by @xiaguan in #165
  • [Chore] Disable garbage collection in Mooncake master by @xiaguan in #167
  • [TransferEngine] fix typos for cancel status in transfer engine by @alogfans in #169
  • Extend VLLMAdaptor API by @xhyf77 in #173
  • [MooncakeAdaptor] Init sglang_adaptor to support SGLang using transfer engine by @stmatengss in #181
  • [HOTFIX] add env MC_DISABLE_METACACHE to force pull metadata from etcd by @alogfans in #175
  • [TransferEngine] Fix compilation bugs of nvmeof transport by @alogfans in #174
  • Package mooncake_vllm_adaptor and MooncakeDistributedStore into a wheel by @xhyf77 in #177
  • [Fix] Fix issues with run path and etcd dependency integration by @xhyf77 in #182
  • [WorkFlow] Add a release.yaml to publish the .whl file by @xhyf77 in #186
  • [Chore] Change logging level from INFO to VLOG for object existence by @xiaguan in #185
  • [Refactor] [MooncakeStore] Migrate Master Service from gRPC to coro_rpc by @xiaguan in #179
  • [Doc] Update vllm MooncakeStore integration doc by @ShangmingCai in #191
  • [CI] Refine release workflow to enable publishing to PyPI by @xiaguan in #189
  • [Enhance] Add an RPATH that takes precedence over LD_LIBRARY_PATH and other paths in the wheel package by @xhyf77 in #190
  • [MooncakeStore] fix: python env path in sglang by @stmatengss in #196
  • [CI] Update etcd-cpp CMake policy version in dependencies script by @xiaguan in #199
  • [Feature] use randomly-selected port in TCP-based OOB communication by @alogfans in #202
  • [FIX] prevent allocating the same port by using real random values by @alogfans in #206
  • [TransferEngine] set use http protocol as default by @stmatengss in #208
  • [CI] Update GitHub Actions permissions for build-and-release job by @xiaguan in #209
  • Enable ccache in Mooncake compile by @yuan-luo in #203
  • [TransferEngine] fix: avoid leaking Slice memory in RDMA transport by @eryugey in #210
  • [Refactor] Remove the dependency of etcd-cpp-api-v3 by @alogfans in #188
  • [MooncakeIntegration] Refactor py mod arch by @stmatengss in #212
  • feat(ci): enhance build process with new import structure test by @xiaguan in #214
  • chore(scripts): set rpath for shared objects in build script by @xiaguan in #216

New Contributors

Full Changelog: v0.2.0...v0.3.0-beta

v0.3.0-alpha

03 Apr 12:46
94e3dc4
Compare
Choose a tag to compare
v0.3.0-alpha Pre-release
Pre-release

Highlights

What's Changed

  • fix: when detect async event, stop the outstanding requests & prevent cq burn out leading severe error by @alogfans in #84
  • [TransferEngine] fix: add auto_discover in transfer_engine_c by @chestnut-Q in #87
  • [TransferEngine] feature: registerLocalMemory support the "*" location. by @doujiang24 in #86
  • [Build] feature: add dockerfile to enable use Mooncake in Docker by @stmatengss in #92
  • [Transfer Engine] add USE_ETCD option to retire the etcd dependency. by @doujiang24 in #94
  • [TransferEngine]chore: replace slices vector with slice counter by @RuixiangMa in #93
  • [Build] feature: add dev container. by @doujiang24 in #96
  • [TransferEngine] fix: wrong numa number in example. by @doujiang24 in #99
  • [TransferEngine] fix: use ibv_get_device_list to get the IB devices. by @doujiang24 in #102
  • fix: memcpy direction wrong for read operations by @alogfans in #101
  • [TransferEngine] Build: build/install shared object for transfer engine. by @doujiang24 in #100
  • Fix P2P store bugs and improvement by @alogfans in #80
  • [Bugfix] Add shared and unique lock for local_memory_regions_ (#107) by @power-more in #108
  • [TransferEngine] Correct the Typo by @power-more in #109
  • [TransferEngine] Fix typos by @fengquyoumo in #110
  • [TransferEngine] BenchMark: Introduce report gflags for enhanced configurability by @fengquyoumo in #111
  • [TransferEngine] feature: introduce USE_NVMEOF to enable NVMe-oF separately. by @doujiang24 in #106
  • [Doc] add FAST25 paper and traces by @chestnut-Q in #116
  • [TransferEngine] Suppress gtest type cast compile warning and fix some trivial bugs by @yuan-luo in #115
  • [Doc] update README.md by @chestnut-Q in #128
  • [Bugfix] fix the gid choice of IB device: only choose IBV_GID_TYPE_ROCE_V2 now by @fengquyoumo in #113
  • [MooncakeStore] Introduce MooncakeStore: A new Distributed Object Store for XpYd PD disaggregation by @alogfans in #126
  • [FIX] Fix the content of README.md by @alogfans in #130
  • [DOC] add description about Docker-based deployment by @alogfans in #133
  • Fix compilation error with clang toolchain by @xiaguan in #135
  • [MooncakeStore] Add getSize method to DistributedObjectStore by @xiaguan in #137
  • [DOC] remove useless metadata_type flags in example.md by @cxz66666 in #138
  • [FIX] golang vulnerabilities in P2P store by @alogfans in #140
  • [TransferEngine] Support Status return value by @yuan-luo in #125
  • [Doc] Fix typo in mooncake-store-preview by @simpx in #142
  • [MooncakeStore] Enhance performance in DistributedObjectStore::get by @xiaguan in #141
  • [Doc] Add vllm integration v1 doc by @ShangmingCai in #129
  • [Optimize] Update EndpointStore interface to use const references for… by @xiaguan in #152
  • fix compilation errors by @xhyf77 in #145
  • [MooncakeStore] fix: treat object_not_found/already_exists as info in glog by @stmatengss in #156
  • [Fix]: Handle SIGINT & SIGTERM for graceful shutdown by @xhyf77 in #153
  • [Doc] typo error in README by @Unprincess17 in #158
  • [FEATURE] catch exceptions during allocator creation by @xiaguan in #157
  • [FEATURE] Update MooncakeStroe CI workflow by @xiaguan in #146
  • [DOC] remove MULAN license to ease vllm deployment by @stmatengss in #163
  • [Refactor] Replace rpc logic with MasterClient in Client class by @xiaguan in #164
  • [CI] Add spell check process to CI by @xiaguan in #165
  • [Chore] Disable garbage collection in Mooncake master by @xiaguan in #167
  • [TransferEngine] fix typos for cancel status in transfer engine by @alogfans in #169
  • Extend VLLMAdaptor API by @xhyf77 in #173
  • [MooncakeAdaptor] Init sglang_adaptor to support SGLang using transfer engine by @stmatengss in #181
  • [HOTFIX] add env MC_DISABLE_METACACHE to force pull metadata from etcd by @alogfans in #175
  • [TransferEngine] Fix compilation bugs of nvmeof transport by @alogfans in #174
  • Package mooncake_vllm_adaptor and MooncakeDistributedStore into a wheel by @xhyf77 in #177
  • [Fix] Fix issues with run path and etcd dependency integration by @xhyf77 in #182
  • [WorkFlow] Add a release.yaml to publish the .whl file by @xhyf77 in #186
  • [Chore] Change logging level from INFO to VLOG for object existence by @xiaguan in #185
  • [Refactor] [MooncakeStore] Migrate Master Service from gRPC to coro_rpc by @xiaguan in #179
  • [Doc] Update vllm MooncakeStore integration doc by @ShangmingCai in #191
  • [CI] Refine release workflow to enable publishing to PyPI by @xiaguan in #189
  • [Enhance] Add an RPATH that takes precedence over LD_LIBRARY_PATH and other paths in the wheel package by @xhyf77 in #190
  • [MooncakeStore] fix: python env path in sglang by @stmatengss in #196
  • [CI] Update etcd-cpp CMake policy version in dependencies script by @xiaguan in #199
  • [Feature] use randomly-selected port in TCP-based OOB communication by @alogfans in #202
  • [FIX] prevent allocating the same port by using real random values by @alogfans in #206
  • [TransferEngine] set use http protocol as default by @stmatengss in #208

New Contributors

Full Changelog: v0.2.0...v0.3.0

What's Changed

  • fix: when detect async event, stop the outstanding requests & prevent cq burn out leading severe error by @alogfans in #84
  • [TransferEngine] fix: add auto_discover in transfer_engine_c by @chestnut-Q in #87
  • [TransferEngine] feature: registerLocalMemory support the "*" location. by @doujiang24 in #86
  • [Build] feature: add dockerfile to enable use Mooncake in Docker by @stmatengss in #92
  • [Transfer Engine] add USE_ETCD option to retire the etcd dependency. by @doujiang24 in #94
  • [TransferEngine]chore: replace slices vector with slice counter by @RuixiangMa in #93
  • [Build] feature: add dev container. by @doujiang24 in #96
  • [TransferEngine] fix: wrong numa number in example. by @doujiang24 in #99
  • [TransferEngine] fix: use ibv_get_device_list to get the IB devices. by @doujiang24 in #102
  • fix: memcpy direction wrong for read operations by @alogfans in https...
Read more

v0.2.0

20 Jan 12:09
ab8021d
Compare
Choose a tag to compare

The first official release of Mooncake!

See our README for details.

What's Changed

  • docs: update README.md by @eltociear in #9
  • fix the p2pstore go module name, make it could be imported in other projects. by @doujiang24 in #7
  • docs: add new vllm-integration guide. by @ShangmingCai in #11
  • fix: remove atty and fix dependency path by @chestnut-Q in #19
  • [DOC] Add contributing guidelines by @stmatengss in #22
  • [Bugfix] cpp std::string by @wxsms in #18
  • [Doc][Integration] Update commands, params description, and benchmark results for vllm integration v2. by @ShangmingCai in #25
  • [Doc] Fix typos in contributing.md by @ShangmingCai in #26
  • [TransferEngine][feature] support multiple metadata servers by @alogfans in #21
  • [CI] add build test on ubuntu by @Ann-1024 in #28
  • [Doc] Add metadata server backend explanation. by @ShangmingCai in #30
  • [Bugfix] build error when USE_CUDA by @wxsms in #32
  • [Bugfix] fix typos and fix rdma device selection when retry_count != 0 by @fengquyoumo in #33
  • [Doc] Update the integration state of Mooncake Transfer Engine with vLLM. by @ShangmingCai in #40
  • [Doc] Update Mooncake Icon by @stmatengss in #41
  • [Doc] Update README to use v0.2 guide and benchmark results. by @ShangmingCai in #42
  • [TransferEngine] Add topology discovery by @wx-csy in #46
  • [TransferEngine][feature] add the http metadata server. by @doujiang24 in #37
  • [TransferEngine] Refactor code to hide transport logics from user APIs by @alogfans in #51
  • [Transfer Engine] fix metadata server connection string builder in vllm intergation by @alogfans in #56
  • Update metadata_server config info for vllm integration doc. by @ShangmingCai in #57
  • Bump golang.org/x/crypto to 0.31.0 & golang.org/x/net to v0.33.0 by @doujiang24 in #58
  • [TransferEngine] fix: selectDevice() should return zero at all case by @alogfans in #60
  • [p2pstore] fix memory leaking in cgo. by @doujiang24 in #61
  • [TransferEngine] test: cmake enable testing. by @doujiang24 in #65
  • [Doc] Re-enable cuda graph to improve inference performance. by @ShangmingCai in #67
  • [TransferEngine] fix: improve error message by @doujiang24 in #63
  • [TransferEngine] adjust transfer_engine_bench: 1.Introduce the gflag buffer_size for enhanced configurability. 2. Utilize uint64_t for block_size to prevent overflow. by @fengquyoumo in #72
  • [TransferEngine] Add tests for Topology by @liusy58 in #74
  • [TransferEngine] Correct the count of slice_quque_ in rdma/worker_pool, remove the repeated check for overlap by @fengquyoumo in #79
  • fix invalid go release url by @gujingit in #83
  • [TransferEngine] change: auto discover topology & install transport. by @doujiang24 in #73

New Contributors

Full Changelog: https://github.com/kvcache-ai/Mooncake/commits/v0.2.0