DOC-12489 magma default storage engine #3813

ggray-cb · 2025-06-06T20:12:51Z

This PR covers the impact of MB-62777 making Magma with 128 vBuckets the default storage backend.

It also tackles some of the work for DOC-12778 Data Settings guidance for reader/writer threads change to 'disk i/o optimized' needs to be revised because it was in the the same area of the doc being updated anyhow. The other areas of that ticket are being handled in the DOC-12485 prevent bucket from running out of space PR.

The following pages were updated for this PR (links lead to preview site. Here's the username/password for the preview)

What's New entry
vBuckets edited for doc standards. Clarfiied the number of vBuckets used by storage backends/platforms. Also expanded on a few thigns, such as active vBuckets, that were glossed over in past versions.
Storage Engines Moved Magma up to top of the topic as it's now the default. Added descriptions for the 128 vBucket version and the new default behavior. Updated the Magma Writer Thread Settings description for DOC-12778.
Create a Bucket updated the screenshot and procedure for creating a bukcet using the GUI. Added explicite statements about teh default storage backend to the two other examples.
Migrate a Bucket’s Storage Backend Added note that you cannot migrate a bucket from 128 vBuckets to 1024 vBuckets using the technique fo changing the storage backend setting. Added section on using XDCR to migrate between buckets with differnt numbers of vBuckets. Added example Python script to duplicate scopes/collection definitions to another bucket.
Cross Data Center Replication(XDCR) added note about not being able to rep;icate 128 vBucket Magma to older clusters.
Prepare for XDCR added same note as above.
Creating and Editing Buckets added the numVBuckets parameter and updated descriptions of the default storage backend.
Getting Bucket Information updated the example output.
Upgrade added section on upgrade implications of new default storage backend.

hyunjuV · 2025-06-19T03:19:20Z

modules/introduction/partials/new-features-80.adoc

+* If you have deployment scripts that create buckets without specifying the storage engine, those scripts  create Magma buckets with 128 vBuckets instead of Couchstore buckets after the upgrade.
+This may affect your deployment if you depend on buckets using the Couchstore storage engine.
+* You cannot use XDCR to replicate Magma buckets using 128 vBuckets with pre-8.0 clusters. 
+XDCR in pre-8.0 clusters only supports replication between buckets that contain the same number of vBuckets.


This comment is for lines 95 and 96.

I think that this level of detail is good for What's New, but the technical detail is:
You can XDCR from a 128 vBucket on 8.0 to a pre-8.0 cluster since 8.0 XDCR supports creating a replication between buckets with different numbers of vBuckets. In XDCR, the source creates the replications. But, I think that "You cannot use XDCR to replicate Magma buckets using 128 vBuckets with pre-8.0 clusters" is fine for What's New since most people would want to replicate from an earlier version to the later version or bi-directionally -- so, it gets the most important point across.

hyunjuV · 2025-06-19T03:26:07Z

modules/install/pages/upgrade.adoc

+
+Another concern is that versions of Couchbase Server earlier than 8.0 do not support XDCR replication between buckets with different numbers of vBuckets. 
+Therefore, you cannot replicate between a bucket you create with the new default backend setting and buckets on an earlier server version. 
+To able to replicate with a bucket on an earlier version of Couchbase Server, explicitly set the new bucket's storage backend to Couchstore or to Magma with 1024 vBuckets during creation.


Typo -- missing a word -- "To able to" should be "To be able to".

hyunjuV · 2025-06-19T03:38:05Z

modules/install/pages/upgrade.adoc

+These behavior changes could cause issues if you rely on the prior behavior, especially if you use deployment scripts.
+If you have deployment scripts that create buckets, review them to determine if you need to make changes.
+
+For example, suppose your deployment script does not specify the storage backend when it creates a bucket that you intend to use with the xref:views/views-mapreduce-intro.adoc[] feature.


Since we are giving a link to the MapReduce Views page here, could we have the same note that we have in Views intro page saying that Views are deprecated and will be removed in a future version in the MapReduce Views page?

Link to Views intro page -- https://docs.couchbase.com/server/current/learn/views/views-intro.html

hyunjuV · 2025-06-19T03:42:46Z

modules/learn/pages/buckets-memory-and-storage/storage-engines.adoc


 [abstract]
 {description}
-It is important to understand which backend storage is best suited to your requirements.
+These storage engines organize the data both on disk ad in memory.


Typo: organize the data both on disk ad in memory
ad should be and

hyunjuV · 2025-06-19T05:30:40Z

modules/learn/pages/buckets-memory-and-storage/storage-engines.adoc


-Magma can work with very low amounts of memory for large datasets: a minimum memory-to-data ratio of 1% is required.
-For example, if a node is holding 5{nbsp}TB of data, Magma can be used with only 64{nbsp}GB RAM.
+Magma using 1024 vBuckets has a minimum memory quota of 1{nbsp}GiB per node.


I liked the "Which Storage Engine Should You Use?", but since that section is further down in the page, I'm thinking that this may be a good place to just note here:

If you can allocate at least 1 GiB memory per node to your bucket, you should choose the 1024 vBucket option for Magma as you will get better performance at scale.

hyunjuV · 2025-06-19T07:31:47Z