Add splitting functionality to curation and `SortingAnalyzer` #3817

alejoe91 · 2025-03-28T22:21:57Z

Depends on #3760

Implements splitting functionality in:

sorting_tools: apply_splits_to_sorting
SortingAnalyzer: .split_units()
AnalyzerExtension: _split_extension_data (implemented function for all extensions and extended tests)

alejoe91 · 2025-03-28T22:23:31Z

src/spikeinterface/core/analyzer_extension_core.py

+                    old_template = arr[self.sorting_analyzer.sorting.ids_to_indices([split_unit_id])[0], ...]
+                    new_indices = np.array([new_unit_ids.index(unit_id) for unit_id in new_splits])
+                    new_array[new_indices, ...] = np.tile(old_template, (len(new_splits), 1, 1))


@samuelgarcia this needs to be discussed. What should we do if the waveforms extension is not there? The current behavior is copying, but we mught want to force recompute here

maybe add warning here!

samuelgarcia · 2025-03-31T16:55:32Z

This is amazing camarade.

I would do first a PR to focus only on the format. And then merge this one.
What do you think ?

alejoe91 · 2025-03-31T17:07:32Z

I did camarade! #3760

src/spikeinterface/core/sorting_tools.py

src/spikeinterface/core/sortinganalyzer.py

src/spikeinterface/curation/curation_model.py

alejoe91 · 2025-07-17T07:38:58Z

src/spikeinterface/curation/curation_model.py

+            full_spike_indices = []
+            for label in np.unique(self.labels):
+                label_indices = np.where(self.labels == label)[0]
+                full_spike_indices.append(label_indices)


maybe make a helper function that returns a dict of indices given an array

alejoe91 · 2025-07-17T07:45:19Z

src/spikeinterface/postprocessing/tests/test_multi_extensions.py

+    # we propagate random spikes to avoid random spikes to be recomputed
+    extension_dict_ = extension_dict_split.copy()
+    extension_dict_.pop("random_spikes")
+    analyzer_hard.extensions["random_spikes"] = analyzer_split.extensions["random_spikes"]


do a copy and ref to the analyzer here

src/spikeinterface/core/analyzer_extension_core.py

samuelgarcia · 2025-07-18T07:33:46Z

src/spikeinterface/core/sorting_tools.py

+
+
+### SPLITTING ZONE ###
+def apply_splits_to_sorting(sorting, unit_splits, new_unit_ids=None, return_extra=False, new_id_strategy="append"):


maybe a small doc no ?
You are becoming more lazy than me.

src/spikeinterface/core/sorting_tools.py

samuelgarcia · 2025-07-18T07:38:13Z

src/spikeinterface/core/sorting_tools.py

+                        new_unit_ids.append([str(m + i) for i in range(num_splits)])
+                    else:
+                        # we cannot automatically find new names
+                        new_unit_ids.append([f"split{i}" for i in range(num_splits)])


Maybe we could put also the old unit id... not sure

samuelgarcia · 2025-07-18T07:39:53Z

src/spikeinterface/core/sorting_tools.py

+                else:
+                    # dtype int
+                    new_unit_ids.append(list(max(old_unit_ids) + 1 + np.arange(num_splits, dtype=dtype)))
+                old_unit_ids = np.concatenate([old_unit_ids, new_unit_ids[-1]])


using this variable name for the growing new unit id set is a bo confusing at the first read.

And the [-1] is not obvious

…t_units

samuelgarcia · 2025-07-18T07:51:16Z

src/spikeinterface/core/sorting_tools.py

+            split_2 = split_2[~np.isin(split_2, split_indices)]
+            new_split_indices = [split_indices, split_2]
+        else:
+            new_split_indices = split_indices


Do we need to check ompleness here ?

salut camarade. I just made some changes and check for completeness in the apply_splits_to_sorting and SortingAnalyzer.split_units.

This forces the user to provide complete spike indices, which is done automatically though the curation module and apply_curation function

src/spikeinterface/core/sortinganalyzer.py

samuelgarcia · 2025-07-18T08:01:53Z

src/spikeinterface/core/sortinganalyzer.py

                        keep_mask=keep_mask,
                        verbose=verbose,
                        **job_kwargs,
                    )
                elif merging_mode == "hard":
                    recompute_dict[extension_name] = extension.params
+            else:
+                # split
+                try:


try is not agood idea here for this new mechanism, we must capture any new bug!

removed and added splitting mode

src/spikeinterface/core/sortinganalyzer.py

samuelgarcia · 2025-07-18T08:13:28Z

src/spikeinterface/curation/tests/test_curation_format.py

+        assert analyzer_curated.sorting.get_property("pyramidal", ids=[unit_id])[0]
+
+
+def test_apply_curation_with_split_multi_segment():


I would make only a multi segment tests every where to avoid mono/multi segment tests

samuelgarcia · 2025-07-18T08:14:24Z

src/spikeinterface/curation/tests/test_curation_format.py

+            {
+                "unit_id": 2,
+                "mode": "labels",
+                "labels": split_labels.tolist(),


should we tests also the other representation by indices ?

src/spikeinterface/postprocessing/template_similarity.py

Co-authored-by: Garcia Samuel <[email protected]>

samuelgarcia · 2025-07-21T09:13:34Z

yeah!

alejoe91 and others added 14 commits March 25, 2025 11:21

wip

677f90c

Enhance CurationModel: Add split_units validation

86a3ab4

Add splitting sorting to curation format

d4e0f84

(wip) Add split_units to SortingAnalyzer

c7316bb

Add split_units to sorting analyzer

2e21923

Propagate SortingAnalyzer.split_units to apply_curation

4afdb80

Extend CurationModel tests

62bfb7f

Add analyzer split to curation tests

40fe01b

wip: add split tests in postprocessing

ca6f2e0

wip - modify model

58b62fb

Fix tests and cleanup apply_curation

09a379e

Fix test-multi-extensions

d2f220a

conflicts

64a3b83

Deal with multi-segment

d4fa8bf

alejoe91 added the curation Related to curation module label Mar 28, 2025

alejoe91 commented Mar 28, 2025

View reviewed changes

Extend splitting-tests to multi-segment and mask labels

659ecff

alejoe91 added this to the 0.103.0 milestone Jun 11, 2025

alejoe91 added 2 commits July 2, 2025 18:39

Solve conflicts

3222607

Fix tests

1a735f9

alejoe91 commented Jul 17, 2025

View reviewed changes

src/spikeinterface/core/sorting_tools.py Outdated Show resolved Hide resolved

alejoe91 commented Jul 17, 2025

View reviewed changes

src/spikeinterface/core/sortinganalyzer.py Show resolved Hide resolved

alejoe91 commented Jul 17, 2025

View reviewed changes

src/spikeinterface/curation/curation_model.py Show resolved Hide resolved

alejoe91 commented Jul 17, 2025

View reviewed changes

Fix conflicts

3f5996b