Exposing Cf amplitude in st[:,3] #844

ryan-ressmeyer · 2024-12-31T18:21:33Z

Hi! I've been working on some quality control metrics for KS4 and made a small change that I thought others might benefit from. I explain below:

A commonly used quality control metric for previous versions of Kilosort has been to quantify the truncation of the amplitude distribution as an estimate of the percentage of spikes missing in a given cluster (i.e. prevalence of type 2 errors). Such methods are implemented in bombcell and the IBL spike sorting pipeline. In the current version of Kilosort4, amplitudes.npy always returns an approximately gaussian distribution for amplitudes, making it impossible to estimate the degree of truncation. The amplitudes column in st[:,2] often shows a truncation, but occasionally appears to superimpose two truncated distributions with different truncation thresholds points (I believe due to normalization and merges, but I am not certain). This PR makes a small change to template_matching.py to append an extra column to st. This column contains the raw value of Cf for each spike, which is the value used to determine spike inclusion. This is the value that is directly compared to Th_learned, and always shows the truncation clearly. This makes the spike inclusion criteria clear to the user, and allows previously developed QC methods to be used with KS4.

I'm also including a figure below to help demonstrate this point. The top row is the a cluster identified by KS2.5, while the bottom 3 rows show the same cluster (high % of coincident spikes) identified by KS4. The top row plots amplitudes.npy from KS2.5 with a clear truncation after the red line (amplitude histograms in the right column). The second row shows amplitudes.npy from KS4 with no clear truncation. The third row shows st[:,2], which matches KS2.5 amplitudes.npy more closely, but seemly has two superimposed truncated distributions. The final row shows the proposed st[:,3] with a clearly truncated distribution that matches (roughly) the KS2.5 distribution. The truncation is also at the Th_learned = 8 value, which makes it clear to the user that this is the amplitude value used for spike inclusion.

Thanks for your time!
Ryan Ressmeyer

ryan-ressmeyer added 4 commits December 29, 2024 10:01

exposed Cf amplitude in st

4f03405

Merge branch 'MouseLand:main' into main

9d0302b

update comments to match new st

7d74a94

Merge branch 'MouseLand:main' into main

e887904

jacobpennington requested a review from marius10p December 31, 2024 18:31

jacobpennington assigned marius10p Dec 31, 2024

ryan-ressmeyer added 3 commits January 3, 2025 10:55

fixed bug with neg_spikes and th_amps

41fb331

Merge branch 'main' of https://github.com/ryan-ressmeyer/Kilosort

460987c

Merge branch 'MouseLand:main' into main

b82c0f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exposing Cf amplitude in st[:,3] #844

Exposing Cf amplitude in st[:,3] #844

ryan-ressmeyer commented Dec 31, 2024

Exposing Cf amplitude in st[:,3] #844

Are you sure you want to change the base?

Exposing Cf amplitude in st[:,3] #844

Conversation

ryan-ressmeyer commented Dec 31, 2024