DEMO: Analysing outputs of t2p

DEMO: Analysing outputs of t2p#

This is a short demo explaining how to explore the output of track2p and use the matched neurons/traces for custom downstream analysis.

The example here is for a 1 plane recording with simultaneous videography (given dataset is jm032).

# imports
import os
from types import SimpleNamespace

import numpy as np
import matplotlib.pyplot as plt

from scipy.stats import zscore

Step by step guide (more detailed explanations below):#

Each point from this list matches one section of this notebook

Load the output of track2p
Find cells that are present in all recordings (‘matched cells’)
Load the data from one example dataset and visualise it
Load the activity of the matched cells
Visualise the activity of matched cells

1) Load the output of track2p#

We will load the .npy files: t2p_output_path/track2p/plane#_match_mat.npy and t2p_output_path/track2p/track_ops.npy. These are the matrix of cell matches for all days and the settings respectively. For more info see the repo readme and documentation.

Note: In this demo a single-plane recording is used, but it can be modified easily for multiplane compatility (just repeat the same procedure while looping through planes)

# this is the directory that contains a /track2p folder that is output by running the track2p algorithm
t2p_save_path = 'data/jm/jm038/' # (change this based on your data)
plane = 'plane0' # which plane to process (the example dataset is single-plane)

# np.load() the match matrix (plane0_match_mat.npy)
t2p_match_mat = np.load(os.path.join(t2p_save_path, 'track2p', f'{plane}_match_mat.npy'), allow_pickle=True)

# np.load() settings (this contains suite2p paths etc.) (track_ops.npy)
track_ops_dict = np.load(os.path.join(t2p_save_path, 'track2p', 'track_ops.npy'), allow_pickle=True).item()
track_ops = SimpleNamespace(**track_ops_dict) # create dummy object from the track_ops dictionary

2) Find cells that are present in all recordings (‘matched cells’)#

Now from this matrix get the matches that are present on all days:

A matrix (plane#_match_mat.npy) containing the indices of matched neurons across the session for a given plane (# is the index of the plane). Since matching is done from first day to last, some neurons will not be sucessfully tracked after one or a few days. In this case the matrix contains None values. To get neurons tracked across all days only take the rows of the matrices containing no None values.

Note: of course we can use cells that are not present on all days, but for now this is the intended use case for downstream analysis.

# get the rows that do not contain any Nones (if track2p doesnt find a match for a cell across two consecutive days it will append a None) -> cells with no Nones are cells matched across all days
t2p_match_mat_allday = t2p_match_mat[~np.any(t2p_match_mat==None, axis=1), :]

print(f'Shape of match matrix for cells present on all days: {t2p_match_mat_allday.shape} (cells, days)')

Shape of match matrix for cells present on all days: (803, 7) (cells, days)

3) Load the data from one example dataset and visualise it#

Note: The track_ops.npy (‘settings file’) contains all the paths to suite2p folders used when running track2p (see cell below)

print('Datasets used for t2p:\n')
for ds_path in track_ops.all_ds_path:
    print(ds_path)

Datasets used for t2p:

/Users/jure/Documents/cossart_lab/data/jm/jm038/2023-04-30_a
/Users/jure/Documents/cossart_lab/data/jm/jm038/2023-05-01_a
/Users/jure/Documents/cossart_lab/data/jm/jm038/2023-05-02_a
/Users/jure/Documents/cossart_lab/data/jm/jm038/2023-05-03_a
/Users/jure/Documents/cossart_lab/data/jm/jm038/2023-05-04_a
/Users/jure/Documents/cossart_lab/data/jm/jm038/2023-05-05_a
/Users/jure/Documents/cossart_lab/data/jm/jm038/2023-05-06_a

Now just to test if the paths work we can try to look at data of one of the recordings (in the case below we use the last one). For this part it is important to know a bit about how the suite2p structures the outputs: https://suite2p.readthedocs.io/en/latest/outputs.html (the important things will be the ops.npy, stat.npy, iscell.npy and the F.npy). There are also separate tutorials and demos for this so we won’t go into so much detail.

# lets take the last dataset
last_ds_path = track_ops.all_ds_path[-1]
print(f'We will look at the dataset saved at: {last_ds_path}')

We will look at the dataset saved at: /Users/jure/Documents/cossart_lab/data/jm/jm038/2023-05-06_a

# load the three files
last_ops = np.load(os.path.join(last_ds_path, 'suite2p', plane, 'ops.npy'), allow_pickle=True).item()
last_f = np.load(os.path.join(last_ds_path, 'suite2p', plane, 'F.npy'), allow_pickle=True)
iscell = np.load(os.path.join(last_ds_path, 'suite2p', plane, 'iscell.npy'), allow_pickle=True)

# we filter the traces based on suite2p's iscell probability (note: it is crucial to use the same probability as in the track2p settings to keep the correct indexing of matches)
iscell_thr = track_ops.iscell_thr

print(f'The iscell threshold used when running track2p was: {iscell_thr}')

if track_ops.iscell_thr==None:
    last_f_iscell = last_f[iscell[:, 0] == 1, :]

else:
    last_f_iscell = last_f[iscell[:, 1] > iscell_thr, :]

The iscell threshold used when running track2p was: 0.5

# now first plot the mean image of the movie (it is saved in ops.npy, for more info see the suite2p outputs documentation)
plt.imshow(last_ops['meanImg'], cmap='gray')
plt.axis('off')
plt.title('Mean image')
plt.show()

plt.figure(figsize=(10, 1))
nonmatch_nrn_idx = 0
plt.plot(last_f[nonmatch_nrn_idx, :])
plt.xlabel('Frame')
plt.ylabel('F')
plt.title(f'Example trace (nrn_idx: {nonmatch_nrn_idx})')
plt.show()

plt.figure(figsize=(10, 3))
plt.imshow(zscore(last_f_iscell, axis=1), aspect='auto', cmap='Greys', vmin=0, vmax=1.96)
plt.xlabel('Frame')
plt.ylabel('ROI')
plt.title('Raster plot')
plt.show()

_images/c42a1a32a339a11b55c160e0aba7f42fb73816b7e5dd4712265bad9e9692ec55.png

_images/a1e61f086293ab571138c51426d67b0cd194cd4b95e4a10e6f960934e568ecb4.png

_images/76b9ac14bef2f24d91af4f2f8592c2cae9bd364d05c5926ae02485ff3332f81c.png

4) Load the activity of the matched cells#

Now that we know how to look at data in one recording we will use the output from track2p to look at activity of the same cells across all datasets.

To do this we need to loop through all datasets and:

load the files described above
filter stat.npy and fluo.npy by the track2p iscell threshold (classical suite2p)
filter stat.npy and fluo.npy by the appropriate indices from the matrix of neurons matched on all days (additional filtering step after track2p)

This will produce a nice data structure where the indices of cells are matched within the stat and fluo objects. Sorting the object in this way allows for very straightforward extraction of matched data (see cells below)

iscell_thr = track_ops.iscell_thr # use the same threshold as when running the algo (to be consistent with indexing)

all_stat_t2p = []
all_f_t2p = []
all_ops = [] # ops dont change

for (i, ds_path) in enumerate(track_ops.all_ds_path):
    ops = np.load(os.path.join(ds_path, 'suite2p', plane, 'ops.npy'), allow_pickle=True).item()
    stat = np.load(os.path.join(ds_path, 'suite2p', plane, 'stat.npy'), allow_pickle=True)
    f = np.load(os.path.join(ds_path, 'suite2p', plane, 'F.npy'), allow_pickle=True)
    iscell = np.load(os.path.join(ds_path, 'suite2p', plane, 'iscell.npy'), allow_pickle=True)
    
    
    if track_ops.iscell_thr==None:
        stat_iscell = stat[iscell[:, 0] == 1]
        f_iscell = f[iscell[:, 0] == 1, :]

    else:
        stat_iscell = stat[iscell[:, 1] > iscell_thr]
        f_iscell = f[iscell[:, 1] > iscell_thr, :]
    
    
    stat_t2p = stat_iscell[t2p_match_mat_allday[:,i].astype(int)]
    f_t2p = f_iscell[t2p_match_mat_allday[:,i].astype(int), :]

    all_stat_t2p.append(stat_t2p)
    all_f_t2p.append(f_t2p)
    all_ops.append(ops)

5) Visualise the ROIs and the activity of (a) matched cell(s)#

This example shows how to extract the information of a ROI from all_stat. We first index by the day to get stat_t2p from all_stat2p (this is the sorted stat object for that day). We can then get the roi information by indexing stat_t2p by the index of the cell match (because of resorting we use the same index across days).

wind = 24
nrn_idx = 0

for i in range(len(track_ops.all_ds_path)):
    mean_img = all_ops[i]['meanImg']
    stat_t2p = all_stat_t2p[i]
    median_coord = stat_t2p[nrn_idx]['med']

    plt.figure(figsize=(1.5,1.5))
    plt.imshow(mean_img[int(median_coord[0])-wind:int(median_coord[0])+wind, int(median_coord[1])-wind:int(median_coord[1])+wind], cmap='gray') # plot a short window around the ROI centroid
    plt.scatter(wind, wind)
    plt.axis('off')
    plt.show()

_images/375ce2a83cf345154889a7f87f5165020281eae541ae0d062070517f49076c9c.png

_images/f1032cb8fd5c22b0ebc865146120e542ef9aab7f6e209946d56cce5dcf54e0d0.png

_images/9557bc47d2122cc95c471639693711ddfa8d4a1217d36f5ec0e2637a7f1d6d56.png

_images/4f84a575257b6d717626dfb199ff9f7b2d36893028620c2d672a7aa9485d3c10.png

_images/15179c17ba55bf8c87e24fbe2c1ff9360aef3a24cdd918bc36c7bc03f3e9665c.png

_images/dd0774521a337658dc1db8ee48843e7434171721052265d2738ce941d13295e6.png

_images/6941c9f0bc6ca41cb169154b4cf55d4df9c6713f12994277b9dcc804ccddc02c.png

# first plot the trace of cell c for all days
nrn_idx = 0 # the activity of the ROI visualised above on all days

for i in range(len(track_ops.all_ds_path)):
    plt.figure(figsize=(10, 1)) # make a wide figure
    plt.plot(all_f_t2p[i][nrn_idx, :])
    plt.xlabel('Frame')
    plt.ylabel('F')
    plt.show()

_images/f9958423ebb7cc37942d2a5e45ce873483a0a4bfa6beb26caedee4ff17a1ca3d.png

_images/695b27308b2fabdf7e6d8cade7d48be8e9a8d7098f1a2da1b5c3788b9a4549ab.png

_images/e1e9838d58286ebd2eb0d11fddaad57983c5e09dcedc0df42ff28dd899f27f91.png

_images/2c4ef408b0441dc5a0cdf7bfaabc7884b45d3dbe7fbe465f2af8edc786cac808.png

_images/5d40c5fa433fc75e53416cbae0caa3e857613b98a5acd8d76b2722445b5fe212.png

_images/5d3b13bda5725a4cce732a22e2be4466af3e85a57250d228a9d30491b111e74a.png

_images/ce26e65de663f0b120b0cbb116818113823b53a49311a5f190c6507330ad6567.png

Now to visualise the rasters its a simple exercise, since they are already sorted in a way that the rows represent the same cell across days we don’t need to do anything other than simply looping through all_f_t2p and plotting each element as we did before.

for i in range(len(track_ops.all_ds_path)):
    plt.figure(figsize=(10, 3)) # make a wide figure
    f_plot = zscore(all_f_t2p[i], axis=1)
    plt.imshow(f_plot, aspect='auto', cmap='Greys', vmin=0, vmax=1.96)
    plt.xlabel('Frame') 

_images/e7cf496cd95cb8770b379089c998d610e4e3d1b4054d1aefc2bb3c969641589a.png

_images/bbb73f79ebd27dc013361fbd1537f740b874edda0dabdfef34590470903120ef.png

_images/c7e124c95f499cc7de9038f25fb1c86032e78ba8e88137d0e58043996ebf3e37.png

_images/f747e11cec0f2572023354a1d84f03caad6a8deaf4f16a0050aad0950ee77b1e.png

_images/a893b020b0927080e5d0d55d7a7c232c8ee395a184427040c4be77ce7a96d6a2.png

_images/d2512ae80ad3f96dc2649d4c20e3c819b36823d9cfce02342389bc45cd055955.png

_images/5f331bdff9c0da38497296baf6dd5865deaabbd744d2d9260ae253fc36df86ba.png

The End!#

Congrats! Hopefully this notebook was a clear and useful way of showing how to interact with the track2p outputs.

From here on custom analysis pipelines can very easily be applied (for example looking at stability of assemblies, representational drift etc etc).

The most straightforward way of doing this is to just run an already implemented pipeline on the data loaded as shown here. Alternatively the loaded match indices can be used to look at already-processed data as a way of post-hoc matching.

Thanks and have fun with analysis :)