MSPSVAE¶

class behavenet.models.vaes.MSPSVAE(hparams)[source]¶

Bases: PSVAE

Partitioned subspace variational autoencoder class for multiple sessions.

Methods Summary

`build_model`()	Construct the model using hparams.
`export_latents`(data_gen[, filename])	Need to create standard data generator in order to export latents.
`forward`(x[, dataset, use_mean])	Process input data.
`get_inverse_transformed_latents`(inputs[, ...])	Return latents after they have been transformed using the diagonal mapping D.
`get_predicted_labels`(x[, dataset, use_mean])	Process input data to get predicted labels.
`get_transformed_latents`(inputs[, dataset, ...])	Return latents after supervised subspace has been transformed to original label space.
`loss`(datas[, dataset, accumulate_grad, ...])	Calculate modified ELBO loss for MSPSVAE.

Methods Documentation

build_model()[source]¶: Construct the model using hparams.

export_latents(data_gen, filename=None)[source]¶: Need to create standard data generator in order to export latents.

forward(x, dataset=None, use_mean=False, **kwargs)[source]¶

Process input data.

Parameters:

x (torch.Tensor object) – input data
dataset (int) – used with session-specific io layers
use_mean (bool) – True to skip sampling step

Returns:

x_hat (torch.Tensor): output of shape (n_frames, n_channels, y_pix, x_pix)
y_hat (torch.Tensor): output of shape (n_frames, n_channels, y_pix, x_pix)
z (torch.Tensor): sampled latent variable of shape (n_frames, n_latents)
mu (torch.Tensor): mean paramter of shape (n_frames, n_latents)
logvar (torch.Tensor): logvar paramter of shape (n_frames, n_latents)

Return type:

tuple

get_inverse_transformed_latents(inputs, dataset=None, as_numpy=True)[source]¶

Return latents after they have been transformed using the diagonal mapping D.

Parameters:

inputs (torch.Tensor object) –
- image tensor of shape (batch, n_channels, y_pix, x_pix)
- latents tensor of shape (batch, n_ae_latents) where the first n_labels entries are assumed to be labels in the original pixel space
dataset (int, optional) – used with session-specific io layers
as_numpy (bool, optional) – True to return as numpy array, False to return as torch Tensor

Returns:

array of latents in transformed latent space

Return type:

np.ndarray or torch.Tensor object

get_predicted_labels(x, dataset=None, use_mean=True)[source]¶

Process input data to get predicted labels.

Parameters:

x (torch.Tensor object) – input data
dataset (int) – used with session-specific io layers
use_mean (bool) – True to skip sampling step

Returns:

x_hat (torch.Tensor): output of shape (n_frames, n_channels, y_pix, x_pix)
y_hat (torch.Tensor): output of shape (n_frames, n_channels, y_pix, x_pix)
z (torch.Tensor): sampled latent variable of shape (n_frames, n_latents)
mu (torch.Tensor): mean paramter of shape (n_frames, n_latents)
logvar (torch.Tensor): logvar paramter of shape (n_frames, n_latents)

Return type:

tuple

get_transformed_latents(inputs, dataset=None, as_numpy=True)[source]¶

Return latents after supervised subspace has been transformed to original label space.

Parameters:

inputs (torch.Tensor object) –
- image tensor of shape (batch, n_channels, y_pix, x_pix)
- latents tensor of shape (batch, n_ae_latents)
dataset (int, optional) – used with session-specific io layers
as_numpy (bool, optional) – True to return as numpy array, False to return as torch Tensor

Returns:

array of latents in transformed latent space

Return type:

np.ndarray or torch.Tensor object

loss(datas, dataset=None, accumulate_grad=True, chunk_size=None)[source]¶

Calculate modified ELBO loss for MSPSVAE.

The batch is split into chunks if larger than a hard-coded chunk_size to keep memory requirements low; gradients are accumulated across all chunks before a gradient step is taken.

Parameters:

datas (list of dict) – batch of data; keys should include ‘images’ and ‘masks’, if necessary
datasets (list of int) – used for embedding loss
accumulate_grad (bool, optional) – accumulate gradient for training step
chunk_size (int, optional) – deprecated

Returns:

‘loss’ (float): full elbo
’loss_ll’ (float): log-likelihood portion of elbo
’loss_kl’ (float): kl portion of elbo
’loss_mse’ (float): mse (without gaussian constants)
’beta’ (float): weight in front of kl term

Return type:

dict