Overview of an hctsa analysis

At its core, *hctsa* analysis involves computing a library of time-series analysis features (which we call *operations*) on a time-series dataset.

The basic sequence of a Matlab-based *hctsa* analysis is to: 1. *Initialize* a

`HCTSA.mat`

file, which contains all of the information about the set of time series and operations in your analysis, as well as the results of applying all operations to all time series, using `TS_Init`

,- 1.These operations can be computed on your time-series data using
`TS_Compute`

. The results are structured in the local`HCTSA.mat`

file containing matrices (that store the results of the computations) and the tables (that store information about the time-series data and operations), as described here. - 2.After the computation is complete, a range of processing, analysis, and plotting functions are provided to understand and interpret the results.

As a quick check of your operation library, you can compute the full default code library on a time-series data vector (a column vector of real numbers) as follows:

x = randn(500,1); % A random time-series

featVector = TS_CalculateFeatureVector(x,false); % compute the default feature vector for x

Suppose you have have a time-series dataset to analyze. You first generate a formatted *hctsa* calculation using the default library of features:

`INP_ts.mat`

input file containing your time series data and associated name and keyword labels, as described here. You then initialize an TS_Init('INP_ts.mat');

This generates a local file,

`HCTSA.mat`

containing the associated metadata for your time series, as well as information about the full time-series feature library (`Operations`

) and the set of functions and code to call to evaluate them (`MasterOperations`

), as described here.Next you want to evaluate the code on all of the time series in your dataset. For this you can simply run:

TS_Compute;

As described here, or, for larger datasets, using a script to regularly save back to the local file (cf.

`sample_runscript_matlab`

).Having run your calculations, you may then want to label your data using the keywords you provided in the case that you have labeled groups of time series:

TS_LabelGroups;

and then normalize and filter the data using the default sigmoidal transformation:

TS_Normalize;

A range of visualization scripts are then available to analyze the results, such as plotting the reordered data matrix:

TS_Cluster; % compute a reordering of data and features

TS_PlotDataMatrix; % plot the data matrix

To inspect a low-dimensional representation of the data:

TS_PlotLowDim;

Or to determine which features are best at classifying the labeled groups of time series in your dataset:

TS_TopFeatures;

Each of these functions can be run with a range of input settings.

Last modified 2yr ago