This workflow executes the following steps:

  • Computes the autosum of the sequences with autoSum.

  • Computes the distance between paired samples and sums it with distancePairedSamples.

  • Computes the dissimilarity measure psi with the function psi.

  • Delivers an output of type "list" (default), "data.frame" or "matrix", depending on the user input, through formatPsi.

workflowPsiPairedSamples(
  sequences = NULL,
  grouping.column = NULL,
  time.column = NULL,
  exclude.columns = NULL,
  method = "manhattan",
  format = "dataframe",
  parallel.execution = TRUE
  )

Arguments

sequences

dataframe with multiple sequences identified by a grouping column generated by prepareSequences.

grouping.column

character string, name of the column in sequences to be used to identify separates sequences within the file.

time.column

character string, name of the column with time/depth/rank data.

exclude.columns

character string or character vector with column names in sequences to be excluded from the analysis.

method

character string naming a distance metric. Valid entries are: "manhattan", "euclidean", "chi", and "hellinger". Invalid entries will throw an error.

format

string, type of output. One of: "data.frame", "matrix". If NULL or empty, a list is returned.

parallel.execution

boolean, if TRUE (default), execution is parallelized, and serialized if FALSE.

Value

A list, matrix, or dataframe, with sequence names and psi values.