The table shows main model parameters for each experiment.
The tables in this section show the standard association metrics between human scores and different types of machine scores. These results are computed on the evaluation set. The scores for each model have been truncated to [min-0.4998, max+.4998].When indicated, scaled scores are computed by re-scaling the predicted scores using mean and standard deviation of human scores as observed on the training data and mean and standard deviation of machine scores as predicted for the training set.
The table shows distributional properties of human and system scores. SMD values lower then -0.15 or higher than 0.15 are highlighted.
The table shows the standard association metrics between human scores and machine scores. Note that some evaluations are based on rounded (Trim-round
) scores computed by first truncating and then rounding the predicted score.