Published November 11, 2020 | Version 0.1
Dataset Open

Reinforcement Learning Control of a Biomechanical Model of the Upper Extremity

  • 1. University of Bayreuth, Bayreuth, Germany

Description

This dataset contains evaluation data of the paper "Reinforcement Learning Control of a Biomechanical Model of the Upper Extremity".

Motivation

We address the question whether the assumptions of signal-dependent and constant motor noise in a full skeletal model of the human upper body, together with the objective of movement time minimization, can predict reaching movements.

For evaluation of the learned policy, two tasks are defined in the paper: a Fitts' Law type task and an elliptic via-point task.

General description of the dataset

  • Fitts' Law Type Task
    • This dataset incorporates detailed information of all 6500 synthesized movements generated in the Fitts' Law type task (following the ISO 9241-9 standard).
    • For each of the 10 task conditions differing in distance between targets ("dist<xxx>" in filename)
      and ID ('ID<xxx>' in filename), there are two .csv-files:
      - one with detailed trajectory information on a sample-to-sample basis ("ISO_SAMPLES" in filename), and
      - one with aggregated movement information on an episode basis ("ISO_METRICS" in filename).

    • In addition, for each task condition and each of the 13 movement directions in the Fitts' Law type task,
      we include 6 figures: Position, Velocity, and Acceleration Profiles, as well as 3D movement path, Phasespace, and Hooke plots.
      Apart from the 3D plots, all figures use centroid projections of the respective trajectory onto the vector between initial and target position.
      The first integer in the file name denotes the movement direction number, starting with "0" for movements between the targets 1 and 2, "1" for movements between the targets 2 and 3 etc.
      The file "6_distance0.35_ID2_policy2100000_phasespace.png", e.g., shows velocity plotted againt position for all 50 movements between the targets 7 and 8 in the task condition with ID 2 and 35cm diameter of the target circle (see Fig 2 in Paper).

  • Elliptic Task

    • This dataset also contains two CSV-files with data of the trajectory generated by the final policy in the elliptic task:
      - one with detailed trajectory information on a sample-to-sample basis ("ELLIPSE_SAMPLES" in filename), and
      - one with aggregated movement information on an episode basis, where a new episode starts
      whenever the target on the ellipse given to the policy switches ("ELLIPSE_METRICS" in filename).

Description of the .csv-files

  • SAMPLES Files
    • "time": time after reaching the initial target (target 1 in Fig 2) for the first time (in seconds)
    • "elv_angle_pos" - "flexion_pos": angle of respective independent DOF (in radians) *
    • "elv_angle_vel" - "flexion_vel": angular velocity of respective independent DOF (in radians/s) *
    • "end-effector_xpos_x" - "end-effector_xpos_z": 3D position of end-effector in global coordinates (in meters) *
    • "target_xpos_x" - "target_xpos_z": 3D position of target sphere in global coordinates (in meters) *
    • "end-effector_xvelp_x" - "end-effector_xvelp_z": positional velocities of end-effector (in meters/s) *
    • "target_xvelp_x" - "target_xvelp_z": positional velocities of target sphere (in meters/s) *
    • "accsensor_end-effector_x" - "accsensor_end-effector_z": positional acceleration of end-effector (in meters/s^2) *
    • "E_elv_angle" - "E_flexion": activation of respective independent DOF *
    • "E_elv_angle_derivative" - "E_flexion_derivative": derivative of activation of respective independent DOF *
    • "difference_vec_x" - "difference_vec_z": vector between the end-effector attached to the index finger and the target, pointing towards the target (in meters) *
    • "centroid_vel_projection": projection of end-effector velocity towards target (in meters/s) *
    • "target_width": radius (!) of the target sphere (in meters) *
    • "A_elv_angle" - "A_flexion": action vector
    • "thorax_tx_frc" - "wrist_hand_r3_frc": net external force at respective DOF (including dependent and fixed DOFs such as "thorax_tx" (thorax translation))
    • "reward": reward obtained in this step
    • "step_type": 0=initial step of episode, 1=intermediate step of episode, 2=terminal step of episode
    • "target_switch": whether the target switched in this step
    • "discount": internal value of tf-agents (does not correspond to the discount factor gamma, which is additionally applied!)
    • "thorax_tx_pos" - "wrist_hand_r3_pos": angle of respective dependent DOF (in radians)
    • "thorax_tx_vel" - "wrist_hand_r3_vel": angular velocity of respective dependent DOF (in radians)
  • METRICS Files
    • Index: episode ID
    • "Init_X" - "Init_Z": initial position in global coordinates (in meters)
    • "Init_X" - "Init_Z": target position in global coordinates (in meters)
    • "Init_Distance": distance between last target (i.e., desired initial position) and current target (in meters)
    • "Target_Width_Diameter": target width diameter (in meters)
    • "Movement_ID": Index of Difficulty of current movement (using the Shannon Formulation) (in bits)
    • "target_accuracy": 1 - (<remaining distance to target center at the end of the episode>/<target radius>) if end-effector is inside target, 0 else
    • "movement_time": duration of the episode (in seconds)
    • "episode_successful": whether episode terminated successfully within the permitted 1.5 seconds
    • "dist2target": remaining distance to target center at the end of the episode (in meters)

-------------------------------------
* included in state space

Files

ELLIPSE_METRICS_0.075-0.03radii_60.0s_policy1200000.csv

Files (449.2 MB)

Name Size Download all
md5:65de49c11ed565cf1a12664b5266bc51
117.2 kB Preview Download
md5:678cc44268928c7d50d9fe4c274a8de3
8.5 MB Preview Download
md5:00fa5f0b86a5efef4f9d9d5bcab140c9
79.9 MB Preview Download
md5:70f80cc86bfffd8b21d6168d0121db6d
112.5 kB Preview Download
md5:3147d773706511ff05624b87f3b72d8e
132.0 kB Preview Download
md5:f5cdf0909e65b22de9386f8378565f73
132.9 kB Preview Download
md5:4b144c85e4645b4f34c1d08ac7007d4d
133.5 kB Preview Download
md5:18cf5ead21d1b209bc1e45eb74fb12a5
133.5 kB Preview Download
md5:11bf98da740397bbd2488d3bd567f8f7
134.2 kB Preview Download
md5:491deeacaddd2f64ce0d39da2a4dbd65
133.5 kB Preview Download
md5:acb6574769051d165d903406041ef998
132.4 kB Preview Download
md5:76518cce100e55b2cfab4374d6873963
122.0 kB Preview Download
md5:6b85f46a050c9448b0b9384d2607500f
132.3 kB Preview Download
md5:3b905387c58fb45953278a5916b09422
20.8 MB Preview Download
md5:1697232a5e47adc062bc24cd471c4bff
25.4 MB Preview Download
md5:73e4e5ccce96361f98447c10aa2ac3d4
30.9 MB Preview Download
md5:d0b4b06bc66d51d9ef3553d301f6ee41
37.1 MB Preview Download
md5:6571b15e224652742660cc7f4b35ce58
31.3 MB Preview Download
md5:266b27c5fbff5264982ef1809c711b57
33.5 MB Preview Download
md5:aa9f9e707ba2db74f58674bfda32941b
41.6 MB Preview Download
md5:992ecd4a4a921ba9d3761e34f1d8cc1f
40.1 MB Preview Download
md5:4af847ba532fb3aeeb3a99dc495b7e8e
46.4 MB Preview Download
md5:b69539b14ba0558d4f38cc941e670fef
52.3 MB Preview Download