Using K-means to cluster kinetic features in above-knee amputees

Abstract

Day to day mobility of above the knee amputees is related to the design of their prosthetic. Although there are some studies about this underrepresented population's biomechanics, the kinetics during stand-up and sit-down movements (STS) have not been fully studied. Here we use K-Means to evaluate the kinetic behavior during STS extracted from Hunt et al.'s public dataset. We found that kinetic signals between the intact and prosthetic side were able to be clustered unsupervised, but it struggled to cluster across different brands of prosthetics. We conclude that there is minimal variance between participant’s kinetic behavior of their prosthetic leg regardless of the brand, however there is a difference between the intact and prosthetic side.

Data

Public Data Set

The publicly available dataset used in this project is credited to Hunt et al. of the paper "Open dataset of kinetics, kinematics, and electromyography of above-knee amputees during stand-up and sit-down". The dataset includes 3D kinematic and kinetic data for 9 above-knee amputees during stand-up and sit-down with their passive, microprocessor-controlled prostheses. The biomechanics were captured using a 12-camera motion capture system with two force plates and four EMG sensors on the intact lower limb.

In this tutorial, we will be using the whole dataset, which can be downloaded fromtheir original website, with the dataset name V3D_STS.zip. Within this folder, each participant has a workspace, for a total of 9 workspaces, and these workspaces can be directly loaded into Visual3D and Sift.

In Visual3D you can see the full body model that was built using a modified Plug-In gait model.

Kinetic Data

In this analysis we will focus on the Ground Reaction Force (GRF), Knee Joint Flexion/Extension Moment, and Hip Joint Flexion/Extension Moment for analysis, normalized to body weight. These link model based items were created using the pipelines available in the dataset, and you can find the pipeline files under the path “V3D_STS/V3D_Pipeline_Files”. Below is an example of the command line for generating Knee Joint Moment (Torque).

Compute_Model_Based_Data
/RESULT_NAME=L_knee_torque
/SUBJECT_TAG=ALL_SUBJECTS
/FUNCTION=JOINT_MOMENT
/SEGMENT=LSK
/REFERENCE_SEGMENT=
/RESOLUTION_COORDINATE_SYSTEM=LTH
! /USE_CARDAN_SEQUENCE=FALSE
/NORMALIZATION=TRUE
/NORMALIZATION_METHOD=DEFAULT_NORMALIZATION
! /NORMALIZATION_METRIC=
! /NEGATEX=FALSE
/NEGATEY=TRUE
/NEGATEZ=TRUE
! /AXIS1=X
! /AXIS2=Y
! /AXIS3=Z
! /TREADMILL_DATA=FALSE
! /TREADMILL_DIRECTION=UNIT_VECTOR(0,1,0)
! /TREADMILL_SPEED=0.0
;

Methods

Visual3D Processing

Although the data set provides completed .cmz workspaces, there was some processing that was required to prepare for analysis.

1. Download V3D_STS.zip

2. Add Tags for each workspace: Each workspace indicates a single participant. Since there are only 9 participants in this dataset, we will add tags manually. The table shown below is from the original paper, which includes all the demographics and details of the participants. In this tutorial, we will add tags for these two columns: “Prosthesis Side” and “Knee Prosthesis”.

Load workspace: In Visual3D, select File → Open/Add…, and go to the V3D_STS folder downloaded previously. Open a workspace. As an example, open the workspace for the TF01 participant: 20210916_TF01_STS.cmz. You should see the workspace being loaded as following:

Add tags: Click Add New File Tag, and you will see a pop up window Enter the new file tag. According to the table, TF01 has “Prosthesis Side” as “L”, so we enter “Left” in the pop up window, and click Continue». Similarly, since TF01 is using “C-Leg” as “Knee Prosthesis”, we will add another tag named as “CLeg”. Make sure both tags are checked.
- Note that there should be no hyphens in the tag!

Now that we have added the tags for TF01, we can repeat the same steps for the other workspaces.

Loading Data into Sift

Within Sift we can visualize and analyze the data.

Select Load Library and click on Browse to go to the V3D_STS folder where all 9 workspaces with updated tags are located. Then select Load and exit the window. Make sure you see the tags showing up on the screen. You could also click on the + button on the left side of each workspace to see whether the corresponding tags are being checked.

Build Queries in Sift

In our analysis we have two main objectives:

To compare intact side vs prosthetic's side kinetic signals.
To compare the the three Knee Prosthetic brands (i.e. C-Leg, Rheo, Plie) kinetic signals.

To do so we will focus on Ground Reaction Force, Knee Joint Moment in Flexion/Extension, and Hip Joint Moment in Flexion/Extension.

Queries for Objective 1

We will build queries for Ground Reaction Force as an example, but the steps and logistics for Knee Joint Moment, and Hip Joint Moment are the same.

Open the Query Builder icon on the Explore Page or on the toolbar to prompt the Query Builder Dialog.

Create a new query with Query Name: GRF_intact. This would be the query for all participants' intact-side. Within this query, create the following conditions:

1. Condition Name: left prosthesis CLeg.

This condition includes all participants who's left leg is the prosthetic leg, AND using the C-Leg knee prosthetic brand.
In the Signals tab, select the following settings:
- Since this participant's left leg is the prosthetic, the GRF of the intact leg would be the RIGHT leg. Thus, R_GRF is selected as the Signal Name.
- Since the Z-axis is the vertical axis in the coordinate system in this study, we select Z for the Component, as the ground reaction force is in the vertical direction from the floor up to the leg.

In the Events tab, within the All Events block, you should see two events: Foot Off and Foot Strike. Select Foot Off and click the > button to add it into Event Sequence block. Then select Foot Strike and click the > button to add it into Event Sequence block. If you accidentally selected the wrong event, you can also click the < button to move the event from Event Sequence back to All Events block. You can also use the Up and Down buttons to move the event correspondingly in the sequence.

In the Refinement tab, check Refine using tag and Use AND Logic, and then select the tags CLeg and Left. Then click Save.

2. Condition Name: right prosthesis CLeg.

This name indicates that the participant has the right leg as the prosthesis, so the left leg is the intact leg.
Signals tab

Events tab: Same as previous.
Refinement tab:

Click Save.

Up to this point, we have successfully created the conditions for C-Leg. The same steps are repeated for Plie and Rheo, and the only difference would be to select the corresponding tags for the different brands of prosthetics.

After creating the GRF_intact query, we can then use the same logic as above to create the GRF_prosthesis query. Note that in this case, when we are creating for example left prosthesis CLeg, we select L_GRF for the Signal Name, because the left leg would be the prosthetic leg.

The queries for Hip Moment and Knee Moment have the same logic as above.

Queries for Objective 2

For this objective, we are trying to compare the kinetics data of the prosthetic leg across different brands. Thus, we will create queries specifically for each brand: C-Leg, Plie, and Rheo. For illustration purpose, we will take C-Leg as and example, but idea would be the same for the other two brands as well.

In the Explore Page, click on the Query Builder icon.
Create a new query, and set Query Name as “CLeg GRF_prosthesis” to indicate that this query is for participants who use C-Leg as their prosthetic leg, and we will focus on the Ground Reaction Force of the prosthetic legs.
Within this query, create two conditions:

1. A condition named as left prosthesis for participants with left leg as prosthetic.

Signals :
- Type: LINK_MODEL_BASED
- Folder: ORIGINAL
- Signal Name: L_GRF
- Component: Z
Events Sequence is Foot Off followed by Foot Strike
Refinement:
- Refine using tag: checked
- Use AND Logic: checked
- Tags: CLeg, Leg
- Click Save

2. A condition named as right prosthesis for participants with right leg as prosthetic.

Signals :
- Type: LINK_MODEL_BASED
- Folder: ORIGINAL
- Signal Name: R_GRF
- Component: Z
Events Sequence is Foot Off followed by Foot Strike
Refinement:
- Refine using tag: checked
- Use AND Logic: checked
- Tags: CLeg, Leg
- Click Save

Now we have built the C-leg query for GRF, we can also do the same for Knee Joint Moment and Hip Joint Moment. Note that the Component should be set to X for Knee Joint Moment and Hip Joint Moment, because the rotations of these joints are relative to the X-axis rather than Z-axis for GRF.

After all queries are created, click on Calculate All Queries at the bottom of the Query Builder Dialog.

Visualizing Data in Sift

Before going into the analysis we can visualize the data to see if we can identify any patterns between groups.

As an example, we can quickly visualize the GRF difference between intact sides vs.. prosthetic sides:

Go to the Explore page
Select both GRF_intact and GRF_prosthesis in the Groups block by pressing Ctrl, and then check Select All Workspaces.
Check Plot Group Mean and Plot Group Dispersion.
The plot that looks like the following would be generated:

The X-axis is the normalized time points, with the following events in the corresponding ranges:

Standing Up: Point 0 ~ 40
Standing: Point 40 ~ 60
Sitting Down: Point 60 ~ 100

Once we are happy with the data that is visualized we can go ahead with the analysis.

PCA Analysis

Principal Component Analysis (PCA) is a dimensionality reduction technique that transforms high-dimensional data into a lower-dimensional space to reveal patterns and visualize variations within the data. In this project, we will show how to perform PCA to visualize the variations of kinetic features between intact leg and prosthetic leg, as well as across the three prosthetic brands. The PCA scores will then be used to perform K-means clustering.

The following PCA results are generated for further analysis.

For Objective 1

Groups: GRF_intact, GRF_prosthesis
PCA Name: GRF_all_together
Number of PCs: 4
Use Workspace Mean: Unchecked

Groups: Hip_Moment_prosthesis, Hip_Moment_intact
PCA Name: Hip_Moment_all_together
Number of PCs: 4
Use Workspace Mean: Unchecked

Groups: Knee_Moment_intact, Knee_Moment_prosthesis
PCA Name: Knee_Moment_all_together
Number of PCs: 4
Use Workspace Mean: Unchecked

For Objective 2

Groups: CLeg GRF_prosthesis, Plie GRF_prosthesis, Rheo GRF_prosthesis
PCA Name: GRF Prosthesis
Number of PCs: 7
Use Workspace Mean: Unchecked

Groups: CLeg Knee Moment prosthesis, Plie Knee Moment prosthesis, Rheo Knee Moment prosthesis
PCA Name: Knee Moment Prosthesis
Number of PCs: 6
Use Workspace Mean: Unchecked

Groups: CLeg Hip Moment prosthesis, Plie Hip Moment prosthesis, Rheo Hip Moment prosthesis
PCA Name: Hip Moment Prosthesis
Number of PCs: 7
Use Workspace Mean: Unchecked

K-means Analysis

K-means is an unsupervised clustering technique that groups similar datapoints together to form a “cluster” based on the Euclidean distances between points. In our project, we will use K-means clustering to answer our two objects:

Investigating whether K-means can find the clusters for “Intact” and “Prosthetic” based on the PC scores of kinetic data.
Investigating whether K-means would be able to cluster the three prosthetic brands based on GRF.

The following shows the settings for performing K-means in Sift.

Settings for Objective 1

Go to the Analyse page, select the PCA Results which you would like to perform K-means analysis on.
From the toolbar, click the Outlier Detection Using PCA icon and select K-means from the drop-down menu.
Use the following settings to perform K-means:

Settings for Objective 2

Go to the Analyse page, select the PCA Results which you would like to perform K-means analysis on.
From the toolbar, click the Outlier Detection Using PCA icon and select K-means from the drop-down menu.
Use the following settings to perform K-means. Note that the K-means setting is the same for all cases, we just need to change “Number of Clusters” parameter correspondingly.