sift:tutorials:run_k-means
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
sift:tutorials:run_k-means [2024/07/17 15:22] – sgranger | sift:tutorials:run_k-means [2024/11/05 14:58] (current) – wikisysop | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== Run K-Means ====== | ====== Run K-Means ====== | ||
- | The k-means clustering algorithm is a commonly used method for grouping //n// individual data points into //k// clusters. It is a multi-variate statistical analysis that reduces the high-dimensional matrix of correlated, time-varying signals into a low-dimensional and statistically uncorrelated set of principal components (PCs). These PCs explain the variance found in the original signals and represent the most important features of the data, e.g., the overall magnitude or the shape of the time series at a particular point in the stride cycle. The value of each particular subject’s score for the individual PCs represents how strongly that feature was present | + | The k-means clustering algorithm is a commonly used method for grouping //n// individual data points into //k// clusters. It does so in an unsupervised manner, iteratively selecting cluster centre points and assigning data points to a cluster. Within Sift, this is implemented onto the [[sift: |
==== The utility of clustering ==== | ==== The utility of clustering ==== | ||
- | When analysing biomechanical signals, we often realize that a number of individual traces are similar. It can be useful to describe these traces as belonging to the same group, or cluster. This potentially allows us to simplify our analysis or to pick a single trace as being " | + | When analysing biomechanical signals, we often realize that a number of individual traces are similar. It can be useful to describe these traces as belonging to the same group, or cluster. This potentially allows us to simplify our analysis or to pick a single trace as being " |
==== Tutorial Overview ==== | ==== Tutorial Overview ==== | ||
- | This tutorial works off the Principal Component Analysis Tutorial, and assumes a good understanding of using PCA in Sift. This tutorial uses overground walking data from roughly 100 subjects divided into two conditions, normal control and osteoarthritis (moderate to severe). This data set is included in the Demo folder of your Inspect3D | + | This tutorial works off the [[sift: |
==== Running a K-Means Test ==== | ==== Running a K-Means Test ==== | ||
Line 29: | Line 29: | ||
==== Viewing K-Means Results ==== | ==== Viewing K-Means Results ==== | ||
- | {{:DataDlg.png}} | + | {{:k-means-datadlg.png}} |
Once you have run your K-Means Test and taken a brief look at the cluster' | Once you have run your K-Means Test and taken a brief look at the cluster' | ||
Line 38: | Line 38: | ||
- Navigate to the **Analyse** page and select the **Workspace Scores** tab in your PCA results. | - Navigate to the **Analyse** page and select the **Workspace Scores** tab in your PCA results. | ||
- | Looking at the workspace tab we can select different points and the group and file will be displayed. This allows us to view which data points in a cluster belong to what group. We can clearly see the data points split into to clusters, blue and red, with somewhat of a separation. | + | Looking at the workspace tab we can select different points and the group and file will be displayed. This allows us to view which data points in a cluster belong to what group. We can clearly see the data points split into to clusters, blue and red, with somewhat of a separation. |
- | {{:WorkspaceScores.png}} | + | {{:k-means-workspacescores.png}} |
A K-means test finds the similarity between data points and groups them together into clusters. If you had two groups that were vastly different, the clusters would not have mixed groups. If the data points between groups have similarities the clusters may have data points from different groups. | A K-means test finds the similarity between data points and groups them together into clusters. If you had two groups that were vastly different, the clusters would not have mixed groups. If the data points between groups have similarities the clusters may have data points from different groups. |
sift/tutorials/run_k-means.1721229739.txt.gz · Last modified: 2024/07/17 15:22 by sgranger