I have this data (200 x and y values) in a file that I can read in and plot. I am following https://www.mathworks.com/help/stats/clustering-using-gaussian-mixture-models.html in order to try and determine the mean and covariance of the data. I have run this code. But, I am really lost as to how to use GMM and then determine how many clusters I may need for my data. And then after that, determine the mean and covariance of each of the clusters. It seems like it's in: https://www.mathworks.com/help/stats/clustering-using-gaussian-mixture-models.html. Can anyone offer guidance as to how I can use this code to get to my number of clusters, and the mean and covariance of the clusters?
The number of clusters is the only parameter you need to give tofitgmdistand mean and covariance matrix will be calculated based on the data that is being processed. The documentation you mentioned does slightly advance analysis around thecovariance matrix being full or diagonalized. If you need analysis on youfitting,then I think following the documentation step-by-step will help.
Otherwise,if you only want to do the clusteringthenthese two documentationswill be quick to understand.
For selecting the number ofclusters,you can plot the data to see how many discreteclustersvisually you candepictfrom the data. You can change your initial estimate by +-1 to see which one gives least log-likelihood.
You can also select a web site from the following list:
How to Get Best Site Performance
Select the China site (in Chinese or English) for best site performance. Other MathWorks country sites are not optimized for visits from your location.
0 Comments
Sign in to comment.