mclustpy

mclustpy is a Python function for clustering data using the Mclust algorithm from the R package mclust. The function takes a 2D numpy array of data and returns a dictionary containing various output values computed by the Mclust algorithm.

Installation

mclustpy requires the following dependencies:

numpy
rpy2

To install mclustpy, you can use pip:

pip install mclustpy

Usage

from mclustpy import mclustpy
import numpy as np

data = np.random.rand(1000, 10)
data.shape

res = mclustpy(data, G=9, modelNames='EEE', random_seed=2020)

The mclustpy function takes the following parameters:

data: a 2D numpy array of data to be clustered.
G: an integer specifying the maximum number of mixture components to be considered (default is 9).
modelNames: a string specifying the model types to be considered (default is 'EEE').
random_seed: an integer specifying the random seed for reproducibility (default is 2020).

The function returns a dictionary containing the following output values:

call: the function call used to run the Mclust algorithm.
data: the input data as an R matrix.
modelName: the model name(s) selected by the algorithm.
n: the number of observations in the data.
d: the number of variables in the data.
G: the number of mixture components selected by the algorithm.
BIC: the Bayesian Information Criterion (BIC) value for the selected model.
loglik: the log-likelihood of the selected model.
df: the number of degrees of freedom in the selected model.
bic: the BIC value for each model considered.
icl: the Integrated Completed Likelihood (ICL) value for each model considered.
hypvol: the hypervolume of the cluster tree for each model considered.
parameters: the estimated parameters for each component in the selected model.
z: the posterior probabilities of assignment to each component for each observation.
classification: the classification of each observation under the selected model.
uncertainty: a measure of uncertainty in the classification of each observation.

For more info take a look at the original mclust page

License Notice:

This package, mclustpy, is licensed under the MIT License. However, it depends on the R package mclust, which is licensed under the GNU General Public License (GPL ≥2). Users must ensure compliance with the GPL license when using mclustpy.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
mclustpy		mclustpy
tests		tests
LICENSE		LICENSE
README.md		README.md
howToMclust.ipynb		howToMclust.ipynb
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mclustpy

Installation

Usage

The mclustpy function takes the following parameters:

The function returns a dictionary containing the following output values:

License Notice:

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

KalinNonchev/mclustpy

Folders and files

Latest commit

History

Repository files navigation

mclustpy

Installation

Usage

The mclustpy function takes the following parameters:

The function returns a dictionary containing the following output values:

License Notice:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages