Skip to content

collinleiber/3CPO

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

3CPO

Poisson Subspace Clustering: Focusing on the Essentials in Count Data

Install Requirements

To install the requirements do the following:

pip install numpy

pip install git+https://github.com/collinleiber/ClustPy.git

Datasets

The used datasets are available here:

Comparison Algorithms

The comparison algorithms Spherical-k-Means (SKM), PoissonL and PoissonC were implemented by us and can be found in the competitors.py file.

The co-clustering algorithms CROINFO, CoclustMod, CoclustSpecMod, ELBM, SELBM and TauCC are contained in the coclustering directory and were originally obtained here:

Experiments

You can test 3CPO manually by testing on some dataset.

from threecpo import ThreeCPO
from datasets import load_synth_data

X, L = load_synth_data() # Replace by any other dataset
n_clusters = len(np.unique(L))
threeCPO = ThreeCPO(n_clusters=n_clusters)
threeCPO.fit(X)

Our results and executions can be obtained by running the methods within the experiments.py file. The methods include the whole pipeline (loading datasets, running algorithms, evaluation). Examples:

from threecpo import experiment_table, experiment_text_data, experiment_ablations, experiment_initializations, experiment_robustness_amount_noise_columns, experiment_robustness_maximum_noise_value, experiment_runtime_rows, experiment_runtime_columns, experiment_estiamte_k, load_synth_data

experiment_table()
experiment_text_data()
experiment_ablations()
experiment_initializations()
experiment_robustness_amount_noise_columns()
experiment_robustness_maximum_noise_value()
experiment_runtime_rows()
experiment_runtime_columns()
X, L = load_synth_data(return_X_y=True)
experiment_estimate_k(X, L)

About

3CPO: Clustering and Column selection of Count Data using a Poisson-based Optimization

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages