Skip to content

adasegroup/pyDMHP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Sequence Clusterers

Framework of Methods for Clustering of Heterogeneous Event Sequences.

To train model:

python3 run.py --config_name config_thp.yaml +task_type=train

To use pretrained model for inference only:

python3 run.py --config_name config_ts.yaml +task_type=infer_only

To reproduce experiments, one should edit several config files:

- param. "data_name" in ~/configs/config_name.yaml to choose corresponging dataset; 
- hyperparams of corresponding method in ~/configs/model/name_of_method.yaml;
- training parameters (device, number of epochs, etc.) in ~/configs/trainer/default.yaml;
- hyperparameters of data preprocessing (max sequence length, batch size, etc.) in ~/configs/datamodule/name_of_datamodule.yaml;

Datasets: LinkedIn, Age, ATM, IPTV, Synthetic Hawkes processes realizations

The datasets are taken from cloud drive

Overview of the project: The project is the implementation of sequences clusterization methods using the common API and code structure that follows best practices of formatting. In the project we focus on proprietary method (aka Cohortney) and several baselines, including Convolutional Autoencoder, Transformer Hawkes Process, TsFresh and TsLearn feature extractors. The methods are refactored to fit Pytorch Lightning framework.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published