Predictive-Process-Monitoring

Important note: The number of epochs have been set for both inter-case and intra-case to 200. However in practice, convergence, i.e. when the validation loss stops decreasing, would happen much faster than that (usually within the first 10 epochs). So once you notice that validation loss is no longer decreasing, you can kill the process. The weights for the execution when the validation loss is minimum are saved.

Intra-case

Download the preprocessed logs from here.
Navigate to the Intra-case folder.

Training

Make the following changes in train.py for each log:

Set the paths to the train, validation and test splits at lines 45-52.
Set the value of maxPrefixLength variable located above where the train_model function is called to the corresponding value from MaxPrefixLength.md.
Set the path to output the weights where the train_model function is called.

Run python3 train.py.

Testing

Make the following changes in evaluate.py for each log:

Set the paths to the train, validation and test splits at lines 47-54.
Set the value of maxPrefixLength variable located above where the load_state_dict function is called to the corresponding value from MaxPrefixLength.md.
Set the path to read the weights where the load_state_dict function is called.

Run python3 evaluate.py

Inter-case

Download the datasets from here.
Navigate to the Inter-case folder.

Training

Navigate to the Train folder and make the following changes in the file corresponding to the dataset for each sub-dataset:

Set the path to the dataset at line 12.
For the BPIC 2015 and Hospital Billing datasets, the variable dataset_name needs to be set depending on which sub-dataset is being used since the preprocessing for the sub-datasets is slightly different.
Set the value of maxPrefixLength variable located above where the train_model function is called to the corresponding value from MaxPrefixLength.md. Note that for the datasets that have only one sub-dataset this value may still not be set to the correct value so please check it.
Set the path to output the weights where the train_model function is called.

Run python3 {filename.py} where filename.py is the name of the file.

Testing

Navigate to the Test folder and make the following changes in the file corresponding to the dataset for each sub-dataset:

Set the path to the dataset at line 13.
For the BPIC 2015 and Hospital Billing datasets, the variable dataset_name needs to be set depending on which sub-dataset is being used since the preprocessing for the sub-datasets is slightly different.
Set the value of maxPrefixLength variable located above where the load_state_dict function is called to the corresponding value from MaxPrefixLength.md. Note that for the datasets that have only one sub-dataset this value may still not be set to the correct value so please check it.
Set the path to read the weights where the load_state_dict function is called.

Run python3 {filename.py} where filename.py is the name of the file.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Inter-case		Inter-case
Intra-case		Intra-case
MaxPrefixLength.md		MaxPrefixLength.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predictive-Process-Monitoring

Intra-case

Training

Testing

Inter-case

Training

Testing

About

Uh oh!

Releases

Packages

Languages

JaiDoshi/Predictive-Process-Monitoring

Folders and files

Latest commit

History

Repository files navigation

Predictive-Process-Monitoring

Intra-case

Training

Testing

Inter-case

Training

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages