1.0.0 SC2DatasetPreparator Release
SC2DatasetPreparator
This repository contains tools which can be used in order to perform the following steps:
- Using
src/directory_flattener.py
Flatten the directory structure and save the old directory tree to a mapping of{"replayUniqueHash": "whereItWasInOldStructure"}
- Using
src/sc2_replaypack_processor
Perform replaypack processing with https://github.com/Kaszanas/SC2InfoExtractorGo
Customization
In order to specify different processing flags for https://github.com/Kaszanas/SC2InfoExtractorGo please modify the src/sc2_replaypack_processor
file directly
Usage
Before using this software please install Python >= 3.7 and requirements.txt
.
Please keep in mind that src/directory_flattener.py
does not contain default flag values and can be customized with the following command line flags:
usage: directory_flattener.py [-h] [--input_path INPUT_PATH]
[--file_extension FILE_EXTENSION]
Directory restructuring tool used in order to flatten the structure, map the
old structure to a separate file, and for later processing with other tools.
optional arguments:
-h, --help show this help message and exit
--input_path INPUT_PATH
Please provide input path to the dataset that is going
to be processed.
--file_extension FILE_EXTENSION
Please provide a file extension for files that will be
moved and renamed.
Please keep in mind that the src/sc2_replaypack_processor.py
does not contain default flag values and can be customized with the following command line flags:
Tool used for processing SC2 datasets. with
https://github.com/Kaszanas/SC2InfoExtractorGo
optional arguments:
-h, --help show this help message and exit
--input_dir INPUT_DIR
Please provide input path to the directory containing
the dataset that is going to be processed.
--output_dir OUTPUT_DIR
Please provide an output directory for the resulting
files.
--number_of_processes NUMBER_OF_PROCESSES
Please provide the number of processes to be spawn for
the dataset processing.