This Repository contains the scripts and instructions about preparing the Pedestrian Situated Intent (PSI) 1.0 & 2.0 dataset.
- PSI 1.0: video_0001 ~ video_0110
- PSI 2.0: video_0001 ~ video_0204
NOTE: You may only need to use PSI 2.0 dataset for the [IEEE ITSS PSI Competition]. But the PSI 1.0 dataset is also provided, and feel free to use it if you would explore additional knowledge beyond the PSI 2.0.
Step 1. Download the PSI 2.0 Dataset videos from [Google Drive] [PSI Homepage]. Move *.zip files to the dataset ROOT_PATH, and unzip them by
cd ROOT_PATH # e.g., root/Dataset
unzip '*.zip' -d .
rm *.zip
The extracted folder contains all videos (Train/Val):
- ROOT_PATH/PSI_Videos/videos.
Step 2 Download PSI 2.0 data annotations from [Google Drive] [PSI Homepage]. Move downloaded *.zip files to the dataset ROOT_PATH.
unzip '*.zip' -d .
rm *.zip
The extracted folder contains all annotations of the PSI 2.0 Dataset (Train/Val)
- ROOT_PATH/PSI2.0_TrainVal/annotations/cognitive_annotation_key_frame
- ROOT_PATH/PSI2.0_TrainVal/annotations/cv_annotation
and the train/val/test splits:
- ROOT_PATH/PSI2.0_TrainVal/splits/PSI2_split.json.
Step 3 Split the videos into frames by
python split_clips_to_frames.py *ROOT_PATH*
and the output frames are saved as:
- ROOT_PATH/frames.
TASK 1 - Pedestrian Intent: The frame at which one annotator explicitly make a cross intent annotation is treated as the "key-frame". Every annotator would give one "intent" annotation about the crossing intent estimation of the target pedestrian, together with one "reansoning/explanation" of the estimation. For these two annotation:
- Crossing Intent: We extend the crossing intent annotation to the frames following the current key-frame, until the next frame that one of the annotators make another estimation (no matter if the two annotations are the same or not).
- Reasoning/Explanation: We extend the reasoning/description of the intent estimation to the frames prior to the current key-frame, until the last key-frame that one of the annotators made another estimation, assuming the description is about the scenes observed by the annotators to support the intent estimation.
Already-crossed: When a pedestrian has already crossed in front of the moving vehicle, we categorize the status after the target pedestrian crosses the middle line of the ego-view as "Already-crossed." In this scenario, there is no necessity to predict any further crossing intent, as the target pedestrian has already safely crossed the road.
python extend_intent_annotation.py *ROOT_PATH*
and the output frames are saved as:
- ROOT_PATH/PSI2.0_TrainVal/annotations/cognitive_annotation_extended.
TASK 2 - Pedestrian Trajectory: Pedestrian trajectory prediction task uses only the visual annotations (bounding boxes) of the target pedestrian, there is no need to exntend the cognitive annotations to all frames.
TASK 3 - Driving Decision: The frame at which one of the annotators explicitly make a driving decision is treated as the "key-frame". Every annotator would give one "decision" at the key-frame, and provide one "reansoning/description" of the decision made. For these two cognitive annotations:
- Driving Decision: We extend the driving decision annotation to the frames following the current key-frame, until the next frame that one of the annotators make another driving decision (e.g., turn or go straight).
- Reasoning/Description: We extend the reasoning/description to the frames prior to the current key-frame, until the last key-frame that one of the annotators made another estimation, assuming the description is about the scenes observed by the annotators to support the driving decision.
python extend_driving_decision_annotation.py *ROOT_PATH*
and the output frames are saved as:
- ROOT_PATH/PSI2.0_TrainVal/annotations/cognitive_annotation_extended.
We provide baselines for all tracks of challenges as hints about using the PSI 2.0 dataset for a quick start.
Track 1 (Pedestrian Intent Prediction (PIP))
Track 2 (Pedestrian Trajectory Prediction (PTP))
Track 3 (Driver Decision Prediction (DDP))
If you would like to use the PSI 1.0 dataset annotations, download PSI 1.0 data annotations from [Google Drive] [PSI Homepage]. Move downloaded *.zip files to the dataset ROOT_PATH.
unzip '*.zip' -d .
rm *.zip
The extracted PSI 1.0 dataset follows the same format as PSI 2.0, so feel free to use the Cognitive Annotation Extention scripts and Baselines prepared for PSI 2.0 to explore the PSI 1.0 knowledge.
(Note: PSI 2.0 and PSI 1.0 share the first 110 videos, but have different annotations. Please check the [paper] and our future udpates for more information. )