Position-aware Location Regression Network for Temporal Video Grounding

This repository contains an official PyTorch implementation of Position-aware Location Regression Network (PLRN) for temporal video grounding, which is presented in the paper Position-aware Location Regression Network for Temporal Video Grounding.

Position-aware Location Regression Network (PLRN)

The overall architecture of the proposed network (PLRN). To understand comprehensive contexts with only one semantic phrase, PLRN exploits position-aware features of a query and a video. Specifically, PLRN first encodes both the video and query using positional information of words and video segments. Then, a semantic phrase feature is extracted from an encoded query with attention. The semantic phrase feature and encoded video are merged and made into a context-aware feature by reflecting local and global contexts. Finally, PLRN predicts start, end, center, and width values of a grounding boundary.

Requirement

Ubuntu 16.04
Anaconda 3
Python 3.6
Cuda 10.1
Cudnn 7.6.5
PyTorch 1.1.0

Preparing Data

We downloaded all data including annotations, video features (I3D for Charades-STA, C3D for ActivityNet Captions), pre-processed annotation information from here.

Training

conda activate plrn
cd PLRN
bash scripts/train_model.sh PLRN plrn charades 0 4 0

Evaluation

conda activate plrn
cd PLRN
bash scripts/eval_model.sh PLRN plrn charades 0

Acknowledgement

Local-Global Video-Text Interactions for Temporal Grounding was very helpful for our implementation.

Citation

If you have found our implementation useful, please cite our paper:

@inproceedings{kim2021position,
		title={Position-aware Location Regression Network for Temporal Video Grounding},
		author={Kim, Sunoh and Yun, Kimin and Choi, Jin Young},
		booktitle={2021 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)},
		year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
imgs		imgs
scripts		scripts
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Position-aware Location Regression Network for Temporal Video Grounding

Position-aware Location Regression Network (PLRN)

Requirement

Preparing Data

Training

Evaluation

Acknowledgement

Citation

About

Releases

Packages

Languages

sunoh-kim/PLRN

Folders and files

Latest commit

History

Repository files navigation

Position-aware Location Regression Network for Temporal Video Grounding

Position-aware Location Regression Network (PLRN)

Requirement

Preparing Data

Training

Evaluation

Acknowledgement

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages