DRCaptioning

The instruction of the object-centric caption generation.

Preliminaries

Our model is implemented in Torch, and depends on the following packages:

After installing torch, you can install / update these dependencies by running the following:

luarocks install torch
luarocks install nn
luarocks install image
luarocks install lua-cjson
luarocks install https://raw.githubusercontent.com/qassemoquab/stnbhwd/master/stnbhwd-scm-1.rockspec
luarocks install https://raw.githubusercontent.com/jcjohnson/torch-rnn/master/torch-rnn-scm-1.rockspec

Test

To run the model on new images, use the script run_model.lua. To run the model on a test image, use the following command:

th run_model.lua -input_image /path/to/my/image/file -output_vis_dir /path/to/the/output/folder

If you have an entire directory of images on which you want to run the model, use the -input_dir flag instead:

th run_model.lua -input_dir /path/to/my/image/folder -output_vis_dir /path/to/the/output/folder

Results

The resulting output file format is as follows:

[
	{
		"boxes": [
			[9.4456, 46.8276,569.0354, 368.3203],
			[183.6740, 77.7138, 185.4196, 332.1285],
			[403.1037, 77.593994, 323.3377, 334.4553],
			...
			]
		"captions": [
			  'the man wearing black shirt',
			  'the man has head',
			  'the man wearing a white shirt',
			  ...
			  ]

	}
...
]

Acknowledgements

This work was supported by Institute for Information & communications Technology Promotion(IITP) grant funded by the Korea government(MSIT) (2017-0-01780, The technology development for event recognition/relational reasoning and learning knowledge based system for video understanding)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
densecap		densecap
eval		eval
info		info
license_report		license_report
test		test
Friends_output.json		Friends_output.json
LICENSE		LICENSE
README.md		README.md
add_caption_to_relationship.py		add_caption_to_relationship.py
add_long_caption_to_relationship.py		add_long_caption_to_relationship.py
caption_deviation.py		caption_deviation.py
caption_statistics.py		caption_statistics.py
debugger.lua		debugger.lua
eval.sh		eval.sh
evaluate_caption.lua		evaluate_caption.lua
evaluate_model.lua		evaluate_model.lua
evaluate_model_R.lua		evaluate_model_R.lua
model_name.txt		model_name.txt
models.lua		models.lua
preprocess.py		preprocess.py
preprocess2.py		preprocess2.py
preprocess_VRD.py		preprocess_VRD.py
preprocess_VRD2.py		preprocess_VRD2.py
preprocess_VRD_union.py		preprocess_VRD_union.py
preprocess_densecap.py		preprocess_densecap.py
preprocess_union.py		preprocess_union.py
run_model.lua		run_model.lua
run_retrieval.lua		run_retrieval.lua
test_code.py		test_code.py
train.lua		train.lua
train_R.lua		train_R.lua
train_opts.lua		train_opts.lua
visualize_results.py		visualize_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRCaptioning

Preliminaries

Test

Results

Acknowledgements

About

Releases

Packages

Languages

License

Dong-JinKim/DRCaptioning

Folders and files

Latest commit

History

Repository files navigation

DRCaptioning

Preliminaries

Test

Results

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages