Consistent HOST and HIP/pinned buffers for respective API #628

r-abishek · 2025-10-08T06:40:59Z

RPP was originally also responsible for host to hip buffer conversions. This was removed during the course of tensor implementations to ensure all RPP HOST API only have HOST buffers, and GPU API only have HIP buffers (or pinned memory for smaller argument buffers).

The following functionality were still using the old style host->hip memcopy within RPP, and this is now being removed. After this, RPP tensor API will no longer be responsible for any HOST -> HIP buffer copy. The user is responsible to provide HOST buffers for HOST API, and HIP/Pinned memory for GPU API.

copy_param_float(), copy_param_uint() etc perform these copies and are now eliminated.
Just like all other tensor functionalities, pinned memory allocation from test suite is used for samller argument buffers.

These are the changed functionalities:
exposure
blend
brightness
color cast
color twist
constrast
crop mirror normalize
gamma_correction
gaussian_filter
noise
non_linear_blend
resize_mirror_normalize
water

@rrawther Please note equivalent changes in MIVisionX would need to be merged together with this PR.
A patch version change has been done for this tentatively from 2.2.0 to 2.2.1

…rmalize

…normalize

… rcm, color temperature

Mem copy elimination

…y_rm

…memcpy_removal

Mem copy elimination version change and Review comments resolved

LakshmiKumar23 · 2025-10-21T22:06:36Z

@r-abishek please resolve merge conflicts

Update patch version from 2.2.0 to 2.2.1

Copilot

Pull Request Overview

This PR removes internal host-to-HIP buffer copy functionality from RPP to ensure consistent memory management. GPU APIs now require users to provide HIP/pinned memory buffers directly, eliminating the copy_param_float(), copy_param_uint(), and similar helper functions that previously performed host-to-device copies within RPP.

Key changes include:

Memory allocation updated from stack arrays to hipHostMalloc in test suite
API function signatures updated to pass tensor pointers directly to HIP kernels
Memory management responsibility shifted entirely to the user

Reviewed Changes

Copilot reviewed 28 out of 28 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
utilities/test_suite/HIP/Tensor_image_hip.cpp	Updated to use hipHostMalloc for parameter buffers instead of stack arrays; added cleanup code
src/modules/tensor/rppt_tensor_geometric_augmentations.cpp	Removed copy_param calls; parameters now passed directly to kernels
src/modules/tensor/rppt_tensor_filter_augmentations.cpp	Removed copy_param calls for gaussian_filter
src/modules/tensor/rppt_tensor_effects_augmentations.cpp	Removed copy_param calls; added hipHostMalloc for spatter mask arrays
src/modules/tensor/rppt_tensor_color_augmentations.cpp	Removed copy_param calls for all color augmentations
src/modules/tensor/hip/kernel/*.cpp	Updated function signatures to accept tensor pointers directly
src/include/tensor/hip_tensor_executors.hpp	Updated function declarations with new parameters
CMakeLists.txt	Version bump from 2.2.0 to 2.2.1; trailing whitespace cleanup
CHANGELOG.md	Added entry for memory copy elimination

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

utilities/test_suite/HIP/Tensor_image_hip.cpp

CHANGELOG.md

Address copilot comments for HIP HOST consistent allocation

LakshmiKumar23 · 2025-11-25T22:54:44Z

@r-abishek please check and resolve conflicts

LakshmiKumar23 · 2025-11-26T18:42:40Z

@Srihari-mcw @HazarathKumarM please add the doc changes as we discussed offline

HazarathKumarM and others added 23 commits September 17, 2025 08:29

Removed memcpy and used hipHostMalloc for allocation : blend

358b187

Removed memcpy and used hipHostMalloc for allocation : brightness

59376a0

Removed memcpy and used hipHostMalloc for allocation : color cast

d8c6b15

Removed memcpy and used hipHostMalloc for allocation : color twist

ca42f58

Removed memcpy and used hipHostMalloc for allocation : contrast

6969414

Removed memcpy and used hipHostMalloc for allocation : crop mirror no…

29d776b

…rmalize

Removed memcpy and used hipHostMalloc for allocation : Exposure

cca850b

Removed memcpy and used hipHostMalloc for allocation : Gamma correction

fbc525f

Removed memcpy and used hipHostMalloc for allocation : gaussian filter

78405c2

Removed memcpy and used hipHostMalloc for allocation : Noise

9e683bb

Removed memcpy and used hipHostMalloc for allocation : Non linear blend

7d9aaef

Removed memcpy and used hipHostMalloc for allocation : Resize mirror …

5a34ce3

…normalize

Removed memcpy and used hipHostMalloc for allocation : Water

fff9abe

Added hipHostFree for all kernels in test suite

c56182a

Merge branch 'apr/mem_cpy_rm' into apr/mem_cpy_rm_set2

859ce40

Added hipHostFree for all kernels in test suite

8bf07fa

Removed memcpy and used hipHostMalloc for allocation : Flip, spatter,…

82d36fb

… rcm, color temperature

Merge remote-tracking branch 'origin' into apr/mem_cpy_rm

96b828c

Merge remote-tracking branch 'origin/develop' into apr/mem_cpy_rm

5a21572

Resolved copilot review comments

33d8876

Updated version

f61fdf9

Removed unused parameter

b68ee69

Merge pull request #496 from RooseweltMcW/apr/mem_cpy_rm

e5d2750

Mem copy elimination

r-abishek requested a review from a team as a code owner October 8, 2025 06:41

r-abishek added the ci:precheckin label Oct 8, 2025

r-abishek changed the title ~~Ar/device memcpy removal~~ Consistent HOST and HIP/pinned buffers for respective API Oct 8, 2025

Updated version in cmakeList

abce1db

kiritigowda self-assigned this Oct 9, 2025

kiritigowda requested a review from rrawther October 9, 2025 17:56

Merge branch 'develop' into ar/device_memcpy_removal

386bed1

HazarathKumarM added 5 commits October 16, 2025 01:33

removed the host to device mem copies for warp affine and rotate

07fe8b7

Merge branch 'develop' of https://github.com/ROCm/rpp into apr/mem_cp…

e962447

…y_rm

Updated version

1beca06

Removed comment

b56e04d

Updated Chnagelog file

3a98500

SundarRajan28 mentioned this pull request Oct 16, 2025

Update memory allocation for HIP augmentation parameters ROCm/MIVisionX#1570

Open

1 task

r-abishek and others added 3 commits October 16, 2025 14:03

Merge branch 'develop' of https://github.com/ROCm/rpp into ar/device_…

891b0a4

…memcpy_removal

Merge branch 'ar/device_memcpy_removal' into apr/mem_cpy_rm

5f4ea95

Merge pull request #504 from RooseweltMcW/apr/mem_cpy_rm

90b7e94

Mem copy elimination version change and Review comments resolved

r-abishek and others added 9 commits October 21, 2025 17:51

Merge branch 'develop' into ar/device_memcpy_removal

1ed2c09

Merge branch 'develop' into ar/device_memcpy_removal

ffc6678

Merge branch 'develop' into ar/device_memcpy_removal

fcc5958

Merge branch 'develop' into ar/device_memcpy_removal

1b321b6

Update patch version from 2.2.0 to 2.2.1

3f6ea43

Update CHANGELOG

12886aa

Merge pull request #524 from Srihari-mcw/memcpy_version_change

a7a1150

Update patch version from 2.2.0 to 2.2.1

Merge branch 'develop' into ar/device_memcpy_removal

2bee3c4

Merge branch 'develop' into ar/device_memcpy_removal

26ace11

kiritigowda requested a review from Copilot November 17, 2025 18:05

Copilot started reviewing on behalf of kiritigowda November 17, 2025 18:06 View session

Merge branch 'develop' into ar/device_memcpy_removal

b517c40

Copilot finished reviewing on behalf of kiritigowda November 17, 2025 18:07

Copilot AI reviewed Nov 17, 2025

View reviewed changes

utilities/test_suite/HIP/Tensor_image_hip.cpp Outdated Show resolved Hide resolved

utilities/test_suite/HIP/Tensor_image_hip.cpp Outdated Show resolved Hide resolved

CHANGELOG.md Show resolved Hide resolved

Srihari-mcw and others added 2 commits November 18, 2025 07:47

Address copilot comments for HIP HOST consistent allocation

334ef28

Merge pull request #529 from Srihari-mcw/copilot_comments_hip_host_alloc

53d5ebd

Address copilot comments for HIP HOST consistent allocation

rrawther requested a review from AryanSalmanpour November 20, 2025 03:03

Merge branch 'develop' into ar/device_memcpy_removal

8afab8e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Consistent HOST and HIP/pinned buffers for respective API #628

Consistent HOST and HIP/pinned buffers for respective API #628

r-abishek commented Oct 8, 2025 •

edited

Loading

Uh oh!

LakshmiKumar23 commented Oct 21, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LakshmiKumar23 commented Nov 25, 2025

Uh oh!

LakshmiKumar23 commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Consistent HOST and HIP/pinned buffers for respective API #628

Are you sure you want to change the base?

Consistent HOST and HIP/pinned buffers for respective API #628

Conversation

r-abishek commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LakshmiKumar23 commented Oct 21, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LakshmiKumar23 commented Nov 25, 2025

Uh oh!

LakshmiKumar23 commented Nov 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

r-abishek commented Oct 8, 2025 •

edited

Loading