Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Last Batch Policy changes for file source reader #182

Merged
merged 613 commits into from
Sep 11, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
613 commits
Select commit Hold shift + click to select a range
44a4d31
Remove the parse_config
swetha097 Apr 8, 2024
876c9ad
Merge remote-tracking branch 'swe_fork/audio_pr4' into swbs/audio/pr5
swetha097 Apr 8, 2024
9445e6c
Adding missed param in python unit tests
SundarRajan28 Apr 8, 2024
12b4801
Merge branch 'swbs/audio/pr5' into swbs/audio/pr6
SundarRajan28 Apr 8, 2024
69d2448
Merge branch 'swbs/audio/pr6' into swbs/audio/pr8
SundarRajan28 Apr 8, 2024
39b5f00
Merge branch 'swbs/audio/pr8' into swbs/audio/pr9
SundarRajan28 Apr 8, 2024
006d23f
Merge branch 'swbs/audio/pr9' into swbs/audio/pr7
SundarRajan28 Apr 8, 2024
a35f3a8
Merge branch 'swbs/audio/pr7' into swbs/audio/pr10
SundarRajan28 Apr 8, 2024
2b2a651
Fix error - Too many open files
swetha097 Apr 9, 2024
4caf56c
Merge branch 'swbs/audio/pr10' of https://github.com/swetha097/rocAL …
swetha097 Apr 9, 2024
6408a68
Fix slice fill values copy issue
fiona-gladwin Apr 9, 2024
e39b59b
Merge branch 'swbs/audio/pr10' of https://github.com/swetha097/rocAL …
fiona-gladwin Apr 9, 2024
5302f9e
Fix file list path in API dataloader
fiona-gladwin Apr 10, 2024
e496f3a
Revert "Add Glob to CMakeLists.txt"
fiona-gladwin Apr 10, 2024
5df0055
Merge branch 'develop' of https://github.com/ROCm/rocAL into swbs/aud…
fiona-gladwin Apr 10, 2024
7dc7092
Fix include headers for Audio files
fiona-gladwin Apr 10, 2024
19e30cf
Fix copy data 2D
fiona-gladwin Apr 10, 2024
34deb3b
Merge branch 'swbs/audio/pr1' into swbs/audio/pr2
fiona-gladwin Apr 10, 2024
4c02dfb
Minor changes
fiona-gladwin Apr 11, 2024
e3f350f
Pass decoded data info to load routine instead of separate vectors
fiona-gladwin Apr 11, 2024
67cda83
Update CHANGELOG.md
fiona-gladwin Apr 11, 2024
d36df07
Merge branch 'swbs/audio/pr1' into swbs/audio/pr2
fiona-gladwin Apr 11, 2024
42c844d
Update CHANGELOG.md
fiona-gladwin Apr 11, 2024
8b1c59f
Change swap_handle_time variable name in loader
fiona-gladwin Apr 11, 2024
241ce67
Merge remote-tracking branch 'swe_fork/swbs/audio/pr2' into swbs/audi…
swetha097 Apr 11, 2024
07ba1f6
Update the changelog.md
swetha097 Apr 11, 2024
83513fb
Update ChangeLog.md
swetha097 Apr 11, 2024
31959c2
Merge branch 'swbs/audio/pr5' of https://github.com/swetha097/rocAL i…
swetha097 Apr 11, 2024
6fd3621
Update ChangeLog.md
swetha097 Apr 11, 2024
abc63c9
Merge branch 'swbs/audio/pr3' into audio_pr4
SundarRajan28 Apr 11, 2024
bb8908b
Update CHANGELOG.md
SundarRajan28 Apr 11, 2024
91fed39
Formatting changes
fiona-gladwin Apr 11, 2024
ee3606b
Merge branch 'audio_pr4' into swbs/audio/pr5
SundarRajan28 Apr 11, 2024
6a80714
Update doxygen comments
fiona-gladwin Apr 11, 2024
a19086b
Merge branch 'swbs/audio/pr5' into swbs/audio/pr6
SundarRajan28 Apr 11, 2024
689985d
Move file source reader from readers/image to readers folder
fiona-gladwin Apr 11, 2024
db758fd
Merge branch 'swbs/audio/pr1' into swbs/audio/pr2
fiona-gladwin Apr 11, 2024
6bf44de
Merge branch 'swbs/audio/pr6' into swbs/audio/pr8
SundarRajan28 Apr 11, 2024
3d0f485
Merge branch 'swbs/audio/pr8' into swbs/audio/pr9
SundarRajan28 Apr 11, 2024
edf81ad
Merge branch 'swbs/audio/pr9' into swbs/audio/pr7
SundarRajan28 Apr 11, 2024
67190bf
Update README and add doxygen description
fiona-gladwin Apr 11, 2024
ffdcb0a
Update CMakeLists and README for audio test
fiona-gladwin Apr 11, 2024
b2de5f4
Merge branch 'swbs/audio/pr1' into swbs/audio/pr2
fiona-gladwin Apr 11, 2024
d000af0
Update README for audio test
fiona-gladwin Apr 11, 2024
7415447
Minor fix
fiona-gladwin Apr 12, 2024
f6bffef
Merge branch 'develop' of https://github.com/ROCm/rocAL into swbs/aud…
fiona-gladwin Apr 12, 2024
cb034b0
Merge branch 'swbs/audio/pr1' into swbs/audio/pr2
fiona-gladwin Apr 12, 2024
6089040
Merge branch 'swbs/audio/pr7' of https://github.com/swetha097/rocAL i…
fiona-gladwin Apr 12, 2024
568ee7e
Fix build errors
fiona-gladwin Apr 12, 2024
2e38233
Fix Copy_Data_2d_ROI
swetha097 Apr 12, 2024
0e51f24
Merge branch 'swbs/audio/pr10' of https://github.com/swetha097/rocAL …
swetha097 Apr 12, 2024
d8031b5
Merge remote-tracking branch 'swe_fork/swbs/audio/pr2' into swbs/audi…
swetha097 Apr 12, 2024
d894aba
Fix merge from PR 2
swetha097 Apr 12, 2024
689c55f
Minor changes shard_count argument name
fiona-gladwin Apr 12, 2024
1079d50
Rename set and get functions of data_info to decoded_data_info
fiona-gladwin Apr 12, 2024
1f63cab
Merge branch 'swbs/audio/pr1' into swbs/audio/pr2
fiona-gladwin Apr 12, 2024
ff75ed9
Fix shard_size and audio source evaluation
Apr 15, 2024
94f6754
Changes in file_source_reader - to minimize the I/O operations
Apr 15, 2024
c082e9d
Changes in the variable name
Apr 15, 2024
5db5535
Changes in the variable names of the audio source evalution
Apr 15, 2024
2967b68
Merge branch 'swbs/audio/pr3' into audio_pr4
SundarRajan28 Apr 16, 2024
27429b3
Use set instead of vector
swetha097 Apr 16, 2024
d45967e
Minor bug fixes
swetha097 Apr 16, 2024
18baa2a
Minor fixes
fiona-gladwin Apr 16, 2024
dd2a7df
Fix drop policy without padding
fiona-gladwin Apr 16, 2024
a8f81d9
Fix pytorch iterator - PARTIAL policy
fiona-gladwin Apr 16, 2024
9dab620
Merge branch 'swbs/audio/pr10' of https://github.com/swetha097/rocAL …
fiona-gladwin Apr 16, 2024
36a9516
Merge branch 'audio_pr4' into swbs/audio/pr5
SundarRajan28 Apr 17, 2024
fb7a52b
Merge branch 'swbs/audio/pr5' into swbs/audio/pr6
SundarRajan28 Apr 17, 2024
b3823c8
Merge branch 'swbs/audio/pr6' into swbs/audio/pr8
SundarRajan28 Apr 17, 2024
4de03a5
Merge branch 'swbs/audio/pr8' into swbs/audio/pr9
SundarRajan28 Apr 17, 2024
0161204
Merge branch 'swbs/audio/pr9' into swbs/audio/pr7
SundarRajan28 Apr 17, 2024
42d1bb1
Merge remote-tracking branch 'upstream/develop' into swbs/audio/pr2
SundarRajan28 Apr 17, 2024
3375f41
Merge branch 'swbs/audio/pr2' into swbs/audio/pr3
SundarRajan28 Apr 17, 2024
d7c8884
Merge branch 'swbs/audio/pr3' into audio_pr4
SundarRajan28 Apr 17, 2024
513fd78
Merge branch 'audio_pr4' into swbs/audio/pr5
SundarRajan28 Apr 17, 2024
44cefd6
Merge branch 'swbs/audio/pr5' into swbs/audio/pr6
SundarRajan28 Apr 17, 2024
c100e80
Merge branch 'swbs/audio/pr6' into swbs/audio/pr8
SundarRajan28 Apr 17, 2024
9698308
Merge branch 'swbs/audio/pr8' into swbs/audio/pr9
SundarRajan28 Apr 17, 2024
23dad87
Merge branch 'swbs/audio/pr9' into swbs/audio/pr7
SundarRajan28 Apr 17, 2024
d928c48
Merge branch 'develop' of https://github.com/ROCm/rocAL into swbs/aud…
fiona-gladwin Apr 17, 2024
c0d2309
Merge branch 'swbs/audio/pr1' into swbs/audio/pr2
fiona-gladwin Apr 17, 2024
c01325d
Revert empty line removed in CMakeLists.txt
fiona-gladwin Apr 17, 2024
549def5
Removed prefix original for audio vectors
fiona-gladwin Apr 17, 2024
b8f90a8
Fix PARTIAL
fiona-gladwin Apr 18, 2024
211c4c9
Reduce overall time for audio source evalution
swetha097 Apr 18, 2024
5a313ec
Fix shard_size and stick to shard issue seen with convergence
swetha097 Apr 18, 2024
c1d9cc5
Resolve PR comments
swetha097 Apr 18, 2024
7874f09
Add @params to all args in pytorch.py
swetha097 Apr 18, 2024
ef9a21b
Fix build issue
swetha097 Apr 18, 2024
0f48da9
Merge branch 'develop' of https://github.com/ROCm/rocAL into swbs/aud…
fiona-gladwin Apr 22, 2024
a4de349
Merge branch 'swbs/audio/pr7' of https://github.com/swetha097/rocAL i…
fiona-gladwin Apr 22, 2024
37921de
Minor changes in unit test
swetha097 Apr 22, 2024
96ace00
Merge branch 'swbs/audio/pr2' of https://github.com/swetha097/rocAL i…
swetha097 Apr 22, 2024
6602895
Minor changes
swetha097 Apr 22, 2024
aa13a35
Change ROCAL instaces to rocAL in pytorch.py
swetha097 Apr 22, 2024
2873d8c
Merge branch 'swbs/audio/pr2' into swbs/audio/pr3
fiona-gladwin Apr 22, 2024
2dd31f8
Resolve the PR comments
swetha097 Apr 23, 2024
1cd9779
Merge branch 'swbs/audio/pr3' of https://github.com/swetha097/rocAL i…
swetha097 Apr 23, 2024
d1d5241
Minor changes in decoders.py - Modify the comment for shard_size
swetha097 Apr 23, 2024
f4bcbca
Merge branch 'swbs/audio/pr2' of https://github.com/swetha097/rocAL i…
fiona-gladwin Apr 23, 2024
d152dca
Merge branch 'swbs/audio/pr3' of https://github.com/swetha097/rocAL i…
fiona-gladwin Apr 23, 2024
e4c5788
Merge branch 'develop' of https://github.com/ROCm/rocAL into swbs/aud…
fiona-gladwin Apr 23, 2024
6a98227
Merge branch 'swbs/audio/pr10_training' into swbs/audio/pr10
swetha097 Apr 24, 2024
6279bb1
Fix shard_size
swetha097 Apr 24, 2024
fb33f06
Merge branch 'swbs/audio/pr3' into audio_pr4
SundarRajan28 Apr 24, 2024
be416ef
Minor changes
swetha097 Apr 24, 2024
d9dbd2c
Changes in pipeline.py and decoders.py
swetha097 Apr 24, 2024
b674564
Merge remote-tracking branch 'origin/swbs/audio/pr10_training' into s…
swetha097 Apr 24, 2024
8a7bb3c
Address the PR comments
swetha097 Apr 25, 2024
2021ab9
Address Review comments
swetha097 Apr 25, 2024
f3a1afa
Remove print statement
swetha097 Apr 30, 2024
08033c7
Merge branch 'swbs/audio/pr10' into lbp_fix_pr10
swetha097 May 3, 2024
33b3681
Fix the count_items
swetha097 May 3, 2024
261b2e7
Make Sharding similar to DALI
swetha097 May 3, 2024
bb2bad2
Fix issues with DROP policy by introducing a new vector for padding
swetha097 May 6, 2024
075882e
Minor fixes
swetha097 May 6, 2024
adbc5fd
Comment out print statements
swetha097 May 7, 2024
c304314
Add changes for shard_size LBP testing
swetha097 May 7, 2024
0524019
Fix DROP Policy with shard_size > 0
swetha097 May 7, 2024
a7eef66
Fix Stick_to_Shard=False
swetha097 May 7, 2024
0395ddd
Fix PARTIAL policy and code clean up
swetha097 May 7, 2024
63aacf6
fix last_batch_padded size when shard_size > 0
swetha097 May 8, 2024
0cb2812
Fix Drop policy - we skip the dropped batch in the next epoch
swetha097 May 8, 2024
296e1c6
Fix single shard outputs
swetha097 May 8, 2024
497cd08
Remove the commented code and fix the padding code in open()
swetha097 May 9, 2024
66addb9
Remove div by num_shards in decoders.py
swetha097 May 9, 2024
0c900a9
Introduce Audio layouts
fiona-gladwin May 9, 2024
e75616c
Add layout changes for spectrogram
fiona-gladwin May 9, 2024
e7ed0d8
Fix the unit tests - c++ & python
swetha097 May 9, 2024
528a87a
Merge branch 'swbs/audio/pr5' of https://github.com/swetha097/rocAL i…
fiona-gladwin May 9, 2024
feff5bd
Code clean up and formatting
swetha097 May 9, 2024
9b206a8
Minor code clean up
swetha097 May 9, 2024
4bc0f0d
code clean up in pytorch.py
swetha097 May 9, 2024
2809761
Add layout changes for spectrogram
fiona-gladwin May 10, 2024
8ff55cb
Pass layouts for MelFilterBank
fiona-gladwin May 10, 2024
0993896
Fix ToDecibels
fiona-gladwin May 10, 2024
79d316c
Fix Normalize
fiona-gladwin May 10, 2024
0822f76
Fix build issue
fiona-gladwin May 10, 2024
120dddc
Merge branch 'swbs/audio/pr5_layout' of https://github.com/swetha097/…
fiona-gladwin May 10, 2024
a6cbbe2
Fix python unit test
fiona-gladwin May 10, 2024
8be961b
Merge remote-tracking branch 'swe_fork/swbs/audio/pr7_layout' into sw…
swetha097 May 10, 2024
ab993d0
Minor fix
fiona-gladwin May 10, 2024
1ddfe34
Pass LBP to decoders instead of the Pipeline creation
swetha097 May 13, 2024
d7764e1
Merge branch 'swbs/lbp_fixes_pr10_pass_readers' into swbs/lbp_fixes_pr10
May 13, 2024
484e1bd
Update pipeline.py - Remove commented code
swetha097 May 13, 2024
924ab79
Update pipeline.py - Remove commented out code
swetha097 May 13, 2024
9757256
Adding changes for spec layout changes
SundarRajan28 May 15, 2024
6b2a06c
Merge branch 'swbs/audio/pr6' into swbs/audio/pr8
SundarRajan28 May 15, 2024
df70d39
Merge branch 'swbs/audio/pr8' into swbs/audio/pr7
SundarRajan28 May 15, 2024
5505ed8
Adding changes to MFB and normalize nodes
SundarRajan28 May 15, 2024
e685c37
Update node_slice.cpp
swetha097 May 16, 2024
fc26afd
Update node_slice.h
swetha097 May 16, 2024
4320399
Resolve PR comments
swetha097 May 17, 2024
ce91644
Merge branch 'swbs/audio/pr5_layout' into swbs/audio/pr5
fiona-gladwin May 17, 2024
60133c6
Merge branch 'swbs/audio/pr5' into swbs/audio/pr6
fiona-gladwin May 17, 2024
b2c40eb
Merge branch 'swbs/audio/pr6' into swbs/audio/pr8
fiona-gladwin May 17, 2024
84db544
Merge branch 'swbs/audio/pr8' into swbs/audio/pr9
fiona-gladwin May 17, 2024
affe8f3
Merge branch 'swbs/audio/pr9' into swbs/audio/pr7
fiona-gladwin May 17, 2024
c41f363
Merge remote-tracking branch 'open_source/develop' into swbs/audio/pr3
swetha097 May 17, 2024
70e12cd
Merge branch 'swbs/audio/pr3' into audio_pr4
swetha097 May 18, 2024
b858b69
Merge branch 'audio_pr4' into swbs/audio/pr5
swetha097 May 18, 2024
5e79034
Merge remote-tracking branch 'origin/swbs/audio/pr5' into HEAD
swetha097 May 18, 2024
66be5a2
Merge branch 'temp_swbs/audio/pr6' into swbs/audio/pr6
swetha097 May 19, 2024
91c4fa1
Merge branch 'swbs/audio/pr6' into swbs/audio/pr8
swetha097 May 19, 2024
750b286
Merge branch 'swbs/audio/pr8' into swbs/audio/pr9
swetha097 May 19, 2024
5276ec2
Merge branch 'swbs/audio/pr9' into swbs/audio/pr7
swetha097 May 19, 2024
4e39f6a
Merge branch 'swbs/audio/pr7' into swbs/lbp_fixes_pr10
swetha097 May 19, 2024
ed9aae2
Fix downmix failing case and resolve the issue with merge
swetha097 May 19, 2024
74759dc
Fix issue with file_source_reader.cpp when file_list is not used
swetha097 May 20, 2024
a417f17
Resolve PR comments - Sundar
swetha097 May 20, 2024
a0242d9
Fix file_source_reader.cpp
swetha097 May 21, 2024
0f4a590
Fix shuffle issues
SundarRajan98 May 23, 2024
9b5fab1
Merge remote-tracking branch 'upstream/develop' into swbs/audio/pr7
SundarRajan28 Jun 5, 2024
328c41a
Adding comments to all if conditions
sbavasab Jun 5, 2024
6237aaf
Merge branch 'swbs/lbp_fixes_pr10' of https://github.com/swetha097/ro…
swetha097 Jun 7, 2024
11b0f96
Merge remote-tracking branch 'upstream/develop' into swbs/audio/pr7
SundarRajan28 Jun 12, 2024
83cecf5
Merge remote-tracking branch 'open_source/develop' into swbs/audio/pr9
swetha097 Jun 12, 2024
6692974
Fix merge conflicts
SundarRajan28 Jun 13, 2024
0cd21de
Merge remote-tracking branch 'upstream/develop' into swbs/audio/pr7
SundarRajan28 Jun 14, 2024
f7e8826
Merge remote-tracking branch 'open_source/develop' into develop
swetha097 Jun 18, 2024
00cdddb
Merge remote-tracking branch 'open_source/develop' into swbs/audio/pr9
swetha097 Jun 18, 2024
0c74d8b
Merge remote-tracking branch 'upstream/develop' into swbs/audio/pr7
SundarRajan28 Jun 20, 2024
20ef6d6
Resolving review comments
SundarRajan28 Jun 21, 2024
7e9b3ce
Merge branch 'develop' into swbs/audio/pr9
swetha097 Jun 21, 2024
7af1c03
Merge remote-tracking branch 'upstream/develop' into swbs/audio/pr9
SundarRajan28 Jun 25, 2024
e2ef16b
Merge branch 'swbs/audio/pr9' into swbs/audio/pr7
SundarRajan28 Jun 25, 2024
c3f5391
Merge remote-tracking branch 'upstream/develop' into swbs/audio/pr7
SundarRajan28 Jun 29, 2024
5b5348a
Merge remote-tracking branch 'swe_fork/swbs/audio/pr7' into swbs/lbp_…
swetha097 Jul 1, 2024
9b5edec
Fix a minor warning in file source reader
swetha097 Jul 1, 2024
5326625
Merge branch 'develop' into swbs/audio/pr7
LakshmiKumar23 Jul 2, 2024
c6fe840
Resolving review comments
SundarRajan28 Jul 4, 2024
5b4ebc5
LBP comments resolution
swetha097 Jul 4, 2024
c8e1791
Merge branch 'develop' into swbs/audio/pr7
LakshmiKumar23 Jul 8, 2024
704badd
Resolving review comments
SundarRajan28 Jul 9, 2024
e521768
Merge remote-tracking branch 'upstream/develop' into swbs/lbp_fixes_pr10
SundarRajan28 Jul 9, 2024
a902dca
Merge branch 'swbs/audio/pr7' into swbs/lbp_fixes_pr10
SundarRajan28 Jul 9, 2024
97d7077
Formatting changes
SundarRajan28 Jul 11, 2024
0ddea59
Resolving Final Set of PR comments
swetha097 Jul 11, 2024
13a105a
Combine with OR condition
swetha097 Jul 11, 2024
f189738
Remove the pad_last_batch_repeated print statement from decoders.py
swetha097 Jul 11, 2024
4e9ff04
Merge branch 'develop' of https://github.com/ROCm/rocAL into swbs/lbp…
fiona-gladwin Jul 12, 2024
5d0decf
Add shard_size and stick_to_shard variables in args
swetha097 Jul 12, 2024
0b8f35c
Merge branch 'develop' into swbs/lbp_fixes_pr10
swetha097 Jul 17, 2024
0e5fa5a
Minor spelling fix
swetha097 Jul 17, 2024
769010c
Merge branch 'develop' into swbs/lbp_fixes_pr10
kiritigowda Jul 24, 2024
615e12d
Make changes to insert the padded data in the file_names vector
swetha097 Jul 24, 2024
cec5e8a
Merge branch 'swbs/lbp_fixes_pr10' of https://github.com/swetha097/ro…
swetha097 Jul 24, 2024
072ee56
Support to pass the variables fo lbp as struct
swetha097 Jul 24, 2024
408091a
Merge branch 'develop' into swbs/lbp_fixes_pr10
kiritigowda Jul 24, 2024
78c2a3d
Fix segmentation fault
swetha097 Jul 29, 2024
4dcc1fc
Merge branch 'ROCm:develop' into swbs/lbp_fixes_pr10_pr_comments
SundarRajan28 Jul 30, 2024
e3ccc27
Merge branch 'swbs/lbp_fixes_pr10_pr_comments' into swbs/lbp_fixes_pr10
swetha097 Aug 5, 2024
1c5398e
Resolve PR comments
swetha097 Aug 5, 2024
4e003c3
Merge branch 'swbs/lbp_fixes_pr10' of https://github.com/swetha097/ro…
swetha097 Aug 5, 2024
4ffb79b
Resolve PR comments
swetha097 Aug 5, 2024
70220c3
Resolve PR comments
swetha097 Aug 5, 2024
c58ffa9
Use PreComputed start and end indices
swetha097 Aug 5, 2024
6803069
Use precomputed shard_idx start and end in initialize
swetha097 Aug 5, 2024
a5797d5
Merge branch 'develop' into swbs/lbp_fixes_pr10
kiritigowda Aug 9, 2024
68cbee9
Merge branch 'develop' into swbs/lbp_fixes_pr10
kiritigowda Aug 20, 2024
4d83a11
Merge remote-tracking branch 'origin/swbs/lbp_fixes_pr10_pr_comments'…
swetha097 Aug 20, 2024
4a5b55f
Merge branch 'develop' into swbs/lbp_fixes_pr10
kiritigowda Aug 27, 2024
99d232d
Initialize the Sharding info using ShardingInfo()
swetha097 Aug 28, 2024
e9b5d63
Merge branch 'swbs/lbp_fixes_pr10' of https://github.com/swetha097/ro…
swetha097 Aug 28, 2024
612b450
convert the signed to int32_t type
swetha097 Aug 28, 2024
85d2c2a
temp commit for struct changes
swetha097 Aug 28, 2024
1c1dfa8
Fix the struct changes - All the test cases passing
Aug 29, 2024
9e1ce0c
Remove any print statements
Aug 29, 2024
85ef52a
Add support to Pass the decode size policy from the user
Aug 29, 2024
6e1b81f
Add support to Pass the decode size policy from the user
Aug 29, 2024
be92088
Merge branch 'swbs/lbp_fixes_pr10' of https://github.com/swetha097/ro…
swetha097 Aug 29, 2024
51259c3
Rename RocalShardingInfo for ShardingInfo and vice-versa
swetha097 Aug 29, 2024
7b4276f
xywh roi copy
swetha097 Aug 29, 2024
8157375
Fix decoders.py for image decoders
swetha097 Aug 29, 2024
673b5d5
Make stick_to_shard True by default
swetha097 Aug 29, 2024
dea2023
Merge branch 'develop' of https://github.com/ROCmSoftwarePlatform/roc…
fgladwin Aug 29, 2024
8ae2e1c
Minor changes to the copy_data function
swetha097 Aug 29, 2024
8e26013
Rename to x_offset and y_offset in copy_data
fgladwin Aug 30, 2024
125c093
Minor changes - remove unused variables
fgladwin Aug 30, 2024
bf6f72f
Minor change - Variable names
fgladwin Aug 30, 2024
4560ffc
Update Doxygen comments and comments of API
fgladwin Sep 3, 2024
81c9d19
Merge branch 'swbs/lbp_fixes_pr10' of https://github.com/swetha097/ro…
fgladwin Sep 3, 2024
f279dc5
Merge branch 'develop' into swbs/lbp_fixes_pr10
LakshmiKumar23 Sep 5, 2024
bf1c23a
Merge branch 'develop' into swbs/lbp_fixes_pr10
kiritigowda Sep 6, 2024
82e276f
Make the rocalShardingInfo as the last param for Audio loaders
Sep 6, 2024
2918a89
Remove unused variables and functions in file_source_reader.cpp & .h …
Sep 6, 2024
0878361
Remove the doctring explanation for unused params
Sep 6, 2024
2c1aee9
Change the explanation according to the newly introduced structure
Sep 6, 2024
753e442
Merge branch 'develop' into swbs/lbp_fixes_pr10
LakshmiKumar23 Sep 6, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
131 changes: 74 additions & 57 deletions rocAL/include/api/rocal_api_data_loaders.h

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion rocAL/include/api/rocal_api_info.h
Original file line number Diff line number Diff line change
Expand Up @@ -133,7 +133,7 @@ extern "C" TimingInfo ROCAL_API_CALL rocalGetTimingInfo(RocalContext rocal_conte
* \brief Retrieves the information about the size of the last batch.
* \ingroup group_rocal_info
* \param rocal_context
* \return The number of samples that were padded in the last batch in adherence with last_batch_policy and last_batch_padded
* \return The number of samples that were padded in the last batch in adherence with last_batch_policy and pad_last_batch_repeated.
*/
extern "C" size_t ROCAL_API_CALL rocalGetLastBatchPaddedSize(RocalContext rocal_context);

Expand Down
1 change: 1 addition & 0 deletions rocAL/include/api/rocal_api_tensor.h
Original file line number Diff line number Diff line change
Expand Up @@ -40,6 +40,7 @@ class rocalTensor {
virtual ~rocalTensor() = default;
virtual void* buffer() = 0;
virtual unsigned copy_data(void* user_buffer, RocalOutputMemType external_mem_type = ROCAL_MEMCPY_HOST) = 0;
virtual unsigned copy_data(void* user_buffer, uint x_offset, uint y_offset, uint max_cols, uint max_rows) = 0; // Copy only the ROI to the user_buffer [The padded region is not copied]
virtual unsigned num_of_dims() = 0;
virtual unsigned batch_size() = 0;
virtual std::vector<size_t> dims() = 0;
Expand Down
22 changes: 20 additions & 2 deletions rocAL/include/api/rocal_api_types.h
Original file line number Diff line number Diff line change
Expand Up @@ -438,7 +438,7 @@ enum RocalMelScaleFormula {
ROCAL_MELSCALE_HTK
};

/*! \brief Tensor Last Batch Policies
/*! \brief Tensor Last Batch Policy Type enum
* \ingroup group_rocal_types
*/
enum RocalLastBatchPolicy {
Expand All @@ -448,9 +448,27 @@ enum RocalLastBatchPolicy {
/*! \brief ROCAL_LAST_BATCH_DROP - The last batch is dropped if there are not enough samples from the current epoch.
*/
ROCAL_LAST_BATCH_DROP = 1,
/*! \brief ROCAL_LAST_BATCH_PARTIAL - The last batch is partially filled with the remaining data from the current epoch, keeping the rest of the samples empty. (currently this policy works similar to FILL in rocAL, PARTIAL policy needs to be handled from python end)
/*! \brief ROCAL_LAST_BATCH_PARTIAL - The last batch is partially filled with the remaining data from the current epoch, keeping the rest of the samples empty. (currently this policy works similar to FILL in rocAL, PARTIAL policy needs to be handled in the python iterator)
*/
ROCAL_LAST_BATCH_PARTIAL = 2
};

/*! \brief rocAL RocalShardingInfo enum
* \ingroup group_rocal_types
*/
struct RocalShardingInfo {
RocalLastBatchPolicy last_batch_policy;
bool pad_last_batch_repeated;
bool stick_to_shard;
int32_t shard_size;

// Constructor with default values
RocalShardingInfo()
: last_batch_policy(RocalLastBatchPolicy::ROCAL_LAST_BATCH_FILL),
pad_last_batch_repeated(false),
stick_to_shard(true),
shard_size(-1)
{}
};

#endif // MIVISIONX_ROCAL_API_TYPES_H
1 change: 1 addition & 0 deletions rocAL/include/loaders/audio/audio_loader.h
Original file line number Diff line number Diff line change
Expand Up @@ -56,6 +56,7 @@ class AudioLoader : public LoaderModule {
void feed_external_input(const std::vector<std::string>& input_images_names, const std::vector<unsigned char*>& input_buffer,
const std::vector<ROIxywh>& roi_xywh, unsigned int max_width, unsigned int max_height, unsigned int channels,
ExternalSourceFileMode mode, bool eos) override { THROW("external source feed is not supported in audio loader") }
size_t last_batch_padded_size() override;

private:
bool is_out_of_data();
Expand Down
1 change: 1 addition & 0 deletions rocAL/include/loaders/audio/audio_loader_sharded.h
Original file line number Diff line number Diff line change
Expand Up @@ -45,6 +45,7 @@ class AudioLoaderSharded : public LoaderModule {
void feed_external_input(const std::vector<std::string>& input_images_names, const std::vector<unsigned char*>& input_buffer,
const std::vector<ROIxywh>& roi_xywh, unsigned int max_width, unsigned int max_height, unsigned int channels,
ExternalSourceFileMode mode, bool eos) override { THROW("external source feed is not supported in audio loader") }
size_t last_batch_padded_size() override;

private:
void increment_loader_idx();
Expand Down
1 change: 1 addition & 0 deletions rocAL/include/loaders/audio/audio_read_and_decode.h
Original file line number Diff line number Diff line change
Expand Up @@ -64,6 +64,7 @@ class AudioReadAndDecode {
const size_t max_decoded_channels);
// returns timing info or other status information
Timing GetTiming();
size_t last_batch_padded_size(); // The number of padded samples in the last batch

private:
std::vector<std::shared_ptr<AudioDecoder>> _decoder;
Expand Down
4 changes: 3 additions & 1 deletion rocAL/include/loaders/audio/node_audio_loader.h
Original file line number Diff line number Diff line change
Expand Up @@ -44,11 +44,13 @@ class AudioLoaderNode : public Node {
/// \param load_batch_count Defines the quantum count of the Audios to be loaded. It's usually equal to the user's batch size.
/// \param mem_type Memory type, host or device
/// \param meta_data_reader Determines the meta-data information
/// \param sharding_info The members of ShardingInfo determines how the data is distributed among the shards and how the last batch is processed by the pipeline.
/// The loader will repeat Audios if necessary to be able to have Audios in multiples of the load_batch_count,
/// for example if there are 10 Audios in the dataset and load_batch_count is 3, the loader repeats 2 Audios as if there are 12 Audios available.
void Init(unsigned internal_shard_count, unsigned cpu_num_threads, const std::string &source_path,
const std::string &file_list_path, StorageType storage_type, DecoderType decoder_type, bool shuffle, bool loop,
size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader);
size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader,
const ShardingInfo& sharding_info);
std::shared_ptr<LoaderModule> GetLoaderModule();

protected:
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -42,11 +42,13 @@ class AudioLoaderSingleShardNode : public Node {
/// \param load_batch_count Defines the quantum count of the Audios to be loaded. It's usually equal to the user's batch size.
/// \param mem_type Memory type, host or device
/// \param meta_data_reader Determines the meta-data information
/// \param sharding_info The members of ShardingInfo determines how the data is distributed among the shards and how the last batch is processed by the pipeline.
/// The loader will repeat Audios if necessary to be able to have Audios in multiples of the load_batch_count,
/// for example if there are 10 Audios in the dataset and load_batch_count is 3, the loader repeats 2 Audios as if there are 12 Audios available.
void Init(unsigned shard_id, unsigned shard_count, unsigned cpu_num_threads, const std::string &source_path,
const std::string &file_list_path, StorageType storage_type, DecoderType decoder_type, bool shuffle,
bool loop, size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader);
bool loop, size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader,
const ShardingInfo& sharding_info);
std::shared_ptr<LoaderModule> GetLoaderModule();

protected:
Expand Down
2 changes: 1 addition & 1 deletion rocAL/include/loaders/image/node_fused_jpeg_crop.h
Original file line number Diff line number Diff line change
Expand Up @@ -42,7 +42,7 @@ class FusedJpegCropNode : public Node {
/// for example if there are 10 images in the dataset and load_batch_count is 3, the loader repeats 2 images as if there are 12 images available.
void init(unsigned internal_shard_count, unsigned cpu_num_threads, const std::string &source_path, const std::string &json_path, StorageType storage_type,
DecoderType decoder_type, bool shuffle, bool loop, size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader,
unsigned num_attempts, std::vector<float> &random_area, std::vector<float> &random_aspect_ratio, std::pair<RocalBatchPolicy, bool> last_batch_info = {RocalBatchPolicy::FILL, true});
unsigned num_attempts, std::vector<float> &random_area, std::vector<float> &random_aspect_ratio, const ShardingInfo& sharding_info = ShardingInfo());

std::shared_ptr<LoaderModule> get_loader_module();

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -39,7 +39,7 @@ class FusedJpegCropSingleShardNode : public Node {
/// for example if there are 10 images in the dataset and load_batch_count is 3, the loader repeats 2 images as if there are 12 images available.
void init(unsigned shard_id, unsigned shard_count, unsigned cpu_num_threads, const std::string &source_path, const std::string &json_path, StorageType storage_type,
DecoderType decoder_type, bool shuffle, bool loop, size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader,
unsigned num_attempts, std::vector<float> &random_area, std::vector<float> &random_aspect_ratio, std::pair<RocalBatchPolicy, bool> last_batch_info = {RocalBatchPolicy::FILL, true});
unsigned num_attempts, std::vector<float> &random_area, std::vector<float> &random_aspect_ratio, const ShardingInfo& sharding_info = ShardingInfo());

std::shared_ptr<LoaderModule> get_loader_module();

Expand Down
2 changes: 1 addition & 1 deletion rocAL/include/loaders/image/node_image_loader.h
Original file line number Diff line number Diff line change
Expand Up @@ -40,7 +40,7 @@ class ImageLoaderNode : public Node {
/// The loader will repeat images if necessary to be able to have images in multiples of the load_batch_count,
/// for example if there are 10 images in the dataset and load_batch_count is 3, the loader repeats 2 images as if there are 12 images available.
void init(unsigned internal_shard_count, unsigned cpu_num_threads, const std::string &source_path, const std::string &json_path, const std::map<std::string, std::string> feature_key_map, StorageType storage_type, DecoderType decoder_type, bool shuffle, bool loop,
size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader, bool decoder_keep_orig = false, std::pair<RocalBatchPolicy, bool> last_batch_info = {RocalBatchPolicy::FILL, true}, const char *prefix = "", unsigned sequence_length = 0, unsigned step = 0, unsigned stride = 0, ExternalSourceFileMode external_file_mode = ExternalSourceFileMode::NONE);
size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader, bool decoder_keep_orig = false, const ShardingInfo& sharding_info = ShardingInfo(), const char *prefix = "", unsigned sequence_length = 0, unsigned step = 0, unsigned stride = 0, ExternalSourceFileMode external_file_mode = ExternalSourceFileMode::NONE);

std::shared_ptr<LoaderModule> get_loader_module();

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -37,7 +37,7 @@ class ImageLoaderSingleShardNode : public Node {
/// The loader will repeat images if necessary to be able to have images in multiples of the load_batch_count,
/// for example if there are 10 images in the dataset and load_batch_count is 3, the loader repeats 2 images as if there are 12 images available.
void init(unsigned shard_id, unsigned shard_count, unsigned cpu_num_threads, const std::string &source_path, const std::string &json_path, StorageType storage_type, DecoderType decoder_type,
bool shuffle, bool loop, size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader, bool decoder_keep_orig = false, std::pair<RocalBatchPolicy, bool> last_batch_info = {RocalBatchPolicy::FILL, true},
bool shuffle, bool loop, size_t load_batch_count, RocalMemType mem_type, std::shared_ptr<MetaDataReader> meta_data_reader, bool decoder_keep_orig = false, const ShardingInfo& sharding_info = ShardingInfo(),
const std::map<std::string, std::string> feature_key_map = std::map<std::string, std::string>(), unsigned sequence_length = 0, unsigned step = 0, unsigned stride = 0, ExternalSourceFileMode external_file_mode = ExternalSourceFileMode::NONE);

std::shared_ptr<LoaderModule> get_loader_module();
Expand Down
1 change: 1 addition & 0 deletions rocAL/include/meta_data/meta_data_reader.h
Original file line number Diff line number Diff line change
Expand Up @@ -100,4 +100,5 @@ class MetaDataReader {
virtual ImgSize lookup_image_size(const std::string& image_name) { return {}; }
virtual void set_aspect_ratio_grouping(bool aspect_ratio_grouping) { return; }
virtual bool get_aspect_ratio_grouping() const { return {}; }
virtual std::vector<std::string> get_relative_file_path() { return {}; } // Returns the relative file_path's of the reader
swetha097 marked this conversation as resolved.
Show resolved Hide resolved
};
2 changes: 2 additions & 0 deletions rocAL/include/meta_data/text_file_meta_data_reader.h
Original file line number Diff line number Diff line change
Expand Up @@ -36,6 +36,7 @@ class TextFileMetaDataReader : public MetaDataReader {
bool set_timestamp_mode() override { return false; }

const std::map<std::string, std::shared_ptr<MetaData>>& get_map_content() override { return _map_content; }
std::vector<std::string> get_relative_file_path() override { return _relative_file_path; }
TextFileMetaDataReader();

private:
Expand All @@ -45,4 +46,5 @@ class TextFileMetaDataReader : public MetaDataReader {
void add(std::string image_name, int label);
std::map<std::string, std::shared_ptr<MetaData>> _map_content;
std::string _path;
std::vector<std::string> _relative_file_path {};
};
4 changes: 2 additions & 2 deletions rocAL/include/pipeline/commons.h
Original file line number Diff line number Diff line change
Expand Up @@ -159,11 +159,11 @@ struct Timing {
long long unsigned video_process_time= 0;
};

/*! \brief Tensor Last Batch Policies
/*! \brief Tensor Last Batch Policy Type enum
These policies the last batch policies determine the behavior when there are not enough samples in the epoch to fill the last batch
FILL - The last batch is filled by either repeating the last sample or by wrapping up the data set.
DROP - The last batch is dropped if it cannot be fully filled with data from the current epoch.
PARTIAL - The last batch is partially filled with the remaining data from the current epoch, and padding the remaining samples with either last image or wrapping up the dataset - the padded images are removed in the python end
PARTIAL - The last batch is partially filled with the remaining data from the current epoch, keeping the rest of the samples empty. (currently this policy works similar to FILL in rocAL, PARTIAL policy needs to be handled in the pytorch iterator)
*/
enum RocalBatchPolicy {
FILL = 0,
Expand Down
2 changes: 1 addition & 1 deletion rocAL/include/pipeline/tensor.h
Original file line number Diff line number Diff line change
Expand Up @@ -326,7 +326,7 @@ class Tensor : public rocalTensor {
#endif
unsigned copy_data(void* user_buffer, RocalOutputMemType external_mem_type) override;
//! Copying the output buffer with specified max_cols and max_rows values for the 2D buffer of size batch_size
unsigned copy_data(void* user_buffer, uint max_rows, uint max_cols);
unsigned copy_data(void* user_buffer, uint x_offset, uint y_offset, uint max_rows, uint max_cols);
//! Default destructor
/*! Releases the OpenVX Tensor object */
~Tensor();
Expand Down
41 changes: 25 additions & 16 deletions rocAL/include/readers/file_source_reader.h
Original file line number Diff line number Diff line change
Expand Up @@ -28,8 +28,8 @@ THE SOFTWARE.
#include <vector>

#include "pipeline/commons.h"
#include "readers/image/image_reader.h"
#include "pipeline/timing_debug.h"
#include "readers/image/image_reader.h"

class FileSourceReader : public Reader {
public:
Expand Down Expand Up @@ -67,9 +67,11 @@ class FileSourceReader : public Reader {

FileSourceReader();

//! Returns the number of images in the last batch
size_t last_batch_padded_size() override;
size_t last_batch_padded_size() override; // The size of the number of samples padded in the last batch

std::string get_root_folder_path() override; // Returns the root folder path

std::vector<std::string> get_file_paths_from_meta_data_reader() override; // Returns the relative file path from the meta-data reader
private:
//! opens the folder containnig the images
Reader::Status open_folder();
Expand All @@ -83,30 +85,37 @@ class FileSourceReader : public Reader {
unsigned _curr_file_idx;
FILE *_current_fPtr;
unsigned _current_file_size;
unsigned _shard_start_idx;
std::vector<unsigned> _shard_start_idx_vector, _shard_end_idx_vector;
std::string _last_id;
std::string _last_file_name, _last_file_path;
std::string _last_file_name, _last_file_path, _absolute_file_path;
size_t _shard_id = 0;
size_t _shard_count = 1; // equivalent of batch size
//!< _batch_count Defines the quantum count of the images to be read. It's usually equal to the user's batch size.
/// The loader will repeat images if necessary to be able to have images available in multiples of the load_batch_count,
/// for instance if there are 10 images in the dataset and _batch_count is 3, the loader repeats 2 images as if there are 12 images available.
size_t _batch_count = 1;
size_t _file_id = 0;
size_t _in_batch_read_count = 0;
int32_t _shard_size = -1;
size_t _batch_size = 1;
size_t _padded_samples = 0;
bool _loop;
bool _shuffle;
int _read_counter = 0;
//!< _file_count_all_shards total_number of files in to figure out the max_batch_size (usually needed for distributed training).
size_t _file_count_all_shards;
void incremenet_read_ptr();
void increment_curr_file_idx();
int release();
size_t get_file_shard_id();
void incremenet_file_id() { _file_id++; }
void fill_last_batch();
void replicate_last_batch_to_pad_partial_shard();
std::shared_ptr<MetaDataReader> _meta_data_reader = nullptr;
//! Pair containing the last batch policy and last_batch_padded values for deciding what to do with last batch
std::pair<RocalBatchPolicy, bool> _last_batch_info;
size_t _last_batch_padded_size = 0;
Reader::Status generate_file_names();
//! Pair containing the last batch policy and pad_last_batch_repeated values for deciding what to do with last batch
ShardingInfo _last_batch_info = ShardingInfo(); // The members of ShardingInfo determines how the data is distributed among the shards and how the last batch is processed by the pipeline.
size_t _last_batch_padded_size = 0; // The size of number of padded samples in the last batch
size_t _num_padded_samples = 0; //! Number of samples that are padded in the last batch which would differ for each shard.
bool _stick_to_shard = false;
bool _pad_last_batch_repeated = false;
Reader::Status generate_file_names(); // Function that would generate _file_names containing all the samples in the dataset
void compute_start_and_end_idx_of_all_shards(); // Start Idx of all the Shards
size_t get_dataset_size(); // DataSet Size
size_t actual_shard_size_without_padding(); // Actual Number of Files present in the shard (without padding)
size_t largest_shard_size_without_padding(); // The size of the shard having largest files (without padding)
//!< Used to advance to the next shard's data to increase the entropy of the data seen by the pipeline>
void increment_shard_id();
};
Loading
Loading