Load bboxes dataset from VIA tracks file (3/4) #229

sfmig · 2024-06-19T10:31:36Z

Rebase after #234

Description

What is this PR

Bug fix
Addition of a new feature
Other

Why is this PR needed?
To be able to load a VIA tracks file with bounding boxes into a movement dataset.

What does this PR do?

Adds a movement.io.load_bboxes module, which follows the equivalent poses one as much as possible.
Adds corresponding unit tests in tests_load_bboxes.

Question: how to make mypy aware of the type transformations that take place in __attrs_post_init__?
For example, the confidence array passed to ValidBboxesDataset can be None, and mypy flags that a .shape attribute is used later which None doesn't have. But mypy seems to be missing that the confidence array is populated with nans in __attrs_post_init__ if None is passed as input. Is there a nice way to fix this?

References

This PR would close #167

How has this PR been tested?

Tests pass locally and on CI.

Is this a breaking change?

No.

Does this PR require an update to the documentation?

I updated api_index.rst.

Checklist:

The code has been tested locally
Tests have been added to cover all new functionality
The documentation has been updated to reflect any changes
The code has been formatted with pre-commit

codecov · 2024-06-19T10:39:05Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.76%. Comparing base (d10ec20) to head (7c260b3).
Report is 1 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #229      +/-   ##
==========================================
+ Coverage   99.74%   99.76%   +0.02%     
==========================================
  Files          13       14       +1     
  Lines         771      854      +83     
==========================================
+ Hits          769      852      +83     
  Misses          2        2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

movement/validators/datasets.py

movement/validators/files.py

sonarqubecloud · 2024-06-21T13:23:09Z

Quality Gate passed

Issues
7 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

niksirbi

Thanks a lot @sfmig and well done on closely following the code architecture for loading poses. This makes me hopeful that we will be able to refactor them in future, such that we abstract away there common bits (to reduce repetition in validators and tests between poses and bboxes). But this shouldn't worry us right now.

I have two substantial comments apart form the cosmetic/trivial issues I've highlighted in specific comments.

I don't fully understand what frame_array is needed for and how it is handled. See my specific comment about this. Perhaps we can discuss it in person together?
The documentaiton pages for input_output.md and movement_dataset.md need to be updated. Specifically:
- the new format has to be aded in the "Supported formats" section of Input/Output.
- A new "Loading tracks of bounding boxes" section has to be added to Input/Output.
- The "Movement Dataset" page has to also include information about the bboxes dataset format, especially where it differs from the poses format

I'm happy for you to leave the documentation changes for a future PR (just open an issue in that case).

EDIT

Another thought that just came to mind: have you tested whether the existing filtering and kinematics functions work on the new bboxes datasets?

You could define a valid_bboxes_dataset pytest fixture (similar to valid_poses_dataset) and add it to all relevant tests in:

test_integration/test_filtering.py
test_unit/test_filtering.py
test_unit/test_kinematics.py
test_unit/test_move_accessor.py

It's crucial to determine if our existing features work on the new type of dataset.

movement/io/load_bboxes.py

movement/sample_data.py

tests/test_unit/test_load_bboxes.py

movement/io/load_bboxes.py

tests/test_unit/test_load_bboxes.py

movement/io/load_bboxes.py

sfmig · 2024-07-19T17:08:01Z

From chats with @niksirbi:
We agreed to take the frame numbers as specified in the csv file for now (that is, not necessarily starting from 0), but to make the time coordinate have the same origin as the frame number.

sfmig · 2024-07-26T12:17:23Z

Thanks @niksirbi for the feedback!

I think I addressed all the comments, let me know if something is missing.

`frame_array` and frame numbers

I did some changes to hopefully make more understandable the behaviour of keeping / resetting the frame numbers:

In all the loaders of bboxes data (from_file, from_via_tracks_file, from numpy), the default behaviour is that the origin of the time dimension is the first loaded (aka tracked) frame, which would be frame number 0 captured at t = 0 seconds. This is to stay consistent with the pose data, and it is likely the most natural for our users.
In the from_file and the from_via_tracks_file I added a use_frame_numbers_from_file option, that is False by default. If True, it will take the frame numbers from the input file, with whatever assumption for the time origin they have (they may assume for example the first frame is 0, or 1).
- The caveat is that if you take the frame number from the file, and later transform it to seconds, then the time origin is assumed to be at frame number 0 and time = 0 seconds. This is because we simply transform to seconds as frame_number/fps.
- Another option could be to add an extra attribute / argument to set the time origin manually, but I didn't want to complicate it too much. Let me know if it is clear as is, if not I am happy to give it another go.
- If users want to keep track of the frame relative to the full video at which the analysed clip starts, they could assign a custom attribute. I sneakily added this in PR Getting started docs update for bboxes #245 - let me know thoughts.
For the from_numpy loader, this is simply done via the frame_array input. If passed, those frame numbers are used, but the input is None by default, which means the frame numbers are assigned from 0 to N-1, with N being the first dimension of the position array.

Update Getting started documentation

I moved this to a separate PR Getting started docs update for bboxes #245

Extend tests to ensure postprocessing methods work with bboxes datasets

Working on it in this (currently draft) PR Extend tests for bboxes datasets #246

Question

How do you think we should merge these PRs?
I have never done this but I'm wondering if we should merge PR #245 and #246 into this PR's branch, before merging this into main. Or should we use a dev branch?

niksirbi

Thanks @sfmig, I'm happy with how you've handled things, and the updated docstrings make it super clear what happens to time and frames.

I've only left very few suggestions re docstrings, feel free to adopt or not.

movement/io/load_bboxes.py

movement/validators/datasets.py

movement/io/load_bboxes.py

niksirbi · 2024-07-26T14:41:11Z

How do you think we should merge these PRs?
I have never done this but I'm wondering if we should merge PR #245 and #246 into this PR's branch, before merging this into main. Or should we use a dev branch?

I've never done this as well. Maybe we should consult with Alessandro or Joe who have more experience with complex merges.
How about merging each of these 3 PRs separately (and sequentially) to main (with rebases in-between)? Is that not an option? It would be nice to have them as 3 separate commits in main, but if that' painful, feel free to merge any way that's convenient.

Co-authored-by: Niko Sirmpilatze <[email protected]>

…via_tracks_df` and `_extract_confidence_from_via_tracks_df` to return one-dimensional arrays instead of forcing always two-dimensional

…file or VIA tracks file

sonarqubecloud · 2024-07-31T10:23:48Z

Quality Gate passed

Issues
3 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

sfmig force-pushed the smg/read-via-file-as-bbox-ds branch from 511b3de to 161bf67 Compare June 19, 2024 17:27

VascoSch92 reviewed Jun 19, 2024

View reviewed changes

movement/validators/datasets.py Outdated Show resolved Hide resolved

movement/validators/files.py Outdated Show resolved Hide resolved

This was referenced Jun 20, 2024

Add bboxes sample data #231

Merged

VIA tracks data loader #186

Closed

sfmig changed the title ~~Load VIA data as bboxes dataset~~ Load bboxes dataset from VIA tracks file Jun 20, 2024

sfmig changed the title ~~Load bboxes dataset from VIA tracks file~~ Load bboxes dataset from VIA tracks file (4/4) Jun 20, 2024

sfmig marked this pull request as ready for review June 21, 2024 12:57

sfmig force-pushed the smg/read-via-file-as-bbox-ds branch from 111e599 to 2cf6c80 Compare June 21, 2024 13:13

sfmig requested a review from niksirbi June 21, 2024 15:42

sfmig changed the title ~~Load bboxes dataset from VIA tracks file (4/4)~~ Load bboxes dataset from VIA tracks file (3/4) Jun 27, 2024

This was referenced Jul 18, 2024

Small edits to ValidBboxesDataset (1/4) #230

Merged

Add a ValidVIAtracksCSV class (2/4) #219

Merged

niksirbi requested changes Jul 18, 2024

View reviewed changes

niksirbi reviewed Jul 18, 2024

View reviewed changes

movement/io/load_bboxes.py Show resolved Hide resolved

sfmig force-pushed the smg/read-via-file-as-bbox-ds branch 4 times, most recently from 089a7d1 to 2d9b330 Compare July 24, 2024 09:17

This was referenced Jul 25, 2024

Getting started docs update for bboxes #245

Merged

Extend tests for bboxes datasets #246

Merged

sfmig force-pushed the smg/read-via-file-as-bbox-ds branch from ec714db to 3f7aadf Compare July 25, 2024 17:31

sfmig requested a review from niksirbi July 26, 2024 12:17

niksirbi approved these changes Jul 26, 2024

View reviewed changes

sfmig force-pushed the smg/read-via-file-as-bbox-ds branch from d0f2326 to d9cf96b Compare July 30, 2024 13:28

Draft load_bboxes.py

1740bd6

sfmig and others added 22 commits July 31, 2024 11:16

Small edits and docstring review

5184751

Update API docs

2883b80

Apply suggestions from code review

7b568f6

Co-authored-by: Niko Sirmpilatze <[email protected]>

Fix rebase artifacts

6cd2354

Edits to docstrings for consistency

83a5902

Modify _via_attribute_column_to_numpy, `_extract_frame_number_from_…

7e8e3df

…via_tracks_df` and `_extract_confidence_from_via_tracks_df` to return one-dimensional arrays instead of forcing always two-dimensional

Small edits to docstring & comments

92f7a04

Add "ds_type": "poses" to poses dataset

79ebf7c

Make time origin at frame 0 == 0 seconds

7328f30

Add load_bboxes to API index

4a976a9

Remove pass and refactor dataset fetching if bboxes

81e7e55

Fix test for time coordinates if fps passed

373af2a

Add docstring examples to explain time origin

7b0bab6

Add use_frame_numbers_from_file when loading a bboxes dataset from …

9cb8261

…file or VIA tracks file

Add examples to docstrings of use_frame_numbers_from_file

f1945e7

Change default behaviour to not use frame numbers from file

0032acc

Small edits to docstrings for doc building

2fe6176

Remove ignore comment

971884d

Update one-line summary of load_bboxes module for API reference

2edb5f2

Update one-line summary of load_bboxes module for API reference

c50cd7e

Fix backticks

80506e2

Swap examples in docstrings

6e5420d

sfmig force-pushed the smg/read-via-file-as-bbox-ds branch from d9cf96b to 6e5420d Compare July 31, 2024 10:19

sfmig added 2 commits July 31, 2024 11:20

Delete API index

9ccf40e

Fix from_file example

7c260b3

sfmig added this pull request to the merge queue Jul 31, 2024

Merged via the queue into main with commit 01c3cf6 Jul 31, 2024
17 checks passed

lochhh deleted the smg/read-via-file-as-bbox-ds branch October 25, 2024 16:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load bboxes dataset from VIA tracks file (3/4) #229

Load bboxes dataset from VIA tracks file (3/4) #229

sfmig commented Jun 19, 2024 •

edited

Loading

codecov bot commented Jun 19, 2024 •

edited

Loading

sonarqubecloud bot commented Jun 21, 2024

niksirbi left a comment •

edited

Loading

sfmig commented Jul 19, 2024

sfmig commented Jul 26, 2024 •

edited

Loading

niksirbi left a comment •

edited

Loading

niksirbi commented Jul 26, 2024

sonarqubecloud bot commented Jul 31, 2024

Load bboxes dataset from VIA tracks file (3/4) #229

Load bboxes dataset from VIA tracks file (3/4) #229

Conversation

sfmig commented Jun 19, 2024 • edited Loading

Description

References

How has this PR been tested?

Is this a breaking change?

Does this PR require an update to the documentation?

Checklist:

codecov bot commented Jun 19, 2024 • edited Loading

Codecov Report

sonarqubecloud bot commented Jun 21, 2024

Quality Gate passed

niksirbi left a comment • edited Loading

Choose a reason for hiding this comment

EDIT

sfmig commented Jul 19, 2024

sfmig commented Jul 26, 2024 • edited Loading

frame_array and frame numbers

Update Getting started documentation

Extend tests to ensure postprocessing methods work with bboxes datasets

Question

niksirbi left a comment • edited Loading

Choose a reason for hiding this comment

niksirbi commented Jul 26, 2024

sonarqubecloud bot commented Jul 31, 2024

Quality Gate passed

sfmig commented Jun 19, 2024 •

edited

Loading

codecov bot commented Jun 19, 2024 •

edited

Loading

niksirbi left a comment •

edited

Loading

sfmig commented Jul 26, 2024 •

edited

Loading

`frame_array` and frame numbers

niksirbi left a comment •

edited

Loading