Added page to show statistics from use of the workshop #669

elisakiv · 2023-08-17T17:38:31Z

Proposed changes

Created a statistics page to view statistics regariding the use of the workshop, such as reservations made, and when the workshop is in use.

Areas to review closely

Checklist

(If any of the points are not relevant, mark them as checked)

Fixed issues with importing charts.

codecov · 2023-08-31T17:50:48Z

Codecov Report

Merging #669 (ab926dc) into dev (6f55b7d) will increase coverage by 0.04%.
The diff coverage is 96.87%.

@@            Coverage Diff             @@
##              dev     #669      +/-   ##
==========================================
+ Coverage   88.16%   88.21%   +0.04%     
==========================================
  Files         152      152              
  Lines        6187     6218      +31     
==========================================
+ Hits         5455     5485      +30     
- Misses        732      733       +1

Files	Coverage Δ
src/makerspace/urls.py	`100.00% <ø> (ø)`
src/makerspace/views.py	`98.70% <96.87%> (-1.30%)`	⬇️

Gunvor4 · 2023-09-28T18:49:57Z

On the third plot, some of the sewing machines appear twice

Gunvor4 · 2023-09-28T18:50:28Z

On a normal screen, with two plots next to each other, the title is above the second plot. It would look better if it was in the middle. The two plots are also not situated in the middle of the page, which is fine, but then the title of the page should be adjusted relative to the plots, and not the page. The same issue is still there when the screen size gets smaller and only one plot is shown per row.

Gunvor4 · 2023-09-28T18:51:07Z

In the two plots counting how many times a 3D printer or a sewing machine has been booked by someone, it looks a bit strange that there are decimals on the y-axis, since it is impossible for a machine to be booked 0.5 times. The two other plot, on the other hand, are counting hours, so there it would have been ok to see decimals. But these plots only have integers on the y-axis. The styles should be switched – the "number of times" plots should not have decimals, and the "how many hours" plots, could.

Gunvor4 · 2023-09-28T18:52:16Z

The last plot doesn't show any numbers on the y-axis, and the axis also doesn't have a label. I guess what is shown here is the result of a calculation? Still – would it be possible to give the y-axis a label or introduce some numbers here, to make it clearer what the plot actually shows? Maybe add a label like "equipment currently in use" or "machines or workstations currently in use"?

Gunvor4 · 2023-09-28T18:52:43Z

On the x-axis on the last plot – the numbers are tilted, like for the names of 3D-printers or sewing machines. But on this plot, the only thing the x-axis shows are time points, which are shorts, so tilting them looks a bit strange.

src/makerspace/templates/makerspace/statistics.html

Gunvor4 · 2023-09-28T19:18:34Z

src/makerspace/static/makerspace/js/statistics_charts.js

+                    label: "How long each printer has been reserved in total",
+                    data: data.map((element)=> (element.len)),
+                    backgroundColor: [
+                        'rgba(249, 65, 68)',


I like the colors of the plots. Still, it might be more logical if they were gradients made from MAKE's official colors, which are MAKE yellow (#F8C811), dark blue (#0C426A) and turquoise (#00EBB3), in addition to two different shades of gray - #4A4B4C and #DCDDDE. We also have an orange variation of the yellow (#F5A13F).
If you decide to keep the colors you already have, maybe the rainbow plot could be smoother, because in the second and fourth bar looks more orange, while the third and the fifth looks more yellow. Also – the ninth bar looks more purple and the tenth looks more blue, while the other plots seem to follow a gradient.
Also – if you have 10 colors in your gradient, and we have more than 10 3D printers, we would go from the 10th color and back to the first, which will not look too nice. Maybe the gradient could be calculated in the JavaScript file, from the first and the last color, and the number of bars? If not, it would also be possible to reverse the gradient, as I've shown in the example. In the example, I used two of the official MAKE colors, and created a gradient using the tool on this page.

Suggested change

'rgba(249, 65, 68)',

'#00EBB3',

'#03C1A1',

'#06978F',

'#096C7C',

'#0C426A',

'#096C7C',

'#06978F',

'#03C1A1',

Gunvor4 · 2023-09-28T19:20:06Z

src/makerspace/views.py

+    get_time_in_hours = 3600000000  # number the reservation length needs to be divided by to get the length in hours
+    filter_reservations = Q(reservations__special=False) & Q(reservations__event__isnull=True)
+
+    reservations = Reservation.objects.prefetch_related(Prefetch('machine', queryset=Machine.objects.


The exprassions for reservations (line 80 – 82) sewingmachines (line 84 – 87) and printers (line 89 – 92) are extremely long and ugly. Would it be possible to refactor these, to make them more readable?

src/makerspace/templates/makerspace/statistics.html

Gunvor4 · 2023-09-28T19:25:36Z

I don't think you need a colon after the title of a plot ;-)

Gunvor4 · 2023-09-28T19:26:13Z

It might be an idea to sort the printers, either alphabetically, or to keep the same order as they appear in on the website. Now I added some fake ultimakers, then some fake raise3D printers, a fake SLA printer, and then some fake ultimakers again, and now I have the ultimakers at both sides of the plot, with raise 3D printers and SLA printers in the middle.

Gunvor4 · 2023-09-28T19:27:00Z

Aaaand now I'm done! :-) Sorry about being nitpicky :-P

ddabble

Amazing job! Properly displaying statistics is usually a pretty complex task, so I'm genuinely impressed by what you've created 😊 However, complexity also necessarily increases the risk of overlooking smaller details, and so I've found quite a few things to provide feedback on 😅 If you'd like some help making any of the suggested changes, or if you'd simply like to offload some work, I'd be happy to help :)

First off, it seems like no parts of the code you've added use any of the things in the makerspace app, and instead only use things in the make_queue app... I guess it makes sense you added it to the former app since all of the statistics relate to the physical makerspace (another reason why the two mentioned apps have pretty terrible names, in my opinion 😅), but the code should really be moved to the make_queue app. This will require a little work, but I'd absolutely recommend you do this, for future maintainability 🙂

It would be great if you could fix the issues detected by Code Climate (link is in the status checks at the bottom of this PR page) 🙂 (It should be mentioned that fixing auto-detected issues like those, might not necessarily make the code any better - it might even make the code worse in some cases - but they're usually a very good indication!) It looks like most of the issues can/will be fixed after addressing some of my comments below, though :)

Lastly, it would be fantastic if you could add some unit tests for the view code, to both make sure that it works the way we think it does, and to provide concrete examples for future developers, which can help them understand the intended function of the code 😊

Btw, well done minimizing the number of DB queries using prefetching! That's often quite a daunting task, but it looks like you've implemented it well! 😄

ddabble · 2023-11-23T19:16:43Z

CHANGELOG.md

@@ -31,6 +33,7 @@ A summary of changes made to the codebase, grouped per deployment.
    - _Examples can be passwords of Google Workspace users that only exist for technical purposes,
      or credentials for the admin page of a physical router_
 - Added an "internal" ribbon label on internal articles and events
+>>>>>>> 66675ec2 (Added page to show statistics from use of the workshop)


Looks like you forgot to remove all the remnants of a merge conflict :p

Suggested change

>>>>>>> 66675ec2 (Added page to show statistics from use of the workshop)

ddabble · 2023-11-23T19:20:50Z

src/locale/nb/LC_MESSAGES/django.po

+msgstr "Inkluderer 3D-printere, Raise3D-printere og SLA 3D-printere"
+
+#: src/makerspace/templates/makerspace/statistics.html:32
+msgid "Number of reservations pr. 3D-printer"


The abbreviation "pr." is not really used in English, as far as I'm aware 🤔 Also, there is no dash between "3D" and "printer" in English 🙂

Suggested change

msgid "Number of reservations pr. 3D-printer"

msgid "Number of reservations per 3D printer"

ddabble · 2023-11-23T19:21:10Z

src/locale/nb/LC_MESSAGES/django.po

+msgstr "Total reservasjonstid for hver symaskin"
+
+#: src/makerspace/templates/makerspace/statistics.html:52
+msgid "Number of reservations pr. sewing machine"


Suggested change

msgid "Number of reservations pr. sewing machine"

msgid "Number of reservations per sewing machine"

ddabble · 2023-11-23T19:21:52Z

src/locale/nb/LC_MESSAGES/django.po

+msgstr "Antall reservasjoner per symaskin"
+
+#: src/makerspace/templates/makerspace/statistics.html:60
+msgid "Longest printer reservations"


Suggested change

msgid "Longest printer reservations"

msgid "Longest 3D printer reservations"

ddabble · 2023-11-23T19:22:03Z

src/locale/nb/LC_MESSAGES/django.po

+
+#: src/makerspace/templates/makerspace/statistics.html:60
+msgid "Longest printer reservations"
+msgstr "Lengste 3D-printer reservasjoner"


Suggested change

msgstr "Lengste 3D-printer reservasjoner"

msgstr "Lengste 3D-printerreservasjoner"

ddabble · 2023-11-26T16:23:38Z

src/makerspace/views.py

+    def get_context_data(self, **kwargs):
+        context = super().get_context_data(**kwargs)
+        span_of_printer_reservations = list(self.get_machines_with_reservations("3D-printere").
+                                            annotate(len=Sum((F('reservations__end_time') - F('reservations__start_time')) / self.get_time_in_hours, output_field=IntegerField())).values('id', 'name', 'len'))


Please use pk instead of id in your code wherever possible, as per the common code smells guide 🙂

Suggested change

annotate(len=Sum((F('reservations__end_time') - F('reservations__start_time')) / self.get_time_in_hours, output_field=IntegerField())).values('id', 'name', 'len'))

annotate(len=Sum((F('reservations__end_time') - F('reservations__start_time')) / self.get_time_in_hours, output_field=IntegerField())).values('pk', 'name', 'len'))

ddabble · 2023-11-26T16:37:43Z

src/makerspace/views.py

+        number_of_printer_reservations = list(self.get_machines_with_reservations("3D-printere").annotate(number_of_reservations=Count('reservations')).values('id', 'name', 'number_of_reservations'))
+        longest_printer_reservations = list(self.get_machines_with_reservations("3D-printere").
+                                            annotate(len=Sum((F('reservations__end_time') - F('reservations__start_time')) / self.get_time_in_hours, output_field=IntegerField())).
+                                            order_by('-len').values('id', 'name', 'reservations', 'len'))[:3]


This 3 is a magic number and should preferably be replaced with a field on the view (e.g. NUM_LONGEST_RESERVATIONS_DISPLAYED = 3) :)

Also, limiting querysets using the list slicing syntax should be done on the queryset itself before converting it to a list, like this:

Suggested change

order_by('-len').values('id', 'name', 'reservations', 'len'))[:3]

order_by('-len').values('id', 'name', 'reservations', 'len')[:3])

Otherwise, the entire queryset (probably thousands of rows) will be converted into a list first, and then sliced 😅

Lastly, correct me if I'm wrong, but doesn't the longest_printer_reservations queryset contain the three machines that have been reserved the longest duration in total, and not the three machines with the longest individual reservations, as suggested by the variable name..? (I haven't run the code locally, so I don't know what the graphs actually display 😅)

Oh wait, after testing some of the code locally, it seems like longest_printer_reservations does indeed contain what the name suggests, and the annotated len attribute's value is, appropriately, the duration of each reservation - in contrast to the renaming suggestion I made in a comment above 😅 The code makes this pretty confusing, since each row/element of the list does not represent each machine - the opposite of what you'd expect, and what the lists above and below contain. (I'm explaining this to avoid any potential misunderstandings:) Instead, due to the way values() works, when providing the name of a many-to-many field or a reverse many-to-one manager ('reservations' in this case), it will return one row/element for each combination the multivalued fields provided (one row per reservation, in this case), as explained in the warning at the bottom of the values() section in the docs. Under the values_list() section, this is mentioned:

values() and values_list() are both intended as optimizations for a specific use case: retrieving a subset of data without the overhead of creating a model instance. This metaphor falls apart when dealing with many-to-many and other multivalued relations (such as the one-to-many relation of a reverse foreign key) because the “one row, one object” assumption doesn’t hold.

So, I think code relying on such an obscure fact of the values() method should be avoided (or at the very least explicitly pointed out and explained in comments). I think I found a fairly easy replacement, however, which also makes the query slightly less complex by going through the reservation table instead of the machine table: 🙂(This contains some elements from other parts of the code and from my other suggestions, as you can see :))

longest_printer_reservations = ( Reservation.objects.filter(event=None, special=False, machine__machine_type__pk=1) .annotate(duration=F('end_time') - F('start_time')) .order_by('-duration') .select_related('machine') [:3] ) longest_printer_reservations_list = [] for reservation in longest_printer_reservations: duration: timedelta = reservation.duration longest_printer_reservations_list.append({ 'machine_name': reservation.machine.name, 'duration_hours': duration.total_seconds() // (60 * 60), })

longest_printer_reservations_list can then be inserted into the context variables 🙂 Any thoughts?

ddabble · 2023-11-26T17:23:56Z

src/makerspace/views.py

+        span_of_printer_reservations = list(self.get_machines_with_reservations("3D-printere").
+                                            annotate(len=Sum((F('reservations__end_time') - F('reservations__start_time')) / self.get_time_in_hours, output_field=IntegerField())).values('id', 'name', 'len'))


The fact that the duration calculated here has a unit of microseconds, is not the case for all DBMSs (see for example the note in the Django docs about PostgreSQL). I think we should write code that is as DBMS-agnostic as practically possible, and so it would be better to remove this calculation from the query...

Suggested change

span_of_printer_reservations = list(self.get_machines_with_reservations("3D-printere").

annotate(len=Sum((F('reservations__end_time') - F('reservations__start_time')) / self.get_time_in_hours, output_field=IntegerField())).values('id', 'name', 'len'))

span_of_printer_reservations = list(self.get_machines_with_reservations("3D-printere").

annotate(len=Sum(F('reservations__end_time') - F('reservations__start_time'))).values('id', 'name', 'len'))

...and instead do it after, using timedelta.total_seconds():

for machine_dict in span_of_printer_reservations: total_reserved_duration: timedelta = machine_dict.pop('len') machine_dict['total_reserved_hours'] = total_reserved_duration.total_seconds() // (60 * 60)

This would be natural to place inside a helper method, of course.

If you decide to implement my suggestion above, then I guess the len attribute could be renamed to
total_reserved_duration (instead of total_reserved_hours, as I suggested above - since that name is inserted into machine_dict in my suggestion). Or any other more fitting name you might come up with 😊

ddabble · 2023-11-26T17:53:42Z

src/makerspace/views.py

+        number_of_sewing_machine_reservations = list(self.get_machines_with_reservations("Symaskiner").
+                                                     annotate(number_of_reservations=Count('reservations')).values('id', 'name', 'number_of_reservations'))
+        get_time_distribution = self.get_time_distribution()
+        context.update({'span_of_printer_reservations': span_of_printer_reservations,
+                        'span_of_sewing_machine_reservations': span_of_sewing_machine_reservations,
+                        'longest_printer_reservations': longest_printer_reservations,
+                        'number_of_printer_reservations': number_of_printer_reservations,
+                        'number_of_sewing_machine_reservations': number_of_sewing_machine_reservations,


The code might be made slightly cleaner if you moved the list() and values() calls (or just one of them) to the context variable dict, like this: 🤔

Suggested change

number_of_sewing_machine_reservations = list(self.get_machines_with_reservations("Symaskiner").

annotate(number_of_reservations=Count('reservations')).values('id', 'name', 'number_of_reservations'))

get_time_distribution = self.get_time_distribution()

context.update({'span_of_printer_reservations': span_of_printer_reservations,

'span_of_sewing_machine_reservations': span_of_sewing_machine_reservations,

'longest_printer_reservations': longest_printer_reservations,

'number_of_printer_reservations': number_of_printer_reservations,

'number_of_sewing_machine_reservations': number_of_sewing_machine_reservations,

number_of_sewing_machine_reservations = self.get_machines_with_reservations("Symaskiner").

annotate(number_of_reservations=Count('reservations'))

get_time_distribution = self.get_time_distribution()

context.update({'span_of_printer_reservations': span_of_printer_reservations,

'span_of_sewing_machine_reservations': span_of_sewing_machine_reservations,

'longest_printer_reservations': longest_printer_reservations,

'number_of_printer_reservations': number_of_printer_reservations,

'number_of_sewing_machine_reservations': list(number_of_sewing_machine_reservations.values('id', 'name', 'number_of_reservations')),

ddabble · 2023-11-26T20:04:47Z

src/makerspace/views.py

+
+    def get_machines_with_reservations(self, machine_type):
+        filter_reservations = Q(reservations__special=False) & Q(reservations__event__isnull=True)
+        return Machine.objects.prefetch_related('machine_type', 'reservations').filter(Q(machine_type__name__icontains=machine_type) & filter_reservations).order_by('machine_type')


Use select_related() for foreign key fields, as explained in one of my comments above 🙂

Suggested change

return Machine.objects.prefetch_related('machine_type', 'reservations').filter(Q(machine_type__name__icontains=machine_type) & filter_reservations).order_by('machine_type')

return Machine.objects.select_related('machine_type').prefetch_related('reservations').filter(Q(machine_type__name__icontains=machine_type) & filter_reservations).order_by('machine_type')

Added page to show statistics from use of the workshop

d7baa9a

elisakiv requested a review from Gunvor4 August 17, 2023 17:38

elisakiv marked this pull request as draft August 17, 2023 17:42

Filtered out MAKE events and special reservations.

c046436

Fixed issues with importing charts.

elisakiv marked this pull request as ready for review August 31, 2023 17:55

Fixed bug with length of sewingmachine reservation chart

0c7a380