-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathNOTES.txt
executable file
·219 lines (206 loc) · 12.1 KB
/
NOTES.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
Unison Audio Baseline
Drum onset F1 (flat): 0.0
Drum onset F1 (full): 0.0
Drum onset F1 (midi_class): 0.0
Drum onset precision (flat): 0.0
Drum onset precision (full): 0.0
Drum onset precision (midi_class): 0.0
Drum onset recall (flat): 0.0
Drum onset recall (full): 0.0
Drum onset recall (midi_class): 0.0
Nondrum onset + offset + program F1 (flat): 0.2306
Nondrum onset + offset + program F1 (full): 0.06584
Nondrum onset + offset + program F1 (midi_class): 0.1194
Nondrum onset + offset + program precision (flat): 0.2456
Nondrum onset + offset + program precision (full): 0.06686
Nondrum onset + offset + program precision (midi_class): 0.1245
Nondrum onset + offset + program recall (flat): 0.2411
Nondrum onset + offset + program recall (full): 0.0683
Nondrum onset + offset + program recall (midi_class): 0.1236
Onset + offset + program F1 (flat): 0.2267
Onset + offset + program F1 (full): 0.06351
Onset + offset + program F1 (midi_class): 0.1164
Onset + offset + program precision (flat): 0.2397
Onset + offset + program precision (full): 0.06383
Onset + offset + program precision (midi_class): 0.1198
Onset + offset + program recall (flat): 0.2411
Onset + offset + program recall (full): 0.0683
Onset + offset + program recall (midi_class): 0.1236
Slakh test (pytorch)
Drum onset F1 (flat): 0.6808
Drum onset F1 (full): 0.6808
Drum onset F1 (midi_class): 0.6808
Drum onset precision (flat): 0.7103
Drum onset precision (full): 0.7103
Drum onset precision (midi_class): 0.7103
Drum onset recall (flat): 0.6651
Drum onset recall (full): 0.6651
Drum onset recall (midi_class): 0.6651
Nondrum onset + offset + program F1 (flat): 0.4218
Nondrum onset + offset + program F1 (full): 0.09942
Nondrum onset + offset + program F1 (midi_class): 0.3847
Nondrum onset + offset + program precision (flat): 0.4778
Nondrum onset + offset + program precision (full): 0.1123
Nondrum onset + offset + program precision (midi_class): 0.4354
Nondrum onset + offset + program recall (flat): 0.3826
Nondrum onset + offset + program recall (full): 0.09017
Nondrum onset + offset + program recall (midi_class): 0.3492
Onset + offset + program F1 (flat): 0.5037
Onset + offset + program F1 (full): 0.2846
Onset + offset + program F1 (midi_class): 0.4781
Onset + offset + program precision (flat): 0.5559
Onset + offset + program precision (full): 0.3138
Onset + offset + program precision (midi_class): 0.5273
Onset + offset + program recall (flat): 0.4651
Onset + offset + program recall (full): 0.2626
Onset + offset + program recall (midi_class): 0.4415
Onset F1: 0.6656
Onset precision: 0.7349
Onset recall: 0.6142
========================
(Thanks to Yun-Ning Hung, the key lies in isses of all_src.mid in Slakh!)
Drum onset F1 (flat): 0.7687
Drum onset F1 (full): 0.7687
Drum onset F1 (midi_class): 0.7687
Drum onset precision (flat): 0.8036
Drum onset precision (full): 0.8036
Drum onset precision (midi_class): 0.8036
Drum onset recall (flat): 0.7501
Drum onset recall (full): 0.7501
Drum onset recall (midi_class): 0.7501
Nondrum onset + offset + program F1 (flat): 0.5256
Nondrum onset + offset + program F1 (full): 0.1494
Nondrum onset + offset + program F1 (midi_class): 0.4957
Nondrum onset + offset + program precision (flat): 0.5808
Nondrum onset + offset + program precision (full): 0.1656
Nondrum onset + offset + program precision (midi_class): 0.5475
Nondrum onset + offset + program recall (flat): 0.4844
Nondrum onset + offset + program recall (full): 0.1373
Nondrum onset + offset + program recall (midi_class): 0.457
Onset + offset + program F1 (flat): 0.6
Onset + offset + program F1 (full): 0.3484
Onset + offset + program F1 (midi_class): 0.5797
Onset + offset + program precision (flat): 0.6527
Onset + offset + program precision (full): 0.3791
Onset + offset + program precision (midi_class): 0.6304
Onset + offset + program recall (flat): 0.5598
Onset + offset + program recall (full): 0.3247
Onset + offset + program recall (midi_class): 0.541
Onset F1: 0.7595
Onset precision: 0.8271
Onset recall: 0.7079
Slakh test (official)
Drum onset F1 (flat): 0.6789
Drum onset F1 (full): 0.6789
Drum onset F1 (midi_class): 0.6789
Drum onset precision (flat): 0.7053
Drum onset precision (full): 0.7053
Drum onset precision (midi_class): 0.7053
Drum onset recall (flat): 0.6662
Drum onset recall (full): 0.6662
Drum onset recall (midi_class): 0.6662
Nondrum onset + offset + program F1 (flat): 0.4225
Nondrum onset + offset + program F1 (full): 0.09958
Nondrum onset + offset + program F1 (midi_class): 0.3856
Nondrum onset + offset + program precision (flat): 0.4784
Nondrum onset + offset + program precision (full): 0.1124
Nondrum onset + offset + program precision (midi_class): 0.4362
Nondrum onset + offset + program recall (flat): 0.3834
Nondrum onset + offset + program recall (full): 0.09033
Nondrum onset + offset + program recall (midi_class): 0.3501
Onset + offset + program F1 (flat): 0.5039
Onset + offset + program F1 (full): 0.2846
Onset + offset + program F1 (midi_class): 0.4784
Onset + offset + program precision (flat): 0.555
Onset + offset + program precision (full): 0.3132
Onset + offset + program precision (midi_class): 0.5266
Onset + offset + program recall (flat): 0.4659
Onset + offset + program recall (full): 0.2631
Onset + offset + program recall (midi_class): 0.4425
Perceiver style (onset-F1 and onset-multi-instrument-F1):
Drum onset F1 (midi_class): 0.6808
Drum onset precision (midi_class): 0.7103
Drum onset recall (midi_class): 0.6651
Nondrum onset + offset + program F1 (midi_class): 0.5908
Nondrum onset + offset + program precision (midi_class): 0.6675
Nondrum onset + offset + program recall (midi_class): 0.5366
Onset + offset + program F1 (midi_class): 0.6208
Onset + offset + program precision (midi_class): 0.6847
Onset + offset + program recall (midi_class): 0.5733
Onset F1: 0.6656
Onset precision: 0.7349
Onset recall: 0.6142
========================
(Thanks to Yun-Ning Hung, the key lies in isses of all_src.mid in Slakh!)
Perceiver style (onset-F1 and onset-multi-instrument-F1, revised):
Drum onset F1 (midi_class): 0.7687
Drum onset precision (midi_class): 0.8036
Drum onset recall (midi_class): 0.7501
Nondrum onset + offset + program F1 (midi_class): 0.7139
Nondrum onset + offset + program precision (midi_class): 0.789
Nondrum onset + offset + program recall (midi_class): 0.6577
Onset + offset + program F1 (midi_class): 0.7296
Onset + offset + program precision (midi_class): 0.7941
Onset + offset + program recall (midi_class): 0.6803
Onset F1: 0.7595
Onset precision: 0.8271
Onset recall: 0.7079
Perceiver style on loop data:
Drum onset F1 (midi_class): 0.0
Drum onset precision (midi_class): 0.0
Drum onset recall (midi_class): 0.0
Nondrum onset + offset + program F1 (midi_class): 0.2104
Nondrum onset + offset + program precision (midi_class): 0.2097
Nondrum onset + offset + program recall (midi_class): 0.2263
Onset + offset + program F1 (midi_class): 0.2023
Onset + offset + program precision (midi_class): 0.1983
Onset + offset + program recall (midi_class): 0.2263
Onset F1: 0.3989
Onset precision: 0.4033
Onset recall: 0.4504
Instrument classification (from hidden state to actual generated output)
- decoder[-1]: 0.83
- decoder[0]: 0.82
- decoder[1]: 0.81
- encoder[-1]: 0.7
clearly decoder is responsible for transcribing instruments from audio.
slakh2100_flac_redux_test_16k_orig|compute_transcription_metrics
on_f1: 0.6656
on_overlap: 0.5312
on_precision: 0.7349
on_recall: 0.6142
onoff_f1: 0.4235
onoff_overlap: 0.7351
onoff_precision: 0.4672
onoff_recall: 0.3912
why ownself train version transcription no drums??? that is weird.
Drum onset F1 (flat): 0.02438
Drum onset F1 (full): 0.02438
Drum onset F1 (midi_class): 0.02438
Drum onset precision (flat): 0.7059
Drum onset precision (full): 0.7059
Drum onset precision (midi_class): 0.7059
Drum onset recall (flat): 0.01325
Drum onset recall (full): 0.01325
Drum onset recall (midi_class): 0.01325
Nondrum onset + offset + program F1 (flat): 0.3413
Nondrum onset + offset + program F1 (full): 0.08837
Nondrum onset + offset + program F1 (midi_class): 0.3093
Nondrum onset + offset + program precision (flat): 0.3209
Nondrum onset + offset + program precision (full): 0.08305
Nondrum onset + offset + program precision (midi_class): 0.2904
Nondrum onset + offset + program recall (flat): 0.372
Nondrum onset + offset + program recall (full): 0.0964
Nondrum onset + offset + program recall (midi_class): 0.3375
Onset + offset + program F1 (flat): 0.2879
Onset + offset + program F1 (full): 0.07846
Onset + offset + program F1 (midi_class): 0.2612
Onset + offset + program precision (flat): 0.3238
Onset + offset + program precision (full): 0.08737
Onset + offset + program precision (midi_class): 0.2935
Onset + offset + program recall (flat): 0.2625
Onset + offset + program recall (full): 0.07188
Onset + offset + program recall (midi_class): 0.2382
Onset F1: 0.6181
Onset precision: 0.6956
Onset recall: 0.5628