You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
While implementing #7638 and attempting to run qs8-dwconv-bench with xnn_qs8_dwconv_minmax_fp32_ukernel_25p8vc the benchmark encounters a malloc error after several tests. Running with 9p8vc is fine.
I noticed that the current qs8-dwconv-bench only uses primary_tile = 9 for its scalar benchmarks. Adding a benchmark test to include the scalar primary_tile=25 kernel results in the same apparent heap corruption.
I think this is a case of invalid parameters for 5x5.
In practice 5x5 is used by mobilenet v3, while 3x3 is used in mobilenet v2.
So you could try mobilenet v3
Taking a quick look at the current
models/benchmark --benchmark_filter=V3
FP32MobileNetV3Large/real_time 5602 us 5601 us 125 cpufreq=3.3723G
FP32MobileNetV3Small/real_time 1722 us 1722 us 405 cpufreq=3.30428G
FP16MobileNetV3Large/real_time 14207 us 14200 us 49 cpufreq=3.4632G
FP16MobileNetV3Small/real_time 4880 us 4879 us 146 cpufreq=3.5469G
The QS8 model is missing.
The old end2end had it if you dig up old versions.
TFLite benchmark_model can do a .tflite file if you can get a mobilenet v3 model.
While implementing #7638 and attempting to run qs8-dwconv-bench with xnn_qs8_dwconv_minmax_fp32_ukernel_25p8vc the benchmark encounters a malloc error after several tests. Running with 9p8vc is fine.
I noticed that the current qs8-dwconv-bench only uses primary_tile = 9 for its scalar benchmarks. Adding a benchmark test to include the scalar primary_tile=25 kernel results in the same apparent heap corruption.
Add to bench/qs8-dwconv.cc
Test result.
I'm not clear if the test case is invalid here, or if there is a bug within DWConvBenchmark.
The text was updated successfully, but these errors were encountered: