Inference time for EfficientFormerV2 on 2080Ti #51

zcbfpramtqs55675 · 2023-02-22T07:30:20Z

Hi, I tested EfficientFormerV2-s0 and EfficientFormerV2-s2 on 2080Ti, the input size is 1x3x224x224, and got the result as follows:
EfficientFormerV2-s2: about 24ms/per input,
EfficientFormerV2-s0: about 22ms/per input
Is this reasonable? seems different from A100 results in your paper. Any reply is appreciated, thx.

giantmonkeyTC · 2023-03-23T04:55:03Z

I tested V2-s1 on 3090 and got 14ms/per sample.
I don't know why either. If you figure it out, could you please reply to this thread?

alanspike · 2023-03-23T16:34:56Z

Hi, we use TensorRT to benchmark the latency. Here is the docker image.

caixiongjiang · 2023-04-26T02:46:36Z

I used GeForce RTX3060 on my segmentation model using the EfficientFormerV2-S0 backbone and the PoolFormer-S12 backbone. The FPS results are 61 frame/s and 108 frame/s. I don't think this backbone is very universal in calculating the time, and it is specially designed for iPhone. It is similar to such segmentation models as ENet, although the number of parameters and the amount of computation are relatively small, the actual computation time is very long, which may be related to the fact that Pytorch has no acceleration for this part of the computation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference time for EfficientFormerV2 on 2080Ti #51

Inference time for EfficientFormerV2 on 2080Ti #51

zcbfpramtqs55675 commented Feb 22, 2023

giantmonkeyTC commented Mar 23, 2023

alanspike commented Mar 23, 2023

caixiongjiang commented Apr 26, 2023 •

edited

Loading

Inference time for EfficientFormerV2 on 2080Ti #51

Inference time for EfficientFormerV2 on 2080Ti #51

Comments

zcbfpramtqs55675 commented Feb 22, 2023

giantmonkeyTC commented Mar 23, 2023

alanspike commented Mar 23, 2023

caixiongjiang commented Apr 26, 2023 • edited Loading

caixiongjiang commented Apr 26, 2023 •

edited

Loading