Skip to content

Commit 7c32b68

Browse files
authored
[Frontend] correctly record prefill and decode time metrics (#10853)
Signed-off-by: Tomer Asida <[email protected]>
1 parent 7090c27 commit 7c32b68

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm/engine/metrics.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -599,9 +599,9 @@ def _log_prometheus(self, stats: Stats) -> None:
599599
stats.time_queue_requests)
600600
self._log_histogram(self.metrics.histogram_inference_time_request,
601601
stats.time_inference_requests)
602-
self._log_histogram(self.metrics.histogram_decode_time_request,
603-
stats.time_prefill_requests)
604602
self._log_histogram(self.metrics.histogram_prefill_time_request,
603+
stats.time_prefill_requests)
604+
self._log_histogram(self.metrics.histogram_decode_time_request,
605605
stats.time_decode_requests)
606606
self._log_histogram(self.metrics.histogram_time_in_queue_request,
607607
stats.time_in_queue_requests)

0 commit comments

Comments
 (0)