where InferenceRequest init? #1650

lanzhiwang · 2024-03-27T10:51:23Z

I followed this documentation https://mlserver.readthedocs.io/en/stable/examples/sklearn/README.html
After running mlserver, I can get the correct results according to this operation.

I took a look at the source code of mlserver and registered the relevant routes in the source code.

         APIRoute(
             "/v2/models/{model_name}/versions/{model_version}/infer",
             endpoints.infer,
             methods=["POST"],
         ),

After the client sends the http request, it also calls the endpoints.infer method. In the endpoints.infer method, the model is also called to generate the final result.

I have a confusion as follows:

async def infer(
         self,
         raw_request: Request,
         raw_response: Response,
         payload: InferenceRequest,
         model_name: str,
         model_version: Optional[str] = None,
     ) -> InferenceResponse:

There is a parameter payload of type InferenceRequest in the endpoints.infer method. This parameter represents the user’s input.

What I want to ask is where in the code the body content of http is instantiated into an InferenceRequest object. I don’t seem to find the relevant part in the code.
May I ask where this logic is implemented?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

where InferenceRequest init? #1650

where InferenceRequest init? #1650

lanzhiwang commented Mar 27, 2024

where InferenceRequest init? #1650

where InferenceRequest init? #1650

Comments

lanzhiwang commented Mar 27, 2024