This example demonstrates a multi-agent setup for handling different customer service requests in an airline context using the Swarm framework. The agents can triage requests, handle flight modifications, cancellations, and lost baggage cases.
This example uses the helper function run_demo_loop
, which allows us to create an interactive Swarm session.
- Triage Agent: Determines the type of request and transfers to the appropriate agent.
- Flight Modification Agent: Handles requests related to flight modifications, further triaging them into:
- Flight Cancel Agent: Manages flight cancellation requests.
- Flight Change Agent: Manages flight change requests.
- Lost Baggage Agent: Handles lost baggage inquiries.
Once you have installed dependencies and Swarm, run the example using:
python3 main.py
Note
These evals are intended to be examples to demonstrate functionality, but will have to be updated and catered to your particular use case.
For this example, we run function evals, where we input a conversation, and the expected function call ('None' if no function call is expected).
The evaluation cases are stored in eval/eval_cases/
subfolder.
[
{
"conversation": [
{ "role": "user", "content": "My bag was not delivered!" }
],
"function": "transfer_to_lost_baggage"
},
{
"conversation": [
{ "role": "user", "content": "I had some turbulence on my flight" }
],
"function": "None"
}
]
The script 'function_evals.py' will run the evals. Make sure to set n
to the number
of times you want to run each particular eval. To run the script from the root airline folder, execute:
cd evals
python3 function_evals.py
The results of these evaluations will be stored in evals/eval_results/