You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Currently, there is no centralized mechanism within KaibanJS to evaluate and provide feedback on agents' outputs or the final results of their collective tasks. This makes it challenging to ensure logical correctness (e.g., scheduling conflicts) and meet user preferences (e.g., avoiding certain conditions like heat or snow) in complex workflows.
Describe the solution you'd like
Introduce a "Manager" agent or framework that can:
Evaluate: Assess both individual agents' outputs and the final results of their combined tasks based on two criteria:
Preferences: Allow personalized criteria (e.g., user dislikes certain weather conditions).
Feedback Mechanism: Automatically provide feedback to agents, enabling them to refine their work in subsequent iterations.
Cost Optimization: Enable the use of smaller, narrow-scope models for agents, while the "Manager" leverages larger reasoning models (e.g., self-hosted or local models) for evaluation and feedback tasks.
Describe alternatives you've considered
Using external validation systems to evaluate agent outputs and final results, which increases the complexity of integrations.
Adding manual checkpoints in the workflow to review outputs, which is time-consuming and defeats the purpose of automation.
Additional context
This feature would enable:
Automated quality control for agent workflows, ensuring logical and preference-based correctness.
Advanced use cases such as trip planning, collaborative projects, and multi-agent simulations.
Cost optimization by balancing smaller, task-specific models with larger reasoning models for evaluation.
The text was updated successfully, but these errors were encountered:
Description
Is your feature request related to a problem? Please describe.
Currently, there is no centralized mechanism within KaibanJS to evaluate and provide feedback on agents' outputs or the final results of their collective tasks. This makes it challenging to ensure logical correctness (e.g., scheduling conflicts) and meet user preferences (e.g., avoiding certain conditions like heat or snow) in complex workflows.
Describe the solution you'd like
Introduce a "Manager" agent or framework that can:
Describe alternatives you've considered
Additional context
This feature would enable:
The text was updated successfully, but these errors were encountered: