Skip to content

The Azure AI Assistant Tool is experimental Python application and middleware designed to simplify the development, experimentation, testing, and debugging of OpenAI assistants.

License

Notifications You must be signed in to change notification settings

jhakulin/azureai-assistant-tool

Β 
Β 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ€–πŸ› οΈAzure AI Assistants Tool

License: MIT  Welcome  CrossPlatform

⚑Develop AI agents powered by Azure OpenAI Assistants, Chat Completion and Realtime APIs at lightning speed⚑

Table of Contents

  • πŸ€–πŸ› οΈ Azure AI Assistants Tool
  • πŸ†• Latest News
  • 🧱 What is Assistants from Azure OpenAI Service?
  • πŸš€ How does this Tool help?
  • πŸ”ŠπŸŽ€ OpenAI Realtime Support (Experimental)
  • ✨ Quick Start: Getting Started with the Tool
  • πŸ“– License
  • Contributing
  • Code of Conduct
  • Getting Help

πŸ€–πŸ› οΈ Azure AI Assistants tool

Azure AI Assistants tool is an experimental Python application and middleware designed to simplify the development, experimentation, testing, and debugging of AI agents created with Azure OpenAI Assistants, Chat Completion and/or Realtime API based technologies. Use this powerful, easy-to-setup low-code playground tool to quickly experiment and build AI agents within your application.

Important

The Azure AI Assistant Tool is experimental, created to support your product ideation and experimentation using AI agents. As the tool evolves, expect significant updates and improvements. We welcome feedback and contributions to help shape its future.

πŸ†• Latest News

  • December 25, 2024: Released 0.5.1 version of the tool containing o1 Model Support which allows to use o1 models with ChatAssistant (with limited completion settings) and OpenAI Realtime Support, with real-time audio interaction capabilities. The earlier Azure Cognitive Services for speech input and output has been removed from the tool, however Azure Speech SDK is still used within OpenAI Realtime for keyword based detection. For more detailed information, refer to the OpenAI Realtime Support section below.

🧱 What is Assistants from Azure OpenAI service?

🌟Assistants, API from Azure OpenAI Service, is a stateful evolution of the Chat Completions API. Assistants makes it easier for developers to create applications with sophisticated copilot-like experiences in their applications and enable developer access to powerful tools like Code Interpreter and File Search. Assistants is built on the same capabilities that power OpenAI’s GPT product and offers unparalleled flexibility for creating a wide range of copilot-like applications. Copilots created with Assistants can sift through data, suggest solutions, and automate tasks and use cases span a wide range: AI-powered product recommender, sales analyst app, coding assistant, employee Q&A chatbot, and more.

Features include:

  • Inbuilt thread and memory management
  • Advanced Data Analysis, create data visualizations and solving complex code and math problems with Code Interpreter
  • Retrieval Augmented Generation with File Search tool
  • Build your own tools or call external tools and APIs with Function Calling

Learn more about Assistants on Azure OpenAI Service:

πŸš€ How Does This Tool Help?

  • Enable Rapid AI Agent Prototyping: Rapidly create AI agent prototypes using OpenAI's technologies, Assistants, Chat Completion, and Realtime APIs. This includes user-friendly configurability of different agents, built-in system functions, specific tool configurations, and LLM configurations.

  • Enhance Developer Productivity: Streamline the agent development process through built-in middleware libraries and tools, utilizing tools in prompt engineering to automate your coding tasks and integrate AI capabilities into your copilot applications more effectively.

πŸ”ŠπŸŽ€ OpenAI Realtime Support (Experimental)

This section covers the Realtime capabilities for AI agent prototyping with OpenAI's Realtime APIs, focusing on speech and text input/output through real-time WebSocket communication.

Please note that these capabilities are offered as an experimental feature. They are still under active development and are intended primarily for exploration, demos, or proof-of-concept usage. We do not recommend using these features in production or business-critical applications until further notice.

Key Features

  • Real-time Audio Interaction: Use the Realtime API with speech input using predefined and integrated keyword Computer to trigger the conversation.
    • Using keyword can be helpful to optimize cost and reliability of your application. To create your own keywords, visit Creating the Custom Keyword. Currently only 1 keyword is supported.
  • Real-time Text Interaction: Use the Realtime API with text input. The agent can respond back with audio or text.
  • Local Voice Activity Detection: Efficiently manage audio data by detecting speech activities by using local voice activity detection.
  • Function Calling: Customize the realtime agent with your own functions which runs asynchronously in the background.
  • Configurable AI Options: Fine-tune realtime agent responses and behaviors with different options in Realtime API.

Demo Video

Check out the demo video to see the OpenAI Realtime Support in action!

RealtimeAssistant.mp4

Resources

✨ Quick Start: Getting Started with the Tool

Step 1: Complete Azure prerequisities

  • Create an Azure Subscription for free, if you don't have one already
  • Apply for access to Azure OpenAI Service in this Azure Subscription. Azure OpenAI Service is currently a limited access service so access is granted through an application process. Most applications are processed within a day of applying.
  • Azure OpenAI Assistants is currently available in Sweden Central, East US 2, and Australia East. We are expanding our models and regional availability - see the models guide for the latest model and regional availability for Assistants.
  • Create an Azure OpenAI resource on Azure Portal with one of the Assistants supported models from the models guide page deployed in that region

Step 2: Install Python

The Azure AI Assistant tool requires Python >= 3.8 on your system. You can download Python from the official website. Latest stable Python version is recommended.

Create a new Python virtual environment. Virtual environment is recommended for safe install the SDK packages:

  • On MacOS and Linux run:
    python3 -m venv .venv
    source .venv/bin/activate
    
  • On Windows run:
    py -3 -m venv .venv
    .venv\scripts\activate
    

Step 3: Install the tool and dependencies

Run the following command in your terminal to install all the necessary dependencies as specified in the requirements.txt file.

pip install -r requirements.txt

Step 4: Build and install azure.ai.assistant Python library (Optional)

We have included the azure.ai.assistant library currently within this repository as part of the alpha status of the project. The plan is to release library on PyPI for more convenience installation in future.

Build the wheel for azure.ai.assistant library using the following instructions, or use the wheel package provided under the repo's release tags directly.

  • Python wheel and setuptools packages are required to build the library package. You can install them using the commands: pip install wheeland pip install setuptools
  • Go to thesdk/azure-ai-assistant folder
  • Build the wheel using following command: python setup.py sdist bdist_wheel
  • Go to generated dist folder
  • Install the generated wheel using following command: pip install --force-reinstall azure_ai_assistant-0.x.xa1-py3-none-any.whl
    • This installation will pick the necessary dependencies for the library (openai, python-Levenshtein, fuzzywuzzy, Pillow, requests)

Step 5: Find and copy your Azure OpenAI Service APIkey, endpoint and model deployment version

To successfully make a call against the Azure OpenAI service, you'll need the following:

- ENDPOINT: This value can be found in the Keys and Endpoint section when examining your resource from the Azure portal. Alternatively, you can find the value in Azure OpenAI Studio > Playground > View code. An example endpoint is: https://docs-test-001.openai.azure.com/.

- API-KEY: This value can be found in the Keys and Endpoint section when examining your resource from the Azure portal. You can use either KEY1 or KEY2.

- MODEL DEPLOYMENT-NAME: This value will correspond to the custom name you chose for your deployment when you deployed a model. This value can be found under Resource Management > Model Deployments in the Azure portal or alternatively under Management > Deployments in Azure OpenAI Studio.

Next, go to your resource in the Azure portal. The Keys and Endpoint can be found in the Resource Management section (see image below). Copy your endpoint and access key as you'll need both for authenticating your API calls. You can use either KEY1 or KEY2. Always having two keys allows you to securely rotate and regenerate keys without causing a service disruption. portal keys and endpoint

Step 6: Setup Environment Variables

Create and assign persistent environment variables for your key and endpoint.

⌨️ Command Line (CLI)

  1. Set the Azure OpenAI Service key, endpoint. Version is optional and default currently is 2024-05-01-preview for assistants.

Windows: Use setx or set command depending on your preference.

setx AZURE_OPENAI_API_KEY "Your Azure OpenAI Key"
setx AZURE_OPENAI_ENDPOINT "Your OpenAI Endpoint"
setx AZURE_OPENAI_API_VERSION "Azure OpenAI version"
setx OPENAI_API_KEY "Your OpenAI Key"

Linux/Mac

export AZURE_OPENAI_API_KEY="Your Azure OpenAI Key"
export AZURE_OPENAI_ENDPOINT="Your OpenAI Endpoint"
export AZURE_OPENAI_API_VERSION="Azure OpenAI version"
export OPENAI_API_KEY="Your OpenAI Key"
  1. Set Cognitive Services Speech key (this is optional and if you want to use speech input & output).

Windows:

setx AZURE_AI_SPEECH_KEY "Your Speech Key"
setx AZURE_AI_SPEECH_REGION "Your Speech Region"

Linux/Mac

export AZURE_AI_SPEECH_KEY="Your Speech Key"
export AZURE_AI_SPEECH_REGION="Your Speech Region"

Step 7: Launch the application

⌨️ Command Line (CLI)

In the root of this repository, command:

python main.py

This command will start the Azure AI Assistant Tool and you can interact with it through its user interface which looks something like this:

Ai-Assistant-Tool-screenshot

Tool In Action - Add Functions To Your Assistant

Ai-Assistant-Tool-Functions-screenshot

πŸ“– License

The Azure AI Assistant Tool is licensed under the MIT License. See the LICENSE file for more details.

πŸ“£ Notice Regarding PySide6

The Azure AI Assistant Tool uses PySide6, which is licensed under the GNU Lesser General Public License (LGPL). By using PySide6, you are able to modify and redistribute the library under the same license. For more information on PySide6's license, please visit Qt Licensing.

Contributing

We welcome contributions and suggestions! Please see the contributing guidelines for details.

Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. Please see the code of conduct for details.

Getting Help

Issues

If you find a bug in the source code or a mistake in the documentation, feel free to submit bug report. Or even better you could submit a pull request with a fix.

Feature Requests

If there's an feature that you'd like to see added, feel free to file a Feature Request.

If you'd like to implement it yourself, please refer to our contributing guidelines.

About

The Azure AI Assistant Tool is experimental Python application and middleware designed to simplify the development, experimentation, testing, and debugging of OpenAI assistants.

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%