TypeAgent Shell is an UI entry point to TypeAgent sample code that explores architectures for building interactive agents with natural language interfaces using structured prompting and LLM.
TypeAgent Shell is a personal agent that takes user request and use an extensible set of agents to perform actions, answer questions, and carry a conversation. TypeAgent CLI's interactive mode is a command line version of the app, and both shared the core dispatcher component. Please read dispatcher's README.md on example requests and usage.
TypeAgent shell is built using Electron. Follow these instructions to install the libraries needed to build and use in the Linux/WSL environment.
pnpm run shellOn Windows, if you notice a lag when starting up the shell, you can add the source code folder to the exclusions list for Windows Defender by following these instructions.
Additionally, if you are running MS Graph based sample agents like Calendar and Email, there is an auth token persisted in the identity cache that can occasionally get corrupted. This could also slow down the start up time of the shell, you can delete that by running:
del %LOCALAPPDATA%\.IdentityService\typeagent-tokencacheWhen connected to the agent server (remote mode), TypeAgent Shell supports full multi-conversation management. Conversations persist across sessions and can be created, renamed, switched, and deleted from the UI.
When the Shell connects to the agent server, it automatically joins a conversation named "Shell", creating it if it does not already exist. This conversation persists across restarts — history is replayed on reconnect, with past messages shown in grayscale and a ─── now ─── separator marking where new activity begins.
Use /conversation (or the @conversation alias) to manage conversations from the chat input:
| Command | Description |
|---|---|
/conversation list |
List all conversations |
/conversation new [name] |
Create a new conversation |
/conversation switch <id|name> |
Switch to a conversation |
/conversation rename <id> <name> |
Rename a conversation |
/conversation delete <id|name> |
Delete a conversation |
When you switch to a different conversation:
- The chat area clears and replays that conversation's history, shown in grayscale to distinguish it from new activity.
- A
─── now ───separator marks where new messages will appear. - A status message confirms which conversation you have joined:
Connected to conversation 'X'.
If you have multiple clients connected (e.g., both a Shell and a CLI connected to the same conversation), you will see status messages when clients join or leave:
[A new client has joined this conversation. You are connected to 'my-chat'.]
[A client has left this conversation. You remain connected to 'my-chat'.]
In local mode (no agent server), only a single default conversation is available. Conversation switching and creation are not supported.
Currently, TypeAgent Shell optionally supports voice input via Azure Speech Services or Local Whisper Service in addition to keyboard input.
To set up Azure Speech to Text service, the following variables in the .env are needed:
| Variable | Value |
|---|---|
SPEECH_SDK_ENDPOINT |
Service URL (or speech API resource ID when using Identity based authentication) |
SPEECH_SDK_KEY |
API key |
SPEECH_SDK_REGION |
Region of the service (e.g. westus2) |
If you would like to enable keyless Speech API access you must have performed the following steps:
- Specify
identityas theSPEECH_SDK_KEYin the.envfile. - Replace the
SPEECH_SDK_ENDPOINTvalue with the azure resource id of your cognitive service instance (i.e./subscriptions/<your subscription guid>/resourceGroups/myResourceGroup/providers/Microsoft.CognitiveServices/accounts/speechapi). - Configure your speech API to support Azure Entra RBAC and add the necessary users/groups with the necessary permissions
(typically
Cognitive Services Speech UserorCognitive Services Speech Contributor). More information on cognitive services roles here. - If you are using JIT access elevate prior to calling the speech API. Please refer to the elevate.js script for doing this efficiently.
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.