22 Dec 08:05

ftian1

a1bca88

Generative AI Examples v1.5 Release Notes Latest

Latest

OPEA Release Notes v1.5

We are excited to announce the release of OPEA version 1.5, which includes significant contributions from the open-source community.

More information about how to get started with OPEA v1.5 can be found on the Getting Started page. All project source code is maintained in the opea-project organization. To pull Docker images, please access the Docker Hub. For instructions on deploying Helm Charts, please refer to the guide.

OPEA Release Notes v1.5

What's New in OPEA v1.5

This release includes new features, optimizations, and user-focused updates.

GenAI Examples

Browser-use Agent: a new use case to empower anyone to automate repetitive web tasks. It controls your web browser to perform tasks like visiting websites and extracting data. (GenAIExamples#2312)
Arbitration Post Hearing Assistant Application: a new use case designed to process and summarize post-hearing transcripts or arbitration-related documents. (GenAIExamples#2309)
Polylingua Translation Service: a new use case for translation. (GenAIComps#2298)
OpenAI-Compatible Endpoint Support: ChatQnA now supports OpenAI API-Compatible endpoints. (GenAIComps#2091)

GenAI Microservices

Text2Query: a specialized, independent service designed to translate natural language queries into structured query languages. (GenAIComps#1931)
Arbitration Post-Hearing: a new microservice for Arbitration Post-Hearing with LLM-Based Entity Extraction. (GenAIComps#1938)
FunASR/paraformer: Add a FunASR toolkit-based backend to ASR microservice to support Paraformer, a non-autoregressive end-to-end speech recognition model. (GenAIComps#1914)
LLM Scaler: Boosted LLM/LVM performance on ARC GPU by llm-scaler-vllm v0.10.0-b4. (GenAIComps#1914)
openEuler OS Support: Enabling openEuler OS Support for OPEA Components. (GenAIComps#1813, GenAIComps#1875, GenAIComps#1879, GenAIComps#1913, GenAIComps#1913)
MCP Compliance: Enabled MCP server for some of the OPEA components. (GenAIComps#1849, GenAIComps#1855)
OPEA Store: Enhanced data access using OPEA Store for ChatHistory, FeedbackManagement, and PromptRegistry. (GenAIComps#1916)

Productization

Enhanced Monitoring: Added monitoring for 8 key GenAI Examples. (GenAIExamples#2316,GenAIExamples#2318,GenAIExamples#2319,GenAIExamples#2322)
GenAIStudio: Added support for drag-and-drop creation of fine-tuning applications. (GenAIStudio#74, GenAIStudio#75)
One-click Deployment: Enabled openEuler OS support for one-click deployment. (GenAIExamples#2267)
Documentation Refinement: Refined READMEs for all the components to help readers easily locate documentation tailored to deployment, customization, and hardware.

Validated Hardware

Intel® Gaudi® AI Accelerators (2nd)
Intel® Xeon® Scalable processor (3rd)

Validated Software

Docker version 28.5.1
Docker Compose version v2.40.3
Intel® Gaudi® software and drivers v1.22.1
TEI v1.7
TGI v2.4.0 (Xeon), v2.3.1 (Gaudi)
Ubuntu 22.04
vLLM v0.10.1 (Xeon), opea/vllm-gaudi:1.22.0 (Gaudi)

Full Changelogs

GenAIExamples: v1.4...v1.5
GenAIComps: v1.4...v1.5
GenAIInfra: v1.4...v1.5
GenAIEval: v1.4...v1.5
GenAIStudio: v1.4...v1.5
docs: v1.4...v1.5

Contributors

This release would not have been possible without the contributions of the following organizations and individuals.

Contributing Organizations

Bud: Polylingua Translation Service, Components as MCP Servers.
Intel: Development and improvements to GenAI examples, components, infrastructure, evaluation, and studio.
openEuler: openEuler OS support.
Zensar: Arbitration Post Hearing Assistant.

Individual Contributors

For a comprehensive list of individual contributors, please refer to the Full Changelogs section.

Assets 2

25 Aug 00:29

ftian1

v1.4

7ecb288

Generative AI Examples v1.4 Release Notes

OPEA Release Notes v1.4

We are excited to announce the release of OPEA version 1.4, which includes significant contributions from the open-source community. This release addresses over 330 pull requests.

More information about how to get started with OPEA v1.4 can be found on the Getting Started page. All project source code is maintained in the opea-project organization. To pull Docker images, please access the Docker Hub. For instructions on deploying Helm Charts, please refer to the guide.

OPEA Release Notes v1.4

What's New in OPEA v1.4

This release includes new features, optimizations, and user-focused updates.

Advanced Agent Capabilities

MCP (Model Context Protocol) Support: The OPEA agent now supports the MCP, allowing for standardized and more efficient integration with external data and services. (GenAIComps#1678, GenAIComps#1810)
Deep Research Agent: The example is designed to handle complex, multi-step research. It leverages langchain-ai/open_deep_research and supports Intel Gaudi accelerators. (GenAIExamples#2117)

Components as MCP Servers

OPEA components can now serve as Model Context Protocol (MCP) servers, allowing external MCP-compatible frameworks and applications to integrate with OPEA seamlessly. (GenAIComps#1652)

KubeAI Operator for OPEA

The KubeAI Operator now features an improved autoscaler, monitoring support, optimized resource placement via NRI plugins, and expanded support for new models on Gaudi. (GenAIInfra#967, GenAIInfra#1052, GenAIInfra#1054, GenAIInfra#1089, GenAIInfra#1113, GenAIInfra#1144, GenAIInfra#1150)

New GenAI Capabilities

Fine-Tuning of Reasoning Models: This feature is compatible with the dataset format used in FreedomIntelligence/medical-o1-reasoning-SFT, enabling you to customize models with your own data. (GenAIComps#1839)
HybridRAG: Combined GraphRAG (knowledge graph-based retrieval) and VectorRAG (vector database retrieval) for enhanced accuracy and contextual relevance. (GenAIExamples#1968)
LLM Router: LLM Router decides which downstream LLM serving endpoint is best suited for an incoming prompt. (GenAIComps#1716)
OPEA Store: Redis and MongoDB have been integrated into OPEA Store. (GenAIComps#1816, GenAIComps#1818)
Guardrails: Added Input/Output Guardrails to enforce content safety and prevent the creation of inappropriate outputs. (GenAIComps#1798)
Language Detection: The microservice is used to ensure the pipeline's response matches the query's language. (GenAIComps#1774)
Prompt Template: The microservice can dynamically generate system and user prompts based on structured inputs and document context. (GenAIComps#1826)
Air-gapped Environment Support: Some OPEA microservices can now be deployed in an air-gapped Docker environment. (GenAIComps#1480)
Remote Inference Endpoints Support: Added support for remote inference endpoints for OPEA examples. (GenAIExamples#1973)

Better User Experience

One-click Deployment: You can now deploy 8 OPEA examples with one click. ChatQnA can deploy in an air-gapped Docker environment. (GenAIExamples#1727)
GenAIStudio: Added support for drag-and-drop creation of documentation summarization and code generation applications. (GenAIStudio#61)
Documentation Refinement: Refined READMEs for key examples and components to help readers easily locate documentation tailored to deployment, customization, and hardware. (GenAIExamples#1673, GenAIComps#1398)

Newly Supported Models

OPEA introduces support for the following models in this release.

Model	TGI-Gaudi	vLLM-CPU	vLLM-Gaudi	vLLM-ROCm	OVMS	Optimum-Habana	PredictionGuard	SGLANG-CPU
meta-llama/Llama-4-Scout-17B-16E-Instruct	-	-	-	-	-	-	-	✓
meta-llama/Llama-4-Maverick-17B-128E-Instruct	-	-	-	-	-	-	-	✓

(✓: supported; -: not validated; x: unsupported)

Newly Supported Hardware

Support for AMD® EPYC™ has been added for 11 OPEA examples. (GenAIExamples#2083)

Newly Supported OS

Support for openEuler has been added. (GenAIExamples#2088, GenAIComps#1813)

Updated Dependencies

Dependency	Hardware	Scope	Version	Version in OPEA v1.3	Comments
huggingface/text-embeddings-inference	all	all supported examples	cpu-1.7	cpu-1.6
vllm	Xeon	all supported examples except EdgeCraftRAG	v0.10.0	v0.8.3

Changes to Default Behavior

CodeTrans: The default model changed from mistralai/Mistral-7B-Instruct-v0.3 to Qwen/Qwen2.5-Coder-7B-Instruct on Xeon and Gaudi.

Validated Hardware

Intel® Gaudi® AI Accelerators (2nd)
Intel® Xeon® Scalable processor (3rd)
Intel® Arc™ Graphics GPU (A770)
AMD® EPYC™ processors (4th, 5th)

Validated Software

Docker version 28.3.3
Docker Compose version v2.39.1
Intel® Gaudi® software and drivers v1.21
Kubernetes v1.32.7
TEI v1.7
TGI v2.4.0 (Xeon, EPYC), v2.3.1 (Gaudi), v2.4.1 (ROCm)
Torch v2.5.1
Ubuntu 22.04
vLLM v0.10.0 (Xeon, EPYC), v0.6.6.post1+Gaudi-1.20.0 (Gaudi)

Known Issues

AvatarChatbot cannot run in a K8s environment due to a functional gap in the wav2clip service. (GenAIExamples#1506)

Full Changelogs

GenAIExamples: v1.3...v1.4
GenAIComps: v1.3...v1.4
GenAIInfra: v1.3...v1.4
GenAIEval: v1.3...v1.4
GenAIStudio: v1.3...v1.4
docs: v1.3...v1.4

Contributors

This release would not have been possible without the contributions of the following organizations and individuals.

Contributing Organizations

AMD: AMD EPYC support.
Bud: Components as MCP Servers.
Intel: Development and improvements to GenAI examples, components, infrastructure, evaluation, and studio.
MariaDB: Added ChatQnA docker-compose example on Intel Xeon using Mari...

Assets 2

14 May 05:10

ftian1

v1.3

e380c18

Generative AI Examples v1.3 Release Notes

OPEA Release Notes v1.3

We are excited to announce the release of OPEA version 1.3, which includes significant contributions from the open-source community. This release addresses over 520 pull requests.

More information about how to get started with OPEA v1.3 can be found on the Getting Started page. All project source code is maintained in the opea-project organization. To pull Docker images, please access the Docker Hub. For instructions on deploying Helm Charts, please refer to the guide.

What's New in OPEA v1.3
Deprecations
Updated Dependencies
Changes to Default Behavior
Validated Hardware
Validated Software
Known Issues
Full Changelogs
Contributors

What's New in OPEA v1.3

This release introduces exciting capabilities, optimizations, and user-centric enhancements:

Advanced Agent Capabilities

Multi-Turn Conversation: Enhanced the OPEA agent framework for dynamic, context-aware dialogues. (GenAIComps#1248)
Finance Agent Example: A financial agent example for automating financial data aggregation and leveraging LLMs to generate insights, forecasts, and strategic recommendations. (GenAIExamples#1539)

Performance and Scalability

vLLM Enhancement: Integrated vLLM as the default LLM serving backend for key GenAI examples across Intel® Xeon® processors, Intel® Gaudi® accelerators, and AMD® GPUs. (GenAIExamples#1436)
KubeAI Operator for OPEA (Alpha release): Simplified OPEA inference operations in cloud environment and enabled optimal out-of-the-box performance for specific models and hardware using profiles. (GenAIInfra#945)

Ecosystem Integrations

Haystack Integration: Enabled OPEA as a backend of Haystack. (Haystack-OPEA#1)
Cloud Readiness: Expanded automated Terraform deployment for ChatQnA to include support for Azure, and enabled CodeGen deployments on AWS and GCP. (GenAIExamples#1731)

New GenAI Capabilities

OPEA Store: Delivered a unified data store access API and a robust data store integration layer that streamlines data store integration. ArangoDB was integrated. (GenAIComps#1493)
CodeGen using RAG and Agent: Leveraged RAG and code agent to provide an additional layer of intelligence and adaptability for CodeGen example. (GenAIExamples#1757)
Enhanced Multimodality: Added support for additional audio file types (.mp3) and supported spoken audio captions with image ingestion. (GenAIExamples#1549)
Struct to Graph: Supported transforming structured data to graphs using Neo4j graph database. (GenAIComps#1502)
Text to Graph: Supported creating graphs from text by extracting graph triplets. (GenAIComps#1357, GenAIComps#1472)
Text to Cypher: Supported generating and executing Cypher queries from natural language for graph database retrieval. (GenAIComps#1319)

Enhanced Evaluation

Enhanced Long-Context Model Evaluation: Supported evaluating long-context model on Intel® Gaudi® with vLLM. (HELMET#20)
TAG-Bench for SQL Agents: Integrated TAG-Bench to evaluate complex SQL query generation (GenAIEval#230).
DocSum Support: GenAIEval now supports evaluating the performance of DocSum. (GenAIEval#252)
Toxicity Detection Evaluation: Introduced a workflow to evaluate the capability of detecting toxic language based on LLMs. (GenAIEval#241)
Model Card: Added a model card generator for generating reports containing model performance and fairness metrics. (GenAIEval#236)

Observability

OpenTelemetry Tracing: Leveraged OpenTelemetry to enable tracing for ChatQnA and AgentQnA along with TGI and TEI. (GenAIExamples#1542)
Application dashboards: Helm installed application E2E performance dashboard(s). (GenAIInfra#800)
E2E (end-to-end) metric improvements: E2E metrics are summed together for applications that use multiple megaservice instances. Tests for the E2E metrics + fixes. (GenAIComps#1301, (GenAIComps#1343)

Better User Experience

GenAIStudio: Supported drag-and-drop creation of agentic applications. (GenAIStudio#50)
Documentation Refinement: Refined READMEs for key examples to help readers easily locate documentation tailored to deployment, customization, and hardware. (GenAIExamples#1741)
Optimized Dockerfiles: Simplified application Dockerfiles for faster image builds. (GenAIExamples#1585)

Exploration

SQFT: Supported low-precision sparse parameter-efficient fine-tuning on LLMs. (GenAIResearch#1)

Newly Supported Models

OPEA introduced the support for the following models in this release.

Model	TGI-Gaudi	vLLM-CPU	vLLM-Gaudi	vLLM-ROCm	OVMS	Optimum-Habana	PredictionGuard
deepseek-ai/DeepSeek-R1-Distill-Llama-8B	✓	✓	✓	✓	-	✓	-
deepseek-ai/DeepSeek-R1-Distill-Llama-70B	✓	✓	✓	✓	-	✓	-
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B	✓	✓	✓	✓	-	✓	-
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B	✓	✓	✓	✓	-	✓	-
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B	✓	✓	✓	✓	-	✓	-
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B	✓	✓	✓	✓	-	✓	-
deepseek-ai/Deepseek-v3	✓	-	✓	✓	-	✓	-
Hermes-3-Llama-3.1-8B	-	-	-	✓	-	-	✓
ibm-granite/granite-3.2-8b-instruct	-	-	✓	✓	-	-	-
Phi-4-mini	x	x	x	✓	x	✓	-
Phi-4-multimodal-instruct	x	x	x	✓	x	✓	-
mistralai/Mistral-Small-24B-Instruct-2501	✓	-	✓	✓	-	✓	-
mistralai/Mistral-Large-Instruct-2411	x	-	✓	✓	-	✓	-

(✓: supported; -: not validated; x: unsupported)

Newly Supported Hardware

AMD® GPU using AMD® ROCm™ for 9 examples. (GenAIExamples#1613 and 8 more.)

Other Notable Changes

Expand the following lists to read:

GenAIExamples

Functionalities
- [AgentQnA] Added web search tool support and simplify the run instructions. (#1656) (e8f2313)
- [ChatQnA] Added support for latest deepseek models on Gaudi (#1491) (9adf7a6)
- [EdgeCraftRAG] A sleek new UI based on Vue and Ant Design for enhanced user experience, supporting concurrent multi-requests on vLLM, JSON pipeline configuration, and API-based prompt modification. (#1665) (5a50ae0)
- [EdgeCraftRAG] Supported multi-card deployment of Intel ARC GPU for vllm inference ([#1729](https://github.com/opea-project/Gen...

Assets 2

27 Jan 02:20

chensuyue

v1.2

c8c6fa2

Generative AI Examples v1.2 Release Notes

OPEA Release Notes v1.2

We are excited to announce the release of OPEA version 1.2, which includes significant contributions from the open-source community. This release addresses over 320 pull requests.

More information about how to get started with OPEA v1.2 can be found at Getting Started page. All project source code is maintained in the repository. To pull Docker images, please access the Docker Hub. For instructions on deploying Helm Charts, please refer to the guide.

What's New in OPEA v1.2

This release focuses on code refactoring for GenAIComps, the epic efforts aimed at reducing redundancy, addressing technical debt, and enhancing overall maintainability and code quality. As a result, OPEA users can expect a more robust and reliable OPEA with clearer guidance and improved documentation.

OPEA v1.2 also introduces more scenarios with general availability, including:

LlamaIndex and LangChain Integration: Enabling OPEA as a backend. LlamaIndex integration currently supports ChatQnA only.
Model Context Protocol(MCP) Support: Experimental support for MCP at Retriever.
Cloud Service Providers(CSP) Support: Supported automated Terraform deployment using Intel® Optimized Cloud Modules for Terraform, available for major cloud platforms, including Amazon Web Services (AWS), Google Cloud Platform (GCP), and Microsoft Azure.
Enhanced Security: Istio Mutual TLS (mTLS) and OIDC (Open ID Connect) based Authentication with APISIX.
Enhancements for GenAI Evaluation: Specialized evaluation benchmarks tailored for Chinese language models, focusing on their performance and accuracy within Chinese dataset.
Helm Charts Deployment: Add supports for the examples Text2Image, SearchQnA and their microservices.

Highlights

Code Factoring for GenAIComps

This is an epic task in v1.2. We refactored the entire GenAIComps codebase. This comprehensive effort focused on reducing redundancy, addressing accumulated technical debt, and enhancing the overall maintainability and code quality. The refactoring not only streamlined the architecture but also laid a stronger foundation for future scalability and development.

At the architecture level, OPEA introduces OpeaComponentRegistry and OpeaComponentLoader. The OpeaComponentRegistry manages the lifecycle of component classes, including their registration and deregistration, while the OpeaComponentLoader instantiates components based on the classes in the registry and execute as needed. Unlike previous implementations, this approach ensures that the lifecycle of a component class is transparent to the user, and components are instantiated only when actively used. This design enhances efficiency, clarity, and flexibility in the system.

At the component level, each OPEA component is structured into two layers: the service wrapper and the service provider (named as integrations in the code). The service wrapper, which is optional, acts as a protocol hub and manages service access, while the service provider delivers the actual functionality. This architecture allows components to be seamlessly integrated or removed without requiring code changes, enabling a modular and adaptable system. All the existing components have ported to the new architecture.

Additionally, we reduced code redundancy, merged overlapping modules, and implemented adjustments to align with the new architectural changes.

Note

We suggest users and contributors to review the documentation to understand the impacts of the code refactoring.

Supporting Cloud Service Providers

OPEA offers automated Terraform deployment using Intel® Optimized Cloud Modules for Terraform, available for major cloud platforms, including AWS, GCP, and Azure. To explore this option, check out the Terraform deployment guide.

Additionally, OPEA supports manual deployment on virtual servers across AWS, GCP, IBM Cloud, Azure, and Oracle Cloud Infrastructure (OCI). For detailed instructions, refer to the manual deployment guide.

Enhanced GenAI Components

vLLM support for embeddings and rerankings: Integrate vLLM as a serving framework to enhance the performance and scalability of embedding and reranking models.
Agent Microservice:
- SQL agent strategy: Take user question, hints (optional) and history (when available), and think step by step to solve the problem by interacting with a SQL database. OPEA currently has two types of SQL agents: sql_agent_llama for using with open-source LLMs and sql_agent: for using with OpenAI models.
- Enabled user-customized tool subsets: Added support for user-defined subsets of tools for the ChatCompletion API and Assistant APIs.
- Enabled persistence: Introduced Redis to persist Agent configurations and historical messages for Agent recovery and multi-turn conversations.
Long-context Summarization: Supported multiple modes: auto, stuff, truncate, map_reduce, and refine.
Standalone Microservice Deployment: Enabled the deployment of OPEA components as independent services, allowing for greater flexibility, scalability, and modularity in various application scenarios.
PDF Inputs Support: Support PDF inputs for dataprep, embeddings, LVMs, and retrievers.

New GenAI Components

Bedrock: OPEA LLM now supports Amazon Bedrock as the backend of the text generation microservice. Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies through a single API, along with a broad set of capabilities you need to build generative AI applications with security, privacy, and responsible AI.
OpenSearch Vector Database: OPEA vectorstores now supports AWS OpenSearch. OpenSearch is an open-source, enterprise-grade search and observability suite that brings order to unstructured data at scale.
Elasticsearch Vector Database: OPEA vectorestores now supports Elasticsearch vector database, Elasticsearch's open source vector database offering an efficient way to create, store, and search vector embeddings.
Guardrail Hallucination Detection: Added the capability of detecting Hallucination which spans a wide range of issues that can impact reliability, trustworthiness, and utility of AI-generated content.

Enhanced GenAI Examples

ChatQnA: Enabled embedding and reranking on vLLM, and Jaeger UI and OpenTelemetry tracing for TGI serving on HPU.
AgentQnA: Added SQL worker agent and introduced a Svelte-based GUI for ChatCompletion API for non-streaming interactions.
MultimodalQnA: Added support for PDF ingestion, and image/audio queries.
EdgeCraftRAG: Supported image/url data retrieval and display, display of LLM-used context sources in UI, pipeline remove operation in RESTful API and UI, RAG pipeline performance benchmark and display in UI. (#GenAIExamples/1324)
DocSum: Added URL summary option to Gradio-based UI.
DocIndexRetriever: Add the pipeline without Reranking.

Enhanced GenAIStudio

In this release, GenAI Studio enables Keycloak for multi-user management, supporting sandbox environment for multi-workflow execution and enables Grafana based visualization dashboards with built-in performance metric on Prometheus for model evaluation and functional nodes performance.

Newly Supported Models

bge-base-zh-v1.5
Falcon2-40B/11B
Falcon3

Newly Supported Hardware

Intel® Gaudi® 3 AI Accelerator
AMD® GPU using AMD® ROCm™ for AgentQnA, [AudioQnA](https://github.com/opea-project/GenA...

Assets 2

26 Nov 00:42

ftian1

v1.1

bbb4e23

Generative AI Examples v1.1 Release Notes

OPEA Release Notes v1.1

We are pleased to announce the release of OPEA version 1.1, which includes significant contributions from the open-source community. This release addresses over 470 pull requests.

More information about how to get started with OPEA v1.1 can be found at Getting Started page. All project source code is maintained in the repository. To pull Docker images, please access the Docker Hub. For instructions on deploying Helm Charts, please refer to the guide.

What's New in OPEA v1.1

This release introduces more scenarios with general availability, including:

Newly supported Generative AI capabilities: Image-to-Video, Text-to-Image, Text-to-SQL and Avatar Animation.
Generative AI Studio that offers a no-code alternative to create enterprise Generative AI applications.
Expands the portfolio of supported hardware to include Intel® Arc™ GPUs and AMD® GPUs.
Enhanced monitoring support, providing real-time insights into runtime status and system resource utilization for CPU and Intel® Gaudi® AI Accelerator, as well as Horizontal Pod Autoscaling (HPA).
Helm Chart support for 7 new GenAIExamples and their microservices.
Benchmark tools for long-context language models (LCLMs) such as LongBench and HELMET.

Highlights

New GenAI Examples

AvatarChatbot: a chatbot that combines a virtual "avatar" that can run on either Intel Gaudi 2 AI Accelerator or Intel Xeon Scalable Processors.
DBQnA: for seamless translation of natural language queries into SQL and deliver real-time database results.
EdgeCraftRAG: a customizable and tunable RAG example for edge solutions on Intel® Arc™ GPUs.
GraphRAG: a Graph RAG-based approach to summarization.
Text2Image: an application that generates images based on text prompts.
WorkflowExecAgent: a workflow executor example to handle data/AI workflow operations via LangChain agents to execute custom-defined workflow-based tools.

Enhanced GenAI Examples

Multi-media support: DocSum, MultimodalQnA
Multi-language support: AudioQnA, DocSum

New GenAI Components

Text-to-Image: add Stable Diffusion microservice
Image-to-Video: add Stable Video Diffusion microservice
Text-to-SQL: add Text-to-SQL microservice
Text-to-Speech: add GPT-SoVITS microservice
Avatar Animation: add Animation microservice
RAG: add GraphRAG with llama-index microservice

Enhanced GenAI Components

Asynchronous support for microservices (28672956, 9df4b3c0, f3746dc8)
Add vLLM backends for summarization, FAQ generation, code generation, and Agents
Multimedia support (29ef6426, baafa402)

GenAIStudio

GenAI Studio, a new project of OPEA, streamlines the creation of enterprise Generative AI applications by providing an alternative UI-based processes to create end-to-end solutions. It supports GenAI application definition, evaluation, performance benchmarking, and deployment. The GenAI Studio empowers developers to effortlessly build, test, optimize their LLM solutions, and create a deployment package. Its intuitive no-code/low-code interface accelerates innovation, enabling rapid development and deployment of cutting-edge AI applications with unparalleled efficiency and precision.

Enhanced Observability

Observability offers real-time insights into component performance and system resource utilization. We enhanced this capability by monitoring key system metrics, including CPU, host memory, storage, network, and accelerators (such as Intel Gaudi), as well as tracking OPEA application scaling.

Helm Charts Support

OPEA examples and microservices support Helm Charts as the packaging format on Kubernetes (k8s). The newly supported examples include AgentQnA, AudioQnA, FaqGen, VisualQnA. The newly supported microservices include chathistory, mongodb, prompt, and Milvus for data-prep and retriever. Helm Charts have now option to get Prometheus metrics from the applications.

Long-context Benchmark Support

We added the following two benchmark kits to response to the community's requirements of long-context language models.

HELMET: a comprehensive benchmark for long-context language models covering seven diverse categories of tasks. The datasets are application-centric and are designed to evaluate models at different lengths and levels of complexity.
LongBench: a benchmark tool for bilingual, multitask, and comprehensive assessment of long context understanding capabilities of large language models.

Newly Supported Models

llama-3.2 (1B/3B/11B/90B)
glm-4-9b-chat
Qwen2/2.5 (7B/32B/72B)

Newly Supported Hardware

Intel® Arc™ GPU: vLLM powered by OpenVINO can perform optimal model serving on Intel® Arc™ GPU.
AMD® GPU: deploy GenAI examples on AMD® GPUs using AMD® ROCm™: CodeTrans, CodeGen, FaqGen, DocSum, ChatQnA.

Notable Changes

GenAIExamples

Functionalities
- New GenAI Examples
  - [AvatarChatbot] Initiate "AvatarChatbot" (audio) example (cfffb4c, 960805a)
  - [DBQnA] Adding DBQnA example in GenAIExamples (c0643b7, 6b9a27d)
  - [EdgeCraftRag] Add EdgeCraftRag as a GenAIExample (c9088eb, 7949045, 096a37a)
  - [GraphRAG] Add GraphRAG example a65640b
  - [Text2Image]: Add example for text2image 085d859
  - [WorkflowExecAgent] Add Workflow Executor Example bf5c391
- Enhanced GenAI Examples
  - [AudioQnA] Add multi-language AudioQnA on Xeon 658867f
  - [AgentQnA] Update AgentQnA example for v1.1 release 5eb3d28
  - [ChatQnA] Enable vLLM Profiling for ChatQnA ([00d9bb6](https://github.com/opea-project...

Assets 2

20 Sep 09:36

kevinintel

v1.0

3c3d0b4

Generative AI Examples v1.0 Release Notes

OPEA Release Notes v1.0

What’s New in OPEA v1.0

Highlights
- Improve the RAG performance through microservice optimizations (e.g., Hugging Face TGI, vLLM) and megaservice tuning
- Provide the experimental LLM model training support, includes full fine-tuning and parameter-efficient fine-tuning (PEFT)
- Improve RAG with Knowledge Graph based on Neo4j
- Improve VisualQnA and provide multi-modality RAG support
- Faster microservice launch through removal of some dispatch overhead
- Enable Gateway with guardrail, and integrate nginx with CORS protection and data preparation
- Enable HorizontalPodAutoscaler (HPA) for better resource management
- Define the metrics of RAG performance and enable accuracy evaluation for more GenAI examples
- Further improvement on documentation and developer experience
Other features
- Enable OpenAI compatible format on applicable microservices
- Support microservice launch from ModelScope to address China ecosystem need
- Support Red Hat OpenShift Container Platform (RHOCP)
- Refactor the code and CI/CD pipeline to provide better support for contributors
- Improve Docker versioning to avoid the potential conflict
- Enhance GenAI Microservice Connector (GMC), including improvements such as router performance optimizations and other updates
- Introduce Memory Bandwidth Exporter that integrates with Kubernetes Node Resource Interface
Learn more about OPEA at
- Getting Started: https://opea-project.github.io/latest/index.html
- Github: https://github.com/opea-project
- Docker Hub: https://hub.docker.com/u/opea
Release Documentation:
- Landing Page: https://opea.dev/
- Release Notes: https://github.com/opea-project/docs/tree/main/release_notes

Details

GenAIExamples

Deployment
- Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum(ba94e01)
- K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum(0629696)
- Update mount path in xeon k8s(2a6af64)
- Add Nginx - k8s manifest in CodeTrans(6a679ba)
- Add Nginx - docker in CodeTrans(cc84847)
- watch more docker compose files changes(4b0bc26)
- Add chatQnA UI manifest(758d236)
- Revert the LLM model for kubernetes GMS(f5f1e32)
- [ChatQnA] Update retrieval & dataprep manifests(6730b24)
- [ChatQnA]Update manifests(3563f5d)
- [ChatQnA] Update benchmarking manifests(36fb9a9)
- [ChatQnA] udate OOB & Tuned manifests(ac34860)
- Add nginx and UI to the ChatQnA manifest(05f9828)
- [ChatQnA] Update OOB with wrapper manifests.(933c3d3)
- [Translation] Support manifests and nginx(1e13031)
- update V1.0 benchmark manifest (e5affb9)
- update image name(e2a74f7)
- K8S manifest: Update ChatQnA/CodeGen/CodeTrans/DocSum(0629696)
- Change megaservice path in line with new file structure(5ab27b6)
- Add ui/nginx support in K8S manifest for ChatQnA/CodeGen/CodeTrans/Docsum(ba94e01)
- Add chatQnA UI manifest(758d236)
- Yaml: add comments to specify gaudi device ids.(63406dc)
- add tgi bf16 setup on CPU k8s.(ba17031)
Documentation
- [ChatQnA] Update README for ModelScope(aebc23f)
- Update README.md(4bd7841)
- [ChatQnA] Update README for without Rerank Pipeline(6b617d6)
- [ChatQnA] Update Benchmark README for w/o rerank(4a51874)
- Fix readme for nv gpu(43b2ae5)
- [ChatQnA] Update Benchmark README to Fix Input Length(55d287d)
- Refine ChatQnA README for TGI(afc3341)
- Add default model for VisualQnA README(07baa8f)
- Update readme for manifests of some examples(adb157f)
- doc: use markdown table in supported_examples(9cf1d88)
- doc: remove invalid code block language(c6d811a)
- add AudioQnA readme with supported model(f4f4da2)
- add more code owners(7f89797)
- doc: fix headings(7a0fca7)
- [Codegen] Refine readme to prompt users on how to change the model.(814164d)
- Update README.md and remove some open-source details(2ef83fc)
- Add issue template(84a781a)
- doc: fix headings and indenting(67394b8)
- Add default model in readme for FaqGen and DocSum(d487093)
- Change docs of kubernetes for curl commands in README(4133757)
- Update v0.9 RAG release data(947936e)
- Explain Default Model in ChatQnA and CodeTrans READMEs(2a2ff45)
- Update docker images list.(a8244c4)
- refactor the network port setting for AWS(bc81770)
- Add validate microservice details link(bd811bd)
- [ChatQnA] Add Nginx in Docker Compose and README(6c36448
- [Doc] Update CodeGen and Translation READMEs(a09395e)
- [Doc] Refine READMEs(372d78c)
- Remove marketing materials(d85ec09)
- doc PR to main instead of of v1.0r(dc94026)
- Update README.md for Multiplatforms(b205dc7)
- Refine the quick start of ChatQnA(3b70fb0)
- Update supported_examples(96d5cd9)
- [Doc] doc improvement(e0b3b57)
- Fix README issues(bceacdc)
- doc: fix broken image reference and markdown(d422929)
- doc: give document meaningful title(a3fa0d6)
- doc: fix incorrefine readme for reorg(d2bab99)
- doc: fix incorrect path to png image files (d97882e)
- update doc according to comments(f990f79)
- doc: fix headings and indenting(67394b8)
- Update README.md(4bd7841)
- refine readme for reorg(d2bab99)
- Update README with new examples(2d28beb)
- README: fix broken links(ff6f841)
- Update v0.9 RAG release data([947936e](https://github....

Assets 2

27 Aug 03:07

kevinintel

v0.9

4d59721

Generative AI Examples v0.9 Release Notes

OPEA Release Notes v0.9

What’s New in OPEA v0.9

Broaden functionality
- Provide telemetry functionalities for metrics and tracing using Prometheus, Grafana, and Jaeger
- Initialize two Agent examples: AgentQnA and DocIndexRetriever
- Support for authentication and authorization
- Add Nginx Component to strengthen backend security
- Provide Toxicity Detection Microservice
- Support the experimental Fine-tuning microservice
Enhancement
- Align the Microservice format with the standards of OpenAI (Chat Completions, Fine-tuning... etc)
- Enhance the performance benchmarking and evaluation for GenAI Examples, ex: TGI, resource allocation, ...etc
- Enable support for launching container images as a non-root user
- Use Llama-Guard-2-8B as default Guardrails model and bge-large-zh-v1.5 as default embedding model, mistral-7b-grok as default CodeTrans model
- Add ProductivitySuite to provide access management and maintains user context
Deployment
- Support Red Hat OpenShift Container Platform (RHOCP)
- GenAI Microservices Connector (GMC) successfully tested on Nvidia GPUs
- Add Kubernetes support for AudioQnA and VisualQnA examples
OPEA Docker Hub: https://hub.docker.com/u/opea
GitHub IO: https://opea-project.github.io/latest/index.html
Thanks for the external contribution from Sharan Shirodkar, Aishwarya Ramasethu
, Michal Nicpon and Jacob Mansdorfer

Details

GenAIExamples

ChatQnA
- Update port in set_env.sh(040d2b7)
- Fix minor issue in ChatQnA Gaudi docker README(a5ed223)
- update chatqna dataprep-redis port(02a1536)
- Add support for .md file in file upload in the chatqna-ui(7a67298)
- Added the ChatQnA delete feature, and updated the corresponding README(09a3196)
- fixed ISSUE-528(45cf553)
- Fix vLLM and vLLM-on-Ray UT bug(cfcac3f)
- set OLLAMA_MODEL env to docker container(c297155)
- Update guardrail docker file path(06c4484)
- remove ray serve(c71bc68)
- Refine docker_compose for dataprep param settings(3913c7b)
- fix chatqna guardrails(db2d2bd)
- Support ChatQnA pipeline without rerank microservice(a54ffd2)
- Update the number of microservice replicas for OPEA v0.9(e6b4fff)
- Update set_env.sh(9657f7b)
- add env for chatqna vllm(f78aa9e)
Deployment
- update manifests for v0.9(ba78b4c)
- Update K8S manifest for ChatQnA/CodeGen/CodeTrans/DocSum(01c1b75)
- Update benchmark manifest to fix errors(4fd3517)
- Update env for manifest(4fa37e7)
- update manifests for v0.9(08f57fa)
- Add AudioQnA example via GMC(c86cf85)
- add k8s support for audioqna(0a6bad0)
- Update mainifest for FaqGen(80e3e2a)
- Add kubernetes support for VisualQnA(4f7fc39)
- Add dataprep microservice to chatQnA example and the e2e test(1c23d87)
Documentation
- [doc] Update README.md(c73e4e0)
- doc fix: Update README.md to remove specific dicscription of paragraph-1(5a9c109)
- doc: fix markdown in docker_image_list.md(9277fe6)
- doc: fix markdown in Translation/README.md(d645305)
- doc: fix markdown in SearchQnA/README.md(c461b60)
- doc: fix FaqGen/README.md markdown(704ec92)
- doc: fix markdown in DocSum/README.md(83712b9)
- doc: fix markdown in CodeTrans/README.md(076bca3)
- doc: fix CodeGen/README.md markdown(33f8329)
- doc: fix markdown in ChatQnA/README.md(015a2b1)
- doc: fix headings in markdown files(21fab71)
- doc: missed an H1 in the middle of a doc(4259240)
- doc: remove use of HTML for table in README(e81e0e5)
- Update ChatQnA readme with OpenShift instructions(ed48371)
- Convert HTML to markdown format.(14621f8)
- Fix typo {your_ip} to {host_ip}(ad8ca88)
- README fix typo(abc02e1)
- fix script issues in MD file(acdd712)
- Minor documentation improvements in the CodeGen README(17b9676)
- Refine Main README(08eb269)
- [Doc]Add a micro/mega service WorkFlow for DocSum(343d614)
- Update README for k8s deployment(fbb81b6)
Other examples
- Clean deprecated VisualQnA code(87617e7)
- Using TGI official release docker image for intel cpu(b2771ad)
- Add VisualQnA UI(923cf69)
- fix container name(5ac77f7)
- Add VisualQnA docker for both Gaudi and Xeon using TGI serving(2390920)
- Remove LangSmith from Examples(88eeb0d)
- Modify the language variable to match language highlight.(f08d411)
- Remove deprecated folder.(7dd9952)
- update env for manifest(4fa37e7)
- AgentQnA example(67df280)
- fix tgi xeon tag(6674832)
- Add new DocIndexRetriever example(566cf93)
- Add env params for chatqna xeon test(5d3950)
- ProductivitySuite Combo Application with REACT UI and Keycloak Authen(947cbe3)
- change codegen tgi model(06cb308)
- change searchqna prompt(acbaaf8)
- minor fix mismatched hf token(ac324a9)
- fix translation gaudi env(4f3be23)
- Minor fixes for CodeGen Xeon and Gaudi Kubernetes codegen.yaml (c25063f)
CI/CD/UT
- update deploy_gmc logical in cd workflow(c016d82)
- fix ghcr.io/huggingface/text-generation-inference tag(503a1a9)
- Add GMC e2e in CD workflow(f45e4c6)
- Fix CI test changed file detect issue([5...

Assets 2

29 Jul 02:18

kevinintel

v0.8

a2437e8

Generative AI Examples v0.8 Release Notes

OPEA Release Notes v0.8

What’s New in OPEA v0.8

Broaden functionality
- Support frequently asked questions (FAQs) generation GenAI example
- Expand the support of LLMs such as Llama3.1 and Qwen2 and support LVMs such as llava
- Enable end-to-end performance and accuracy benchmarking
- Support the experimental Agent microservice
- Support LLM serving on Ray
Multi-platform support
- Release the Docker images of GenAI components under OPEA dockerhub and support the deployment with Docker
- Support cloud-native deployment through Kubernetes manifests and GenAI Microservices Connector (GMC)
- Enable the experimental authentication and authorization support using JWT tokens
- Validate ChatQnA on multiple platforms such as Xeon, Gaudi, AIPC, Nvidia, and AWS
OPEA Docker Hub: https://hub.docker.com/u/opea

Details

GenAIExamples

ChatQnA
- Add ChatQnA instructions for AIPC(26d4ff)
- Adapt Vllm response format (034541)
- Update tgi version(5f52a1)
- Update README.md(f9312b)
- Udpate ChatQnA docker compose for Dataprep Update(335362)
- [Doc] Add valid micro-service details(e878dc)
- Updates for running ChatQnA + Conversational UI on Gaudi(89ddec)
- Fix win PC issues(ba6541)
- [Doc]Add ChatQnA Flow Chart(97da49)
- Add guardrails in the ChatQnA pipeline(955159)
- Fix a minor bug for chatqna in docker-compose(b46ae8)
- Support vLLM/vLLM-on-Ray/Ray Serve for ChatQnA(631d84)
- Added ChatQnA example using Qdrant retriever(c74564)
- Update TEI version v1.5 for better performance(f4b4ac)
- Update ChatQnA upload feature(598484)
- Add auto truncate for embedding and rerank(8b6094)
Deployment
- Add Kubernetes manifest files for deploying DocSum(831463)
- Update Kubernetes manifest files for CodeGen(2f9397)
- Add Kubernetes manifest files for deploying CodeTrans(c9548d)
- Updated READMEs for kubernetes example pipelines(c37d9c)
- Update all examples yaml files of GMC in GenAIExample(290a74)
- Doc: fix minor issue in GMC doc(d99461)
- README for installing 4 worklods using helm chart(6e797f)
- Update Kubernetes manifest files for deploying ChatQnA(665c46)
- Add new example of SearchQnA for GenAIExample(21b7d1)
- Add new example of Translation for GenAIExample(d0b028)
Other examples
- Update reranking microservice dockerfile path (d7a5b7)
- Update tgi-gaudi version(3505bd)
- Refine README of Examples(f73267)
- Update READMEs(8ad7f3)
- [CodeGen] Add codegen flowchart(377dd2)
- Update audioqna image name(615f0d)
- Add auto-truncate to gaudi tei (8d4209)
- Update visualQnA chinese version(497895)
- Fix Typo for Translation Example(95c13d)
- FAQGen Megaservice(8c4a25)
- Code-gen-react-ui(1b48e5)
- Added doc sum react-ui(edf0d1)
CI/UT
- Frontend failed with unknown timeout issue (7ebe78)
- Adding Chatqna Benchmark Test(11a56e)
- Expand tgi connect timeout(ee0dcb)
- Optimize gmc manifest e2e tests(15fc6f)
- Add docker compose yaml print for test(bb4230)
- Refactor translation ci test (b7975e)
- Refactor searchqna ci test(ecf333)
- Translate UT for UI(284d85)
- Enhancement the codetrans e2e test(450efc)
- Allow gmc e2e workflow to get secrets(f45f50)
- Add checkout ref in gmc e2e workflow(62ae64)
- SearchQnA UT(268d58)

GenAIComps

Cores
- Support https for microservice(2d6772)
- Enlarge megaservice request timeout for supporting high concurrency(876ca5)
- Add dynamic DAG(f2995a)
LLM
- Optional vllm microservice container build(963755)
- Refine vllm instruction(6e2c28)
- Introduce 'entrypoint.sh' for some Containers(9ecc5c)
- Support llamaindex for retrieval microservice and remove langchain(61795f)
- Update tgi with text-generation-inference:2.1.0(f23694)
- Fix requirements(f4b029)
- Add vLLM on Ray microservice(ec3b2e)
- Update code/readme/UT for Ray Serve and VLLM([dd939c](https://gith...

Assets 2

28 Jun 16:46

kevinintel

v0.7

77ba913

Generative AI Examples v0.7 Release Notes

OPEA Highlight

Add 3 MegaService examples: Translation, SearchQnA and AudioQnA
Add 4 MicroService and LLM supports llamaIndex, vllm, RayServe
Enable Dataprep: extract info from table, image...etc
Add HelmChart and GenAI Microservice Connector(GMC) test

GenAIExamples

ChatQnA
- ChatQnA supports Qwen2(422b4b)
- Add no_proxy in docker compose yaml for micro services(99eb6a, 240587)
- Fix DataPrep image build in ChatQnA(2fb070)
- Add Nvidia GPU support for ChatQnA(e80e56)
- Update ChatQnA docker_compose.yaml to fix downloads failing(e948a7, f2a943)
- Chat QNA React UI with conversation history(b994bc)
- Adapt Chinese characters(2f4723)
Other examples
- Refactor Translation Example(409c723)
- Add AudioQnA with GenAIComps(b4d8e1)
- Add SearchQnA with GenAIComps(6b76a9)
- Add env for searchqna(d9b62a)
- Supports ASR on HPU(2a4860)
- Fix DocSum Gaudi building instructions(29de55)
- Add image build job in docker compose e2e gaudi test in CI(4fecd4)
CI
- Add docker build job in manifest e2e workflow(c5f309)
- Create reuse workflow for get-test-matrix in CI(961abb)
- Enable new CI runner and improve manifest e2e test scripts(26d6ea)
- Enable building latest megaservice image on push event in CI(a0b94b)
- Fix the image build refer(01eed8)
- Add build docker image option for test scripts(e32a51)
- Add e2e test of chatqna(afcb3a), codetrans(295b818), codegen(960cf38), docsum(2e62ecc))

GenAIComps

Cores
- Add aio orchestrator to boost concurrent serving(db3b4f)
- Add microservice level perf statistics(597b3c, ba1d11)
- Add Gateway for Translation(1b654d)
LLM
- Support Qwen2 in LLM Microservice(3f5cde)
- Fix the vLLM docker compose issues(3d134d)
- Enable vLLM Gaudi support for LLM service based on officially habana vllm release(0dedc2)
- Openvino support in vllm(7dbad0)
- Support Ollama microservice(a00e36)
- Support vLLM XFT LLM microservice(2a6a29, 309c2d, fe5f39)
- Add e2e test for llm summarization tgi(e8ebd9)
DataPrep
- Support Dataprep(f7443f), embedding(f37ce2) microservice with Llama Index
- Fix dataprep microservice path issue(e20acc)
- Add milvus microservice(e85033)
- Add Ray version for multi file process(40c1aa)
- Fix dataprep timeout issue(61ead4)
- Add e2e test for dataprep redis langchain(6b7bec)
- Supported image summarization with LVM in dataprep microservice(86412c)
- Enable conditional splitting for html files(e1dad1)
- Added support for pyspark in dataprep microservice(a5eb14)
- DataPrep extract info from table in the docs(953e78)
- Added support for extracting info from image in the docs(e23745)
Other Components
- Add PGvector support in Vectorstores(1b7001) and Retriever(75eff6), Dataprep(9de3c7)
- Add Mosec embedding(f76685) and reranking(a58ca4)
- Add knowledge graph components(4c0afd)
- Add LVMs LLaVA component(bd385b)
- Add asr/tts components for xeon and hpu(cef6ea)
- Add WebSearch Retriever Microservice(900178)
- Add initial pii detection microservice(e38041)
- Pinecone support for dataprep and retrieval microservice(8b6486)
- Support prometheus metrics for opea microservices(758914), (900178)
- Add no_proxy env for micro services(df0c11)
- Enable RAGAS(8a670e)
- Fix RAG performance issues(70c23d)
- Support rerank and retrieval of RAG OPT(b51675)
- Reranking using an optimized bi-encoder(574847)
- Use parameter for retriever(358dbd), reranker(dfdd08)
CI
- CI optimization to support multiple test for single kind of service(38f646)
- Update CI to support dataprep_redis path level change(5c0773)
- Enable python coverage(cd91cf)
- Add codecov(da2689)
- Enable microservice docker images auto build and push(16c5fd)

GenAIEvals

Enable autorag to automatically generate the evaluation dataset and evaluate the RAG system(b24bff)
Support document summar...

Assets 2

01 Jun 09:33

kevinintel

v0.6

aa6b0e8

Generative AI Examples v0.6 Release Notes

OPEA Highlights

Add 4 MegaService examples: CodeGen, ChatQnA, CodeTrans and Docsum, you can deploy them on Kubernetes
Enable 10 microservices for LLM, RAG, security...etc
Support text generation, code generation and end-to-end evaluation

GenAIExamples

Build 4 reference solutions for some classic GenAI applications, like code generation, chat Q&A, code translation and document summarization, through orchestration interface in GenAIComps.
Support seamlessly deployment on Intel Xeon and Gaudi platform through Kubernetes and Docker Compose.

GenAIComps

Activate a suite of microservices including ASR, LLMS, Rerank, Embedding, Guardrails, TTS, Telemetry, DataPrep, Retrieval, and VectorDB. ASR functionality is fully operational on Xeon architecture, pending readiness on Gaudi. Retrieval capabilities are functional on LangChain, awaiting readiness on LlamaIndex. VectorDB functionality is supported on Redis, Chroma, and Qdrant, with readiness pending on SVS.
Added 14 file formats support in data preparation microservices and enabled Safeguard of conversation in guardrails.
Added the Ray Gaudi Supported for LLM Service.

GenAIEvals

Add evaluating the models on text-generation tasks(lm-evaluation-harness) and coding tasks (bigcode-evaluation-harness)
Add end-to-end evaluation with microservice

GenAIInfra

Add Helm Charts redis-vector-db, TEI, TGI and CodeGen for deploying GenAIExamples on Kubernetes
Add Manifests for deploying GenAIExamples CodeGen, ChatQnA and Docsum on Kubernetes and on Docker Compose

Assets 2

Releases: opea-project/GenAIExamples

Generative AI Examples v1.5 Release Notes

OPEA Release Notes v1.5

Table of Contents

What's New in OPEA v1.5

GenAI Examples

GenAI Microservices

Productization

Validated Hardware

Validated Software

Full Changelogs

Contributors

Contributing Organizations

Individual Contributors

Uh oh!

Generative AI Examples v1.4 Release Notes

OPEA Release Notes v1.4

Table of Contents

What's New in OPEA v1.4

Advanced Agent Capabilities

Components as MCP Servers

KubeAI Operator for OPEA

New GenAI Capabilities

Better User Experience

Newly Supported Models

Newly Supported Hardware

Newly Supported OS

Updated Dependencies

Changes to Default Behavior

Validated Hardware

Validated Software

Known Issues

Full Changelogs

Contributors

Contributing Organizations

Uh oh!

Generative AI Examples v1.3 Release Notes

OPEA Release Notes v1.3

Table of Contents

What's New in OPEA v1.3

Advanced Agent Capabilities

Performance and Scalability

Ecosystem Integrations

New GenAI Capabilities

Enhanced Evaluation

Observability

Better User Experience

Exploration

Newly Supported Models

Newly Supported Hardware

Other Notable Changes

Uh oh!

Generative AI Examples v1.2 Release Notes

OPEA Release Notes v1.2

What's New in OPEA v1.2

Highlights

Code Factoring for GenAIComps

Supporting Cloud Service Providers

Enhanced GenAI Components

New GenAI Components

Enhanced GenAI Examples

Enhanced GenAIStudio

Newly Supported Models

Newly Supported Hardware

Uh oh!

Generative AI Examples v1.1 Release Notes

OPEA Release Notes v1.1

What's New in OPEA v1.1

Highlights

New GenAI Examples

Enhanced GenAI Examples

New GenAI Components

Enhanced GenAI Components

GenAIStudio

Enhanced Observability

Helm Charts Support

Long-context Benchmark Support

Newly Supported Models

Newly Supported Hardware

Notable Changes