Generative AI Examples

Introduction

GenAIComps-based Generative AI examples offer streamlined deployment, testing, and scalability. All examples are fully compatible with Docker and Kubernetes, supporting a wide range of hardware platforms such as Gaudi, Xeon, and other hardwares.

Architecture

GenAIComps is a service-based tool that includes microservice components such as llm, embedding, reranking, and so on. Using these components, various examples in GenAIExample can be constructed, including ChatQnA, DocSum, etc.

GenAIInfra, part of the OPEA containerization and cloud-native suite, enables quick and efficient deployment of GenAIExamples in the cloud.

GenAIEval measures service performance metrics such as throughput, latency, and accuracy for GenAIExamples. This feature helps users compare performance across various hardware configurations easily.

Getting Started

GenAIExamples offers flexible deployment options that cater to different user needs, enabling efficient use and deployment in various environments. Here’s a brief overview of the three primary methods: Python startup, Docker Compose, and Kubernetes.

Users can choose the most suitable approach based on ease of setup, scalability needs, and the environment in which they are operating.

Deployment Guide

Deployment are based on released docker images by default, check docker image list for detailed information. You can also build your own images following instructions.

Prerequisite

For Docker Compose based deployment, you should have docker compose installed. Refer to docker compose install.
For Kubernetes based deployment, we provide 3 ways from the easiest manifests to powerful GMC based deployment.
- You should have a kubernetes cluster ready for use. If not, you can refer to k8s install to deploy one.
- (Optional) You should have GMC installed to your kubernetes cluster if you want to try with GMC. Refer to GMC install for more information.
- (Optional) You should have Helm (version >= 3.15) installed if you want to deploy with Helm Charts. Refer to the Helm Installation Guide for more information.

Deploy Examples

Use Case	Docker Compose Deployment on Xeon	Docker Compose Deployment on Gaudi	Kubernetes with GMC	Kubernetes with Manifests	Kubernetes with Helm Charts
ChatQnA	Xeon Instructions	Gaudi Instructions	ChatQnA with GMC	ChatQnA with Manifests	ChatQnA with Helm Charts
CodeGen	Xeon Instructions	Gaudi Instructions	CodeGen with GMC	CodeGen with Manifests	CodeGen with Helm Charts
CodeTrans	Xeon Instructions	Gaudi Instructions	CodeTrans with GMC	CodeTrans with Manifests	CodeTrans with Helm Charts
DocSum	Xeon Instructions	Gaudi Instructions	DocSum with GMC	DocSum with Manifests	DocSum with Helm Charts
SearchQnA	Xeon Instructions	Gaudi Instructions	SearchQnA with GMC	Not Supported	Not Supported
FaqGen	Xeon Instructions	Gaudi Instructions	FaqGen with GMC	Not Supported	Not Supported
Translation	Xeon Instructions	Gaudi Instructions	Translation with GMC	Not Supported	Not Supported
AudioQnA	Xeon Instructions	Gaudi Instructions	AudioQnA with GMC	Not Supported	Not Supported
VisualQnA	Xeon Instructions	Gaudi Instructions	VisualQnA with GMC	Not Supported	Not Supported

Supported Examples

Check here for detailed information of supported examples, models, hardwares, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 414 Commits
.github		.github
AgentQnA		AgentQnA
AudioQnA		AudioQnA
ChatQnA		ChatQnA
CodeGen		CodeGen
CodeTrans		CodeTrans
DocIndexRetriever		DocIndexRetriever
DocSum		DocSum
FaqGen		FaqGen
ProductivitySuite		ProductivitySuite
SearchQnA		SearchQnA
Translation		Translation
VisualQnA		VisualQnA
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
LEGAL_INFORMATION.md		LEGAL_INFORMATION.md
LICENSE		LICENSE
README.md		README.md
docker_images_list.md		docker_images_list.md
pyproject.toml		pyproject.toml
supported_examples.md		supported_examples.md
third-party-programs.txt		third-party-programs.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Generative AI Examples

Introduction

Architecture

Getting Started

Deployment Guide

Prerequisite

Deploy Examples

Supported Examples

Additional Content

About

Releases

Packages

Languages

License

lvliang-intel/GenAIExamples

Folders and files

Latest commit

History

Repository files navigation

Generative AI Examples

Introduction

Architecture

Getting Started

Deployment Guide

Prerequisite

Deploy Examples

Supported Examples

Additional Content

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages