🤖 Multilingual Generative AI Avatar: Local and Cloud-based Real-Time Deployments

📖 About

This repository presents a lightweight, multilingual avatar system for real-time Human-AI interaction in Kazakh, Russian, and English. We compare two deployment architectures developed at ISSAI:

Local: Uses quantized Qolda model (4.3B parameters), Whisper Turbo ASR, and Matcha-TTS Cloud-based: Uses Oylan LLM and MangiSoz APIs

Key Results:

Local deployment is 62% faster (2.20s vs 5.74s end-to-end latency)
LLM inference: 76% faster locally (0.99s vs 4.11s)
ASR: 38% faster locally
Avatar rendering uses only 15-20% GPU at 60 FPS
On-device models enable responsive, offline multilingual interaction

✨ Features

Core Capabilities

Multilingual Support: Kazakh, Russian, and English language processing
Dual Deployment Architectures: Cloud-based and local deployment options
Real-time Human-AI Interaction: Low-latency conversational interface
3D Avatar Interface: Ready Player Me-based avatar rendering at 60 FPS
Speech Processing Pipeline: End-to-end ASR, LLM inference, and TTS synthesis

🎬 Demo

▶️ Watch the video demonstration to see the system in action.

🏗️ System Pipeline

The following diagram illustrates the complete system architecture comparing cloud-based and local deployment approaches:

🚀 Quick Start

Prerequisites

Node.js (v16 or higher)
npm or yarn package manager
Modern browser with microphone access support
MangiSoz API access (STT and TTS services)

Installation

Clone the repository:

git clone <repository-url>
cd r3f-virtual-girlfriend-frontend

Install dependencies:

npm install
# or
yarn install

Start development server:

npm run dev
# or
yarn dev

Open in browser: Navigate to http://localhost:5173/

🎮 Usage Guide

🎤 Voice Interaction

Click the microphone button to start voice recognition
Speak your question to the AI educator
Stop speaking - automatic 2-second countdown begins
Message sends automatically - no button clicking needed!
AI responds - microphone auto-pauses during response
Auto-resumes after AI finishes for seamless conversation

📁 Project Structure

src/
├── components/
│   ├── LandingPage.jsx          # Beautiful landing page
│   ├── ClassroomPage.jsx        # Main classroom wrapper
│   ├── ClassroomUI.jsx          # Zoom-like interface
│   ├── ClassroomExperience.jsx  # 3D classroom environment
│   ├── VoiceRecognition.jsx     # Voice control component
│   ├── Avatar.jsx               # AI educator 3D model
│   ├── Experience.jsx           # Original 3D scene
│   └── UI.jsx                   # Original UI (legacy)
├── hooks/
│   ├── useChat.jsx              # AI conversation management
│   ├── useMangiSozSTT.jsx       # MangiSoz STT integration
│   └── useVoiceRecognition.jsx  # Legacy voice recognition (deprecated)
├── assets/
├── App.jsx                      # Main app with routing
├── main.jsx                     # App entry point
└── index.css                    # Global styles + animations

🎨 Customization

Classroom Themes

Modify src/components/ClassroomExperience.jsx to add new environments:

Change lighting presets
Add new 3D models
Customize classroom layout

AI Educator Personality

Update src/hooks/useChat.jsx to modify:

Educational context
Subject specialization
Response style
Learning level

Voice Settings

Adjust src/hooks/useVoiceRecognition.jsx for:

Silence detection timing (default: 2 seconds)
Language settings
Audio sensitivity

🚀 Deployment

Build for Production:

npm run build
# or
yarn build

Preview Production Build:

npm run preview
# or
yarn preview

📄 License

This project is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).

You are free to:

Share — copy and redistribute the material in any medium or format
Adapt — remix, transform, and build upon the material

Under the following terms:

Attribution — You must give appropriate credit to ISSAI
NonCommercial — You may not use the material for commercial purposes

For more details, see the CC BY-NC 4.0 License.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
public		public
src		src
.gitignore		.gitignore
Colored_merged_pipeline.drawio.svg		Colored_merged_pipeline.drawio.svg
README.md		README.md
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
vite.config.js		vite.config.js
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🤖 Multilingual Generative AI Avatar: Local and Cloud-based Real-Time Deployments

📖 About

✨ Features

Core Capabilities

🎬 Demo

🏗️ System Pipeline

🚀 Quick Start

Prerequisites

Installation

🎮 Usage Guide

🎤 Voice Interaction

📁 Project Structure

🎨 Customization

Classroom Themes

AI Educator Personality

Voice Settings

🚀 Deployment

Build for Production:

Preview Production Build:

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🤖 Multilingual Generative AI Avatar: Local and Cloud-based Real-Time Deployments

📖 About

✨ Features

Core Capabilities

🎬 Demo

🏗️ System Pipeline

🚀 Quick Start

Prerequisites

Installation

🎮 Usage Guide

🎤 Voice Interaction

📁 Project Structure

🎨 Customization

Classroom Themes

AI Educator Personality

Voice Settings

🚀 Deployment

Build for Production:

Preview Production Build:

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages