Resilient AI Agent

Resilient AI Agent is a full-stack, production-ready AI Doctor Assistant platform. It features a Rust backend (Axum, MongoDB, Ollama with Llama3 and other models, circuit breaker, retry, logging), a modern Next.js frontend (Tailwind, shadcn/ui), and a Python-based TTS microservice. The system is robust, modular, and cloud-native, with Docker and Kubernetes support for scalable, secure deployments. It is designed for LAN and cloud, with browser and server TTS fallback, and a clean, accessible chat UI.

Project Structure

resilient-ai-agent/
├── backend/      # Rust backend (Axum, MongoDB, Ollama, etc.)
├── frontend/     # Next.js frontend (React, Tailwind, shadcn/ui)
├── tts/          # Python TTS microservice (pyttsx3, Flask)
├── k8s/          # Kubernetes manifests
├── examples/     # Rust example code
└── README.md     # Project documentation

How to Build & Run

Backend

cd backend
cargo build
cargo run

Frontend

cd frontend
npm install
npm run dev

TTS Service

cd tts
./start_tts_server.sh

Kubernetes

All manifests are in the k8s/ folder.

Docker Compose Deployment

You can run the entire stack (backend, frontend, tts, mongo) using Docker Compose:

cd backend
docker-compose up --build

backend: Rust API (Axum, MongoDB, Ollama proxy)
frontend: Next.js UI
tts: Python TTS microservice (Flask, pyttsx3)
mongo: MongoDB database

All services are networked together. The backend and frontend containers will wait for TTS and MongoDB to be ready.

Accessing Services

Frontend: http://localhost:3000
Backend API: http://localhost:3001
TTS API: http://localhost:5002
MongoDB: localhost:27017 (for development)

Kubernetes Deployment

All Kubernetes manifests are in the k8s/ directory. To deploy the stack:

kubectl apply -f k8s/

backend-deployment.yaml: Rust API deployment, service, and secrets
frontend-deployment.yaml: Next.js UI deployment and service
tts-deployment.yaml: Python TTS deployment and service
ingress.yaml: (Optional) Ingress for routing external traffic

Accessing Services in Kubernetes

Use kubectl port-forward or configure an ingress controller to access services from outside the cluster.

Example:

kubectl port-forward svc/resilient-ai-backend 3001:3001
kubectl port-forward svc/frontend 3000:3000
kubectl port-forward svc/tts 5002:5002

Environment Variables

Backend:
- MONGODB_URI (default: mongodb://mongo:27017)
- TTS_URL (default: http://tts:5002)
Frontend:
- NEXT_PUBLIC_BACKEND_URL (default: http://backend:3001)
- NEXT_PUBLIC_TTS_URL (default: http://tts:5002)
TTS:
- No special env vars required (runs on port 5002)

Notes

All build artifacts for Rust are kept in backend/target/.
Each service has its own Dockerfile and can be built/deployed independently.
For production, update image tags and ingress as needed.
For local development, you can run each service individually or use Docker Compose.

Features

Backend (Rust)

Async Rust: Built with tokio for high performance and concurrency.
Ollama Integration: Uses ollama-rs to interact with local Ollama models.
Retry with Exponential Backoff: Automatically retries failed requests with configurable backoff.
Circuit Breaker: Prevents repeated failures from overwhelming the system.
Structured Logging: Uses env_logger and log for clear, timestamped logs.
Model Availability Check: Verifies the requested model is available before making requests.
REST API: /api/chat for chat, /api/health for health check.

Frontend (Next.js)

Next.js 14+ with App Router, TypeScript, and Tailwind CSS (official guide)
shadcn/ui for beautiful, accessible UI components (docs)
Modern chat interface: Responsive, mobile-friendly, and visually appealing
Live connection status: Shows if backend/model is available
Avatar, Card, Input, Button, ScrollArea: All UI built with shadcn/ui

UI/UX Improvements

The chat input is now always visible and fixed at the bottom of the page, just like a real chat app (no need to scroll to reach it).
All footers and branding have been removed for a clean, app-like experience.

Getting Started

1. Backend (Rust)

# In the project root
cargo run

The API will be available at http://localhost:3001
Endpoints:
- POST /api/chat — send chat messages
- GET /api/health — check model/backend status

Requirements:

Rust 1.70+
Ollama running locally
The desired model (default: mistral) installed:
```
ollama pull mistral
```

2. Frontend (Next.js)

cd frontend
npm install
npm run dev

The app will be available at http://localhost:3000
The frontend will communicate with the backend at http://localhost:3001

Example Usage

Ask health questions in the chat UI
The AI doctor will respond using the Ollama model
Connection status is shown at the top of the chat

Customization

Change the model in src/lib.rs (MODEL constant) or pass a different prompt/system prompt.
Adjust retry/circuit breaker settings in src/client.rs or via the respective config structs.
Tweak the chat UI in frontend/src/components/chat/ for your brand/style.

Deployment

Docker Compose (Recommended for Local/Dev)

Build and run both backend and frontend:

docker compose up --build

Access the frontend at http://localhost:3000 and backend at http://localhost:3001.

Docker (Manual)

Backend

docker build -t resilient-ai-backend .
docker run -p 3001:3001 resilient-ai-backend

Frontend

cd frontend
docker build -t resilient-ai-frontend .
docker run -p 3000:3000 resilient-ai-frontend

Kubernetes

Apply manifests in the k8s/ directory:

kubectl apply -f k8s/

Expose services as needed (e.g., via LoadBalancer or Ingress).

Production Environment & Secrets

Backend (Rust)

Configure secrets (e.g., Ollama API key) via environment variables.
In Kubernetes, use a Secret (see k8s/backend-deployment.yaml) and reference it in the deployment.
In Docker, use -e OLLAMA_API_KEY=... or --env-file.

Frontend (Next.js)

Set the backend API URL via NEXT_PUBLIC_API_URL.
In Kubernetes, set as an env var in the deployment.
In Docker, use -e NEXT_PUBLIC_API_URL=... or --env-file.

Example: Kubernetes Secret

apiVersion: v1
kind: Secret
metadata:
  name: ollama-secrets
stringData:
  api-key: "your-ollama-api-key-here"

API Endpoints

Backend (Rust)

POST /api/chat

Request:
```
{ "message": "What is hypertension?" }
```

Response:

{ "response": "Hypertension is...", "success": true }

GET /api/chats

Response:

[ { "id": "...", "content": "...", "role": "user|assistant", "timestamp": "..." }, ... ]

GET /api/health

Response:

{ "status": "ok", "model": "mistral", ... }

GET /api/tts/voices
- Response:
```
["Samantha", "Alex", "Daniel", ...]
```
POST /api/tts
- Request:
```
{ "text": "Hello, this is a test.", "voice": "Samantha" }
```
- Response:
  - On success: WAV audio file
  - On error: { "error": "TTS server unavailable" } (Content-Type: application/json)

TTS Fallback & Robustness

If the backend cannot reach the TTS server, it returns a JSON error. The frontend automatically falls back to browser TTS (using the Web Speech API) so users always hear responses.
TTS settings (enable/disable, mute, select voice) are available in the frontend settings page.

Types of Tests

Unit Tests:
- Test individual functions, methods, or small modules in isolation.
- Fast, do not require external systems (e.g., database, network).
- Example: serialization/deserialization tests for types, logic in a single function.
Integration Tests:
- Test how multiple parts of the system work together.
- May require external systems (e.g., database, TTS server).
- Example: API endpoint tests that hit the running backend and check real responses.
Functional Tests:
- Test a specific feature or user-facing functionality end-to-end.
- May span multiple modules or services, but focus on a user story or workflow.
- Example: Simulate a user asking a question and receiving a spoken answer.
Smoke Tests:
- Basic tests to check that the most important parts of the system work at all.
- Often run after deployment to catch major breakage quickly.
- Example: Check that the backend starts and responds to /api/health.

Running Tests

All tests:
```
cd backend
cargo test
```
This will run all unit, integration, functional, and smoke tests in backend/tests/.

Run a specific test file:

cargo test --test <test_file_name>
# Example:
cargo test --test types
cargo test --test integration_api

Test file organization:
- All test files are in backend/tests/.
- Each .rs file is a separate test crate (e.g., types.rs, integration_api.rs).
- Add new functional or smoke tests as new files (e.g., functional_login.rs, smoke_health.rs).

Credits

Kubernetes Production Deployment (AWS/EKS Example)

The k8s/ directory now contains manifests for a full production deployment:

mongo-statefulset.yaml: MongoDB StatefulSet with persistent EBS storage and secret
ollama-deployment.yaml: Ollama model server with persistent EBS storage and secret (preloads and serves Llama3 model automatically)
backend-deployment.yaml: Rust backend API
frontend-deployment.yaml: Next.js frontend
tts-deployment.yaml: Python TTS microservice
ingress.yaml: Ingress for HTTPS routing (uses cert-manager and nginx)

Prerequisites

An EKS (or other Kubernetes) cluster with storage class gp2 (EBS)
kubectl configured for your cluster
cert-manager and nginx ingress controller installed (for HTTPS)

Deploy the Full Stack

kubectl apply -f k8s/mongo-statefulset.yaml
kubectl apply -f k8s/ollama-deployment.yaml
kubectl apply -f k8s/backend-deployment.yaml
kubectl apply -f k8s/tts-deployment.yaml
kubectl apply -f k8s/frontend-deployment.yaml
kubectl apply -f k8s/ingress.yaml

All persistent data (MongoDB, Ollama models) will be stored on EBS volumes.
Secrets for database and model API keys are managed as Kubernetes secrets.
Ingress provides HTTPS access to the frontend and API (edit ingress.yaml for your domain).

Accessing Services

Frontend: https://ai.example.com/
Backend API: https://ai.example.com/api
TTS API: internal service tts:5002
MongoDB: internal service mongo:27017
Ollama: internal service ollama:11434

Roadmap: Security Features

The following security features are planned or in progress to ensure the safety, privacy, and robustness of the Resilient AI Agent platform:

Authentication & Authorization
- Add user authentication (OAuth2, JWT, or SSO integration)
- Role-based access control for sensitive endpoints (e.g., admin, doctor, patient)
API Security
- Rate limiting and abuse prevention on all public endpoints
- Input validation and sanitization to prevent injection attacks
- CORS policy hardening for production
Data Protection
- Encrypt sensitive data at rest (MongoDB encryption)
- Encrypted secrets management (Kubernetes Secrets, Docker secrets)
- Secure handling of environment variables and API keys
Transport Security
- Enforce HTTPS everywhere (Ingress, frontend, backend)
- Automatic TLS certificate management (cert-manager)
Audit & Monitoring
- Structured, tamper-resistant logging for all services
- Audit trails for user actions and admin operations
- Integration with monitoring/alerting (Prometheus, Grafana, Sentry)
Vulnerability Management
- Automated dependency scanning (GitHub Dependabot, cargo audit, npm audit)
- Regular security updates for all base images and dependencies
Privacy & Compliance
- Data retention and deletion policies (user chat history, logs)
- GDPR/CCPA compliance roadmap
AI Safety
- Prompt injection mitigation and output filtering
- Guardrails to prevent unsafe or non-medical advice

Contributions and suggestions for additional security features are welcome!

Data Protection

MongoDB Encryption at Rest: For production, enable MongoDB's encryption at rest (see MongoDB docs). The provided mongo-statefulset.yaml is ready for use with encrypted storage classes and can be extended to enable encryption options.
Kubernetes Secrets: All sensitive credentials (MongoDB root password, Ollama API key, etc.) are managed via Kubernetes Secrets and never hardcoded. See k8s/mongo-statefulset.yaml and k8s/backend-deployment.yaml for examples.
Environment Variables: The backend loads all secrets and sensitive config from environment variables, never from code or public files. This includes database URIs and API keys.
Docker Secrets: For Docker Compose or Swarm, use Docker secrets for production deployments to avoid exposing secrets in environment variables.

Best Practices:

Always rotate secrets regularly and never commit them to version control.
Use encrypted storage (EBS, etc.) for all persistent data in production.
Review and restrict access to secrets in your cloud provider and Kubernetes cluster.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
backend		backend
frontend		frontend
k8s		k8s
tts		tts
.gitignore		.gitignore
README.md		README.md

philodi-dev/rust-production-ai-assistant-app

Folders and files

Latest commit

History

Repository files navigation

Resilient AI Agent

Project Structure

How to Build & Run

Backend

Frontend

TTS Service

Kubernetes

Docker Compose Deployment

Accessing Services

Kubernetes Deployment

Accessing Services in Kubernetes

Environment Variables

Notes

Features

Backend (Rust)

Frontend (Next.js)

UI/UX Improvements

Getting Started

1. Backend (Rust)

2. Frontend (Next.js)

Example Usage

Customization

Deployment

Docker Compose (Recommended for Local/Dev)

Docker (Manual)

Backend

Frontend

Kubernetes

Production Environment & Secrets

Backend (Rust)

Frontend (Next.js)

Example: Kubernetes Secret

API Endpoints

Backend (Rust)

TTS Fallback & Robustness

Types of Tests

Running Tests

Credits

Kubernetes Production Deployment (AWS/EKS Example)

Prerequisites

Deploy the Full Stack

Accessing Services

Roadmap: Security Features

Data Protection

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages