Arietta Recognition

A high-quality & reliable recognition toolkit for various data types, including Text-to-Speech (TTS) and Speech-to-Text (STT).

Changelog · Report Bug · Request Feature

Table of contents

📖 Introduction

🤖 Arietta Recognition is a high-quality and reliable recognition toolkit for various data types, including Text-to-Speech (TTS) and Speech-to-Text (STT). It is developed by [Arietta Studio]

Note

Therefore, we decided to refine our implementation and make it open source, hoping to assist developers who wish to implement TTS. @arietta-studio/recognition is a high-quality TTS toolkit developed in TypeScript, which supports usage both on the server-side and in the browser.

Server-side: With just 15 lines of code, you can achieve high-quality voice generation capabilities comparable to OpenAI's TTS service. It currently supports EdgeSpeechTTS, MicrosoftTTS, OpenAITTS, and OpenAISTT.
Browser-side: It provides high-quality React Hooks and visual audio components, supporting common functions such as loading, playing, pausing, and dragging the timeline. Additionally, it offers a very rich set of capabilities for adjusting the audio track styles.

📦 Usage

Generate Speech on server

run the script below use Bun: bun index.js

// index.js
import { EdgeSpeechTTS } from '@arietta-studio/recognition';
import { Buffer } from 'buffer';
import fs from 'fs';
import path from 'path';

// Instantiate EdgeSpeechTTS
const tts = new EdgeSpeechTTS({ locale: 'en-US' });

// Create speech synthesis request payload
const payload = {
  input: 'This is a speech demonstration',
  options: {
    voice: 'en-US-GuyNeural',
  },
};

// Call create method to synthesize speech
const response = await tts.create(payload);

// generate speech file
const mp3Buffer = Buffer.from(await response.arrayBuffer());
const speechFile = path.resolve('./speech.mp3');

fs.writeFileSync(speechFile, mp3Buffer);

Important

Run on Node.js

As the Node.js environment lacks the WebSocket instance, we need to polyfill WebSocket. This can be done by importing the ws package.

// import at the top of the file
import WebSocket from 'ws';

global.WebSocket = WebSocket;

Use the React Component

import { AudioPlayer, AudioVisualizer, useAudioPlayer } from '@arietta-studio/recognition/react';

export default () => {
  const { ref, isLoading, ...audio } = useAudioPlayer(url);

  return (
    <Flexbox align={'center'} gap={8}>
      <AudioPlayer audio={audio} isLoading={isLoading} style={{ width: '100%' }} />
      <AudioVisualizer audioRef={ref} isLoading={isLoading} />
    </Flexbox>
  );
};

📦 Installation

Important

This package is ESM only.

To install @arietta-studio/recognition, run the following command:

$ pnpm i @arietta-studio/recognition

$ bun add @arietta-studio/recognition

Compile with Next.js

Note

By work correct with Next.js SSR, add transpilePackages: ['@arietta-studio/recognition'] to next.config.js. For example:

const nextConfig = {
  transpilePackages: ['@arietta-studio/recognition'],
};

⌨️ Local Development

You can use Github Codespaces for online development:

Or clone it for local development:

$ git clone https://github.com/arietta-studio/arietta-recognition.git
$ cd arietta-recognition
$ bun install
$ bun dev

🤝 Contributing

Contributions of all types are more than welcome, if you are interested in contributing code, feel free to check out our GitHub Issues to get stuck in to show us what you’re made of.

🩷 Sponsor

Every bit counts and your one-time donation sparkles in our galaxy of support! You're a shooting star, making a swift and bright impact on our journey. Thank you for believing in us – your generosity guides us toward our mission, one brilliant flash at a time.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
.husky		.husky
api		api
docs		docs
examples/text-to-speech-on-server		examples/text-to-speech-on-server
src		src
.bunfig.toml		.bunfig.toml
.changelogrc.cjs		.changelogrc.cjs
.commitlintrc.cjs		.commitlintrc.cjs
.dumirc.ts		.dumirc.ts
.editorconfig		.editorconfig
.eslintignore		.eslintignore
.eslintrc.cjs		.eslintrc.cjs
.fatherrc.ts		.fatherrc.ts
.gitignore		.gitignore
.i18nrc.cjs		.i18nrc.cjs
.npmrc		.npmrc
.prettierignore		.prettierignore
.prettierrc.cjs		.prettierrc.cjs
.releaserc.cjs		.releaserc.cjs
.remarkrc.cjs		.remarkrc.cjs
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json
renovate.json		renovate.json
tsconfig.json		tsconfig.json
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Arietta Recognition

TOC

📖 Introduction

📦 Usage

Generate Speech on server

Use the React Component

📦 Installation

Compile with Next.js

⌨️ Local Development

🤝 Contributing

🩷 Sponsor

📝 License

About

Releases 3

Sponsor this project

Packages

Contributors 2

Languages

License

Arietta-Studio/arietta-recognition

Folders and files

Latest commit

History

Repository files navigation

Arietta Recognition

TOC

📖 Introduction

📦 Usage

Generate Speech on server

Use the React Component

📦 Installation

Compile with Next.js

⌨️ Local Development

🤝 Contributing

🩷 Sponsor

📝 License

About

Resources

License

Stars

Watchers

Forks

Releases 3

Sponsor this project

Packages 0

Contributors 2

Languages

Packages