New Experimental Training Features, Providers Refactor #1155

Josh-XT · 2024-03-29T05:20:04Z

Providers Refactor

Removed many providers and extensions during transition.
Added vision for Claude provider.
Automated model selection for vision for OpenAI and Claude.
Added a default provider that uses gpt4free for LLM, faster-whisper for audio transcription/translation, streamlabs for text-to-speech, ONNX all-MiniLM-L6-v2 embedder (256 chunk size), and stable diffusion on Hugging Face for image generation (Requires HUGGINGFACE_API_KEY).

Refactor TTS, Audio to Text, Embeddings, and Image Generation to Providers

There are now multiple provider services instead of having multiple extensions for different providers for things like TTS, audio to text, embeddings, and image generation.

Each provider now has a services property which is a list services available from that provider. Providers with an embeddings service will have an additional property for chunk_size for the embedder.

For example, the OpenAI provider has:

self.chunk_size = 1024

@staticmethod
def services():
    return [
      "llm", # Language model
      "tts", # Text to speech
      "image", # Image generation
      "embeddings", # Embeddings creation
      "transcription", # Audio transcription to text
      "translation", # Audio translation to text in English
  ]

New Experimental Training Features

These new training features require some testing and will improve as better training methods become available. The first implementation for training that I have built in is DPO. Open to feedback and improvements.

DPO, CPO, and ORPO style Dataset Creation Functionality Created

AGiXT can now take all memories created and turn them into a synthetic question/good answer/bad answer dataset in DPO / CPO / ORPO format to be used in Transformers (or pick your solution) to fine-tune models.
API Endpoint /api/agent/{agent_name}/memory/dataset
Once dataset is done being created, it can be found at AGiXT/agixt/WORKSPACE/{dataset_name}.json.

Example with Python SDK

The example below will consume the AGiXT GitHub repository into the agent's memory, then create a synthetic dataset with the learned information.

from agixtsdk import AGiXTSDK

agixt = AGiXTSDK(base_uri="http://localhost:7437", api_key="Your AGiXT API Key")

# Define the agent we're working with
agent_name="gpt4free"

# Consume the whole AGiXT GitHub Repository to the agent's memory.
agixt.learn_github_repo(
    agent_name=agent_name,
    github_repo="Josh-XT/AGiXT",
    collection_number=0,
)

# Create a synthetic dataset in DPO/CPO/ORPO format.
agixt.create_dataset(
    agent_name=agent_name, dataset_name="Your_dataset_name", batch_size=5
)

Model Training Based on Agent Memories

Finally making training a full process instead of stopping at the memories. After your agent learns from GitHub repo, files, arXiv articles, websites, or YouTube captions, you can use the new training endpoint to:

Turn all of the agent's memories into synthetic DPO/CPO/ORPO format dataset
Turn the dataset into a DPO QLoRA with unsloth
Merge into the model of your choosing to make your own model from the data you trained your AGiXT agent on.
Uploads your new model to Hugging Face with your choice of private_repo on a bool once complete if your agent has a HUGGINGFACE_API_KEY in its config.

from agixtsdk import AGiXTSDK

agixt = AGiXTSDK(base_uri="http://localhost:7437", api_key="Your AGiXT API Key")

# Define the agent we're working with
agent_name="gpt4free"

# Consume the whole AGiXT GitHub Repository to the agent's memory.
agixt.learn_github_repo(
    agent_name=agent_name,
    github_repo="Josh-XT/AGiXT",
    collection_number=0,
)

# Train the desired model on a synthetic DPO dataset created based on the agents memories.
agixt.train(
      agent_name="AGiXT",
      dataset_name="dataset",
      model="unsloth/mistral-7b-v0.2",
      max_seq_length=16384,
      huggingface_output_path="JoshXT/finetuned-mistral-7b-v0.2",
      private_repo=True,
)

Chat Completions endpoint modifications

Several modifications have been made to the Chat Completions endpoint to bring it more in line with the OpenAI endpoints. These modifications were in addition to changes in #1154 .

agixt/endpoints/Completions.py

Josh-XT added 3 commits March 28, 2024 23:57

Lean AGiXT

11e3abd

Leannnnnnnn

f1a94ad

Add defaults

8d2f40f

Josh-XT changed the title ~~Refactor TTS, Audio to Text, and Image Generation to Providers~~ Refactor TTS, Audio to Text, Embeddings, and Image Generation to Providers Mar 29, 2024

Josh-XT added 17 commits March 29, 2024 01:36

Add to default embedder

7745db9

Remove requirement

85c4f55

Clean up

6e5444b

Add provider services

2cc38c7

Add service endpoint

fd1d437

Add services to agent

99b097f

Add to defaults

85d3b95

Add provider settings

ace2ffc

Fix bad refs

825be41

Add services test

ba84b0a

Switch model to gpt-4-vision-preview if images uploaded

279c64e

Remove old refs

faea996

Add image generations endpoint

510c5e0

QA Improvements for DPO training

eb4fa9b

Update dataset creation to use DPO method

c727574

Remove embeddings, clean up

90cd0bb

Move functionality

15a8dba

Josh-XT changed the title ~~Refactor TTS, Audio to Text, Embeddings, and Image Generation to Providers~~ Providers Refactor, Dataset Creation Functionality Created Mar 29, 2024

Josh-XT added 8 commits March 29, 2024 16:10

Add np

d35bc90

Attempt to fix embedder

dba085d

Remove old voice code

14b677a

Maybe fix embedder

3c39bca

Use input instead of text

aacd337

Tuning wip

2bbd55a

Add fine_tune_llm function

290dbd7

Attempt at fixing embedders

810db91

Josh-XT added 26 commits March 31, 2024 20:11

Use ffmpeg

bffa374

Fix test

d8cc322

Clean up

f7fd0b4

wOrKs oN mY mAcHiNe

07f19a3

Add providers to example

4dc7709

use if not

6c4939d

use agent_name

d318d43

Use if not user

e39913c

Add user to collection name

0baaad0

get default user global

01f37a2

Updates

43d7cfc

change file type

40131b2

pass thru urls

0bf3901

Updates

0b6fc7c

Clean up ezlocalai response

c28d8a1

improve vision

40d2b84

change output location for file in test

6997882

Try using gtts as default?

f824513

Use audiosegment

91fb68c

Fix google provider

49468fc

Fix api key for google provider

e1859e4

continue

5b4dd27

Fix defaults

80f51d1

Add mode

3057f77

Hope this works!

ee9a82d

Fix tabbing

11f7fef

github-advanced-security bot found potential problems Apr 2, 2024

View reviewed changes

agixt/endpoints/Completions.py Dismissed Show dismissed Hide dismissed

Remove unused refs

d272592

Josh-XT merged commit 409909b into main Apr 2, 2024
8 checks passed

Josh-XT deleted the lean-agixt branch April 2, 2024 18:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Experimental Training Features, Providers Refactor #1155

New Experimental Training Features, Providers Refactor #1155

Josh-XT commented Mar 29, 2024 •

edited

Loading

New Experimental Training Features, Providers Refactor #1155

New Experimental Training Features, Providers Refactor #1155

Conversation

Josh-XT commented Mar 29, 2024 • edited Loading

Providers Refactor

Refactor TTS, Audio to Text, Embeddings, and Image Generation to Providers

New Experimental Training Features

DPO, CPO, and ORPO style Dataset Creation Functionality Created

Example with Python SDK

Model Training Based on Agent Memories

Chat Completions endpoint modifications

Josh-XT commented Mar 29, 2024 •

edited

Loading