Skip to content
Chris Booth edited this page Feb 21, 2024 · 34 revisions

Welcome to the InferGPT wiki!

North Star of how it will (probably) work:

ArchitectureFlowOverview

High level Sequence

  • User utterance sent to Director Supervisor

  • Director Supervisor reviews the current ‘world state’ by reviewing canvas, previous actions and conversation history.

  • Director Supervisor predicts the next step and picks the most suitable supervisor to handle the next step.

  • Planner Agent queries data science algorthims based on graph to make predictions based on given plan. If confidence on next steps/response is low, questions are returned to gather further information and goes in a loop until confidence high.

  • Memory agent captures every action and conversation history. And is used to retrain the long term memory based how close its initial guesses were. New information if useful to predictions is stored

Predict Next Step: For each iteration of the execution loop, Evo starts by making an informed prediction about what the best-next-step should be. Select Best Agent: Based on this prediction, Evo selects a best-fit agent persona. Contextualize Chat History: Based on the prediction from step 1, and the agent persona in step 2, the complete chat history is "contextualized" and only the most relevant messages are used for the final evaluation step. Evaluate and Execute: A final evaluation step is run to determine what agent function is executed to try and further achieve the user's goal.

Why InferGPT?

Language models are great a predicting the next token - as they are designed to. The issue though, compared to humans, is when one human requests another, we very rarely just spew out a response. Instead, we usually ask a question back. For example, if I ask you for a film recommendation, if you know me well, you would think: "Chris loves Marvel, and I know there's been a recent film released" - so you would ask: "Have you seen the latest Ant-Man?"

Alternatively, if you didn't know me well, you would ask things such as: "What genre of films do you like?"

We believe knowledge graphs are the solution to the above issue; to understand the user's current profile and ask questions based on missing context needed to solve their issue. It can then also store conversations, context and new information as time goes on - always remaining contextually updated.

Why a graph?

Graphs are great at this sort of task. They infer fast and they carry deep context with their edges. Most excitingly they also:

  1. Act as super-vector stores with Neo4j's cypher language, providing better performance vs cosine similarity methods.
  2. Make great recommendation models - graphs could even start to predict what you want to do next!

For a deeper dive I highly recommend the Neo4j Going Meta YouTube series

Clone this wiki locally