Echo is a tool that enables you to retrieve information faster and ask questions about your documents with the power of AI. This project is built using Retrieval Augmented Generation (RAG) techniques. This project is built on the Next.js framework and uses LlamaIndex for RAG, Supabase for authentication and file storage, and Pinecone for storing vector embeddings.
This project uses the Meta Llama 3 model as the underlying LLM via the Hugging Face Inference API (rate limits apply).
To run this project locally, make sure all environment variables are setup properly (see .env.example). You will need a Supabase account, a Pinecone account and a Hugging Face API token. All API keys and tokens can be found in the account settings of the respective services. See individual documentation for more help.
- Go to Supabase website and create a free account. Then create a new project.
- In Settings -> Configuration -> Authentication, make sure "Allow anonymous sign-ins" is turned on.
- In Storage, create a new bucket, "echo-bucket".
- In Storage -> Configuration -> Policies, create 4 new policies as follows:
CREATE POLICY "Give users access to own folder SELECT"
ON storage.objects FOR select {USING | WITH CHECK} (
-- restrict bucket
((bucket_id = 'echo-bucket'::text)
-- only allow if authenticated user's id matches the folder name
AND (( SELECT (auth.uid())::text AS uid) = (storage.foldername(name))[1]))
);
CREATE POLICY "Give users access to own folder INSERT"
ON storage.objects FOR insert {USING | WITH CHECK} (
-- restrict bucket
((bucket_id = 'echo-bucket'::text)
-- only allow if authenticated user's id matches the folder name
AND (( SELECT (auth.uid())::text AS uid) = (storage.foldername(name))[1]))
);
CREATE POLICY "Give users access to own folder UPDATE"
ON storage.objects FOR update {USING | WITH CHECK} (
-- restrict bucket
((bucket_id = 'echo-bucket'::text)
-- only allow if authenticated user's id matches the folder name
AND (( SELECT (auth.uid())::text AS uid) = (storage.foldername(name))[1]))
);
CREATE POLICY "Give users access to own folder DELETE"
ON storage.objects FOR delete {USING | WITH CHECK} (
-- restrict bucket
((bucket_id = 'echo-bucket'::text)
-- only allow if authenticated user's id matches the folder name
AND (( SELECT (auth.uid())::text AS uid) = (storage.foldername(name))[1]))
);
- Lastly, you need to set the following environment variables in your Next.js project's .env file:
NEXT_PUBLIC_SUPABASE_URL=https://your-project-id.supabase.co
NEXT_PUBLIC_SUPABASE_ANON_KEY=your-anon-key
STORAGE_BUCKET=your-bucket-name
- Setup a Pinecone account and create a new project in the account dashboard. Copy the API key value from the dashboard to your .env file as
PINECONE_API_KEY=your-api-key
. - Create a new index by running the following command in your Next.js project directory:
npx tsx app/lib/generate-index.ts
- Go to meta-llama/Meta-Llama-3-8B-Instruct on Hugging Face. This is a gated model, so you will need to apply for access. Don't worry, access is approved almost instantly once you provide your information.
- Next, go to your HF account settings and create a new access token from Account -> Settings -> Access Token. Copy the value of this access token in your .env file as
HF_TOKEN=your-token
Firstly,
npm install
then...
npm run dev
Open http://localhost:3000 with your browser to see the result.
For more information, take a look at the following resources:
- Next.js Documentation - learn about Next.js features and API.
- Learn Next.js - an interactive Next.js tutorial.
- LlamaIndex Documentation
- Supabase Documentation
- Pinecone Documentation
- Parsing takes too long on PDFs with large number of pages
- LLM response truncates if it’s long
Please report any issues you come across while using this tool - your feedback and contributions are welcome!