🛡️ Capybara - AI-Powered Email Security

Capybara: Browser-based Phishing Detection

Just as the Catholic Church once classified capybaras as "fish" for Lent (despite them being decidedly not fish), our Chrome Extension and (soon-to-be-released) AI model helps users identify what is and isn't a phish.

The capybara's historical misclassification makes it the perfect mascot for anti-phishing software!

Capybara is a Chrome extension that leverages advanced AI to protect users from phishing attempts and malicious emails in Gmail. By analyzing email content in real-time, it provides instant risk assessments and security recommendations.

🌟 Features

🤖 Advanced AI-powered email analysis using GPT-4
🔒 Privacy-focused with minimal data collection
⚡ Instant risk assessment for incoming emails
🎯 Context-aware analysis of business communication
🎨 Clean, unobtrusive UI integration with Gmail
📱 Responsive design that works across devices
🔐 Secure by design - your API key stays local
💻 Optional local LLM mode in production builds

📚 Documentation

Product Specification - Detailed product requirements and specifications
ML Architecture - Technical design document for the ML model
WebLLM Integration - Guide for WebLLM integration and local inference

📋 Prerequisites

Common Requirements:

Node.js (v18 or higher)
Chrome Browser (v88 or higher)
ImageMagick (for icon generation)

For OpenAI API Mode (Default):

OpenAI API key
Internet connection

For Local LLM Mode:

NVIDIA GPU requirements:
- Minimum: NVIDIA GPU with 6GB VRAM
- Recommended: NVIDIA GPU with 8GB+ VRAM and CUDA support
- Best Performance: NVIDIA RTX 3070 or better
Chrome with WebGPU enabled
Model compatibility varies by GPU:
- Entry GPUs (6GB VRAM): Limited to heavily quantized models
- Mid-range GPUs (8GB VRAM): Can run most quantized models
- High-end GPUs (12GB+ VRAM): Can run larger, more accurate models
Not all models will work on all GPUs due to:
- VRAM limitations
- Compute capabilities
- Quantization requirements

🔬 Coming Soon: We're training a specialized SOTA model specifically for email security analysis. This model will:

Be optimized for phishing detection using a large corpus of real-world data

Require less VRAM through advanced quantization techniques

Provide better instruction following for security analysis

Offer improved accuracy while maintaining small size

Support more GPUs through various quantization options

🔐 Configuration

Copy the template configuration file:
```
cp config.template.js config.js
```
Edit config.js with your settings:
```
export const CONFIG = {
  // OpenAI API mode:
  OPENAI_API_KEY: 'your-api-key-here',
  provider: 'openai', // This is the default
  model: 'gpt-4', // Primary model to use
  backupModel: 'gpt-3.5-turbo', // Fallback model for rate limits/overload

  // Or for WebLLM local mode:
  // provider: 'webllm',
  // model: 'SmolLM2-360M-Instruct-q4f16_1-MLC', // Local model to use
};
```
Available configuration options:
- provider: Either 'openai' (default) or 'webllm'
- OPENAI_API_KEY: Your OpenAI API key (required for OpenAI mode)
- model: The primary model to use
  - For OpenAI: 'gpt-4' (default), 'gpt-3.5-turbo', etc.
  - For WebLLM: Model options depend on your GPU:
    - 6GB VRAM: 'Llama-3.2-3B-Instruct-q4f16_1-MLC' (heavily quantized, but we recommend using the latest MLC LLama model.)
- backupModel: Fallback model for OpenAI mode when rate limited (defaults to 'gpt-3.5-turbo')

⚠️ Local Model Note: Model compatibility is highly dependent on your GPU's capabilities. Some models may fail to load or run slowly if your GPU doesn't meet the requirements. We recommend starting with the default model and adjusting based on your GPU's performance.

⚠️ IMPORTANT NOTE: This extension requires a production build to run (npm run build). Development mode is not supported due to Chrome's Content Security Policy restrictions. After any configuration changes, you'll need to rebuild the extension and refresh it in Chrome.

⚠️ SECURITY NOTE: Never commit your config.js file or share your API keys. The file is listed in .gitignore to prevent accidental commits.

For CI/CD pipelines, set the following secrets in your GitHub repository:

OPENAI_API_KEY: Your OpenAI API key
CODECOV_TOKEN: Your Codecov token

🚀 Quick Start

Clone the repository:

git clone https://github.com/yourusername/capybara.git
cd capybara

Install dependencies:
```
npm install
```
Set up configuration (see Configuration section above)
Generate extension icons:
```
./create_icons.sh
```
Build the extension:
```
npm run build
```
Load the extension in Chrome:
- Open Chrome and navigate to chrome://extensions
- Enable "Developer mode"
- Click "Load unpacked" and select the dist directory

🛠️ Development

Build and watch for changes:

npm run build

In Chrome:

Go to chrome://extensions
Enable "Developer mode"
Click "Load unpacked" and select the dist directory
After making changes, run npm run build again
Click the refresh icon in Chrome's extension page

Run tests:

npm test

Note: Due to Chrome's Content Security Policy restrictions and WebLLM's requirements, we can only run the extension from production builds. The development server (npm run dev) is not supported.

🧪 Testing

We maintain high test coverage and quality standards:

Unit tests using Jest
Integration tests for Chrome API interactions
End-to-end tests for Gmail integration
Coverage thresholds: 95% statements, 90% branches

Run the full test suite:

npm run test:all

Generate test coverage report:

npm run coverage

🏗️ Architecture

The extension is built with a modular architecture:

src/content/ - Gmail integration and UI components
src/background/ - Chrome extension background scripts
src/ml/ - Machine learning model integration
src/utils/ - Shared utilities and helpers

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details. Key points:

Fork the repository
Create a feature branch
Commit your changes
Push to your branch
Open a Pull Request

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🔒 Privacy & Security

Capybara is designed with privacy and security in mind:

Two operation modes:
1. OpenAI API Mode (Default, Recommended):
  - Email content is sent to OpenAI's API for analysis
  - The following data is sent:
    - Email subject
    - Email body content
    - Attachment names and types (but not the files themselves)
  - The data is sent as part of a structured prompt that asks for phishing analysis
  - All API calls use encrypted HTTPS
  - Your OpenAI API key is stored locally and never shared
2. Local LLM Mode (Privacy-Focused):
  - Uses WebLLM to run a smaller model locally on your GPU
  - No data leaves your computer
  - Performance and compatibility depends on your GPU:
    - Higher-end GPUs: Better accuracy, more model options
    - Mid-range GPUs: Good balance with quantized models
    - Entry-level GPUs: Limited to smaller, quantized models
  - Model selection impacts:
    - Analysis accuracy
    - Processing speed
    - Memory usage
    - GPU compatibility
No data persistence
- Analysis results are shown in the UI but not stored
- No tracking or analytics
- No user behavior monitoring
Local processing where possible
- UI rendering and DOM manipulation happen locally
- Email parsing is done in the browser
- Results caching is temporary and in-memory only

Data Flow

When an email is opened, the extension extracts:
- Subject line
- Email body content
- List of attachments (names and types only)
This content is analyzed either:
- By OpenAI's GPT-4 API (default mode) with a prompt that asks for:
  - Phishing risk assessment
  - Analysis of business communication patterns
  - Evaluation of attachments risk
  - Context-aware false positive reduction
- OR by WebLLM locally (if configured) with a simpler analysis model
The API/local model returns a structured analysis with risk levels and explanations
The assessment is displayed in the UI
All data is cleared from memory when you close the email

⚠️ Note: While we take security seriously, using this extension means either trusting OpenAI with your email content (in default mode) or accepting reduced accuracy with local processing (in WebLLM mode). Please review OpenAI's privacy policy and make an informed decision. If you handle sensitive information, consider using the local LLM mode despite its limitations.

🙋 Support

📫 For bugs and features, open an issue
💬 For questions, start a discussion
🤝 For contributing, see CONTRIBUTING.md

⭐ Show Your Support

If you find this project useful, please consider giving it a star on GitHub!

🙏 Acknowledgments

WebLLM for the local ML processing
All our amazing contributors
The open-source community

Made with ❤️ by the Capybara team

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github/workflows		.github/workflows
config		config
docs		docs
icons		icons
ml		ml
src/content		src/content
tests		tests
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
babel.config.js		babel.config.js
config.template.js		config.template.js
create_icons.sh		create_icons.sh
jest.config.js		jest.config.js
jest.setup.js		jest.setup.js
manifest.json		manifest.json
package-lock.json		package-lock.json
package.json		package.json
styles.css		styles.css
webpack.config.js		webpack.config.js
worker.js		worker.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ Capybara - AI-Powered Email Security

🌟 Features

📚 Documentation

📋 Prerequisites

🔐 Configuration

🚀 Quick Start

🛠️ Development

🧪 Testing

🏗️ Architecture

🤝 Contributing

📜 License

🔒 Privacy & Security

Data Flow

🙋 Support

⭐ Show Your Support

🙏 Acknowledgments

About

Releases

Packages

Languages

License

eraye1/capybara_phishing

Folders and files

Latest commit

History

Repository files navigation

🛡️ Capybara - AI-Powered Email Security

🌟 Features

📚 Documentation

📋 Prerequisites

🔐 Configuration

🚀 Quick Start

🛠️ Development

🧪 Testing

🏗️ Architecture

🤝 Contributing

📜 License

🔒 Privacy & Security

Data Flow

🙋 Support

⭐ Show Your Support

🙏 Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages