Skip to content

Latest commit

 

History

History
27 lines (16 loc) · 1.34 KB

README.md

File metadata and controls

27 lines (16 loc) · 1.34 KB

PrivacyBench

This repository is dedicated to benchmarking legal and privacy-related performance of generative AI models and is used to enable appropriate and effective use of AI to assist with legal and compliance matters.

Purpose of PrivacyBench

  • Existing industry benchmarks, such as the MMLU, provide strong indications of linguistic understanding but are not specifically tuned to measure legal, privacy, and compliance tasks.
  • Existing benchmarks are not adequately specific to legal, privacy and compliance tasks.

Proposed Solution

  • Develop a testing method for benchmarking performance in personal data redaction.
  • Develop and report on LLM performance.
  • Identify, in particular, LLM models that can be deployed locally and efficiently for maximum privacy and security and lowest cost.
  • Encourage the community development of better tools through benchmarking.

Call for contributions

  • This repository is open-sourced under MIT license and the code and testing process is free to use with appropriate credit attribution (subject to third-party licenses).

Specific Tasks

  • The first use case selected to be benchmarked and tested is personal data detection and redaction; please see this task in this repo for additional details.

Trademark

  • PrivacyBench is a trademark of Alex J. Wall.