Skip to content

A curated repository of resources on bias and machine unlearning in Large Language Models.

Notifications You must be signed in to change notification settings

fabianumfalco/llm-bias-unlearning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 

Repository files navigation

Biases and Machine Unlearning in Language Models (LLMs)

GitHub last commit Github repo stars Github watchers Github forks Hits Downloads

This is curated repository of resources on bias and machine unlearning in Large Language Models. Its primary purpose is to provide a comprehensive list of relevant resources related to these topics that can support my research and dissertation, under the advisorship of PhD Edna Dias Canedo, in the Professional Graduate Program in Electrical Engineering (PPEE) at the University of Brasília (UnB), Department of Electrical Engineering (ENE).

avatar

Table of Contents

Paper List

Fairness and Bias in AI

2025

Date Author(s) Title Keywords Venue Bib Source Code

2024

Date Author(s) Title Keywords Venue Bib Source Code
2024.08 D. Bouchard An actionable framework for assessing bias and fairness in large language model use cases Arxiv Bias, Evaluation Metrics, Fairness, Framework, Large Language Models, LLMs arXiv GitHub GitHub
2024.04 S. Caton and C. Haas Fairness in machine learning: A survey Open Fairness, accountability, transparency, machine learning ACM Comput. Surv. dblp

2023

Date Author(s) Title Keywords Venue Bib Source Code
2023.12 A. F. Oketunji, M. Anas, and D. Saina Large language model (LLM) bias index - LLMBI Arxiv Large Language Model, LLM, Model Calibration, Bias Quantification, Bias Mitigation, Algorithmic Fairness, Algorithmic Governance arXiv dblp
2023.11 E. Ferrara Should ChatGPT be biased? Challenges and risks of bias in large language models Open Artificial Intelligence, Generative AI, Bias, Large Language Models, OpenAI, ChatGPT, GPT-3, GPT-4 First Monday dblp

2022

Date Author(s) Title Keywords Venue Bib Source Code
2022.11 N. Mehrabi, F. Morstatter, N. Saxena, K. Lerman, and A. Galstyan A survey on bias and fairness in machine learning ACM Fairness and Bias in Artificial Intelligence, Machine Learning, Deep Learning, Natural Language Processing, Representation Learning ACM Comput. Surv. dblp

≤ 2021

Date Author(s) Title Keywords Venue Bib Source Code

Machine Unlearning

2025

Date Author(s) Title Keywords Venue Bib Source Code

2024

Date Author(s) Title Keywords Venue Bib Source Code
2024.06 J. Xu, Z. Wu, C. Wang, and X. Jia Machine unlearning: Solutions and challenges IEEE Machine Unlearning; Machine Learning Security; the Right to be Forgotten IEEE Trans. Emerg. Top. Comput. Intell. dblp
2024.05 A. Oesterling, J. Ma, F. P. Calmon, and H. Lakkaraju Fair machine unlearning: Data removal while mitigating disparities Arxiv Open Data Privacy; Fair Machine Learning; Fairness; Machine Unlearning; Right to Be Forgotten PMLR dblp GitHub
2024.05 H. Hu, S. Wang, T. Dong, and M. Xue Learn what you want to unlearn: Unlearning inversion attacks against machine unlearning IEEE Machine Unlearning, Privacy Vulnerability, Right to be Forgotten, Unlearning Inversion Attacks SP 2024 dblp GitHub
2024.05 M. Bertrán, S. Tang, M. Kearns, J. Morgenstern, A. Roth, and Z. S. Wu Reconstruction attacks on machine unlearning: Simple models are vulnerable Arxiv Data Privacy, Machine Unlearning, Privacy Risks in AI, Reconstruction Attacks arXiv dblp
2024.04 Z. Liu, H. Ye, C. Chen, and K.-Y. Lam Threats, attacks, and defenses in machine unlearning: A survey Arxiv Machine unlearning, threats, attacks, defenses arXiv GitHub
2024.03 N. Li et al. Machine unlearning: Taxonomy, metrics, applications, challenges, and prospects Arxiv Machine learning, machine unlearning, data privacy, federated learning arXiv dblp
2024.03 J. Foster, S. Schoepf, and A. Brintrup Fast machine unlearning without retraining through selective synaptic dampening Open Machine Unlearning, Model Performance, Retrain-Free, Selective Synaptic Dampening (SSD) AAAI 2023 dblp GitHub
2024.02 L. Wang, X. Zeng, J. Guo, K.-F. Wong, and G. Gottlob Selective forgetting: Advancing machine unlearning techniques and evaluation in language models Arxiv Machine Unlearning, Language Model, Selective Unlearning arXiv dblp

2023

Date Author(s) Title Keywords Venue Bib Source Code
2023.12 M. Kurmanji, P. Triantafillou, J. Hayes, and E. Triantafillou Towards unbounded machine unlearning Arxiv Bias Removal, Machine Unlearning, Model Utility, Right to Be Forgotten, Unlearning Algorithm NeurIPS 2023 dblp GitHub
2023.08 H. Xu, T. Zhu, L. Zhang, W. Zhou, and P. S. Yu Machine Unlearning: A Survey ACN Machine learning, deep learning, machine unlearning, sample removal, data privacy, model usabilit ACM Comput. Surv. dblp

2022

Date Author(s) Title Keywords Venue Bib Source Code

≤ 2021

Date Author(s) Title Keywords Venue Bib Source Code

Other Related

2025

Date Author(s) Title Keywords Venue Bib Source Code

2024

Date Author(s) Title Keywords Venue Bib Source Code
2024.01 B. C. Das, M. H. Amini, and Y. Wu Security and privacy challenges of large language models: A survey Arxiv Large Language Models, Security and Privacy Challenges, Defense Mechanisms. arXiv dblp

2023

Date Author(s) Title Keywords Venue Bib Source Code
2023.12 Y. Yao, J. Duan, K. Xu, Y. Cai, E. Sun, and Y. Zhang A survey on large language model (LLM) security and privacy: The good, the bad, and the ugly Arxiv Large Language Model (LLM), LLM Security, LLM Privacy, ChatGPT, LLM Attacks, LLM Vulnerabilities arXiv GitHub

2022

Date Author(s) Title Keywords Venue Bib Source Code

≤ 2021

Date Author(s) Title Keywords Venue Bib Source Code

Venues

In the context of a research paper, "venue" refers to the specific conference, journal, symposium, or workshop where the paper was published or presented.

Acronym Venue
AAAI Association for the Advancement of Artificial IntelligenceSponsorship
ACM Comput. Surv. ACM Computing Surveys
IEEE Trans. Emerg. Top. Comput. Intell. IEEE Transactions on Emerging Topics in Computational Intelligence
NeurIPS Annual Conference on Neural Information Processing Systems
PMLR Proceedings of Machine Learning Research
SP IEEE symposium on security and privacy

Related Awesome Lists

Title User GitHub Topics
Awesome Attacks on Machine Learning Privacy stratosphereips Machine-Unlearning Privacy
Awesome Bias and Fairness Datasets and Benchmarks in Language Models richhh520 Bias-AI Fairness-AI
Awesome-GenAI-Unlearning franciscoliu Generative-AI Machine-Unlearning
Awesome Large Language Model Unlearning chrisliu298 Machine-Unlearning LLM-Unlearning
Awesome Machine Unlearning tamlhp Machine-Unlearning
Awesome Trustworthy Deep Learning MinghuiChen43 Trustworthy-AI
LLM-Unlearning-Paper-List KID-22 Machine-Unlearning LLM-Unlearning
Machine Unlearning Papers jjbrophy47 Machine-Unlearning

Toolboxes

Seminar

Workshops

Talks

Blogs

Tutorials

Other Resources

About

A curated repository of resources on bias and machine unlearning in Large Language Models.

Topics

Resources

Stars

Watchers

Forks