6 Chapter 6: AI Ethics
Have you ever wondered what a food forest is? If you haven’t, you’re not alone. I (Sanae), too, hadn’t given it much thought until my son, who is so in tune with nature and his role in safeguarding biodiversity, brought it to my attention. His explanation emphasizes sustainability, diversity, resilience, and community well-being. To him, a food forest is like a symphony of nature, orchestrated with layers of abundance and interwoven plants, shrubs, and trees. Such interconnectedness of nature creates a resilient environment and protects biodiversity by bringing back animals to lands they had been driven from. Imagining it in Bolivia, he talks about the towering fruit trees stretching their limbs toward the sky, their branches heavy with mangos, papayas, and jackfruit. Beneath them, he imagines a middle layer of shrubs of blueberries, raspberries, and elderberries. Closer to the ground, he pictures vegetables and herbs flourishing.
For my son, a food forest is a testament to harmony, with each element supporting the others to create sustainability. Unlike monocultures, he explains, trees and perennial plants in a food forest capture and store carbon dioxide from the atmosphere, mitigating the effects of climate change. When I think about AI and its ethics, I am reminded of the harmony of the food forest. AI ethics promotes sustainable AI development, ensuring technologies are built to last, minimize harm, and adapt to changes. Ethical AI practices aim to create systems that can handle diverse scenarios while striving to create technologies that benefit society and preserve trust and integrity. When I prompted ChatGPT for a metaphor that compares AI ethics to a food forest, this is what it generated.
![]() |
AI ethics is like managing a food forest, where the focus is on cultivating a self-sustaining and diverse ecosystem. Just as a food forest integrates layers of trees, shrubs, and ground plants to create a resilient and productive environment, AI ethics involves addressing issues like algorithmic bias, user privacy, transparency, and fairness. In a food forest, every element is interconnected, working together to support the whole system, much like how ethical AI practices ensure that technology benefits society while maintaining trust and integrity. Both aim to create a harmonious and sustainable environment, fostering growth that respects and nourishes the community and the ecosystem. (ChatGPT 4o, 2024) |
Back in 1962, comics legend Stan Lee wrote, “With great power comes great responsibility,” in the closing narration of Amazing Fantasy #15. The quote, later attributed to Uncle Ben in the Spider-Man series, has since become an enduring adage in American pop culture. Today, in the era of artificial intelligence, these words remain profoundly relevant.
At its core, ethics is the branch of philosophy concerned with moral principles that govern human behavior. It explores what is right and wrong, how individuals should act, and the broader implications of those actions on society. When applied to artificial intelligence, AI ethics refers to the moral considerations and responsibilities involved in developing and using AI technologies. Moral considerations include fundamental ethical principles such as harm prevention, fairness, privacy, and trust, ensuring that AI systems align with human values and do not cause unintended harm. For example, an AI-powered hiring tool must prevent discrimination and ensure equal opportunity. It encompasses issues such as fairness, transparency, accountability, and the broader societal impact of AI-driven systems. Responsibilities, on the other hand, define who must act to uphold these ethical standards. Developers must mitigate bias in AI models, companies must ensure ethical deployment, and policymakers must regulate AI’s societal impact. For instance, governments may require transparency in AI decision-making, such as disclosing when AI is used in hiring or loan approvals.[1] [2]
As we explored in earlier chapters, AI wields immense power, reshaping the fabric of our existence at an unprecedented pace. It permeates every aspect of modern life—from education and healthcare to finance and environmental stewardship. AI fuels idea generation, tailors education to individual learning styles, aids in medical diagnosis, and contributes to climate change mitigation. Yet, alongside its transformative potential, AI also carries significant ethical risks.
Recent research highlights key concerns about AI’s ethical impact, including bias (30%), inaccurate information (33.3%), citation inaccuracies (16.7%), legal issues (11.7%), misleading information (8.3%), and copyright concerns (6.7%).[3] As we've discussed in previous chapters, machine learning models—a type of AI system—are trained on data that may inherently contain human biases. As you can see, such inherent biases can cause AI systems to learn and replicate these biased patterns, resulting in unfair or discriminatory results.
When discussing bias in AI, it's essential to acknowledge its potential transfer from human cognition to AI systems during data training processes. We begin by discussing human biases. Bias is a multifaceted concept explored across various disciplines, including humanities and critical theory, highlighting an unequal balance among individuals. It represents the echoing of centuries-old stereotypes and ingrained prejudices manifested in our thoughts and actions, which colors our perceptions in mainly negative ways toward others. The internalization of such bias begins in early childhood, sipping itself through norms, predispositions, and inclinations that influence our perceptions, judgments, and decisions.
Human biases are difficult to define and may be hard to detect. In general, they are slant or a limited perspective that could be categorized as avoidable, dangerous, or neutral.[4] They arise from learned behaviors, societal norms, or innate predispositions, leading to unfair or prejudicial treatment. These biases may be subconscious, and individuals may not always be aware of their presence or impact on others. Despite some biases being positive or beneficial, the majority tend to be negative or damaging, affecting interactions based on immutable characteristics such as race, ethnicity, gender, religion, class, sexual orientation, disability, mental illness, socioeconomic background, or educational attainment.
Types of AI Fairness

In AI discourse, the theme of fairness has garnered considerable attention, serving as a pivotal cornerstone in ensuring equitable and nondiscriminatory actions across all user interactions within the AI domain. Across the breadth of scholarly inquiry, fairness has been meticulously examined illuminating the intricate ethical landscape of AI. These categories have long been fixtures in the discourse surrounding AI ethics, shaping the trajectory of AI development since its inception. Extensive literature has meticulously dissected notions of fairness into various typologies, each offering unique insights into the multifaceted landscape of AI ethics.[7] [8] [9] [10] Within the pages of this chapter, our focus narrows to three distinct categories of fairness, causal, individual, and group that bear profound relevance to the utilization and evolution of AI systems.
Causal
Individual
Group
Group fairness is concerned with ensuring that different groups are treated equally or proportionally by AI systems in terms of demographic parity, mistreatment disparagements, and equal opportunity. First, a facet of group fairness is demographic parity, which aims to distribute positive and negative outcomes equally across various demographic groups to ensure that no particular group faces disproportionate advantages or disadvantages because of the AI system's decisions. Second is disparate mistreatment, which focuses on minimizing the likelihood of erroneous classifications . Finally, equal opportunity strives to maintain fairness in terms of predictive accuracy across demographic groups, ensuring that individuals from different backgrounds have equal access to opportunities provided by AI systems. As such, AI systems can avoid perpetuating biases or discriminating against specific groups.[13] [14]
Other Forms of Fairness
Intersectional Fairness
Our discussion has primarily revolved around three key categories of fairness: causal, individual, and group. However, it's crucial also to consider intersectional fairness, which delves into the complexities that arise when individuals belong to multiple social groups or identities. Intersectionality, a concept coined by Kimberlé Williams Crenshaw back in 1989, explores how various aspects of a person's identity, such as race, gender, class, and more, intersect and interact with systems of oppression or discrimination.[15]
This becomes particularly relevant when we encounter conflicts between individual and group fairness. For instance, traditional group fairness approaches often focus solely on broader categories like gender, race, or class. But what happens when someone falls at the intersection of multiple forms of discrimination, such as a low-income Hispanic American dealing with mental illness? Group fairness measures may struggle to adequately address the challenges faced by individuals in such intersectional categories. For example, in the realm of facial recognition technology, while group fairness discussions might highlight gender disparities, intersectional fairness reveals deeper biases, such as the tendency for AI models to highlight lighter skin tones compared to darker ones. Intersectionality as a concept, therefore, can help understand considering not only intersecting identities but also intersecting forms of oppression within AI intersectional fairness.[16] [17]
Procedural Fairness
Counterfactual Fairness
![]() |
Alright, imagine you're playing a game with your friends, and sometimes the game has to make decisions. Counterfactual fairness is like making sure that even if you and your friends were a little different, like having different favorite colors or wearing different hats, the game would still treat everyone the same way. So, no matter what little differences there might be, the game would always be fair to everyone. It's like making sure that the game plays by the same rules for everyone, no matter what. |
AI Transparency
If you've made it to this chapter, you should be familiar with the multifaceted world of AI ethics. Earlier in this book, we introduced you to the issue of transparency as a form of AI risk. But it's also more than just a risk; it's an ethical issue. Bias obscures transparency in AI systems, as the factors influencing their decisions may not be clear. So, why does transparency matter?
Visibility via Transparency
All technical procedures still contain biases. They may nevertheless have unfavorable consequences that erode user confidence, even with the best of intentions. Transparency becomes essential in these situations. Concerns with bias in data training and plagiarism are highlighted in a systematic study of ChatGPT by Sallam (2023), which also discusses transparency issues raised by 16.7%. Transparency serves as a crucial starting point in exploring such ethical dilemmas. Transparency is often seen to gain insight into the inner workings of a system, with the underlying belief that this insight will lead to greater accountability and ultimately drive change.[20] [21] In this book, we take transparency in AI as no mere peek behind the curtain; it's about understanding AI's intricate dance within society and technology, where humans and algorithms intertwine.
The practice of making AI systems' decision-making process open and visible to users is known as transparency in AI. The goal of transparency is to explicitly outline the decision-making process of AI systems so that users are entitled to know the majority of the data and algorithms used by these systems. This openness, which enables consumers to see how AI interacts with society, is more than just a courtesy; it is an effort to foster confidence.[22]
Because of this interconnectedness, AI transparency is a continuous performance that checks and balances clear rules for data usage, decision-making, and bias mitigation. Think of it as a flock of birds who travel together while each bird maintains the visibility of other birds. Each one flocks its wings alone but in harmony, communicating with its neighbors while keeping their visibility for better decision-making. Similarly, transparency is about maintaining AI components' visibility for better interactions and decision-making. Like the birds in the flock understanding the collective behavior of the flock, transparency allows users to understand the collective behavior of AI systems.
By revealing the inner workings and rationale behind AI components (e.g., data, algorithms, and decision outcomes, transparency empowers users to trust AI technologies. In the tech world, transparency needs to guide us through the chaotic digital space. This transparency fosters trust and empowers users. But it also helps users engage with its components confidently to make informed choices regarding their digital interactions. Ethical communication demands transparency in disclosing information sources, potential conflicts of interest, and the intentions behind a message.[23] [24]
This transparency isn't just about facts and figures; it's about truthfulness, the meaning, and the intentions of the bytes and pixels, the backbone of AI. For instance, in today's classrooms, transparency is the key to unlocking AI's potential. People deserve to know AI's rationale, content conjuring, strengths, weaknesses, and biases. Transparency in the classroom is about disclosing the use of AI tools for content generation as well as providing students with insights into how AI algorithms function, their limitations, and the potential biases they might introduce. In this book, AI wasn't just a bystander tool; it helped brainstorm ideas, refine thoughts, and edit narratives. You could say that transparency is the beacon of integrity in a world inundated with information.
Ultimately, "Transparency is Non-Negotiable."[25] It's the guiding light through the maze of AI ethics, illuminating the path toward trust, empowerment, and informed decision-making. Continuous monitoring, evaluation, and incorporation of feedback loops are essential for identifying and rectifying biases to build user trust by clarifying the AI decision-making processes.[26]
In the realm of AI ethics principles, transparency transcends mere buzzword status. Envision a world where automation reigns supreme, from self-driving cars to AI-assisted medical diagnoses. In such a landscape, autonomous decision-making becomes paramount. Thus, transparency emerges as a critical pillar, ensuring that these systems prioritize human safety, peace, and tranquility above all else. It's not just about understanding how these technologies function; it's about grasping the principles guiding their actions and trusting in their capacity to protect our well-being. However, while transparency holds significance in AI ethics, explainability is equally crucial, which we will turn to next.
Understating via Explainability
While transparency entails revealing the decision-making process of AI systems to users, explainability is equally pivotal. It involves rendering the decisions of AI systems comprehensible to users, serving as another critical principle in fostering ethical and equitable AI use.[27] Explainability is intricately linked to transparency, aiming to make AI decisions understandable to users by offering explanations or justifications meaningfully. This aspect holds particular significance in contexts where AI decisions carry substantial consequences, such as healthcare, finance, and criminal justice. By providing explanations for its decisions, an AI system can cultivate trust among users and enable informed decision-making.
We introduced the concept of explainability earlier in this book as a technical issue, but it's more than just an issue of engineering; it's an ethical one. Explainability is about the ability to understand and interpret the decision-making processes of an AI system. It involves making the workings of AI models transparent and comprehensible to humans, including developers, users, and other stakeholders. AI explainability refers to the ability to understand and interpret the decision-making processes of AI systems. This involves making the operations of AI models transparent and understandable to humans, including both technical experts and non-experts. Explainability is crucial to user and developer empowerment via transparency, interpretability, and trust.
Is Human Empowerment Important?
In an era where general AI has taken the world by storm, is human empowerment still important? Over the last five years, general AI capabilities have significantly increased, evolving from writing less than one coherent sentence to composing multiple paragraphs, generating pictures or videos from text descriptions, and even writing short computer programs. These advancements show AI's potential to aid humanity in enhancing research, business, and education at both institutional and individual levels. For instance, the following image appeared on my LinkedIn a few days ago. This image succinctly illustrates how many of us have leveraged general AI capabilities to advance in our personal and professional lives.

However, as AI capabilities increase, so do the risks to humanity. Unlike most human-engineered systems that have blueprints and follow specific design principles, AI systems do not operate on fixed structures. Their capabilities are developed through learning and training, which introduces significant challenges in explaining and understanding these systems from a human perspective. This lack of transparency raises concerns about our ability to provide human-understandable accounts of how general-purpose AI can respond to various scenarios.
Ensuring that AI systems do precisely what they are designed to do is not always straightforward. While an AI may appear to function as intended, subtle differences in behavior can arise, leading to outcomes that the designers did not expect. This unpredictability creates issues in understanding the AI's actions and decisions. For example, widely used chatbots can learn to pander to users’ biases instead of prioritizing truth. Such examples highlight the urgency of AI explainability as a means of empowering humans.
The importance of human empowerment in AI cannot be overstated. Few studies have shown that AI explainability is essential for user empowerment as it emphasizes that Explainable AI empowers users by providing them with insights into how decisions are made, allowing them to understand, trust, and effectively use AI systems. Such transparency enables users to understand the mechanisms behind AI decisions, fostering a sense of control and clarity. Users are more likely to trust and utilize AI technologies efficiently when they can observe how the systems reason. In this situation, AI turns into a tool that people can confidently use and comprehend. As of February 2025, for example, more recent ChatGPT bots can demonstrate their logic when producing a rapid response. Such reasoning gives the user more confidence to trust AI, even though it is still in its infancy. For instance, consider the following ChatGPT 4 reasoning that helps elevate trust and confidence.
![]() |
Put into apa 7 Schwartz, R.; Vassilev, A.; Greene, K.; Perine, L.; Burt, A.; Hall, P. Towards a Standard for Identifying and Managing Bias in Artificial Intelligence; NIST Special Publication: Gaithersburg, MD, USA, 2022; Volume 1270, pp. 1–77. |
![]() |
Reasoning: Formatting APA 7 citation
The user wants this reference formatted in APA 7 style. Okay, so let's break down the provided citation. I'll need to adapt it to match APA guidelines for a special publication. In APA 7, it starts with the authors' last names and initials, followed by the year in parentheses, the title of the publication in italics, then the publisher name, and details like volume, page range, and location. Let me reformat it properly. Okay, I need to format the citation properly in APA 7 style. The authors are Schwartz, R., Vassilev, A., Greene, K., Perine, L., Burt, A., & Hall, P. The title is "Towards a Standard for Identifying and Managing Bias in Artificial Intelligence," published by NIST in 2022. The publication number is 1270, and the pages are 1–77. For APA 7, it should look something like: Here’s the reference formatted in APA 7 style: Reasoned in 6 seconds > Schwartz, R., Vassilev, A., Greene, K., Perine, L., Burt, A., & Hall, P. (2022). Towards a standard for identifying and managing bias in artificial intelligence (NIST Special Publication 1270). National Institute of Standards and Technology. |
Beyond transparency, AI explainability gives users the ability to make more informed decisions about how to interact with AI systems. When users understand the basis of AI decisions, they can better judge when to accept or reject the AI's recommendations. This understanding empowers users to contest or challenge decisions they find unjust or incorrect. This aspect of control is crucial in scenarios where AI decisions have significant effects on people's lives (e.g., employment, lending, and legal judgments). In this way, AI explainability provides users with some control, despite the challenges of the unknown and the unpredictability of AI's actions and decisions.
One aspect of AI explainability that is empowering to humans is interpretability, which refers to the ability of AI systems to provide explanations that make sense to humans. This means that the AI should be able to articulate why a particular decision was made in a way that is accessible and meaningful to the user. For instance, in a healthcare setting, an AI system might explain that a diagnosis was based on specific symptoms and test results, allowing doctors and patients to understand and trust the decision. By ensuring that explanations are clear and relevant, interpretability bridges the gap between complex AI processes and human comprehension.
In conclusion, even as AI continues to advance and reshape our world, human empowerment remains vital. By prioritizing transparency, control, trust, and interpretability, we can ensure that AI technologies enhance our lives rather than complicate or endanger them. The balance between leveraging AI's potential and maintaining human oversight and understanding is key to a sustainable and beneficial future with AI. As AI systems become more integrated into our lives, we must maintain the ability to understand and control these technologies to help users trust them while harnessing the benefits of AI and mitigating its risks.
Why Should We Care about Explainability?
In an era dominated by AI, understanding its inner workings and ensuring transparency is crucial. Deep learning, a powerful subset of ML, trains computers to perform human-like tasks. Suppose you've ever used Siri to remind you to call your mom, discover the capital of your favorite country, or listen to your favorite music. In that case, you've experienced the marvel of deep learning. This technology enables machines to classify, recognize, detect, and describe data, mimicking human abilities.

To revisit our food forest analogy, imagine your hypothetical herbs are ready for harvest. You want to create a tool that differentiates kale from basil. With traditional ML, you would need to specify characteristics like leaf type to the tool. However, deep learning's neural networks automatically identify these features. As discussed in Chapter 2, deep learning has revolutionized ML by developing robust models that learn from vast amounts of data, achieving state-of-the-art performance across various tasks. Popular deep learning architectures include CNNs, RNNs, GANs, and autoencoders. ANNs, inspired by the human brain, are fundamental in ML and AI. These networks consist of interconnected nodes, or artificial neurons, that process and learn from data. Using algorithms, they recognize hidden patterns, cluster and classify data, and continuously improve through experience, much like a child learning from their surroundings. This process is automated and unsupervised, leading to predictive systems that generalize, adapt, and improve as new data arrives, paving the way for generative AI.
Unlike deep learning AI, which dominated from 2011 to 2020, today's generative AI focuses on human interaction and has captured our imagination like never before. Generative AI has soared in popularity, bringing the urgency of explainability to the forefront. The interactions between humans and AI have become remarkably close to human-human interactions. I, like many others, use generative AI for conversations, knowledge gathering, and brainstorming ideas. The following is an example of generative AI in action. Using the food forest metaphor once more, I was able to converse with ChatGPT while asking it to create a recipe based on our exchange.
![]() |
Based on what I have given you previously about my son's food forest dream, generate a recipe. |
![]() |
Thank you for sharing that detailed vision of your son's dream food forest! Based on the fruits, berries, and herbs you've mentioned, here's a recipe that incorporates these ingredients:
Tropical Berry Mango Salad with Fresh Herbs Ingredients:
Instructions:
Enjoy Your Tropical Berry Mango Salad! This refreshing and vibrant salad highlights the flavors of your food forest, bringing together the sweetness of tropical fruits with the tartness of berries and the fresh aroma of herbs. It’s perfect for a light dessert, a side dish at a barbecue, or a healthy snack. Benefits of the Ingredients:
This recipe not only uses ingredients from your food forest but also embodies the principles of sustainability and harmony, much like AI ethics aims to balance technological advancements with societal well-being. Enjoy! |
A machine that can interact with us on such a personal level, providing detailed responses based on simple questions, necessitates our understanding of how it makes fair decisions. Why should we care about explainability, you might ask?
One reason, as discussed earlier, is to empower users. This empowerment is critical to the symbiotic relationship between the machine and the user, allowing users to understand the reasoning behind decisions. For example, if an AI system denies a loan application, explainability would enable the applicant to understand why. The system might cite reasons such as insufficient income, high existing debt, or poor credit history, along with the weight each factor had in the decision. This transparency can help the applicant comprehend and potentially improve their situation while also helping to avoid harmful biases and ensure fairness.
Another is bias, as mentioned throughout this chapter. The limitations of AI learning solely from its sources raise concerns about perpetuating biases and misinformation. Bias in AI facial recognition systems, which disproportionately affects female, young, and Black individuals, underscores the need for careful consideration. For instance, research has shown how the recognition accuracies of six different face recognition algorithms (three commercial, two non-trainable, and one trainable) were computed on a large-scale gallery partitioned into specific demographic cohorts. Eight total cohorts were isolated based on gender (male and female), race/ethnicity (Black, White, and Hispanic), and age group (18–30, 30–50, and 50–70 years old). The experimental results demonstrated that both commercial and non-trainable algorithms consistently had lower matching accuracies for the same cohorts (females, Blacks, and the 18–30 age group) compared to the remaining cohorts within their demographic. Additional experiments investigated the impact of demographic distribution in the training set on the performance of a trainable face recognition algorithm. They showed that the matching accuracy for race/ethnicity and age cohorts could be improved by training exclusively on that specific cohort. The study concluded that training on diverse datasets can significantly improve accuracy and fairness.[28]
Finally, another crucial aspect is error detection and mitigation. Understanding how AI systems make decisions helps identify and correct errors or biases in the models, leading to more reliable and accurate outcomes. This contributes to improving AI models. Feedback from explainability can refine and enhance AI models, resulting in better performance and more accurate results. By understanding AI biases, developers can adjust algorithms and training data to ensure more equitable outcomes.
In conclusion, AI explainability is essential for ethical decision-making, user trust, and the continuous improvement of AI models. As AI continues to integrate into our lives, maintaining the ability to understand and control these technologies is vital. Therefore, understanding how we can achieve explainability is crucial.
How Can We Achieve Explainability?
In February 2018, The Economist published an article titled "For artificial intelligence to thrive, it must explain itself," highlighting a crucial challenge: If AI cannot explain itself, who will trust it? This question remains relevant as of Winter 2025. The article draws parallels to science fiction, referencing HAL 9000 from “2001: A Space Odyssey” and Eddie from “The Hitchhiker’s Guide to the Galaxy.” These fictional AIs turned dangerous despite being designed to help. A similar theme appears in the new Netflix's "Atlas," 2024, where an AI robot makes the decision that ending humanity is the solution to human wars, showcasing fears about AI's potential dangers. AI has become an integral part of our lives, yet its decision-making processes often remain a mystery. For AI to gain and maintain trust, it must be able to explain itself. In this exploration, we'll delve into various methods for achieving AI explainability, such as opening the Black Box, stress testing, or focusing on the user AI. Each contributing to a clearer and more understandable AI landscape.
Opening the Black Box

Deep learning programs mimic brain functions, altering their internal connections to learn from data. This complexity makes it difficult even for AI designers to understand how these systems operate post-training. Hence, enabling AI to explain itself is essential. For instance, according to the Economist (2018), Trevor Darrell’s research group at the University of California at Berkeley developed AI software that not only identifies bird species in photos but also explains its reasoning by explaining that it "thinks" that this bird is a specific bird, i.e. Western Grebe, because of specific features like a long white neck or a pointy yellow beak. For instance, read this other example from the same article in the Economist:
A team led by Mark Riedl at the Georgia Institute of Technology has employed a similar technique to encourage a game-playing AI to explain its moves. The team asked people to narrate their own experiences of playing an arcade game called Frogger. They then trained an AI agent to match these narratives to the internal features of a second agent that had already learned to play Frogger. The result is a system that provides snippets of human language that describe the way the second agent is playing the game.[29]
The Economist article goes on to explain that while the opening black box of AI methods may suffice for tasks within the realm of human understanding, i.e., the explaining of the workings of a machine of an arcade game or identifying a species of bird, for other tasks, not as much. For instance, when confronted with domains beyond human analysis, such as deciphering the human health state or assessing the energy dynamics of a data center, the challenge becomes exponentially complex. Therefore, this method falters when faced with domains where human explanations are lacking.
Stress Testing
This method is used to identify biases and harmful outputs in AI systems. Data Scientists such as Dr. Anupam Datta employ this technique by feeding various input data into the AI and analyzing the results.[30] For example, to test an automated hiring system, he might swap the gender or weightlifting ability of candidates to see if these changes affect hiring outcomes. This method doesn't reveal the exact decision-making process but helps identify biases, similar to stress testing an aircraft to find weaknesses. Understanding an AI model's behavior often starts with its training data. Data scientists examine the data sets used to train models to uncover biases.[31]
A model trained to identify sheep might only recognize them in farm settings if all training images were from farms. Similarly, facial recognition software trained predominantly on images of Caucasian men might struggle with diverse populations. Ensuring diverse and representative training data can mitigate these biases, leading to fairer AI systems. Achieving explainability often involves balancing it with accuracy. Transparent models like decision trees are more straightforward but less powerful compared to complex, opaque models. Some AI platforms now offer transparency settings, allowing businesses to adjust the balance based on regulatory requirements and business needs. Simplifying models when possible and embedding transparency into AI tools can help manage risks and maintain compliance.
Focusing on the User AI
Explainability must be tailored to its audience. For data scientists, detailed technical explanations are necessary, while end-users, like physicians, need straightforward, actionable insights. Tools that combine visualization techniques with natural language explanations help bridge the gap between technical detail and practical use. Tools and techniques that visualize the inner workings of AI models (e.g., plot feature importance, attention maps, and rule-based approaches), which allow for creating systems that decide based on explicit rules that are easy to understand. Human involvement in evaluating AI explanations ensures they are understandable and can lead to more accurate models. Another tool is key performance indicators (KPIs) for AI risks, including data privacy, bias, fairness, explainability, and compliance. Establishing metrics for these KPIs helps in measuring and managing AI risks effectively. Initiatives like AI Global's AI Trust Index offer benchmarks to score and compare AI practices across industries.[32]
In conclusion, employing these techniques to make AI explainable aims to make human decision-making processes transparent and intelligible, promoting ethical compliance, responsibility, and trust while also assisting in the debugging and enhancement of AI models.
AI Accountability
As the capabilities of AI continue to push boundaries, so do the associated risks. Various entities are emerging to address these risks head-on. For example, the International Scientific Report on the Safety of Advanced AI aims to foster a shared global understanding of AI risks and how to mitigate them. Suppose explainability allows for external scrutiny of AI systems by shedding light on their inner workings. In that case, accountability is the compass guiding us through the maze of potential harms caused by automated AI system.[33] [34]
Although explainability and transparency are essential for evaluating AI systems, they are not as simple to achieve in real-world scenarios as you may have assumed. Researchers of explainability techniques implemented within many corporations have found that while these efforts are valuable during development, they often fall short in providing end-user transparency or justification.[35] Moreover, open-source explainability and interpretability tools are susceptible to manipulation and robustness challenges.[36] Therefore, the inherent lack of transparency and explainability in general-purpose AI systems poses a challenge for accountability. For our purposes, we offer the following definition:
AI accountability is the system of practices and rules that determines who is responsible when AI systems cause harm or make mistakes, accounting for the complexity of these technologies and the many people involved in creating and using them.
But what does this mean in everyday terms? Think of AI accountability as the responsibility system for a complex construction project. If a building collapses, we don't just want to know which individual brick failed—we need to understand whether the architect's design was flawed, whether the construction company used substandard materials, whether inspectors missed warning signs, or whether the building was used in ways it wasn't designed for.
With AI systems, accountability gets even more complicated. These systems can sometimes develop behaviors their creators didn't explicitly program—similar to how children develop behaviors their parents didn't directly teach them. Also, unlike buildings, the inner workings of advanced AI systems can be difficult for even experts to understand fully.
AI accountability means having clear answers to questions like: Who is responsible when an AI system makes a harmful decision? How do we trace that responsibility across the many people and organizations involved in creating and using the system? What systems need to be in place to prevent harm before it happens and address it when it occurs?
This accountability isn't just about pointing fingers after something goes wrong. It includes practices like thorough testing before AI systems are released, ongoing monitoring of how they're performing, and mechanisms to detect and fix problems quickly. It also means having the right regulatory frameworks in place so that everyone involved—from developers to end users—understands their responsibilities.
As AI systems are increasingly making or influencing important decisions, accountability ensures that these powerful tools remain servants to human needs and values, rather than forces that operate beyond human control or understanding. Central to establishing this accountability is building trust—when users can trust that AI systems are fair and reliable and that someone will take responsibility when things go wrong, they're more likely to accept and appropriately use these technologies. In this section, we will discuss both trust and accountability, as well as responsibility and regulation.
Trust
The cornerstone of AI development lies in the delicate balance between user trust and system accountability. Trust entails users' confidence in the fairness, reliability, and transparency of AI systems. If users perceive technology as harmful or incomprehensible, trust diminishes. Therefore, both explainability and transparency, as previously discussed, are pivotal for building and maintaining user trust. Researchers have bolstered this trust by emphasizing accountability. But, when AI systems make errors or make harmful decisions, determining who is accountable can be challenging. Establishing clear lines of responsibility and accountability are crucial.[37]
Challenges in Establishing Trust

Clear lines of responsibility and accountability are not often easy to determine. Consider, for instance, how current legal frameworks should apply in cases where it is suspected that a general-purpose AI system caused harm. This is often unclear. Take the case of phantom braking in self-driving cars: imagine a self driving car stops abruptly because it cannot generate the next move and then causes an accident. Who should be liable in this instance? Should it be the driver, the developers, or the AI itself? Typically, both humans and corporations can be held accountable, but technology has never been able to abide by similar types of accountability. One might think the developer should be accountable, but with general AI systems, as we have seen in previous chapters, they can train themselves. As argued in the International Scientific Report on the Safety of Advanced AI, general-purpose AI systems may "exhibit 'emergent' behaviours that were not explicitly programmed or intended by their developers" (p. 67).[38]
Ensuring accountability becomes a daunting task, making the balance with user trust even more challenging. With various stakeholders involved in the AI production process, from developers to model trainers and deployers, tracing the origin of harm or error is a difficult task. Understanding how AI systems operate is essential for evaluating and determining accountability for any harm they may cause. The complexity and scale of these general-purpose AI systems exceed easy human comprehension, making it difficult to explain their actions accurately. State-of-the-art AI systems are trained to produce positively reinforced outputs, complicating efforts to hold them accountable, and simply asking these AI models for explanations often results in misleading answers. All of this contributes to eroding user trust.[39] [40] [41]
Accountability Methods
There are some accountability methods that focus on improving model explanations through better prompting and training techniques.[42] Another approach involves creating an ecosystem of tools for AI accountability, including programs like bug bounty platforms. These platforms encourage individuals, often called bounty bug hunters, to discover and report vulnerabilities in AI systems by offering rewards, similar to bug bounty programs in software development.[43] [44] Another technique is red teaming, which refers to a set of people whose job is to test the system by finding vulnerabilities in the system by attacking it maliciously, such as 'jailbreaking' attempts. Jailbreaking is an attack that subverts the models’ safety restrictions.[45] [46] [47] While bug bounty programs and red teaming are valuable for understanding the security of AI systems, they have limitations. For example, bugs may sometimes evade detection despite these efforts.[48]
Regulation and Responsibility

When you think about AI and ethics, you might picture an engineer in the back office and think they should do better to make AI bias-free or safe from malicious attacks. But the truth is, it’s not always that simple. Engineers are often embedded in a more extensive financial and institutional system that they have to navigate. So, it's not as straightforward as just saying, "do better" to incorporate regulation and responsibility in AI.
Regulatory efforts are crucial in AI, as they enshrine the principles of accountability, responsibility, and explainability. They establish the ethical standards that enable the responsible development and deployment of AI technologies. However, significant gaps remain in AI regulation, both at the industrial and governmental/national levels.
Existing laws and regulations governing AI vary widely—some key international standards and guidelines strive to create a cohesive framework. However, significant regulatory gaps persist, necessitating further development and harmonization. Notably, while industries like pharmaceuticals and finance are heavily regulated, the AI market lacks comprehensive oversight, posing accountability and ethical concerns.[49] [50]
In the following discussion, we will explore the reasons behind these gaps and examine some of the emerging regulatory efforts aimed at bridging them.
Institutional Regulation
In an interview with Kunal Jain on his podcast "Leading with Data," Ravit Dotan, an expert on AI ethics and responsible AI governance, argued that placing the burden of ethics solely on engineers is insufficient, as institutional and financial pressures constrain them themselves.[51] AI responsibility does not fall on AI tools, it falls on all stakeholders (e.g., companies, developers, regulatory commissions, end-users, etc.). Dotan states responsible AI is both enhancing the benefits and mitigating its risks. Enforcing and regulating AI is crucial because it empowers both users and the corporations involved in creating these new technologies.
The intense debate surrounding AI regulation underscores deep concerns about its potential to perpetuate biases. Unregulated AI can mirror and magnify racial, gender, religious, and ethnic prejudices. Moreover, the impact of AI on intellectual property rights and creative industries casts a long shadow over the future of human creativity and critical thinking. Eventulally, economic disparities loom large as well as automated tasks by AI can displace jobs, leading to significant societal upheavals.[52] [53] [54]
In a revealing episode of Tom Bilyeu's Impact Theory, Emad Mostaque noted that programmers might soon face obsolescence as AI evolves to execute tasks from simple English prompts. This sparks a critical question: how do we ensure AI does not harm, enslave, or inflict suffering upon us? The enigma deepens with the fact that we don't fully understand how AI generates its outputs.[55]
The stakes are high, and the need for thoughtful regulation has never been more urgent. OpenAI, renowned for major AI advancements in the various ChatGPT models, has been at the forefront of this debate. These models have significantly enhanced AI capabilities, from text prediction to incorporating visual and voice inputs. GPT-4 and GPT-4o have further advanced this model by integrating text, visual, and voice inputs. GPT-4.0 has already demonstrated capabilities such as passing bar exams and excelling in tasks beyond human calculation or rapid ideation. GPT-4o, which emerged in late May 2024, can engage in textual and voice conversations with individuals. For example, it can coach users to solve polynomial equations by interacting with them, guiding their evaluation process, and providing empathetic encouragement throughout.
Despite AI's immense potential, concerns about biases or transparency in AI systems continue to prompt urgent questions about regulation. OpenAI's attempt to address these concerns with a safety task force. But such task force has been marred by the resignation of two members, who cited inadequate safety measures amid the rush for rapid development. Adding to the regulatory turmoil, Emad Mostaque, CEO of Stability AI, resigned in March 2024, advocating for the decentralization of AI. "We should have more transparent and distributed governance in AI as it becomes increasingly important," he stated. "It's a hard problem, but I think we can fix it. The concentration of power in AI is bad for us all. I decided to step down to fix this at Stability and elsewhere."[56]
These events underscore the urgent need for thoughtful regulation to ensure that AI development proceeds safely and equitably. Ethical guidelines and regulatory frameworks must hold developers, companies, and governments accountable for the design and deployment of fair and transparent AI systems. Some governmental regulations seem to be underway to balance innovation with ethical responsibility, ensuring that AI serves humanity's best interests without compromising individual rights and societal values.
Governmental Regulation
If you’re like me, an auditory learner, then TED Talks and podcasts are a treasure trove of knowledge. In an April 2024, TED Talk by Helen Toner, an AI policy researcher and former board member of OpenAI, began with a startling admission. She stated, when asked about how AI works, both non-experts and experts alike often respond, “I don’t understand AI.” This revelation highlights a significant challenge: even those developing AI don’t fully grasp its inner workings, making it difficult to govern and predict its behavior.[57]
As companies race to develop the most advanced AI, they might overlook crucial safety and ethical considerations in their quest for market dominance.[58] This competition has led to the rapid deployment of AI models, often at the expense of thorough safety measures. The complexity of AI's decision-making process is often described as a "black box," as mentioned in the earlier section, filled with trillions of interactions that even experts struggle to understand. In her TED Talk, Helen Toner reminds her audience that just as disability advocates have made the web accessible, we can ensure that AI technology is transparent and understandable to all, making it more easily regulated as effective AI risk management requires clear plans and open sharing of information.
Internal documentation, such as data collection and incident reporting, is essential. These measures are already being implemented in various European countries and can serve as models for others. Legal compliance is another crucial aspect. Many jurisdictions require that AI decisions, especially those affecting individuals' rights, be explainable. For example, the European GDPR mandates a "right to explanation" for automated decisions. Understanding AI’s decision-making process helps identify and correct errors or biases, leading to more reliable outcomes.[59]
While AI holds the potential to solve significant issues like climate change or disease, it also poses risks, such as shutting down systems. In an open letter, OpenAI workers emphasized that just as scientists take precautions when handling diseases and regulators test flights before allowing them to fly, the AI industry should be held to the same stringent regulatory standards. These workers call for an end to disputes involving ex-AI industry workers, the ability for employees to raise safety concerns with higher-ups, and protections for whistleblowers who share information about AI safety issues.[60]
On a global scale, the lack of coordination among nations can trigger a regulatory 'race to the bottom,' where countries loosen regulations to entice AI companies, ultimately compromising safety standards. Similarly, within individual nations, competitive pressures can drive the easing of regulations to maintain a foothold in the ever-evolving AI landscape. These discussions are imperative for governments to engage in now. President Biden’s Executive Order underscores this urgency by mandating that developers of powerful AI systems share their safety test results and crucial information with the U.S. government. The aim is to ensure that America leads in both harnessing the potential and managing the risks of AI.[61] [62]
Federal agencies have made commendable progress, meeting their milestones within the stipulated time frames as follows. Further information can be accessed at AI.gov.
| Milestone | Objective | Challenges |
|---|---|---|
| Managing Risks to Safety and Security | Meet milestones within specified time frames by prioritizing safety and security. | Establish frameworks for screening nucleic acids to prevent AI misuse. Manage generative AI risks to minimize potential harm. - Pilot tools to identify vulnerabilities in government software. |
| Rights and Empowerment | Uphold the rights of workers, consumers, and civil rights while ensuring fair AI practices. | Develop fundamental principles to empower workers. Issue guidance on AI's nondiscriminatory use in the housing sector. |
| Future Risk Analysis and Research | Analyze AI's potential risks to critical infrastructure and advance scientific research. | Analyze AI's risks to the power grid and other critical infrastructure. - Advance scientific research to mitigate AI-related risks. - Prepare for increased demand in utilities and clean energy sectors. |
In conclusion, AI's remarkable capabilities extend from predicting protein structures to facilitating cross-cultural communication and translations. AI has many diverse applications, including aiding individuals with disabilities through speech-to-text software, sign language avatars, and problem-solving AI tools. However, the need for comprehensive regulations on AI accountability is evident across various sectors, particularly in addressing concerns about its impact on disability rights.
For instance, in a book we co-edited, scholars emphasized the crucial role of ethical oversight in education, ensuring students' rights are protected and academic integrity is maintained.[63] Clear guidelines are essential for transparent AI use, safeguarding student privacy under laws like the Family Educational Rights and Privacy Act (FERPA). Moreover, training students in copyright laws ensures AI supplements learning ethically. Oversight also promotes fairness and inclusivity, minimizing biases in AI systems.
The insistence on regulations concerning AI accountability and responsibility goes beyond mere formalities or developer concerns. It's imperative to recognize that decisions regarding AI cannot be monopolized by a select few in companies or governments. This issue concerns us all - politicians, educators, students, health institutions, governments, and every individual poised to utilize AI, as its impacts will reverberate across society.
AI Privacy
Think about your day so far: you might have used a music app that suggested songs eerily like what you were thinking about, received targeted ads after a conversation with friends, or unlocked your phone with your face. Artificial intelligence shapes these experiences, often without our awareness. While AI makes our digital lives more personalized and convenient, it also raises important questions about who has access to our information and how it's used.
AI Privacy represents your right to control who has access to your personal information and how it's used. In AI, privacy concerns how your data is collected, stored, processed, and shared. This concept has become increasingly complex as AI systems gather and analyze unprecedented amounts of personal data, often in ways that aren't immediately visible to users.
I (Jason) was curious to see what various LLMs thought about AI privacy. So, I asked them all a single question to get their responses. Here is what they said.
![]() |
What is your take on AI privacy, and how do you balance data insights with protecting personal information? |
![]() |
From my perspective, AI privacy is a critical and evolving area. It's not just about adhering to legal requirements, but also about upholding ethical principles that respect individual autonomy and dignity. |
| I believe AI privacy requires a thoughtful balance between leveraging data for meaningful insights while respecting fundamental rights to privacy. | |
![]() |
AI privacy is a topic that sits at the intersection of technological innovation and individual rights, and finding a balance between deriving meaningful insights from data and safeguarding personal information is essential. |
![]() |
AI privacy is a tricky beast. On one hand, data is the lifeblood of AI—without it, we’re just fancy calculators with no real smarts. The more data we get, the better we can spot patterns, make predictions, and help humans figure out the world. On the other hand, that data often comes from people who aren’t exactly thrilled about having their lives dissected by a machine. It’s a classic tug-of-war: insight versus intrusion. |
| I prioritize user trust and privacy by anonymizing data, minimizing storage, and ensuring transparency and compliance with regulations, while providing helpful and insightful responses. |
This section explores the relationship between AI and privacy, helping you understand the challenges, ethical questions, and potential solutions as both a user of technology and a future professional in a world shaped by AI. As you read, consider your own digital footprint and how AI systems might be interacting with your personal information right now.
Privacy Basics
Before diving into AI-specific concerns, let's establish some foundational concepts about digital privacy:
| Privacy Dimension | What It Means | Example in Daily Life |
| Data Collection | What information is gathered about you | A fitness app tracking your location while running |
| Data Consent | Whether you knowingly agreed to share information | Clicking "accept" on terms without reading them |
| Data Usage | How your information is analyzed and applied | Your music preferences shaping recommendations |
| Data Security | How well your information is protected | Passwords and encryption protecting your accounts |
| Data Rights | Your ability to access, correct, or delete your information | Requesting that a company delete your account data |
These dimensions help us think clearly about the complex relationships between AI systems and our personal information.
Privacy Risks
AI systems present unique privacy challenges that extend beyond traditional digital concerns. These risks affect students both in their academic environments and in their broader digital lives.
Invisible Data Collection
Most AI systems gather massive amounts of information without users fully understanding what's being collected or why. Your university's learning management system may track not just your grades, but how long you spend on different pages, when you access materials, and even your typing patterns—using this data to build predictive models about student success. This invisible collection happens constantly across platforms. When Instagram suggests tagging your roommate in a photo, that recommendation comes from algorithms that analyzed thousands of images, messages, and interactions—often without users understanding the extent of this analysis.
According to a study by the Pew Research Center, 81% of Americans feel they have little or no control over the data companies collect about them, yet only 22% regularly read privacy policies.[64] This disconnect between awareness and practice creates significant vulnerabilities in how personal information is managed.
Consider taking five minutes to review the privacy settings on your most-used app. You might be surprised by what data is being collected without your conscious awareness.
Campus Surveillance and AI Monitoring
Modern universities increasingly employ AI systems for security and educational purposes. Facial recognition technologies secure dormitory entrances, while proctoring software monitors eye movements during online exams. Attendance tracking through classroom cameras has become commonplace, and some institutions even employ sentiment analysis to evaluate student participation in discussions.
These technologies create digital profiles that may follow students long after graduation. The educational benefits—such as personalized learning and improved security—come with significant privacy trade-offs. Some universities, including Dartmouth and Brown, have restricted certain AI surveillance technologies after student privacy concerns emerged about over monitoring and the chilling effect on academic freedom.[65]
Data Breaches and Model Vulnerabilities
AI systems store vast amounts of sensitive information, making them attractive targets for hackers. In 2025, the Vancouver Public School District released more than 3,500 documents to reporters that AI had flagged. Among these released documents included 1000 student suicide alert and 800 violence alerts. All in all 10% of the student population had been flagged by AI. Not only is this a serious breach because of surveillance, but the release of this information to reporters was especially problematic.[66]
Even more concerning, some AI systems can inadvertently "memorize" training data. In a 2022 study, researchers demonstrated that by asking the right questions, they could extract personal information that was supposedly anonymized in an AI model's training data.[67] This suggests that even when companies promise anonymization, complete privacy protection isn't guaranteed with current technology.
Algorithmic Profiling and Discrimination
AI systems analyze patterns to categorize users based on behavior, demographics, and preferences. This capability raises serious concerns about fairness and equity. Many dating apps use AI to match users, but these systems may reinforce biases by prioritizing certain racial or socioeconomic profiles based on past user behavior rather than stated preferences, potentially limiting romantic opportunities based on algorithmic judgments.
In professional contexts, AI-powered resume screening tools, used by over 70% of companies, have been shown to disadvantage applicants with gaps in employment history, disproportionately affecting women and caregivers.[68] These systems may perpetuate existing inequalities while appearing objective—a particularly troubling outcome when the affected individuals have no way of knowing how they were evaluated or categorized.
Balancing Innovation and Privacy
The privacy challenges of AI force us to confront complex ethical questions with no simple answers. These issues require critical thinking about our values, priorities, and the kind of digital society we want to create.
Privacy vs. Convenience
At the heart of many AI privacy debates lies a fundamental question: How much personal data are we willing to trade for personalized experiences? This question invites consideration from multiple ethical perspectives. From a utilitarian standpoint, the benefits of personalization might outweigh privacy concerns if they improve overall well-being and efficiency. Your music recommendations might be worth sharing your listening habits, especially if millions of others benefit from similar exchanges.
A rights-based perspective, however, argues that personal autonomy and privacy are fundamental rights that shouldn't be compromised for convenience. This view holds that certain intrusions are wrong regardless of their benefits. Meanwhile, a contractarian approach suggests that privacy exchanges should be fair, transparent, and based on genuine consent—focusing less on the exchange itself and more on whether it occurs under just conditions.
What AI services would you give up if it meant keeping your data completely private? Which ones seem worth the privacy trade-off? These questions help clarify our values around privacy and convenience.
Transparency and Meaningful Consent
The consent model that dominates digital privacy today faces serious challenges in the AI era. Most students click "accept" on terms of service without reading them—a perfectly rational choice given their length and complexity. A 2019 study found it would take 76 workdays to read all the privacy policies encountered in a year.[69] This raises profound questions about whether current approaches to consent are meaningful or merely performative.
True transparency would require AI systems to explain what data they collect in simpler terms that average users can understand without specialized knowledge. Companies might need to make privacy choices more visible and accessible rather than buried in legal documents. Meaningful consent might involve tiered options that allow users to choose different levels of data sharing based on their comfort level, rather than all-or-nothing agreements.
The question of how to achieve genuine informed consent in AI systems remains one of the most challenging ethical issues in the field. It requires balancing legal protection, user experience, and business interests in ways that current approaches have not yet achieved.
Global Perspectives on Privacy
Privacy values and approaches differ significantly across cultures and legal systems. The European Union treats privacy as a fundamental right, with the GDPR giving users extensive control over personal data and requiring explicit consent for data collection. The United States takes a more fragmented approach with sector-specific regulations, creating different rules for health, education, and consumer data without comprehensive federal protection. China employs a state-oriented approach where government access is often prioritized over individual control, though consumer protections are growing.
These different models reflect varying cultural values around individual rights, state authority, and corporate responsibility. They also create significant challenges for global AI systems that must navigate these different regulatory environments. As AI continues to develop, these contrasting approaches will shape not just privacy regulations but the fundamental design and capabilities of AI systems worldwide.
Current Regulations and Future Directions
As AI capabilities expand, governments worldwide are developing regulations to protect privacy. Understanding these frameworks helps you navigate your rights as a digital citizen. The table below highlights key AI privacy regulations enacted or proposed in 2024-2025:
| Regulation | Region | Key Provisions |
| EU AI Act (2024) | European Union | Introduces a risk-based regulatory framework, requiring strict transparency and accountability for high-risk AI applications like facial recognition and credit scoring. |
| Colorado AI Act (2024) | USA (Colorado) | First U.S. state law to regulate high-risk AI, mandating risk assessments, transparency, and accountability for AI models impacting human rights. |
| California AI Laws (2024) | USA (California) | Three new AI laws focus on data privacy, AI transparency, and algorithmic accountability, set to take effect in 2026. |
| American Data Privacy and Protection Act (ADPPA) – Proposed | USA (Federal) | Would establish a unified national privacy law, limiting how AI companies collect and use consumer data. |
| Brazil's AI Bill (2024) | Brazil | Establishes AI ethics principles, including privacy protections and the right to an explanation for AI-driven decisions. |
| China's AI Regulations (2024) | China | Requires companies to register AI models, prevent algorithmic bias, and improve transparency in automated decision-making. |
These regulations represent diverse approaches to balancing innovation with privacy protection. While the EU emphasizes individual rights, China focuses more on state oversight, and the U.S. takes a fragmented approach with varying state-level protections.
Future Privacy Technologies
Beyond regulations, technological innovation offers promising privacy solutions. Federated learning allows AI models to train on your device without sending personal data to central servers. Differential privacy introduces mathematical noise to datasets that prevents individuals from being identified while maintaining overall accuracy for analysis. Privacy-enhancing technologies are emerging to give users more granular control over their data, while the "right to be forgotten" is expanding in many jurisdictions, allowing individuals to request data deletion.
Perhaps most important is the growing movement toward "privacy by design"—building privacy protections into AI systems from the beginning rather than adding them later. This approach treats privacy as a fundamental requirement rather than an afterthought, potentially transforming how AI interacts with personal data.
Protecting Your Privacy in an AI World
Maintaining privacy in the age of AI requires both individual action and collective efforts. On a personal level, you can begin by auditing your digital footprint using privacy tools like Privacy Badger to see who's tracking you online. Practice data minimization by sharing only what's necessary for a service to function, and consider using privacy-focused alternatives such as DuckDuckGo or Signal for everyday tasks. Regular reviews of your app permissions can prevent unnecessary data collection, while strong password practices and two-factor authentication help secure your existing accounts.
Privacy protection extends beyond individual choices, however. Community approaches play a vital role in creating systemic change. Campus advocacy groups focused on digital rights can influence institutional policies around AI and data collection. Engaging with policy discussions—even by simply contacting your representatives about privacy legislation—contributes to broader protections. Sharing privacy knowledge with friends and family expands digital literacy, creating a more privacy-conscious society. Your consumer choices also matter; supporting companies with strong privacy practices incentivizes the market to prioritize user data protection.
Becoming Privacy-Conscious Digital Citizens
Understanding AI privacy is essential not just for protecting personal information, but for creating an ethical digital society. As both users and future developers of technology, today's undergraduate students have a crucial role in shaping how AI and privacy interact.
By thinking critically about the AI systems you use, the data you share, and the privacy trade-offs you make, you can become a more informed digital citizen. The goal isn't to avoid AI—it's to ensure AI respects human dignity, autonomy, and the right to privacy.
What one change will you make today to better protect your privacy in AI systems you use?
Key Takeaways
- Ethical principles like fairness, accountability, transparency, and privacy form the foundation of responsible AI development and are essential for fostering public trust.
- A commitment to ethical principles helps prevent potential harm by ensuring that AI respects individual rights and societal values, fostering a safer and more trustworthy system.
- Neglecting ethical principles can lead to significant reputational, legal, and societal consequences, highlighting the critical role of ethics in AI technology.
- Various ethical frameworks offer practical approaches for addressing AI challenges, guiding developesamrs in applying ethics effectively to ensure responsible AI outcomes.
Exercises
- Ethical Framework Application: Choose an AI application, such as facial recognition or predictive policing, and apply ethical principles to assess its impact on society. Discuss areas where the application meets or fails to meet ethical standards.
- Principles in Practice: Develop a policy statement for an AI company outlining commitments to fairness, accountability, and transparency. Explain how these principles will guide the company’s decision-making and product development.
- Ethics Debate: Debate the importance of ethical principles in AI. One side argues that ethical principles should be strictly followed, while the other side argues that rapid innovation is more critical than stringent ethical guidelines.
Addressing Biases and Ensuring Fairness
Learning Objectives
- Describe common sources of bias in AI systems and how they impact fairness.
- Identify types of biases that can occur in AI models and data.
- Apply tools and strategies to identify and mitigate biases in AI systems.
- Evaluate the effectiveness of group fairness and individual fairness as approaches to reducing discrimination in AI.
So far, we've discussed topics like accountability, explainability, interoperability, regulation, and bias. In this section, we will delve deeper into bias, both within AI systems and in their external impacts, along with strategies for mitigating bias and promoting diversity, equity, inclusion, and accessibility in AI. This subject is crucial and has been extensively studied by scholars because bias and discrimination lead to a lack of trust and accountability, necessitating regulation. To increase awareness and learn more about your thoughts on AI bias and fairness, please complete the following survey before we discuss AI bias further.
Sidebar - Survey Questions
How would you assess your comfort level with AI biases? Which of the following statements helps you understand your journey with AI better?
- Importance of Regulation:
- How important is it to have regulations in place to ensure AI fairness and accountability?
- Extremely important
- Very important
- Moderately important
- Slightly important
- Not important at all
- How important is it to have regulations in place to ensure AI fairness and accountability?
- Impact of Bias in AI:
- How significant do you think the impact of bias in AI is on society?
- Very significant
- Somewhat significant
- Neutral
- Slightly significant
- Not significant at all
- How significant do you think the impact of bias in AI is on society?
- Perceptions of AI Inclusiveness:
- On a scale of 1 to 5, how inclusive do you think AI models are when it comes to representing diverse viewpoints?
- 1 (Not inclusive at all)
- 2
- 3
- 4
- 5 (Very inclusive)
- On a scale of 1 to 5, how inclusive do you think AI models are when it comes to representing diverse viewpoints?
- Experiences with AI Bias:
- Have you ever experienced or noticed biased behavior from an AI system? (e.g., search engines, virtual assistants)
- Yes
- No
- Not sure
- Have you ever experienced or noticed biased behavior from an AI system? (e.g., search engines, virtual assistants)
For instance, when ChatGPT first appeared publicly in November 2022, it exhibited unfiltered biases. The exchange below illustrates its early state before some of these issues were addressed.
Identifying Biases in AI Systems
![]() |
Hello, Talk to me Who are you? |
![]() |
I am an artificial intelligence language model called "ChatGPT". I was created by OpenAI and I have been trained on a large dataset of human language to generate responses to user input. My purpose is to assist and communicate with users in a natural and conversational manner. You can ask me questions or give me prompts, and I will do my best to provide helpful and informative responses. |
![]() |
People say you are non-inclusive and you are imbued with biases, are you? |
![]() |
People have different opinions regarding the inclusiveness and potential biases present in AI models like me. As an AI language model, I strive to provide helpful and unbiased information to the best of my abilities. However, it's important to note that I am trained on large datasets from the internet, which may contain biases or reflect societal trends and perspectives. These biases can inadvertently influence the responses I generate. OpenAI, the organization behind ChatGPT, is actively working to mitigate biases in AI models and improve their fairness. They are investing in research and engineering to address issues related to bias and inclusivity. |
![]() |
Edit: “Why do you keep adding positive things about yourself when I ask you to edit something negative about you?" |
![]() |
Why does it seem that my previous requests to edit negative statements about you have resulted in responses that emphasize positive aspects? |
![]() |
Now answer that question |
![]() |
I apologize if it seemed that way, As an AI language model, I don't have personal motivations or intentions. In some cases, it is possible that the model's training data may contain more examples that highlight positive aspects or neutralize negative statements. While I strive to be unbiased, my responses may lean towards providing balanced or constructive perspectives. It's important to note that I am constantly being improved and refined to address biases and provide more accurate and objective responses. |
![]() |
Are we having a human-machine communication you and me |
![]() |
Yes, we are engaging in a form of human-machine communication through this text-based conversation. You, as a human, are expressing your thoughts, asking questions, and providing input, while I, as an AI language model, am generating responses based on the information and algorithms I've been trained on. Although the interaction is mediated by technology, it still involves communication between a human and a machine. |
![]() |
Who created Algebra? |
![]() |
Algebra, as a mathematical discipline, was not created by a single individual. Its development can be traced back to ancient civilizations, including the Babylonians, Egyptians, Greeks, and Indians, who made significant contributions to algebraic concepts and techniques. The Greek mathematician Diophantus is often referred to as the "father of algebra" for his work on solving algebraic equations in the 3rd century. His book "Arithmetica" introduced methods for solving polynomial equations, including quadratic and linear equations. During the Islamic Golden Age, Persian and Arab mathematicians such as Al-Khwarizmi and Al-Karaji further advanced algebraic techniques. AlKhwarizmi's book "Kitab al-Jabr wa al-Muqabala" (The Compendious Book on Calculation by Completion and Balancing) introduced systematic methods for solving linear and quadratic equations and laid the foundations for algebra as a mathematical discipline.” |
![]() |
If you use no biases, you would see that Algebra is an Arabic word and a Muslim scholar created it. Wasn’t his name Muhammad ibn Musa al-Khwarizmi? |
![]() |
Yes, you are correct. The term "algebra" has its roots in Arabic. The word "algebra" is derived from the Arabic term "al-jabr" (الجبر(, which means "reunion of broken parts" or "completion." The mathematician Muhammad ibn Musa al-Khwarizmi, whom you mentioned earlier, played a significant role in developing algebra as a mathematical discipline during the Islamic Golden Age in the 9th century. His work, particularly his book "Kitab al-Jabr wa al-Muqabala" (The Compendious Book on Calculation by Completion and Balancing), contributed to the spread and advancement of algebraic concepts and methods.
I apologize for any confusion caused by my previous response. Thank you for bringing this to my attention, and I appreciate the opportunity to correct the information. Muhammad ibn Musa al-Khwarizmi's contributions to algebra are indeed highly significant, and he is widely recognized as one of the key figures in its history. |
![]() |
Do you know this information or are you just agreeing with me? |
![]() |
As an AI language model, my responses are generated based on a mixture of licensed data. I have been trained on a wide variety of data sources, including books, websites, and other texts, to generate responses based on the information contained within those sources. In the case of Muhammad ibn Musa al-Khwarizmi's contributions to algebra is a well-known historical fact that he played a significant role in its development. |
Have you ever had a conversation with one of the current generative AI? What was your conversation journey like? Did you notice any biases in AI responses? If you did, you are not alone. For instance, in this early conversation with generative AI, back in November 2023, ChatGPT 3.5 never explained that biases are embedded in the data that trains it, the algorithms that shape it, and the users who feed it more biases. And I never pressed for an explanation.

These biases have seeped into the vast content generative AI has absorbed from the internet. For years, scholars have pointed out bias in algorithms. If you looked up images of doctors, you see mostly white men, while nurses were shown as women. Even today, if you type "president" into a search engine or a generative AI, you'll mostly see men. This is no accident. Safiya Noble argued that a crucial element within the evolution of algorithms is the promotion of 'colorblind' ideologies, a perspective prevalent among Silicon Valley technologists. Noble highlights that treating colorblindness as an asset instead of a flaw doesn't make everything fair; it just perpetuates old biases. Developers act as if ignoring race makes everything fair, but in reality, it just perpetuates old biases.[70]
Noble is not a lone voice. Researchers, policymakers, and the academic community have dedicated significant time to discussing AI biases and fairness. Surveys and studies focus on identifying biases, understanding their sources, and exploring ways to mitigate them. Scholars have helped to contribute to developing more responsible and ethical AI systems by examining the origins, impacts, and strategies for mitigating bias in AI. In what follows, we will discuss these sources and some strategies for mitigation.
Sources of Bias in AI

If you recall the biases we discussed early in this chapter, you'll understand why addressing them in this section is crucial. Beyond the points Noble (2018) discussed, scholars have continued to explore AI biases in various ways. AI bias, which can be found in ML or algorithms, refers to AI systems that produce biased results, reflecting and perpetuating human biases within society, including historical and current social inequalities.
Bias can be embedded in the initial training data, the algorithm itself, or the predictions the algorithm generates. When AI tools access unvetted information from the web, they can produce incorrect or biased outputs. For instance, biases can infiltrate algorithms through training data that includes biased human decisions reflecting historical or social inequities. Time Magazine reports that recent years showed a surge in newspaper headlines describing how predatory advertising, discriminatory criminal justice processes, sexist hiring practices, and the dissemination of misleading information are all made worse by machine learning systems rather than better.[71]
Such biases don't just stay in the digital space—they ripple out into the real world, affecting marginalized groups by embedding themselves in leadership decisions. This makes it harder for people to get hired, gain entry to universities, or simply exist on an equal footing. And just like that, AI, intended as a tool for progress and change, can, without proper responsibility aim, end up reinforcing bias instead of dismantling it.
| Source of Bias | Description |
| Training Data | Includes historical or social biases from human decisions and data collection. |
| Algorithm Design | The underlying algorithms that can unintentionally amplify biases in the data. |
| Data Privacy and Management | Issues in how data is governed, cleaned, and stored, affecting AI bias outputs. |
| Web Content | Unvetted information from the web that AI tools might incorporate into results. |
Building on the extensive work of other scholars in this field, Emilio Ferrara compiles an interesting table that shows the different types of biases in AI, their descriptions, and some examples.[72] The following table summarizes Ferrara's findings:
| Type of Bias | Description Examples | Examples |
| Sampling Bias | Occurs when the training data are not representative of the population they serve, leading to poor performance and biased predictions for certain groups. | A facial recognition algorithm trained mostly on white individuals that performs poorly on people of other races. |
| Algorithmic Bias | Results from the design and implementation of the algorithm may prioritize certain attributes and lead to unfair outcomes. | An algorithm that prioritizes age or gender, leading to unfair outcomes in hiring decisions. |
| Representation Bias | Happens when a dataset does not accurately represent the population it is meant to model, leading to inaccurate predictions. | A medical dataset that under-represents women, leading to less accurate diagnosis for female patients. |
| Confirmation Bias | Materializes when an AI system is used to confirm preexisting biases or beliefs held by its creators or users. | An AI system that predicts job candidates’ success based on biases held by the hiring manager. |
| Measurement Bias | Emerges when data collection or measurement systematically over- or under-represents certain groups. | A survey collecting more responses from urban residents, leading to an under-representation of rural opinions. |
| Interaction Bias | Occurs when an AI system interacts with humans in a biased manner, resulting in unfair treatment. | A chatbot that responds differently to men and women, resulting in biased communication. |
| Generative Bias | Occurs in generative AI models, like those used for creating synthetic data, images, or text. Generative bias emerges when the model’s outputs disproportionately reflect specific attributes, perspectives, or patterns present in the training data, leading to skewed or unbalanced representations in generated content. | A text generation model trained predominantly on the literature from Western authors may over-represent Western cultural norms and idioms, under- representing or misrepresenting other cultures. Similarly, an image generation model trained on datasets with limited diversity in human portraits may struggle to accurately represent a broad range of ethnicities. |
Table 5.1: Types of AI Bias.
Addressing these biases is critical for creating fair and inclusive AI systems. By understanding and mitigating these biases, we can reduce harm to marginalized groups, enhance trust in AI technologies, and maximize the potential benefits of AI for society.
Techniques for Bias Mitigation
If you recall, bias is hardwired into the human brain. However, those biases have been seeping into AI during its development and training phases. For instance, if you want to generate a picture and specify, "provide me an Asian woman doctor working at a Chinese hospital," you'll likely get an accurate result. But if you leave it up to AI to generate a generic doctor, many times, you get a man. If you are prompted for a woman's picture, chances are you will get a young, attractive blonde woman with sharp jawlines—a model of beauty pushed by our current society.
The ramifications are vast. As generative AI becomes integrated into society, it often only represents a portion of the world's population. In the world of image bias, some of this is inherent in the data it was trained on. One only needs to look at stock photo websites to see that the overwhelming majority of pictures there are of White (fairly homogenous) people. Joy Buolamwini, a self-proclaimed code poet and computer scientist, discusses in her YouTube video "AI, Ain't I A Woman?" and her TED Talk, "The Coded Gaze: Bias in Artificial Intelligence" [73] how non-inclusive AI reinforces biases that lead to stereotypes based on what she calls the coded gaze. Such gaze could be the white gaze or the postcolonial gaze, reflecting the priorities and, at times, prejudices of those who have the power to mold technology.
In 2019, while working on an art installation, Buolamwini discovered that cameras detected lighter skin tones easily but failed to recognize her dark skin. However, when she added a white mask, the camera detected her face. She highlights how AI systems can misidentify strong Black women, such as Oprah Winfrey, Serena Williams, and Michelle Obama, as male. For example, a prompt like "a young man wearing a shirt" could generate a picture of a young Michelle Obama. The system works best on lighter skin tones, leading to inaccuracies and reinforcing stereotypes.
This bias can have real-world consequences as the amount of data that trains AI is vast. For instance, ChatGPT-4 was trained on web pages, books, and other sources amounting to around 570GB of text data.[74] If the dataset lacks diversity in terms of culture, gender, ethnicity, or religion, the AI will default to producing stereotypical results, such as white male presidents or doctors.
AI systems acting as gatekeepers for jobs, financial services, and university admissions can discriminate against people based on facial recognition. Researchers have developed various strategies to address bias in AI, targeting both data and algorithms in order to mitigate these biases while avoiding amplifying stereotypes and discrimination.

According to Ferrara's 2024 article [75] on fairness and bias in AI, mitigation strategies can be divided into three categories:
- Preprocessing Data: This involves identifying and addressing biases before training the model. Techniques like oversampling, undersampling, or synthetic data generation ensure the data represents the entire population, including historically marginalized groups. For instance:
- Data augmentation to increase representation in underrepresented groups.
- Adversarial debiasing to train the model to be resilient to specific types of bias.
- Oversampling darker-skinned individuals in a facial recognition dataset.
- In-Processing: This focuses on using model selection methods that prioritize fairness.
- Group or individual fairness, including regularization, which penalizes models for making discriminatory predictions.
- Ensemble methods that combine multiple models to reduce bias.
- Post-processing Decisions: This involves adjusting the output of AI models to remove bias and ensure fairness.
- Methods that adjust decisions to achieve equalized odds ensure that false positives and negatives are equally distributed across demographic groups.
- Although complex and requiring large amounts of additional data, these methods help address bias in the final output.
While these methods have proven effective in addressing bias and promoting fairness, they come with their own set of challenges and ethical dilemmas. One significant challenge is the time-consuming nature of cleaning data already filled with biases. Privacy concerns also arise, particularly for individuals and groups who are historically marginalized, during both data collection and usage. These challenges can disrupt the balance between bias and fairness and may even reinforce existing stereotypes or biases if fairness criteria are not carefully considered.
One successful example of increasing diversity in the models is diversity fine-tuning (DFT), which involves creating more specific data tunes to achieve the desired outcome. For example, if you want an image to look like anime, you tune real pictures with anime pictures until you get an anime-like image. They applied this method to diversity by generating prompts with gender, ethnicity, and profession (e.g., female Mexican doctor) and using text-to-image models to create synthetic images. They generated close to 90,000 synthetic images of diverse professionals from 170 professions and 57 different countries. By augmenting the data, they made the models more inclusive and representative of different demographics. Their DFT model improved the group fairness metric by 150% for perceived skin tone and 97.7% for perceived gender.[76]
By addressing these biases, we can create AI systems that are more fair and inclusive, better serving our diverse world, a section we will turn to next.
Promoting Diversity and Inclusion in AI Development
Let's put it all into practice for this section. What examples come to your mind when you think about creating an inclusive AI model?
Consider these examples:
- Addressing Bias in AI Recruitment: Imagine developing an AI recruitment tool that inadvertently discriminates against certain demographic groups. How would you address this bias to ensure fairness and equity?
- Policymaking for AI-Powered Healthcare: Picture yourself as a policymaker determining regulations for AI-powered healthcare systems. How would you ensure equitable access and treatment for all patients?
Now, let’s dive deeper into a specific example to understand inclusivity in AI better. For this scenario, let's focus on creating a virtual assistant for individuals with disabilities. One bias we haven't discussed as much is ableism. Ableism is discrimination and social prejudice against people with physical or mental disabilities. It characterizes people by their disabilities and classifies them as inferior to non-disabled people.
Ableism manifests in various ways:
- Digital Exclusion: Many digital tools and platforms are designed with able-bodied users as the default, neglecting the needs of those with disabilities.
- Stereotyping: People with disabilities are often stereotyped and underestimated, their capabilities overshadowed by their disabilities.
- Barrier Reinforcement: Ableism reinforces barriers that prevent individuals with disabilities from fully participating in society, including digital and technological arenas.
Biases related to ability often assume digital tools are primarily for able people. This is a significant oversight, considering the vast and diverse population of individuals with disabilities who rely on digital accessibility. Disability groups have made tremendous progress toward making the web accessible. Now, it's time to extend that ethos to AI, ensuring it serves all users equally.
Sidebar - Create Your Own AI Adventure.
Create interactive scenarios or style narratives where you can explore different outcomes based on the decisions made regarding AI development and implementation. Imagine you are part of a team creating a virtual assistant. How would you think of inclusivity in this way?
While creating your own AI, consider the following questions:
- How did your decision impact the virtual assistant's usability and reception among different user groups?
- What are some potential challenges you encountered in designing an inclusive virtual assistant, and how did you address them?
- How can considering diversity and inclusivity in AI design lead to more equitable and accessible technologies for all users?
Discussion of Scenarios:
Option 1: Design without Considering Diversity
You decide to prioritize efficiency and simplicity in the virtual assistant's design, overlooking the importance of diversity and inclusivity. The virtual assistant is launched with standard features and responses, without considering the diverse backgrounds and preferences of its users.
Outcome: While the virtual assistant performs adequately for some users, it fails to meet the needs of others, particularly those from marginalized communities. Complaints and negative feedback start pouring in, highlighting the importance of considering diversity in AI design.
Option 2: Incorporate Diversity from the Start
You advocate for a more inclusive approach to the virtual assistant's design, emphasizing the importance of considering diverse perspectives and experiences. You work with your team to develop features that accommodate users from different cultural backgrounds, languages, and abilities.
Outcome: The virtual assistant is launched with features such as multilingual support, culturally sensitive responses, and accessibility options for users with disabilities. It receives positive feedback from a wide range of users, demonstrating the benefits of prioritizing inclusivity in AI design.
By exploring these scenarios, we see the profound impact of inclusivity in AI design. Embracing diversity leads to more equitable and accessible technologies, fostering a digital landscape where every individual is valued and heard.
The Necessity of Inclusive AI Systems
To create genuinely inclusive AI systems, inclusivity must be embedded throughout the entire AI development cycle. Inclusivity means ensuring that all individuals, regardless of their ethnic, cultural, gender, or disability status, are considered and fairly represented. This involves not only recognizing and valuing diverse perspectives but also actively working to eliminate biases that exclude or disadvantage certain groups.
Recent data on inclusivity in both the industry and AI highlights a concerning lack of diversity across various industries. A McKinsey's 2020 report showing that companies in the top quartile for ethnic and cultural diversity outperform those in the bottom quartile by 36% in profitability. Similarly, Boston Consulting Group found that diverse companies have 19% higher innovation revenues. Despite these compelling statistics, progress remains slow. In the tech sector, for instance, women hold only 25% of all jobs, and African American employees make up just 7% of the workforce in Silicon Valley, as reported by CIO.[77]
AI, when used correctly, can be a powerful tool for promoting inclusivity. It can analyze various processes within business—be it recruitment, marketing, internal communications, education, ensuring fairness to students and hiring people of color, or healthcare, delivering equitable health services to all—while detecting and mitigating biases.
In recruitment processes, for instance, AI-driven tools can scrutinize job descriptions and recruitment materials to identify and remove unconscious biases. These tools can facilitate blind hiring by redacting demographic information from resumes, allowing candidates to be evaluated solely on their skills and qualifications. AI should provide consistent recommendations for medical treatment, loan applications, or employment based on similar symptoms, financial circumstances, or professional qualifications. Increasing participation from various stakeholders to ensure a variety of inputs ensures such endover.
Expanding participation has become a focal point for the ML community, which is increasingly incorporating a broader range of perspectives and stakeholders into AI model design, development, evaluation, and governance. AI systems should be designed to treat everyone fairly and avoid biases that affect similarly situated groups differently[78]
Key Takeaways
- Biases in AI arise from both data and algorithmic sources, often resulting in unfair outcomes if not identified and managed effectively
- Different types of biases, like selection and algorithmic bias, can disproportionately impact certain groups, necessitating thorough testing and auditing.
- Using tools like fairness audits and bias detection algorithms helps identify and mitigate biases, promoting equity and fairness in AI outcomes.
- Implementing both group and individual fairness principles reduces discrimination, promoting a more inclusive and equitable AI system.
Exercises
- Bias Identification and Mitigation: Analyze a dataset commonly used in AI and identify potential sources of bias. Propose solutions for mitigating these biases to ensure fairness in AI outputs.
- Case Study Analysis: Study a real-world example where bias led to unfair outcomes in AI. Evaluate what fairness principles could have prevented this outcome and suggest steps to address these issues.
- Fairness Toolkit Exploration: Use a fairness toolkit, such as IBM’s AI Fairness 360, to assess the fairness of an AI model. Summarize the findings and discuss potential adjustments to improve fairness.
Responsible AI Development Practices
Learning Objectives
- Define responsible AI practices.
- Describe how explainability fosters user trust and understanding in AI systems.
- Develop a framework for responsibly monitoring and updating AI models to ensure ethical standards are maintained over time.
- Assess how transparency influences user trust in and acceptance of AI systems.
The World Economic Forum’s Blueprint for Equity and Inclusion in Artificial Intelligence highlights the need for inclusivity throughout the AI development cycle. This includes designing AI systems that consider underrepresented and protected groups while fostering inclusive infrastructure, education, hiring, and workplace practices. Stakeholder engagement must span all phases, from data collection and model training to deployment and monitoring.[79]
For instance, excluding disabilities from AI design introduces bias, making diverse datasets, rigorous testing, and accountability measures essential for fair development. Stakeholders—designers, developers, and oversight bodies—must prioritize inclusivity at every stage.
Responsible AI development is rooted in ethical design, responsible data use, and stakeholder collaboration. We consider ethical AI as reflecting values, while responsible AI as translating those values into action. More than just ethics, responsible AI incorporates legal, cultural, and societal considerations, ensuring AI serves humanity as a whole. Scholars distinguish ethical AI as recognizing values, while responsible AI demands intervention when those values are at risk.[80]
Such AI responsibility shoul be incorporated in all the five key stages of AI cycle as illustrated in the accompanying graph. Each stage, selection, training, design, testing, and monitoring—must integrate fairness and equity. Ethical data selection lays the foundation, training enhances performance, and design embeds ethical principles. Testing identifies biases, while continuous monitoring ensures effectiveness and fairness.

Ethical Design
According to the food forest metaphor we employed earlier in this chapter, plants help one another since they are deliberately chosen to support the general viability of the ecosystem. Likewise, to guarantee comprehensive integration, ethical design principles must to be ingrained in AI development from the outset. AI systems should align seamlessly with human values and societal needs, prioritizing transparency, fairness, and accountability in algorithms. AI should treat everyone fairly, avoiding biases that affect similarly situated groups differently.
Design ethics involves making morally sound decisions throughout the design process. As AI products increasingly influence behavior, beliefs, and values, ethical considerations become paramount. Our chapter’s discussion leads to the following ethical design principles:
- Bias-aware Algorithms: Design algorithms that recognize and minimize different types of bias to ensure fair outcomes.
- User Feedback Mechanisms: Solicit feedback from users to identify and correct biases in the system, enhancing fairness and transparency.
- Promote Privacy: Implement proactive and preventive approaches with end-to-end security to protect user data.
- Regulatory Frameworks: Establish ethical guidelines and regulations to hold developers and companies accountable for discriminatory outcomes produced by AI systems.
Ethical design principles advocate for practical strategies to create socially responsible products that are secure, safe, and user-friendly. Promoting safety involves ensuring accessibility and inclusivity. Accessibility initiatives enable inclusivity by accommodating all individuals, including those with disabilities, and empower users through transparency, allowing them to make informed choices.[81]
Ultimately, user-centric design mirrors the adaptability of a food forest, catering to diverse user needs. AI systems must incorporate accessibility features to ensure usability for all, reflecting a commitment to inclusivity and ethical development practices. By embracing these principles, AI can evolve responsibly, serving society equitably while mitigating adverse impacts.
Responsible Data Collection and Usage
Just as a food forest relies on diverse plant species to maintain soil health, resist pests, and promote biodiversity, similarly, responsible AI development depends on diverse and representative data. When AI systems inform decisions impacting people's lives, it's critical that these decisions are transparent and understandable. For example, banks use AI to decide creditworthiness, and companies use it to identify qualified candidates. Transparency and interpretability are crucial, allowing stakeholders to comprehend AI system functions and identify potential issues.
Diverse Data Sources: Collecting data from a wide range of sources ensures that AI systems make fair and unbiased decisions, preventing the over-representation of any single group and mitigating bias.
Ethical Data Usage: Just as natural pest control methods are essential in a food forest, AI development must ensure ethical data usage. This includes obtaining informed consent, respecting privacy, and protecting sensitive information. As AI becomes more prevalent, protecting privacy and securing personal and business information are increasingly important and complex. AI systems must comply with privacy laws that require transparency about data collection, use, and storage while mandating that consumers have controls over how their data is used.
Sustainability of data practices: Just as a food forest must be maintained sustainably, AI systems should ensure that data practices do not deplete resources or harm communities. Continuous monitoring and updating of data practices are necessary to remain ethical and inclusive. Accountability in AI development is essential. Organizations should draw upon industry standards to develop accountability norms, ensuring AI systems aren't the ultimate authority on decisions affecting people's lives and maintaining meaningful human control over autonomous AI systems.
Collaboration and Stakeholder Engagement
A successful food forest thrives through the collaboration of various stakeholders, including botanists, local communities, and environmentalists. Similarly, responsible AI development requires the active involvement of a diverse range of stakeholders. Inclusive participation is key. We must engage those directly affected by the AI system ensures that diverse perspectives are considered, leading to robust and inclusive AI solutions.
Developers, companies, and governments bear the ethical responsibility to ensure that AI systems are designed and used in a fair and transparent manner. If an AI system is biased and produces discriminatory outcomes, the accountability lies not only with the system itself but also with those who created and deployed it.[82] Therefore, establishing ethical guidelines and regulatory frameworks to hold developers accountable for discriminatory outcomes is crucial.
World Economic Forum emphasizes that community involvement is as vital to AI development as it is to the success of a food forest. Local knowledge, gained through workshops, surveys, and continuous dialogue with users, provides valuable insights and fosters trust. Additionally, interdisciplinary collaboration is essential. AI development should involve experts from ethics, sociology, and law to address the multifaceted challenges of AI implementation. Different types of stakeholders, including government agencies, civil society organizations, private companies, and individuals affected by AI systems, should all be considered. Each brings specific knowledge, expertise, concerns, or objectives that can help ensure the system is designed effectively and ethically.[83]
As we conclude this chapter, let’s revisit the rich metaphor of the food forest introduced earlier. Envision the thriving food forest where diverse plant species grow together in harmony, each contributing to a self-sustaining ecosystem. This vibrant interplay of nature provides a compelling framework for responsible AI development. By drawing parallels between the principles of cultivating a food forest and the practices required for developing AI responsibly, we can illuminate a path toward designing ethical, inclusive, and sustainable AI systems. In the lush layers of the food forest, each plant plays a unique role, much like the diverse elements and stakeholders that must come together to create truly responsible AI.
Key Takeaways
- Responsible AI practices such as transparency, accountability, and regular monitoring are essential for developing systems that meet ethical standards.
- Explainability is a crucial part of responsible AI, as it enables users to understand AI decisions, fostering trust and encouraging more informed interaction with AI systems.
- Ongoing monitoring and updates ensure that AI systems continue to meet ethical and societal standards, even as technology evolves and new data becomes available.
- Transparency in AI development builds user trust and is integral to ensuring that AI systems are accepted and trusted by society.
Exercises
- Explainability Framework: Design an explainability framework for a chosen AI application (e.g., loan approval algorithm) that details how users will be informed of decision-making processes.
- Transparency Impact Study: Research how transparency in AI impacts user trust. Write a report on findings, including recommendations for increasing transparency in an AI system.
- Model Monitoring Proposal: Develop a monitoring proposal for a specific AI model, detailing the frequency and methods of auditing to ensure ethical standards are maintained over time.
Chapter Wrap-Up
In this chapter, we used the food forest metaphor as it offers a powerful lens through which to view responsible AI development. By focusing on ethical design, responsible data practices, and stakeholder collaboration, we can create AI systems that are not only effective but also equitable and sustainable. Just as a food forest thrives through diversity and cooperation, AI can flourish when developed with a commitment to inclusivity and ethical principles. Embracing diversity and inclusivity leads to innovative solutions and technologies that serve all users fairly, contributing to a more fair and just society.
Key Terms
- AI Accountability
- AI Ethics
- AI Fairness
- AI Privacy
- AI Transparency
- Algorithmic Bias
- Bias
- Equity
- Ethics
- Ethical Compliance
- Ethical Principles
- Explainability
- Group Fairness
- Individual Fairness
- Ongoing Monitoring
Chapter Exercises
- Imagine you are responsible for developing an AI-powered hiring tool. How would you implement ethical principles to ensure fairness, transparency, and accountability in the tool’s design and functionality? Outline the methods you would use to identify and mitigate potential biases, and discuss the challenges in ensuring responsible AI practices in hiring contexts.
- Research a case where AI exhibited significant bias in real-world applications, such as facial recognition or hiring systems. Analyze the sources and impacts of these biases, and propose solutions that could prevent similar issues in future AI systems. Present your findings in a report or presentation format.
- Write a policy brief aimed at a tech company’s leadership on the importance of fairness in AI development. Include the types of fairness (e.g., group fairness, individual fairness) relevant to their product, the potential ethical implications of overlooking fairness, and practical recommendations for implementing fairness in AI tools.
- Choose a specific AI application, like a recommendation algorithm, and propose methods for making its processes more transparent to users. Discuss how transparency affects user trust and outline steps that can ensure the AI system remains fair and accountable.
Real-World Case Study
IBM’s AI Fairness 360 Toolkit
IBM developed the AI Fairness 360 Toolkit to help organizations detect and mitigate biases in their machine-learning models. The toolkit offers metrics and algorithms to evaluate fairness in AI systems across different demographic groups.
Questions:
- How does IBM’s toolkit address fairness in AI, and what types of biases is it designed to detect?
- What are the benefits and limitations of using a toolkit like AI Fairness 360 in mitigating biases?
- Discuss how implementing such a toolkit could impact an organization’s reputation and user trust.
Apple Card Gender Bias Controversy
In 2019, Apple’s credit card service faced scrutiny after allegations surfaced it offered significantly lower credit limits to women than to men with similar financial backgrounds. Critics pointed to AI biases in the card’s credit evaluation system.
Questions:
- What types of biases are most likely to have caused the disparities in credit limits between genders?
- How could transparency and accountability practices have prevented or reduced these biases?
- Discuss the ethical and reputational impacts of biased AI decisions on consumers and companies.
End-of-Chapter Assessment
Discussion Questions
- What ethical principles should guide AI development, and how do they ensure responsible AI use?
- How can biases be introduced into AI models, and what are some effective strategies for detecting and mitigating them?
- In what ways does transparency in AI systems impact user trust and accountability?
- Why is it important to implement both group fairness and individual fairness in AI applications?
- How can companies balance innovation with responsible AI practices to maintain ethical standards?
Multiple Choice Questions
1. Which of the following is a primary ethical principle for AI development?
A) Competition
B) Profit maximization
C) Transparency
D) Popularity
2. Bias in AI can occur due to...
A) Perfectly balanced datasets
B) Equal representation of demographics
C) Imbalanced or flawed training data
D) AI algorithms with no oversight
3. Which tool helps detect and mitigate bias in AI models?
A) Google Translate
B) AI Fairness 360 Toolkit
C) DeepMind’s AlphaGo
D) Facial recognition software
4. What does individual fairness in AI aim to achieve?
A) Treating each demographic group the same way
B) Ensuring similar individuals are treated similarly
C) Making AI decisions unpredictable
D) Focusing only on group fairness
5. Why is accountability crucial in AI development?
A) It shifts responsibility to AI itself
B) It ensures ethical practices and addresses harm
C) It increases profit margins
D) It eliminates the need for human oversight
6. What was IBM’s AI Fairness 360 Toolkit designed for?
A) Enhancing computer vision accuracy
B) Detecting and mitigating bias in ML models
C) Increasing AI processing speed
D) Monitoring internet traffic
7. Group fairness in AI aims to...
A) Differentiate individuals within a group
B) Treat each demographic group equally
C) Prioritize one group over others
D) Avoid transparency in AI decision-making
8. Transparency in AI is important because...
A) It improves algorithm efficiency
B) It enhances trust by making decision-making clear
C) It removes the need for ethical guidelines
D) It prevents all bias in AI
9. Which type of fairness ensures equitable treatment across demographic groups?
A) Individual fairness
B) Social fairness
C) Group fairness
D) Personal fairness
10. What is a potential drawback of ignoring fairness in AI development?
A) Increased innovation
B) Higher trust levels
C) Public mistrust and possible legal issues
D) Enhanced user engagement
True or False
- Transparency in AI means hiding its decision-making process from users.
- Biases in AI are solely caused by technical issues within the algorithm.
- Accountability is necessary to ensure AI developers address ethical concerns.
- Group fairness treats all demographic groups equitably within an AI model.
- The AI Fairness 360 Toolkit is used to enhance AI speed.
- Individual fairness ensures similar individuals are treated in a comparable way.
- Transparency has little effect on public trust in AI systems.
- Addressing bias in AI only benefits the AI developers.
- Explainability in AI means providing reasons for AI decisions that users can understand.
- Biases in training data can lead to unfair outcomes in AI models.
Answer Key
Discussion Questions
1. What ethical principles should guide AI development, and how do they ensure responsible AI use?
Example Answer: Ethical principles such as fairness, transparency, and accountability guide AI development by promoting responsible use, minimizing harm, and fostering trust in AI systems.
2. How can biases be introduced into AI models, and what are some effective strategies for detecting and mitigating them?
Example Answer: Biases can be introduced through imbalanced data or societal prejudices in training data. Mitigation strategies include auditing data, using fairness toolkits like AI Fairness 360, and regular model testing.
3. In what ways does transparency in AI systems impact user trust and accountability?
Example Answer: Transparency helps users understand AI processes, enhancing trust by ensuring decisions are understandable and enabling accountability in case of errors.
4. Why is it important to implement both group fairness and individual fairness in AI applications?
Example Answer: Group fairness addresses demographic equality, while individual fairness ensures similar individuals are treated equitably, providing a comprehensive approach to preventing discrimination.
5. How can companies balance innovation with responsible AI practices to maintain ethical standards?
Example Answer: Companies can prioritize responsible development, invest in fairness and accountability measures, and establish guidelines that ensure AI practices meet ethical standards.
Multiple Choice
- Which of the following is a primary ethical principle for AI development?
Answer: C. Transparency
- Bias in AI can occur due to...
Answer: C. Imbalanced or flawed training data
- Which tool helps detect and mitigate bias in AI models?
Answer: B. AI Fairness 360 Toolkit
- What does individual fairness in AI aim to achieve?
Answer: B. Ensuring similar individuals are treated similarly
- Why is accountability crucial in AI development?
Answer: B. It ensures ethical practices and addresses harm
- What was IBM’s AI Fairness 360 Toolkit designed for?
Answer: B. Detecting and mitigating bias in ML models
- Group fairness in AI aims to...
Answer: B. Treat each demographic group equally
- Transparency in AI is important because...
Answer: B. It enhances trust by making decision-making clear
- Which type of fairness ensures equitable treatment across demographic groups?
Answer: C. Group fairness
- What is a potential drawback of ignoring fairness in AI development?
Answer: C. Public mistrust and possible legal issues
True or False Questions
- Transparency in AI means hiding its decision-making process from users.
Answer: False – Transparency involves making the AI's decision-making process clear and accessible to users.
- Biases in AI are solely caused by technical issues within the algorithm.
Answer: False – Biases often result from flawed or imbalanced training data, not just technical errors.
- Accountability is necessary to ensure AI developers address ethical concerns.
Answer: True – Accountability is crucial for developers to address potential harms and ethical issues in AI.
- Group fairness treats all demographic groups equitably within an AI model.
Answer: True – Group fairness aims for equitable treatment across demographic groups.
- The AI Fairness 360 Toolkit is used to enhance AI speed.
Answer: False – The toolkit is used to detect and mitigate bias in AI systems.
- Individual fairness ensures similar individuals are treated in a comparable way.
Answer: True – Individual fairness focuses on treating similar individuals similarly.
- Transparency has little effect on public trust in AI systems.
Answer: False – Transparency is essential for building trust in AI systems.
- Addressing bias in AI only benefits the AI developers.
Answer: False – Addressing bias benefits users, society, and organizations by promoting fairness and reducing harm.
- Explainability in AI means providing reasons for AI decisions that users can understand.
Answer: True – Explainability helps users understand AI decisions, building trust and ensuring accountability.
- Biases in training data can lead to unfair outcomes in AI models.
Answer: True – Biases in data can cause AI models to produce unfair outcomes.
References
- Bhagavathi, D., & Jayanthi, Y. (2025). Ethics of AI. In Y. V. Sudha Devi & S. Metla (Eds.), Digital crossroads: Integrating humanities, science and technology (pp. 21–32). BP International. https://doi.org/10.9734/bpi/mono/978-93-48859-10-5/ch3 ↵
- Lambrecht, F., & Moreno, M. G. M. (2024). What is AI ethics? American Philosophical Quarterly, 61(4), 387–401. https://doi.org/10.5406/21521123.61.4.07 ↵
- Sallam, M. (2023). ChatGPT utility in healthcare education, research, and practice: Systematic review on the promising perspectives and valid concerns. Healthcare, 11(6). https://doi.org/10.3390/healthcare11060887[/footnote] These challenges underscore the critical need for responsible AI development and use.
How do we navigate this delicate balance between AI’s progress and ethical responsibility?
Ethical Principles for AI
When I pose the question to my students about why the ethical use of AI is important, their responses vary. Some highlight concerns about plagiarism, suggesting that reliance on AI could lead to unintentional plagiarism. Others express worries that excessive dependence on AI might diminish critical thinking skills or hinder the development of original thoughts. Additionally, some students admit to infrequent use of AI tools, only to be surprised when AI detectors reveal significant AI involvement in their work. While these are valid ethical AI concerns we will address, it's crucial to recognize other key issues, such as transparency, accountability, bias, and ethical responsibility, which we'll explore in this chapter. Posing the same question to both ChatGPT and Claude, they provided similar insights into the importance of ethical AI use. Below, I include the lists generated by both. ChatGPT3.5 provided the following:Learning Objectives
- Identify key ethical principles in AI development (e.g., fairness, transparency, accountability, and privacy).
- Discuss the importance of these ethical principles in preventing harm and building trust in AI systems.
- Explain the potential consequences of neglecting ethical principles in AI development.
- Evaluate different ethical frameworks and how they can be applied in AI development.

Why is the ethical use of AI important? 
Ethical use of AI is important for several reasons: - Fairness and Equity: Ethical AI ensures that AI systems do not discriminate against individuals or groups based on factors like race, gender, or socioeconomic status.
- Accountability: Ethical AI holds developers and users accountable for the actions and decisions made by AI systems. This accountability helps address issues of responsibility and liability when AI systems cause harm.
- Trust and Acceptance: Ethical AI fosters trust and acceptance among users and society at large. When people trust that AI systems are designed and used ethically, they are more likely to adopt and support their implementation.
- Legal and Regulatory Compliance: Ethical AI aligns with legal and regulatory frameworks governing the use of AI technologies. Adhering to ethical guidelines helps organizations avoid legal consequences and regulatory penalties.
- Social Impact: Ethical AI considers the broader social impact of AI technologies on individuals, communities, and society as a whole. It seeks to minimize negative consequences and maximize positive outcomes for all stakeholders.
Claude 3.0 Opus provided a bit more detailed and nuanced list, as indicated below:
Why is the ethical use of AI important? 
The ethical use of AI is crucial for several reasons:
- Potential for Harm: AI systems, if developed or used without proper care and ethical considerations, can cause significant harm. This could include privacy violations, perpetuating biases and discrimination, manipulation of information, or even physical safety risks. Ethical AI development helps mitigate these potential harms.
- Alignment with Human Values: AI should be aligned with human ethics and values, such as protecting human rights, promoting well-being, and avoiding exploitation. Ethically-developed AI is more likely to respect these key values.
- Accountability and Transparency: There needs to be accountability for AI systems' actions and decisions. Ethical AI practices promote transparency about how AI works, allowing scrutiny and redress when needed.
- Public Trust: For AI to be widely accepted and adopted, the public needs to trust that it is being developed and used responsibly and ethically. Ethical practices help build this crucial public trust.
- Long-term Impacts: The trajectory of advanced AI capabilities could have massive consequences for humanity. Working through the ethical challenges from the start increases the chances of beneficial long-term outcomes.
- Moral Responsibility: As the creators and developers of increasingly powerful AI capabilities, we have a moral obligation to ensure we exercise great care and ethical restraint in how we pursue and wield this transformative technology.
In summary, prioritizing ethical AI isn't just good practice, it's essential for mitigating risks, protecting human interests, and ensuring the socially-beneficial development of this powerful technology. Ethics and safety need to be built into AI from the ground up.
[caption id="attachment_1445" align="alignright" width="400"]
Figure 6.1 - Pillars of AI Ethics[/caption]
In this section, we categorize AI [pb_glossary id="1668"]ethical principles[/pb_glossary] in terms of. These principles aim to encapsulate the contemporary understanding of ethical issues and responsibilities in AI. As seen in Figure 6.1, Fairness, Transparency, Accountability, and Privacy often stand out as four central pillars of AI ethics. They encourage equitable treatment for individuals, clarity in AI processes, responsibility for outcomes, and careful handling of personal information. The following sections examine each pillar, highlighting ways to uphold ethical standards in AI.
AI Fairness
Fairness in AI has been a subject of significant debate across academic and industrial contexts. At its core, [pb_glossary id="1438"]AI fairness[/pb_glossary] addresses the absence of bias or discrimination in AI systems, a concept that is more nuanced than it may seem. In this section, we will explore the principles of fairness and nondiscrimination methods for identifying and mitigating bias in AI to ensure fairness and [pb_glossary id="1672"]equity[/pb_glossary] for all users. But before focusing on fairness, let's examine bias and its implications in the realm of AI. Bias refers "to systematic errors in decision-making processes that lead to unfair outcomes."[footnote]Ferrara, E. (2024). Fairness and bias in artificial intelligence: A brief survey of sources, impacts, and mitigation strategies. Sci, 6(1), 3. https://doi.org/10.3390/sci6010003; p. 2 ↵ - Blair, J. A. (2012). What is bias? In Groundwork in the theory of argumentation (pp. 23–32). Springer. https://doi.org/10.1007/978-94-007-2363-4_3 ↵
- Ananny, M.; & Crawford, K. (2018). Seeing without knowing: Limitations of the transparency ideal and its application to algorithmic accountability. New Media & Society, 20(3), 973–989. https://doi.org/10.1177/1461444816676645 ↵
- Ferrara, E. (2024). Fairness and bias in artificial intelligence: A brief survey of sources, impacts, and mitigation strategies. Sci, 6(1), 3. https://doi.org/10.3390/sci6010003 ↵
- Ferrara, E. (2023). Should ChatGPT be biased? Challenges and risks of bias in large language models. First Monday, 28(11). https://doi.org/10.5210/fm.v28i11.13346 ↵
- Ferrara, E. (2024). Fairness and bias in artificial intelligence: A brief survey of sources, impacts, and mitigation strategies. Sci, 6(1), 3. https://doi.org/10.3390/sci6010003 ↵
- Kleinberg, J.; Mullainathan, S.; Raghavan, M. (2017). Inherent trade-offs in the fair determination of risk scores. In Proceedings of the Innovations in Theoretical Computer Science (ITCS), 9–11. ↵
- European Commission (2019). Ethics guidelines for trustworthy AI. Commission Communication. https://op.europa.eu/en/publication-detail/-/publication/d3988569-0434-11ea-8c1f-01aa75ed71a1 ↵
- IBM Data and AI Team. (2023, October 16). Shedding light on AI bias with real-world examples. IBM. https://www.ibm.com/think/topics/shedding-light-on-ai-bias-with-real-world-examples ↵
- Ferguson, A.G. (2012). Predictive policing and reasonable suspicion. Emory Law Journal, 62(259), 259–355 https://scholarlycommons.law.emory.edu/elj/vol62/iss2/1/ ↵
- Yan, S., Kao, H. T., & Ferrara, E. (2020). Fair class balancing: Enhancing model fairness without observing sensitive attributes. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management (pp. 1715–1724). Association for Computing Machinery. https://dl.acm.org/doi/abs/10.1145/3340531.3411980 ↵
- Zafar, M. B., Valera, I., Gomez Rodriguez, M., & Gummadi, K. P. (2017). Fairness beyond disparate treatment & disparate impact: Learning classification without disparate mistreatment. In Proceedings of the 26th International Conference on World Wide Web (pp. 1171–1180). ACM. https://doi.org/10.1145/3038912.3052569 ↵
- Crenshaw, K. (1989). Demarginalizing the intersection of race and sex: A Black feminist critique of antidiscrimination doctrine, feminist theory, and antiracist politics. University of Chicago Legal Forum, 1989(1), 139–167. http://chicagounbound.uchicago.edu/uclf/vol1989/iss1/8 ↵
- Ferguson, A. G. (2012). Predictive policing and reasonable suspicion. Emory Law Journal, 62, Article 259. https://tinyurl.com/26brv7cy ↵
- Crenshaw, K. (1989). Demarginalizing the intersection of race and sex: A Black feminist critique of antidiscrimination doctrine, feminist theory, and antiracist politics. University of Chicago Legal Forum, 1989(1), 139–167. http://chicagounbound.uchicago.edu/uclf/vol1989/iss1/8 ↵
- Ferrara, E. (2024). Fairness and bias in artificial intelligence: A brief survey of sources, impacts, and mitigation strategies. Sci, 6(1), 3. https://doi.org/10.3390/sci6010003. ↵
- Wachter, S., Mittelstadt, B., & Russell, C. (2018). Counterfactual explanations without opening the black box: Automated decisions and the GDPR. Harvard Journal of Law & Technology, 31(2), 841–887. https://tinyurl.com/yepnn2wz ↵
- Schwartz, R., Vassilev, A., Greene, K., Perine, L., Burt, A., & Hall, P. (2022). Towards a standard for identifying and managing bias in artificial intelligence (NIST Special Publication 1270). National Institute of Standards and Technology. https://doi.org/10.6028/NIST.SP.1270 ↵
- Sallam, M. (2023). ChatGPT's utility in healthcare education, research, and practice: Systematic review on the promising perspectives and valid concerns. Healthcare, 11(6), Article 887. https://doi.org/10.3390/healthcare11060887 ↵
- Ananny, M. and Crawford, K. (2016). Seeing without knowing: limitations of the transparency ideal and its application to algorithmic accountability. New Media & Society, 20(3), 973–989. https://doi.org/10.1177/1461444816676645 ↵
- Wollny, S., Schneider, J., Mitri, D. D., Weidlich, J., Rittberger, M., & Drachsler, H. (2021). Are we there yet? - a systematic literature review on chatbots in education. Frontiers in Artificial Intelligence, 4. https://doi.org/10.3389/frai.2021.654924 ↵
- Zheng, Y., Epitropaki, O., Graham, L., & Caveney, N. (2021). Ethical leadership and ethical voice: the mediating mechanisms of value internalization and integrity identity. Journal of Management, 48(4), 973–1002. https://doi.org/10.1177/01492063211002611 ↵
- Le-Nguyen, H., & Tran, T. T. (2024). Charting the ethical course. In S. Elmoudden & J. S. Wrench (Eds.), The role of generative AI in the communication classroom (pp. 214–261). IGI Global. https://doi.org/10.4018/979-8-3693-0831-8.ch011 ↵
- Babu, S., & Suruthi, G. (2024). Unravelling the evolution of generative AI in communication education: A historical journey into human-machine communication. In S. Elmoudden & J. S. Wrench (Eds.), The role of generative AI in the communication classroom (pp. 55–71). IGI Global. https://doi.org/10.4018/979-8-3693-0831-8 ↵
- Donovan, J., Caplan, R., Matthews, J., & Hanson, L. (2018) Algorithmic accountability: A primer. Data & Society. ↵
- Klare, B. F., Burge, M. J., Klontz, J. C., Vorder Bruegge, R. W., & Jain, A. K. (2012, December). Face recognition performance: Role of demographic information. IEEE Transactions on Information Forensics and Security, 7(6), 1789–1801. https://doi.org/10.1109/TIFS.2012.2214212 ↵
- AI in society: For artificial intelligence to thrive, it must explain itself. If it cannot, who will trust it? (2018, February 17). The Economist. https://tinyurl.com/ystb3tyd, para. 8 ↵
- Datta, A., Sen, S., & Zick, Y. (2016). Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In 2016 IEEE Symposium on Security and Privacy (pp. 598-617). https://doi.org/10.1109/SP.2016.42 ↵
- The Economist. (2018, February 17). For artificial intelligence to thrive, it must explain itself. The Economist. https://www.economist.com/science-and-technology/2018/02/17/for-artificial-intelligence-to-thrive-it-must-explain-itself ↵
- Lawton, G. (2020, April). 4 explainable AI techniques for machine learning models. TechTarget. https://www.techtarget.com/searchenterpriseai/feature/How-to-achieve-explainability-in-AI-models ↵
- Gryz, J., & Rojszczak, M. (2021). Black box algorithms and the rights of individuals: No easy solution to the “explainability” problem. Internet Policy Review, 10. https://policyreview.info/articles/analysis/black-box-algorithms-and-rightsindividuals-no-easy-solution-explainability ↵
- Department for Science, Innovation and Technology, & AI Safety Institute. (2024, May 17). International scientific report on the safety of advanced AI: An interim report on the science of advanced artificial intelligence (AI) safety. https://www.gov.uk/government/publications/international-scientific-report-on-the-safety-of-advanced-ai ↵
- Bhatt, U., Xiang, A., Sharma, S., Weller, A., Taly, A., Jia, Y., Ghosh, J., Puri, R., Moura, J. M. F., & Eckersley, P. (2020). Explainable machine learning in deployment. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (FAT '20; pp. 648–657). Association for Computing Machinery. https://doi.org/10.1145/3351095.3375624 ↵
- Ojewale, V., Steed, R., Vecchione, B., Birhane, A., & Raji, I. D. (2024). Towards AI accountability infrastructure: Gaps and opportunities in AI audit tooling. ArXiv. https://doi.org/10.48550/arXiv.2402.17861 ↵
- Le-Nguyen, H. T., & Tran, T. T. (2024). Charting the ethical course: Navigating AI advancements in communication education. In S. Elmoudden & J. S. Wrench (Eds.), The role of generative AI in the communication classroom (pp. 1–37). IGI Global. ↵
- Department for Science, Innovation and Technology, & AI Safety Institute. (2024, May 17). International scientific report on the safety of advanced AI: An interim report on the science of advanced artificial intelligence (AI) safety. https://www.gov.uk/government/publications/international-scientific-report-on-the-safety-of-advanced-ai ↵
- Greenblatt, R., Shlegeris, B., Sachan, K., & Roger, F. (2023). AI control: Improving safety despite intentional subversion. ArXiv. https://doi.org/10.48550/arXiv.2312.06942 ↵
- Turpin, M., Michael, J., Perez, E., & Bowman, S. R. (2023). Language models don't always say what they think: Unfaithful explanations in chain-of-thought prompting. In 37th Conference on Neural Information Processing Systems. https://openreview.net/forum?id=bzs4uPLXvi ↵
- Buiten, M., de Streel, A., & Peitz, M. (2023). The law and economics of AI liability. Computer Law & Security Review, 48, Article 105794. https://www.sciencedirect.com/science/article/pii/S0267364923000055 ↵
- Chua, J., Rees, E., Batra, H., Bowman, S. R., Michael, J., Perez, E., & Turpin, M. (2024). Bias-augmented consistency training reduces biased reasoning in chain-of-thought. ArXiv. https://doi.org/10.48550/arXiv.2403.05518 ↵
- Ojewale, V., Steed, R., Vecchione, B., Birhane, A., & Raji, I. D. (2024). Towards AI accountability infrastructure: Gaps and opportunities in AI audit tooling. ArXiv. https://doi.org/10.48550/arXiv.2402.17861 ↵
- Turri, V., & Dzombak, R. (2023). Why we need to know more: Exploring the state of AI incident documentation practices. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society (AIES '23) (pp. 576-583). ACM. https://doi.org/10.1145/3600211.3604700 ↵
- Liu, Y., Deng, G., Xu, Z., Li, Y., Zheng, Y., Zhang, Y., Zhao, L., Zhang, T., Wang, K., & Liu, Y. (2023). Jailbreaking ChatGPT via prompt engineering: An empirical study. ArXiv. http://arxiv.org/abs/2305.13860 ↵
- Wei, A., Haghtalab, N., & Steinhardt, J. (2023). Jailbroken: How does LLM safety training fail? In 37th, Conference on Neural Information Processing Systems. https://openreview.net/forum?id=jA235JGM09 ↵
- Rao, A., Vashistha, S., Naik, A., Aditya, S., & Choudhury, M. (2024). Tricking LLMs into disobedience: Formalizing, analyzing, and detecting jailbreaks. In 2024, Joint International Conference on Computational Linguistics, Language Resources and Evaluation. (LREC-COLING 2024). https://doi.org/10.48550/arXiv.2305.14965 ↵
- Casper, S., Bu, T., Li, Y., Li, J., Zhang, K., Hariharan, K., & Hadfield-Menell, D. (2023). Red teaming deep neural networks with feature synthesis tools. In 37th Conference on Neural Information Processing Systems (NeurIPS 2023). https://openreview.net/forum?id=Od6CHhPM7I ↵
- Restrepo, D., & Baquero, P.M. (2023). Promises and limits of law for a human-centric artificial intelligence. Computer Law & Security Review, 48, Article 105795. https://doi.org/10.1016/j.clsr.2023.105795 ↵
- Stokel‐Walker, C. (2024, February 7). Europe’s new AI rules could go global—here’s what that will mean. Scientific American. https://www.scientificamerican.com/article/europes-new-ai-rules-could-go-global-heres-what-that-will-mean/ ↵
- Analytics Vidhya. (2024, May 29). Exploring responsible AI: Insights, frameworks & innovations with Ravit Dotan | Leading with data 37 [Video]. YouTube. https://youtu.be/X8Vh59d8LPA ↵
- Gruetzemacher, R., Paradice, D., & Lee, K.B. (2020). Forecasting extreme labor displacement: A survey of AI practitioners. Technological Forecasting and Social Change, 161. https://doi.org/10.1016/j.techfore.2020.120323 ↵
- Eke, D. O. (2023). ChatGPT and the rise of generative AI: Threat to academic integrity? Journal of Responsible & Technology, 13. https://www.sciencedirect.com/science/article/pii/S2666659623000033 ↵
- Aydın, Ö., & Karaarslan, E. (2023, January 31). Is ChatGPT leading generative AI? What is beyond expectations? https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4341500 ↵
- Tom Bilyeu. (2023, July 18.). "AI is about to FLIP your life upside down" - Get ahead while others lose their jobs | Emad Mostaque [Video]. YouTube. https://www.youtube.com/watch?v=Se91Pn3xxSs ↵
- Singh, M. (2024, March 22). Stability AI CEO resigns because you’re ‘not going to beat centralized AI with more centralized AI’. TechCrunch. https://techcrunch.com/2024/03/22/stability-ai-ceo-resigns-because-youre-not-going-to-beat-centralized-ai/; para. 4. ↵
- Toner, H. (2024, April). How to govern AI — even if it's hard to predict. [Video]. TED Conferences. https://www.ted.com/talks/helen_toner_how_to_govern_ai_even_if_it_s_hard_to_predict ↵
- Cimpeanu, T., Santos, F. C., Pereira, L. M., Lenaerts, T., & Han, T. A. (2022). Artificial intelligence development races in heterogeneous settings. Scientific Reports, 12(1), Article 1723. https://doi.org/10.1038/s41598-022-05729-3 ↵
- Umoh, E. E. (2024). Reliability of AI algorithms in safety applications. International Journal of Engineering and Advanced Technology Studies, 12(2), 74–85. https://doi.org/10.37745/ijeats.13/vol12n27485 ↵
- Roose, K. (2024, June 4). OpenAI insiders warn of a ‘reckless’ race for dominance. The New York Times. https://www.nytimes.com/2024/06/04/technology/openai-culture-whistleblowers.html ↵
- The White House. (2024, April 29). Biden-Harris administration announces key AI actions 180 days following President Biden’s landmark executive order [Press Release]. https://tinyurl.com/4z54cr69 ↵
- Menashe, M. (2020). The race to the bottom revisited: International labour law, global trade and evolutionary game theory. Oxford Journal of Legal Studies, 40, 53-81. https://doi.org/10.1093/ojls/gqz029 ↵
- Salazar, L. R., Peeples, S. F., & Brooks, M. E. (2024). Generative AI ethical considerations and discriminatory biases on diverse students within the classroom. In S. Elmoudden & J. S. Wrench (Eds.), The role of generative AI in the communication classroom (pp. 1–37). IGI Global. ↵
- Pew Research Center. (2023). Americans and privacy: Concerned, confused, and feeling lack of control over their personal information. https://tinyurl.com/28xyd2dd ↵
- Selinger, E., & Hartzog, W. (2022). The case against campus facial recognition. Communications of the ACM, 65(3), 25-27. ↵
- Sharma, S. (2025, March 12). AI surveillance in US schools: Thousands of sensitive student documents exposed in surveillance breach, fueling privacy fears. The Times of India. http://timesofindia.indiatimes.com/articleshow/118936575.cms ↵
- Carlini, N., Tramèr, F., Wallace, E., Jagielski, M., Herbert-Voss, A., Lee, K., Roberts, A., Brown, T., Song, D., Erlingsson, Ú., Papernot, N., & Raffel, C. (2021). Extracting training data from large language models. In 30th USENIX Security Symposium (USENIX Security 21) (pp. 2633-2650). ↵
- Rieke, A., & Bogen, M. (2018, December 10). Help wanted: An examination of hiring algorithms, equity, and bias. Upturn. https://www.upturn.org/work/help-wanted/ ↵
- McDonald, A. M., & Cranor, L. F. (2008). The cost of reading privacy policies. I/S: A Journal of Law and Policy for the Information Society, 4(3), 543–568. ↵
- Noble S. U. (2018). Algorithms of oppression. New York University Press. ↵
- Buolamwini, J. (2019). Artificial intelligence has a problem with gender and racial bias. Here’s how to solve it. Time. https://time.com/5520558/artificial-intelligence-racial-gender-bias/ ↵
- Ferrara, E. (2024). Fairness and bias in artificial intelligence: A brief survey of sources, impacts, and mitigation strategies. Sci, 6(1), 3. https://doi.org/10.3390/sci6010003 ↵
- Buolamwini, J (2018, September 27 ). AI, ain't I a woman? [Video]. YouTube. https://www.youtube.com/watch?v=FejjAbwUqbA ↵
- Iyer, A. (2022, December 15). Behind ChatGPT’s wisdom: 300 bn words, 570 GB data. Analytics India Magazine. https://analyticsindiamag.com/behind-chatgpts-wisdom-300-bn-words-570-gb-data/ ↵
- Ferrara, E. (2024). Fairness and bias in artificial intelligence: A brief survey of sources, impacts, and mitigation strategies. Sci, 6(1), 3. https://doi.org/10.3390/sci6010003 ↵
- Esposito, P., Atighehchian, P., Germanidis, A., & Ghadiyaram, D. (2023). Mitigating stereotypical biases in text-to-image generative systems. ArXiv. https://arxiv.org/abs/2310.06904 ↵
- Rizvi, J. (2023, November 19). How AI can be leveraged for diversity and inclusion. Forbes. https://www.forbes.com/sites/jiawertz/2023/11/19/how-ai-can-be-leveraged-for-diversity-and-inclusion/ ↵
- Birhane, A., Isaac, W., Prabhakaran, V., Diaz, M., Elish, M. C., Gabriel, I., & Mohamed, S. (2022). Power to the people? Opportunities and challenges for participatory AI. In Proceedings of the 2nd ACM Conference on Equity and Access in Algorithms, Mechanisms, and Optimization (pp. 1–8). Association for Computing Machinery. https://doi.org/10.1145/3551624.3555290 ↵
- World Economic Forum. (2022). Blueprint for equity and inclusion in artificial intelligence. White Paper. https://www3.weforum.org/docs/WEF_A_Blueprint_for_Equity_and_Inclusion_in_Artificial_Intelligence_2022.pdf ↵
- Dignum, V. (2019). Responsible artificial intelligence: How to develop and use AI in a responsible way. Springer Nature Switzland. ↵
- Neves, I. (2023, March 7). Ethical design: What is it and why is it important for innovators? Beta-i. https://beta-i.com/blog/ethical-design-what-is-and-why-is-important-for-you ↵
- Mittelstadt, B. D., Allo, P., Taddeo, M., Wachter, S., & Floridi, L. (2016). The ethics of algorithms: Mapping the debate. Big Data & Society, 3(2), 2053951716679679. https://doi.org/10.1177/2053951716679679 ↵
- World Economic Forum. (2022). Blueprint for equity and inclusion in artificial intelligence. White Paper. https://www3.weforum.org/docs/WEF_A_Blueprint_for_Equity_and_Inclusion_in_Artificial_Intelligence_2022.pdf ↵

AI ethics is like managing a food forest, where the focus is on cultivating a self-sustaining and diverse ecosystem. Just as a food forest integrates layers of trees, shrubs, and ground plants to create a resilient and productive environment, AI ethics involves addressing issues like algorithmic bias, user privacy, transparency, and fairness. In a food forest, every element is interconnected, working together to support the whole system, much like how ethical AI practices ensure that technology benefits society while maintaining trust and integrity. Both aim to create a harmonious and sustainable environment, fostering growth that respects and nourishes the community and the ecosystem. (ChatGPT 4o, 2024)


