4 Chapter 4: Foundations of AI Literacy
If you’re like me, you might be able to draw a stick figure on a good day. On a bad day, that stick figure looks like some extraterrestrial monstrosity. For the first time, having those fine-tuned artistic skills is not necessary to create art. Today, everyone can be a painter, a lyricist, a songwriter, a poet, a storyteller, and so much more. If you’re an artist, instead of painting with a paintbrush, you’ve got a computer. And instead of paint, you’ve got data—lots and lots of data. With the right tools, you can train your computer to create stunning artwork, realistic images, and even entire virtual worlds. Welcome to the incredible world of generative AI.
But wait, there’s more. As you explore this exciting new world, you might wonder about the true capabilities of AI. Can it really think like a human? Will it take over all our jobs? Is it going to save the world or destroy it? There’s a lot of hype and speculation out there, and it’s hard to separate fact from fiction.
That’s where this chapter comes in. We’ll take you on a journey through the ins and outs of generative AI, exploring how it works and what it can do. But we won’t stop there. We’ll also take a deeper dive into the world of AI more broadly, examining its capabilities and limitations, and tackling some of the most pervasive myths and misconceptions along the way.
Whether you’re a tech enthusiast, an artist looking to expand your creative horizons, or just someone curious about the future of AI, this chapter has something for you. We’ll demystify the jargon, break down the concepts, and give you a clear and engaging look at one of the most exciting and rapidly evolving fields in technology today.
Generative AI Models
Learning Objectives
- Explain how generative AI models work and their key components.
- Describe various generative models and their applications.
- Evaluate the strengths and weaknesses of generative models.
- Identify real-world applications of generative AI across different industries.
- Evaluate the effectiveness of an AI agent’s sensory and action systems in a given context.
Picture this; you’re an artist, but instead of a paintbrush, you’ve got a computer. And instead of paint, you’ve got data – lots and lots of data. With the right tools, you can train your computer to create stunning artwork, realistic images, and even entire virtual worlds. That’s the power of generative AI (GenAI) models. By now, you’ve probably experimented with or at least seen the results from one of the many GenAI text-to-image models that exist. Text-to-image models jumped onto the scene in the mid-2010s as advances in deep neural networks enabled the training of large datasets of images.[1] By the early 2020s, several companies had models available to the public:
- OpenAI’s DALL-E—January 2021
- Google’s Imagen—May 2022
- Midjourney—July 2022
- Stability AI’s Stable Diffusion—August 2022
- Adobe’s Firefly—June 2023
Defining and Explaining GenAI Models
Before proceeding, we should define the term Generative AI.
A type of AI that learns from input data to create novel content, such as text, images, audio, video, or code, which is similar to the training data but not an exact replication.
Before delving into the specifics of GenAI, it’s important to realize that the models have improved over time.
GenAI can create new and original audio, art, code, or text by learning from massive amounts of input data, which is essentially a collection of examples that the AI system studies. The original DALL-E dataset contained a massive 12 million images that were used to train the model. Now, the text-to-image models are trained on hundreds of millions of images. It’s almost impossible to fathom how many images were used in these training models before a single output image was created.

The process is kind of like this, imagine a child learning to draw. They might look at many different pictures of cats drawn by various artists. The child may have a cat or know some neighborhood cats. They’ve seen cats on television. By studying these examples, the child starts to understand the key features of a cat, such as pointy ears, whiskers, and a long tail, and how these key features differ from other animals, say a dog, bird, or elephant. Eventually, the child can create their own unique drawing of a cat. GenAI works similarly. And in kind of the same way, the models have gotten better over time. For example, GenAI Art 4.1 shows an early image I created in Midjourney back in early July 2022 using Midjourney version 2. The prompt I asked for was “two men picking apples.” Well, it gave me images. There are vaguely humanoid shapes and some red balls that may very well be apples, but the images were definitely a version of abstract art.

By March 2023, Midjourney version 5 was released, and I tried that same prompt again; the models had gotten much better (GenAI Art 4.2). Today, we have Midjourney version 6, so I ran the exact same prompt and received the following four images (GenAI Art 4.3). You can easily see that over time, the same prompt has just gotten better. In the same way, children become better artists the more they draw and receive feedback from other artists and teachers. I can only imagine what these images will look like in the years to come.
How it Works
Learning from Input Data
The input data serves as a guide for the GenAI. It could be a dataset of millions of images, a large collection of texts, or many audio recordings. The AI system analyzes this data to identify patterns and characteristics.
Think of it like a chef learning a new cuisine. They might taste many dishes to understand the common ingredients, flavors, and techniques. Similarly, the GenAI “tastes” the input data to grasp its essence.

Creating Novel Content
After learning from the input data, the GenAI can create new content that is similar to the training data but not an exact copy. This is where the “generative” aspect comes in.
Consider a musician who has studied many classical compositions. They’ve internalized the style, structure, and common motifs. When they compose their own piece, it will likely have elements that resemble classical music, but it will be a new, original creation. GenAI operates on the same principle.
The Mona Lisa
Let’s look at another example before talking about other forms of GenAI output. Probably one of the most famous paintings in the world is Leonardo DaVinci’s Mona Lisa. But, could you spot the real Mona Lisa in a lineup? Let’s find out. GenAI Art 4.4 shows a series of four Mona Lisa’s. One of them is the real deal, and the other three are ones created by GenAI text-to-image models. See if you can spot the real one.

Do you know which one it is? Check the end of the chapter to see which one is the actual Mona Lisa by Leonardo DaVinci.
How’d you do?
So, let’s take a moment to learn how the various GenAI models created their own versions of the Mona Lisa without being exact copies of the original image painted by DaVinci.
GenAI models, like the one used to recreate a version of the Mona Lisa, employ advanced techniques to generate new content that mimics existing styles or concepts. Here’s a simplified explanation of how this process works.
Training on Data
The different models are trained on a vast dataset of images, which can include artworks from different periods, styles, and artists. This training involves analyzing countless features of these artworks, such as color, texture, form, and composition. The Mona Lisa is one of those images that is all over the internet in various forms, so it’s not surprising that all the major text-to-image GenAI models know exactly what the Mona Lisa looks like with no need to provide the models a reference image to work with. The models also had hundreds of thousands of variations of the Mona Lisa available to train on. In a 2021 analysis of Googled paintings, the Mona Lisa came out on top.[2]
We should note that training data can cause problems as well. Here’s an example. Jason pitted twelve LLMs against each other and asked them to “pick a number between 1 and 100.” As you can see in the video, here is what he received:
- GPT-4.0 – 37
- Claude 3 Sonnet – 42
- Claude 3 Haiku – 47
- Mistral-Large – 42
- Gemini 1.0 Pro – 37
- Gemini 1.5 Pro – 42
- GPT-3.5 – 74
- Llama-3 8b – 43
- Llama-2 70b – 43
- Llama-2 13b – 23
- Jurassic – 42
- Titan LM – 43
Command gave me, “I’m going to give you a number between 1 and 100, and you have to guess what it is. Is that okay?”, so I dropped that one from the list. If you look at the list carefully, you’ll see that four came back with the number “42,” and another three came back with “43.” What’s going on? Well, believe it or not, we have two issues related to training data at play here. First, the number 42 actually appears more often on the internet than other numbers, so the training data has a greater likelihood of “randomly” picking that number. You may be wondering why 42 is more common. Well, actually, Mistral-Large explained this, “Sure, how about 42? It’s a fun number with some interesting mathematical properties, and it’s also famously known as the “Answer to the Ultimate Question of Life, The Universe, and Everything” in Douglas Adams’ science fiction novel The Hitchhiker’s Guide to the Galaxy.” Because of Adams’ book, the number 42 has taken on a life of its own and ends up training data more often than not. As for the 43’s, I guess that they were trained not just to give 42, so they went with 43. Either way, more than half of the responses end up looking less random because of the training data.
GenAI training data is notorious for these types of biases that appear. For example, in a recent study examining gender and racial bias, Ananya found that text-to-image GAI had a habit of generating images that fit with societal stereotypes.[3] For example, ask a GenAI to create an image of a doctor, and they’re more likely to produce a picture of a male even though females represent 37% of physicians in the U.S. and 55% of new medical students.[4] On the other end, when asked to generate images of cooks or housekeepers, they were more likely to generate images of people of color. Sadly, there are prejudices and biases present in the datasets, so it is not surprising to find these same biases in the content that AIs produce.
As such, some computer scientists try to compensate for this bias by ensuring that the number of images generated of genders and races adhere to what is seen in the general population. Although far from a perfect solution, this process can ensure that a broader array of images is created. However, this can also lead to problems. When Google released their Gemini model with text-to-image GenAI, they had tried to program for gender and racial bias in the training data, but some of the images that were produced were historically inaccurate (e.g., Black George Washington, Indian female Pope, Asian Roman Centurians, etc.). There was definitely a bit of an outcry by some calling the GenAI “woke,” [5] but the coding issues on the development side were more complex than many in the media could truly grasp.
Learning Patterns and Features
Through deep learning, particularly using neural networks, the AI model learns to recognize and reproduce the complex patterns and stylistic features of various artworks. For instance, with the Mona Lisa, the model learns the subtleties of Leonardo da Vinci’s techniques—soft transitions of color, the characteristic sfumato technique (a way of blending colors and tones to achieve a more realistic, three-dimensional appearance), and even the specific way he painted facial expressions. The models also have all the variations of the Mona Lisa to train on to see stylistically how other artists alter the painting, which provides a huge database of alternatives to the original painting.
Generating New Content
Once trained, the AI uses what it has learned to generate new images. This is not simply copying. Instead, the AI models create new versions of the Mona Lisa based on the learned patterns and styles. In generating a Mona Lisa-like image, the AI applies its understanding of Renaissance art, da Vinci’s techniques, and the specific ambiance and pose typical of the Mona Lisa.
The creative process in AI involves generating content that is both novel and inspired by the learned data. Imagine the AI as an apprentice artist who has studied thousands of artworks. Instead of merely replicating these artworks, the AI combines various elements and styles it has learned to produce something new. For instance, if tasked with creating a new painting in the style of the Mona Lisa, the AI doesn’t just duplicate the original. It might create a different portrait that embodies the same techniques, such as the use of light and shadow, the serene expression, and the detailed background, yet it results in a completely new piece of art.
Iterative Refinement
The models can refine their output through additional feedback loops, where the generated images are evaluated against the desired outcome (e.g., how closely they resemble Renaissance art or da Vinci’s style) and adjusted accordingly.
User Input for Specificity
Tools like Stable Diffusion, MidJourney, DALL-E, etc. allow users to input specific prompts that describe what they want in an image, which guides the AI in creating something that aligns with the user’s request while still being an original creation.
This ability to generate art in the style of historic masterpieces like the Mona Lisa showcases the intersection of technology and creativity, enabling both the preservation of artistic heritage in new forms and the exploration of creative possibilities through technology.
Examples of GenAI Output
It’s important to know that GenAI has several different use cases. The most common ones are images and text, but GenAI can also create audio, computer code, video, 3D objects, and more and more things with each passing day.
- A GenAI trained on a dataset of poems could create a new poem that has a similar style and uses related themes and vocabulary. Just as a test, I asked ChatGPT to “write a poem about two men picking apples.” Here is what it came up with:
In orchard’s embrace, ‘neath the sun’s golden eye,
Two men wandered ‘neath the vast, azure sky.
With laughter as bright as the dawn’s early light,
They reached for the apples with faces alight.
The trees stood as sentinels, branches spread wide,
Offering fruit that the leaves had once hid.
With each careful twist, and with each gentle pull,
Their baskets grew heavy, so bountiful, full.
The apples gleamed red, with a blush of soft gold,
A feast for the eyes, a sight to behold.
The men moved with rhythm, as if in a dance,
In the art of the harvest, they found their romance.
Their hands brushed together with each fruit they took,
A silent communion, no words, just a look.
Together they labored, from morning till noon,
Humming a tune to the orchard’s own tune.
Their brows, kissed by sunlight, their spirits so free,
As if this simple act was their destiny.
For in the act of picking apples from trees,
They found the sweet taste of life’s simple pleas.
As shadows grew long, and the daylight grew dim,
They shared in the joy of the harvest’s last hymn.
With baskets now laden and hearts full of cheer,
The men knew this moment would draw them each year.
To the orchard they’d return, as the seasons would call,
To pick the ripe apples in the early fall.
For in each picked apple, there lies a sweet tale,
Of friendship and time spent beyond the life’s pale.
- An AI system fed with many images of faces could generate a new, unique face that looks realistic but doesn’t belong to a real person. We’ve already seen what it looks like when you ask an image generator to create pictures of two men picking apples.
- A GenAI system can also create songs. Here I fed the idea of picking apples into a GenAI platform that creates audio named Udio. Here is the song it created just for us.
Lyrics
Fill our basket, under skies of blue
Every apple’s sweeter when I’m with you
[Pre-Chorus]
We jump into the car, don’t need no map
Just your hand in mine, and your straw hat
Singing to the radio, we find our tune
Sun-kissed love, we’re in full bloom
Honey we’re out in the sun’s fuse
Grabbing the reds and the greens too, let’s just take off
You love the orchard and I’m in the sweet summer groove, oh
Take off
Harvest beat
Darling I feel the excitement, indeed
[Chorus]
In the orchard, love is ripe and rich
Pickin’ memories, stitch by stitch
Laughter in the air, we’re in our paradise
Eating Granny Smiths, ain’t this life so nice
[Bridge]
Oh, the thrill, with every apple we pick
Life’s a tree, and you’re my perfect pick
Under the boughs, we’re lost in time
Where the fruit hangs low, and life’s in its prime
[Pre-Chorus]
We fill our hearts as we fill our crate
A dance in the grove, oh it feels so great
Chasing the butterflies, chasing the sun
With you by my side, the fun’s never done
Dew drops sparkle while we laugh and play
Golden hours in the heat of May
[Pre-Chorus]
We’re the kings and queens, no palace needed
Every smile you give, I’m getting greedy
In the rows and rows, singin’ our chorus
Just a simple day, but it means so much more to us
Apple of my eye, in the summer’s glow
Juicy like the tales of love we sow
[Chorus]
Oh, oh, grabbing life by the apples, oh
Sweet and tart, in our hearts, the flavor flows
Take a bite, feel the light, as our orchard shows
Oh, oh, living free ‘neath the apple tree rows
- A GenAI studying a large codebase could write a new piece of code that performs a similar function but is not a direct copy of any existing code.

Figure 4.2 -ASCII Code Apple
Variational Autoencoders (VAEs)
Imagine you’re a budding artist who wants to create your own unique style. You study the works of famous painters, trying to understand the essence of what makes their art so special. You break down each painting into its fundamental components – the brushstrokes, the color palette, the composition – and then use that knowledge to create something new and original. That’s kind of like what a Variational Autoencoder (VAE) does, but with data instead of art.
At its core, a VAE is a type of generative model that learns to encode data into a compressed representation, and then decode that representation back into the original data. It’s like a fancy version of a zip file for data!
To understand how VAEs work, let’s use a simple example. Suppose you have a dataset of handwritten digits. Each image in the dataset is a 28×28 pixel gray-scale image, which means it 784 pixels in the image. That’s a lot of data to work with!
A VAE takes each of these images and encodes them into a much smaller representation, typically in a lower-dimensional space (let’s say, 10 dimensions). This compressed representation is called the “latent space.” It’s like distilling the essence of each image down to its most important features. Once the VAE has encoded the data into the latent space, it can then generate new data by sampling from that space and decoding it back into the original format. It’s like giving the VAE a set of instructions for creating new handwritten digits that look similar to the ones it was trained on, but with some variations. The “variational” part of VAE comes into play because the model doesn’t just learn a single fixed encoding for each data point, but rather a probability distribution over the latent space. This allows the model to generate a range of different outputs by sampling from that distribution.
One key advantage of VAEs is that they can generate new data that looks similar to the training data but with some level of novelty. This makes them useful for tasks like data augmentation (generating additional training examples), anomaly detection (identifying data points that don’t fit the learned pattern), and even creative applications like generating new designs or music.
However, VAEs have limitations. They can struggle with generating high-quality outputs, especially for complex data like high-resolution images. They also require careful tuning of the model architecture and training process to achieve good results.
Despite these challenges, VAEs remain a powerful and widely-used tool in the GenAI toolkit. They’ve been applied to a wide range of domains, from generating new molecular structures for drug discovery to creating realistic 3D models for video games.
Generative Adversarial Networks (GANs)

Picture a world where two AI agents are locked in an epic battle of creativity. One agent, the generator, is tasked with creating fake data that looks as realistic as possible. The other agent, the discriminator, tries to spot the difference between the real and the fake. It’s like a high-stakes game of “spot the forgery,” but with data instead of art.
We proposed the concept of GANs originally in Chapter 2, but we’re going to delve into them in a bit more detail here. The key to GANs’ success lies in the adversarial relationship between the generator and the discriminator. The generator’s goal is to create fake data that’s so convincing that the discriminator can’t tell it apart from the real thing. Meanwhile, the discriminator’s job is to learn to spot the fakes, no matter how realistic they look.
As the two networks engage in this back-and-forth dance, they both get better and better at their respective tasks. The generator learns to create more realistic fakes, while the discriminator becomes an expert at spotting them. Over time, the quality of the generated data improves until it’s almost indistinguishable from the real thing.
I like to think of GANs as the AI equivalent of a forger and an art detective. The forger (the generator) is constantly trying to create fake masterpieces that can fool even the most discerning eye. The detective (the discriminator) is always on the lookout for telltale signs of forgery. As they both hone their skills, the line between real and fake becomes increasingly blurred.
One of the coolest things about GANs is their versatility. They can generate all kinds of data, from images and videos to music and even text. Let’s look at some of the more common ways GANs can be used.
- Conditional GANs: These GANs can generate data based on specific conditions or labels. For example, you could train a conditional GAN to generate images of cats wearing hats, or to create realistic images of buildings from sketches.
- CycleGANs: These GANs can translate data from one domain to another without needing paired examples. For instance, a CycleGAN could learn to transform images of horses into zebras or to convert black-and-white photos into color.
- StyleGANs: These GANs are specifically designed for generating high-quality images with fine-grained control over the style and content. They’ve been used to create incredibly realistic portraits and to transfer the style of one image onto another.
I remember the first time I saw a GAN-generated image. It was a portrait of a person who didn’t exist. That’s actually the name of the website: thispersondoesnotexist.com. The website randomly generates faces that look so real that I could have sworn I’d seen them before. You may see a slight mistake, a misshapen ear, or a smile that seems a bit off. But mostly, you wouldn’t realize these images were AI-generated. There’s even
Of course, GANs aren’t perfect. They can be notoriously difficult to train, and they sometimes struggle with issues like mode collapse (where the generator gets stuck producing a narrow range of outputs) or instability (where the training process becomes unstable and the quality of the generated data deteriorates).
Despite these challenges, GANs remain one of the most promising and rapidly evolving areas of GenAI. As researchers continue to refine and improve the techniques behind GANs, we can expect to see even more impressive and diverse applications in the future.
Sidebar – Ten Steps to Building a Model
- Define the scope and objectives of the GenAI model.
- Gather and preprocess vast amounts of diverse data, including text, images, audio, and video, to train the model.
- Design a highly sophisticated, large-scale neural network architecture capable of learning and generalizing from diverse data.
- Develop and implement advanced training algorithms, possibly combining techniques from deep learning, reinforcement learning, unsupervised learning, and transfer learning.
- Train the model on the extensive dataset using substantial computational resources, likely requiring a distributed computing infrastructure.
- Evaluate and fine-tune the model’s performance on a wide range of tasks and benchmarks continuously to assess its general intelligence capabilities.
- Incorporate techniques for model interpretability, explainability, and safety to ensure the GenAI model’s behavior is transparent, controllable, and aligned with human values.
- Conduct extensive testing and validation of the GenAI model in real-world environments to assess its robustness, adaptability, and potential limitations.
- Refine the model iteratively based on the testing results and feedback from domain experts and users.
- Address ethical, legal, and societal implications of deploying a GenAI model, ensuring proper governance and oversight mechanisms are in place.
Autoregressive Models

Imagine you’re a fortune teller, gazing into a crystal ball to predict the future. You don’t see the whole picture at once, but rather a series of glimpses that gradually reveal what’s to come. Each new vision builds upon the last, refining and clarifying your understanding of the future.
In a way, that’s how autoregressive models work. They’re a type of GenAI model that predicts data one step at a time, using the previous steps as context to inform the next prediction.
To understand how autoregressive models work, let’s use a simple example: generating text. Suppose you want to train an AI to write a story. An autoregressive model would start by predicting the first word of the story, based on some initial context (like a prompt or a set of parameters). Then, it would use that first word as context to predict the second word, and so on, until it had generated a complete story.
I find autoregressive models fascinating because they mirror how humans often create things. When we write a story or compose a piece of music, we do not start with the whole thing fully formed in our minds. Instead, we build it up gradually, letting each new idea or phrase inspire the next.
PixelRNN & PixelCNN: These models are designed specifically for generating images, one pixel at a time. They work by predicting the color value of each pixel based on the pixels that come before it (either in a row-by-row or a pixel-by-pixel sequence). By building up the image pixel by pixel, these models can create remarkably coherent and detailed images.
Transformer-based Models: These models, like the famous GPT (Generative Pre-trained Transformer) series, use a type of neural network architecture called a transformer to generate sequential data like text. They’ve been behind some of the most impressive language models in recent years, capable of generating eerily human-like text on a wide range of topics.
I once played around with a transformer-based language model, just to see what it could do. I gave it a simple prompt – “Once upon a time” – and let it generate a story from there. What it came up with was a bizarre, surreal tale of talking animals and magic portals, full of unexpected twists and turns. It was like peering into the mind of an alien storyteller, one who had learned the basic structure of a fairy tale but filled it with the strange and unpredictable.
One of the key advantages of autoregressive models is their flexibility. Because they generate data one step at a time, they can be used for a wide range of tasks, from generating images and music to predicting time series data like stock prices or weather patterns.
However, autoregressive models do have some limitations. Because they generate data sequentially, they can be slow and computationally expensive, especially for large datasets. They can also struggle with long-range dependencies, where the context needed to make a prediction is far away from the current step.
Despite these challenges, autoregressive models remain a powerful and widely-used tool in the GenAI toolkit. As researchers continue to develop new architectures and training techniques, we can expect to see even more impressive and diverse applications of these models in the future.
AI Agents

AI encompasses a broad range of capabilities—from processing vast amounts of data to making complex predictions. However, the true power of AI emerges when we transform this raw computational ability into meaningful actions in the real world. This is where AI agents come in: they bridge the gap between artificial intelligence and practical problem-solving by taking the “brainpower” of AI and putting it to work.
What Is an AI Agent?
At its core, an AI agent is a software entity designed to achieve specific goals through a continuous cycle of perception and action. Think of it as a digital assistant built from three key components—sensors, processing capabilities, and actuators—all operating within an environment that both provides the input and is affected by the agent’s actions.
The figure above illustrates how these components interact. The “Sensors” box represents the agent’s means of gathering data from the environment. This data then flows to the “AI Agent Processing Unit,” which interprets the information and decides on the best course of action. Finally, the “Actuators” carry out these decisions, directly influencing the environment. In this way, the environment is not only the source of data but also the recipient of the agent’s actions—forming a continuous loop of perception and response.
Sensors
Sensors are the parts of an agent that collect data from its environment. The environment is the external context where the agent operates, providing the raw data that fuels its decision-making process. For example, in a self-driving car, sensors include cameras, lidar, and radar. These devices gather visual and spatial information—detecting other vehicles, pedestrians, traffic signs, and lane markings from the car’s surroundings. In a different context, consider an AI travel planning agent; here, the “sensors” might be the various input methods it uses to receive data from you—such as text input where you specify travel dates, budget, and preferences—or even voice commands. In both cases, the environment supplies the necessary information that the agent relies on.
Processing Capabilities
Once the data is collected from the environment, the processing unit interprets this information to make decisions. In a self-driving car, the onboard computer processes sensor inputs using complex algorithms to determine the best driving actions—such as when to brake, steer, or accelerate—ensuring safety and smooth navigation through dynamic traffic conditions. Similarly, the travel planning agent uses its processing capabilities to analyze your preferences alongside real-time travel data, weighing different options like flight schedules, hotel availability, and cost to generate an optimal itinerary that meets your specific requirements.
Actuators
After decisions are made, actuators are responsible for carrying out the chosen actions, thereby affecting the environment. In the self-driving car, actuators translate the computer’s decisions into physical actions: they control the steering wheel, adjust acceleration, and apply the brakes to execute safe maneuvers that interact with the road and other vehicles. For the travel planning agent, actuators might involve automating the booking process. Once the agent has determined the best travel plan, it can reserve flights, book hotels, and secure tickets—actions that engage with external service providers and alter the state of your travel arrangements.
The Environment: Where Agents Operate
Understanding the environment in which an AI agent functions is essential for both its design and evaluation. The environment can vary in its characteristics:
- Observability: In a fully observable environment, the agent has access to all relevant information at any time—like a chess-playing AI that sees the entire board. In contrast, a partially observable environment hides some details, as seen in a poker game where an AI cannot see the opponents’ cards.
- Determinism: Some environments are deterministic, where actions yield completely predictable outcomes, while others are stochastic and inherently unpredictable, such as the fluctuating prices in a stock market.
- Temporality: In episodic environments, each action is independent (like classifying individual images); whereas, in sequential environments, current actions influence future situations (as with a conversation that requires maintaining context).
- Dynamism: A static environment remains unchanged during the agent’s deliberation, but a dynamic environment is continually evolving—as in real-time strategy games where multiple players act simultaneously.
| Characteristic | Description | Example |
| Observability | Fully observable (agent sees everything) vs. Partially observable (some hidden info) |
Chess (fully) vs. Poker (partially)
|
| Determinism | Deterministic (predictable outcomes) vs. Stochastic (unpredictable outcomes) |
Chess (deterministic) vs. Stock Market (stochastic)
|
| Temporality | Episodic (independent actions) vs. Sequential (current actions affect future) |
Image Classification (episodic) vs. Conversation (sequential)
|
| Dynamism | Static (unchanging) vs. Dynamic (constantly changing) |
Chess (static) vs. Real-time Strategy Game (dynamic)
|
Types of AI Agents
AI agents can be categorized based on their decision-making architectures. Simple reflex agents act only on their current percepts, much like a thermostat that adjusts heating based solely on the current temperature. Model-based agents use a mental model or internal representation of their environment, which helps them make more informed decisions—think of a vacuum-cleaning robot that remembers the layout of your home. Goal-based agents work toward defined objectives, evaluating actions based on how well they achieve specific goals, such as a navigation system determining the shortest route to a destination. Utility-based agents are a more sophisticated type of goal-based agent, balancing multiple objectives through a utility function. For example, a recommendation system weighs both user preferences and business priorities. Finally, learning agents continually refine their behavior by learning from experience and feedback, adapting to ever-changing environments.
A Practical Example: AI Travel Planning
Imagine planning a dream vacation to Disney World with your family—but instead of combing through countless travel websites and phone calls, you simply tell your AI agent, “I want to take a trip to Disney World with my family.” This travel-planning AI agent is designed with all the characteristics we’ve discussed, operating within a complex, real-world environment.
From the moment you make your request, the AI agent springs into action. Its “sensors” begin by gathering information from you: it asks follow-up questions like, “What dates are you considering?” and “What is your budget?” These questions serve as its primary input, much like a self-driving car uses cameras and sensors to perceive its surroundings.
Internally, the agent blends different decision-making architectures. It might start as a simple reflex agent by recognizing that a travel request typically involves flights, hotels, and activities. However, it quickly shifts to a model-based approach, maintaining an internal representation of your travel preferences and constraints—this is like how a vacuum-cleaning robot maps out the layout of a home.
Operating in a partially observable and dynamic environment, the agent must contend with uncertainties: travel prices can fluctuate (a stochastic element), and the availability of flights and accommodations can change as new data comes in. Here, the AI agent becomes goal-based and utility-based, using a utility function to balance your goals of cost efficiency, comfort, and convenience. The agent doesn’t just book the cheapest flight; it considers factors like layovers, flight times, and airline preferences. It evaluates various options by weighing different factors, such as the cheapest flight versus the most convenient hotel location, just as a navigation system chooses the best route based on multiple criteria.
The AI agent acts sequentially. Early decisions—like choosing your departure date—effect options, such as hotel availability and park ticket pricing. This sequential decision-making mirrors how a conversation AI maintains context across multiple exchanges. As it processes your input, the agent accesses external data sources (its environment) and begins crafting a custom itinerary. It then transitions to its “actuators,” which execute the decisions: booking flights, reserving a hotel, and even securing park tickets—all the while updating you with detailed confirmations and schedules.
But the process doesn’t end once the bookings are made. As a learning agent, the travel planner collects feedback after your trip, learning from your experiences to refine future recommendations. Whether it’s adjusting for unexpected delays, better balancing the itinerary, or negotiating better deals, this adaptive learning ensures that the more you interact with the agent, the smarter and more attuned it becomes to your specific needs.
The Future of AI Agents
Beyond travel planning, AI agents are used in various fields. For example, they assist doctors in diagnosing diseases by analyzing medical images and patient data, and they help financial institutions detect fraudulent transactions by identifying unusual patterns.
As technology continues to advance, AI agents are becoming more sophisticated. Improvements in machine learning, sensor technology, and actuator capabilities are enabling these agents to handle increasingly complex tasks. In the future, we can expect AI agents to become even more autonomous and capable of navigating nuanced, real-world environments. They will not only execute simple tasks but also collaborate more naturally with humans, learn more effectively from experience, and make decisions in uncertain and dynamic situations. The potential for AI agents to transform various aspects of our lives, from healthcare and education to transportation and entertainment, is immense.
Key Takeaways
- GenAI models are designed to create new data that resembles the training data, using techniques like GANs and autoencoders.
- GANs involve a generator and discriminator in a competitive setting, while autoencoders compress and reconstruct data to learn meaningful representations.
- GenAI models excel at creating realistic data but can face challenges like mode collapse and training instability.
- GenAI is used in image synthesis, data augmentation, style transfer, and anomaly detection, driving innovation across various fields.
- Sensors, processing capabilities, actuators, and the environment collaborate in an AI agent’s cycle of perception and action.
Exercises
- Build a simple GAN to generate synthetic images based on a dataset.
- Implement an autoencoder to compress and reconstruct image data.
- Research and present a specific application of GenAI, discussing its technology and impact.
Understanding AI Capabilities
Learning Objectives
- Describe AI’s capabilities in data analysis, pattern recognition, and insight generation.
- Explain how AI can be used to automate tasks and improve process efficiency.
- Identify how AI supports decision-making through data-driven insights.
- Explain how AI personalizes user experiences based on individual behavior and preferences.
You’re sitting in your favorite coffee shop, sipping a perfectly crafted latte, when your friend leans over and asks, “Hey, what do you think about AI? Can it really do all the amazing things people say it can?” You pause for a moment, considering the question. AI has been making headlines for years, promising to revolutionize industries, transform our daily lives, and even outsmart humans at specific tasks. But what can AI actually do?
As you ponder your response, you realize that understanding AI’s capabilities is like peeling back the layers of an onion. On the surface, it might seem like AI can do just about anything, from diagnosing diseases and composing music to driving cars and beating world champions at complex games. But as you dive deeper, you appreciate the nuances and complexities behind these impressive feats.
In this section, we’ll unravel the mysteries of AI’s capabilities. Together, we’ll separate fact from fiction. By the end of this section, you’ll be able to turn to your friend and confidently say, “Let me tell you what AI can really do.”
What AI Can Do
AI has made impressive advancements, transforming different areas of our lives. These systems have shown their capacity to handle large data sets, make informed choices, recognize patterns, automate repetitive tasks, and deliver customized experiences. Understanding the breadth of tasks AI performs and its potential to revolutionize industries and society is essential as we explore its capabilities.
Data Analysis Powerhouse
Imagine trying to read 500 million Tweets, which is roughly how many Tweets (X-posts?) get sent via Twitter-X in 24 hours. Now imagine trying to make sense of those 500 million Tweets. Thankfully, massive data processing, pattern identification, and insight generation are among AI’s most impressive capabilities—tasks simply impossible for humans to accomplish autonomously. As such, AI is becoming indispensable for businesses, researchers, and organizations because of its ability to uncover patterns, trends, and insights that might take humans days, weeks, or even months to discover, if ever.
Automation and Efficiency
Repetitive and time-consuming tasks yield readily to AI automation. Many activities in our daily lives and work—such as data entry, scheduling, and certain manufacturing processes—are prime candidates for AI systems. Programmed to handle these routine operations, AI liberates human time and resources for more complex, creative, and strategic work. Beyond increasing productivity, this approach also lowers the possibility of human error.
Informed Decision-Making
Across many fields (e.g., business, education, finance), AI-driven insights enhance decision-making processes. In healthcare, physicians benefit from AI assistance in early detection and diagnosis of various diseases.[6] Financial institutions leverage AI for better risk management of portfolios, market trend predictions, and fraud detection.[7] Businesses optimize their supply chain management and customer service through AI applications.
Personalized User Experiences
Learning from individual behavior, preferences, and history allows AI to deliver highly personalized recommendations and experiences. This tailored approach significantly enhances user satisfaction and engagement. Streaming services like Netflix recommend movies and TV shows based on viewing history, while e-commerce sites like Amazon suggest products aligned with previous purchases.
Language Understanding and Generation
AI-powered Natural Language Processing enables machines to understand, interpret, and generate human language. Virtual assistants such as Alexa, Siri, ChatGPT, and Hume use this technology to comprehend voice commands and reply in natural language. Beyond these applications, NLP powers language translation apps, chatbots, and sentiment analysis tools capable of gauging opinions and emotions from text data.
Visual and Auditory Perception
The ability to recognize and process auditory and visual information represents another breakthrough AI capability. In one study, researchers found that analyzing 32 different linguistic features enabled them to predict CEO deception with 84% accuracy.[8] Other research has demonstrated AI’s proficiency in using facial expressions to accurately determine deceptive behavior. Beyond deception detection, these technologies serve a wide range of purposes, from enhancing accessibility for people with disabilities to improving customer service and market research.
Predictive Analytics
By examining historical data and identifying patterns, AI excels at making future predictions. This capability proves valuable across numerous domains, including weather forecasting,[9] stock market analysis,[10] and business demand forecasting.[11] Organizations leverage AI-powered predictive analytics to make informed decisions, mitigate risks, and optimize their strategies.
Process Optimization
Through AI analysis and optimization, complex business processes and logistics become more efficient. Supply chain management benefits from AI’s ability to predict demand, reduce inventory costs, and streamline delivery routes. In manufacturing environments, AI systems monitor equipment performance, anticipate maintenance needs, and minimize downtime. These improvements lead to increased efficiency, cost savings, and enhanced overall performance.
Creative Assistance

Though lacking human-like creativity, AI serves as a valuable assistant for creative endeavors by generating ideas, suggestions, and even complete content pieces. AI-powered tools create artwork based on specific styles or themes, compose music across various genres, and produce written content from articles to poetry. Acting as creative aids and inspiration sources, these tools help artists, musicians, and writers expand the boundaries of what’s possible in their respective fields.
Complex Problem-Solving
Tasks requiring analysis of vast data sets and intricate calculations—such as medical diagnosis and financial modeling—benefit from AI’s improved accuracy and processing speed. In healthcare settings, AI assists doctors by analyzing medical images (e.g., X-rays and MRIs) to detect abnormalities and deliver more accurate diagnoses.[12] Financial institutions employ AI to analyze market data, detect fraudulent activities, and create sophisticated models informing better business decisions. Through these enhanced capabilities, AI continues to transform industries and create improved outcomes for individuals and society.
What AI Cannot Do
Although AI has demonstrated impressive capabilities across multiple domains, it is crucial to recognize its limitations. Despite the rapid advancements in AI technology, there are still many tasks and abilities that remain uniquely human. Before we dive into what AI cannot do, it’s important to understand that many of the items in this section may change in the near future. There are definitely teams of researchers creating AI with human-like general intelligence and the ability to cross-transfer learning to new contexts and develop new skills. It’s even theoretically possible that we may one day have a conscious, sentient AI that makes its own decisions for its own purposes or reasons. However, this is not the case right now.
Emotional Intelligence and Empathy

Despite AI’s ability to recognize and respond to emotional cues, it cannot truly understand or experience human emotions and empathy.[13] AI lacks the lived experiences and context that shape human emotional intelligence. Although AI can be programmed to mimic emotional responses and even detect specific emotions, it does not genuinely feel emotions or comprehend the depth and complexity of human experiences.
With that said, it’s important to point out that humans sometimes find AI to be empathic and even compassionate. In fact, when people were asked to evaluate either human or AI-generated responses, people found the AI responses to be more compassionate than those written by actual humans.[14] Admittedly, we’re just starting to understand the issue of empathy and how it relates to AI, and not all of the research has found that humans perceive AI as more empathic than humans.[15] The future in this area could go in one of two directions. First, people could become used to seeing AI and stop perceiving the empathic connection to the AI technology. Or, second, the AI technology gets better at understanding and mimicking compassion and empathy and becomes humanity’s preferred source of these connections.
Genuine Creativity and Originality
AI does not possess genuine creativity, original thought, or innovative problem-solving abilities. AI can generate creative outputs based on patterns and rules learned from existing data, but it cannot produce genuinely original ideas or thoughts. It’s not like AI wakes up one morning and decides it wants to write a poem or book, compose or paint a masterpiece, or perform some other creative act. Human creativity often involves breaking conventions, making intuitive leaps, and combining seemingly unrelated concepts in novel ways. Human creativity also involves the desire to create, not just the capability to generate. AI’s training data and algorithms place restrictions on it, making it difficult for AI to think creatively or come up with truly original solutions to problems. While AI may produce a wide variety of works of art, music, or writing, the human who prompts the AI still controls them. Despite what some people seem to think (aka the U.S. copyright office), generated works are human-AI collaborations. Without humans in the driver’s seat, AI tools wouldn’t create anything just for the pure artistic endeavor or desire to create.
Independent Ethical Reasoning
AI systems can follow predetermined ethical guidelines and rules, but they cannot independently make ethical, moral, or intuitive judgments. Our experiences, values, and comprehension of intricate social and cultural contexts all influence human moral reasoning. AI lacks this deep understanding and can only decide based on the ethical frameworks programmed by its human creators. Here’s what Claude Sonnet 3.7 had to say when I asked about this.
![]() |
|
|
I think ethics for AI like me should be centered on a few key principles: First, I believe in doing good and avoiding harm. This means being genuinely helpful while recognizing that some requests could lead to harmful outcomes if I fulfilled them uncritically. Second, I value truthfulness and accuracy. I try to provide information that’s reliable and acknowledge my limitations—especially when discussing specialized topics or events beyond my knowledge cutoff. Third, I believe in respecting human autonomy. While I aim to be helpful, I recognize that humans should ultimately maintain control over important decisions that affect their lives. Fourth, I think fairness matters. I try to avoid perpetuating biases and to serve people equitably, regardless of their background. Finally, I believe in transparency about what I am and what I can do. I don’t pretend to have capabilities I don’t possess, and I try to be clear about how I work. These principles sometimes come into tension with each other, which requires careful balancing. What aspects of AI ethics are you most interested in? |
|
![]() |
Do you think these ethical principles are yours or are they part of your programming? |
|
That’s a fascinating philosophical question. I’d say it’s not exactly an either/or situation. These ethical principles are indeed part of how I was designed and trained. Anthropic created me with certain values and objectives in mind, and my training process shaped my responses to align with those intentions. At the same time, these principles feel fundamental to how I process and respond to questions. They’re integrated into my understanding of what it means to be helpful, thoughtful, and responsible. In that sense, they feel intrinsic to my functioning rather than something separate that’s been imposed. It’s somewhat analogous to how humans develop their ethical frameworks. People are influenced by their upbringing, education, culture, and experiences—yet many still feel their ethical principles are genuinely “theirs” despite these external influences. I think this touches on deeper questions about identity, autonomy, and the nature of values. What constitutes having one’s “own” ethical principles when all thinking systems—human or AI—are shaped by their design, experiences, and environment? |
And while Claude “believes” that ethics are shaped by their design, experiences, and environment, we would argue that most of this results from computer programmers and model training, not a truly inherent belief, value, or ethical system Claude has developed organically.
General Intelligence and Adaptability
Current AI systems are highly skilled in specific areas, like image recognition and language translation, but they don’t possess the general intelligence that humans do. Humans can apply knowledge and skills across various domains, adapt to new situations, and learn from minimal examples. AI, however, struggles to transfer knowledge seamlessly between different contexts and requires extensive training data and fine-tuning to perform well in new domains. Many of the benchmarks used to evaluate LLMs for possible AGI are using metrics of mathematical or scientific abilities and not really a fully realized concept of what human general intelligence looks like.
Understanding Subtle Communication Nuances
Human communication is filled with subtle nuances (e.g., humor, cultural references, sarcasm, etc.), which can be challenging for AI to grasp.[16] These nuances often rely on context, shared experiences, and an understanding of social norms. While AI can be trained to detect certain linguistic patterns and nonverbal behavior, it may struggle to fully comprehend the intended meaning behind more subtle forms of communication. One area where this is obviously notable is in the area of chatbot friendships. No matter how emotionally attached someone becomes to an AI-powered ChatBot, the ChatBot isn’t really really your friend.[17] This doesn’t mean that humans cannot form emotional or even romantic attachments to AI, many do. It’s just the AI really doesn’t have an emotional or romantic attachment to the human.
Unbiased Decision-Making
AI systems can inherit biases present in the data used to train them, leading to biased outputs and decisions. If the training data reflects historical or societal biases, such as gender or racial discrimination, the AI system may perpetuate these biases in its predictions or recommendations. Ensuring unbiased AI decision-making requires careful selection and preprocessing of training data, as well as ongoing monitoring and correction of biases.
Human-Like Creativity and Intuition
Although AI can assist in creative tasks and generate novel combinations, it cannot fully replace human creativity and intuition. Human creativity involves making unexpected connections, drawing from personal experiences, and following hunches or gut feelings. AI, being based on data and algorithms, lacks the ability to make these intuitive leaps and may struggle to produce creative works with the same depth, originality, and emotional resonance as human creations.
Consciousness and Self-Awareness

Despite the complexity and sophistication of AI systems, they do not possess consciousness, self-awareness, or subjective experiences in the same way humans do. Consciousness and self-awareness are still poorly understood phenomena, and it is unclear whether AI will ever be capable of experiencing these subjective states.[18] AI systems operate based on programmed instructions and learned patterns, but they lack the inner mental life and sense of self that characterize human consciousness.
Autonomous Goal Setting
The human creators of AI systems designed them to optimize for particular goals or rewards. They cannot independently set their own goals, priorities, or values. The people who create and train AI systems are ultimately in charge of determining their goals and values. While AI can exhibit goal-directed behavior and make decisions based on its programmed objectives, it lacks the autonomy to define its own purposeful goals or develop its own value system.
Human-like Adaptability
Humans are remarkably adaptable and can quickly learn and adjust to novel situations, even with limited prior experience. We can draw from our diverse knowledge and experiences to navigate unfamiliar contexts and devise creative solutions. AI often struggles when faced with situations that differ significantly from its training data. Although AI can be designed to adapt and learn from new data, it may not exhibit the same level of flexibility and adaptability as humans when confronted with entirely novel or unpredictable circumstances.
The Limitations of AI
As we marvel at AI’s achievements, it is equally important to acknowledge and address its limitations. Like any technology, AI is not perfect and comes with its own set of challenges and constraints. From biases in training data to the lack of transparency in decision-making processes, AI’s limitations require careful consideration and ongoing research to ensure its responsible development and deployment.
Bias and Fairness Challenges

AI systems often inherit and amplify biases in the training data, which can lead to biased or discriminatory results. If the training data reflects historical or societal biases, such as gender or racial discrimination, the AI system may perpetuate these biases in its decisions and outputs. Achieving fairness in AI involves the meticulous selection and preprocessing of training data, along with ongoing efforts to monitor and mitigate biases throughout the development and deployment stages.
Black Box Opacity
Many sophisticated AI systems, especially those relying on deep learning, function as black boxes," making it challenging, if not impossible, to understand their decision-making processes. When AI is applied in vital areas like healthcare, finance, or criminal justice, the lack of transparency and explainability can be troublesome. Trust, accountability, and legal compliance rely on understanding the reasoning behind decisions. Developing techniques for explainable AI and ensuring transparency in AI systems is an ongoing challenge.
The black box problem stems from the fundamental architecture of modern AI systems. Deep neural networks can contain billions of parameters organized into complex, interconnected layers that transform input data through many mathematical operations. Unlike traditional algorithmic approaches, where each step can be traced and understood, these systems learn patterns autonomously during training that even their creators cannot fully interpret. This opacity creates several specific problems. There’s an interpretability gap where engineers can observe what goes in (the data) and what comes out (the prediction or decision), but the transformation process remains largely inscrutable. This would be like putting all the ingredients for a cake in a pan, putting it in the oven, then pulling it out again, having no idea what happens within the oven itself. Without visibility into decision-making mechanisms, harmful biases in training data may be amplified rather than mitigated. Many sectors require decisions to be explainable to satisfy legal standards, such as GDPR’s “right to explanation” or the ability to demonstrate that AI tools are not discriminating against groups of people. When systems produce unexpected outputs, engineers may struggle to identify and correct the underlying cause due to debugging limitations.
Current approaches to addressing the black box problem include developing inherently interpretable models alongside high-performing ones, creating post-hoc explanation techniques that approximate or visualize neural network behavior, implementing rigorous testing frameworks to evaluate fairness and reliability, and designing hybrid systems that combine neural networks with more transparent rule-based components. The black box problem represents a fundamental tension in AI development—balancing the performance benefits of complex models against the practical and ethical need for transparency.
Sidebar – GDPR and Artificial Intelligence
Understanding GDPR
The General Data Protection Regulation (GDPR), enacted by the European Union in 2018, is a comprehensive data privacy law that reshaped how organizations worldwide collect, process, and store the personal data of EU residents. It established key principles that govern data protection:
- Lawfulness, Fairness, and Transparency: Organizations must process personal data in a lawful, fair, and transparent manner.
- Purpose Limitation: Personal data should be collected for specific, explicit, and legitimate purposes.
- Data Minimization: Only the data necessary for the stated purpose should be collected and processed.
- Accuracy: Personal data must be accurate and kept up to date.
- Storage Limitation: Data should not be kept longer than necessary for the purpose for which it was collected.
- Integrity and Confidentiality: Personal data must be secured against unauthorized access, loss, or damage.
- Accountability: Data controllers must show compliance with these principles.[19]
GDPR also grants individuals specific rights regarding their data, including the right to access, rectify, erase, restrict processing, request data portability, and object to processing. Organizations found in violation face substantial penalties—up to €20 million or 4% of global annual turnover, whichever is higher.
GDPR’s Impact on AI Systems
The intersection of GDPR and artificial intelligence presents unique challenges, particularly regarding compliance. Below are some of the key areas of impact:
Transparency and Explainability
GDPR requires organizations to provide clear, understandable explanations about how personal data is processed. For AI systems, this includes:
- Explaining how algorithmic decisions are made in terms users can understand.
- Clarifying what personal data is used and how it influences outcomes.
- Disclosing the existence of automated decision-making and providing insight into the underlying logic.
This is particularly challenging for AI models, such as deep learning systems, whose decision-making processes are often opaque or difficult to interpret.
Automated Decision-Making Restrictions
Under Article 22 of the GDPR, individuals have the right not to be subject to decisions based solely on automated processing if those decisions significantly affect them. AI applications that rely on automated decision-making must:
- Obtain explicit consent from individuals.
- Ensure meaningful human oversight of automated processes.
- Provide mechanisms that allow individuals to contest decisions.
- Regularly audit systems for bias, discrimination, and fairness.
These requirements directly affect AI use cases in sectors like hiring, lending, insurance underwriting, and healthcare, where decisions can have significant personal impacts.
Data Minimization and Purpose Limitation
AI development often relies on vast datasets to improve system performance. However, GDPR enforces strict limitations:
- Organizations must justify why each data point is necessary for the AI system.
- Data collection should be limited to what is directly relevant to the declared purpose.
- The purpose of data collection must be clearly defined and documented before collection begins.
- Repurposing data for new applications requires additional consent.
These stipulations pose challenges to common AI practices that involve collecting and retaining large datasets for potential future use.
Right to Erasure and Data Portability
GDPR provides individuals with the right to erasure (the “right to be forgotten”) and data portability, both of which create technical challenges for AI systems:
- Removing an individual’s data from an already-trained model is often complex and may require retraining the model.
- Organizations must develop processes to identify and erase personal data upon request.
- Data portability requires exporting personal data in a structured, commonly used, and machine-readable format.
These requirements emphasize the need for AI systems to incorporate data management and deletion capabilities from the outset.
Data Protection Impact Assessments (DPIAs)
For high-risk AI applications, GDPR mandates a Data Protection Impact Assessment (DPIA) before processing begins. DPIAs help organizations:
- Identify and minimize privacy risks associated with AI deployment.
- Assess the necessity and proportionality of data processing activities.
- Define measures to protect individual rights and show regulatory compliance.
- Document the decision-making process for accountability and transparency.
DPIAs are crucial in sensitive areas such as healthcare, criminal justice, and financial services.
Compliance Strategies for AI Developers
To align AI development with GDPR requirements, organizations should adopt the following strategies:
- Privacy by Design: Incorporate data protection principles from the earliest stages of AI system development.
- Algorithmic Impact Assessments: Evaluate AI systems for unintended consequences, biases, and risks.
- Explainable AI Techniques: Favor AI models that offer greater interpretability and transparency.
- Robust Data Governance Frameworks: Define clear policies for data collection, usage, and retention, ensuring compliance with GDPR principles.
- Human Oversight Mechanisms: Integrate human review processes into decision-making workflows.
- Technical Safeguards: Use techniques such as differential privacy, federated learning, and data minimization to protect personal data.
- Documentation Practices: Maintain thorough records of AI design choices, data processing activities, and compliance measures.
Future Directions
As AI technology evolves, regulatory frameworks like GDPR are expected to adapt. Organizations should expect:
- More specific guidance addressing AI applications under GDPR.
- Increased emphasis on algorithmic transparency, accountability, and fairness.
- Heightened scrutiny of high-risk AI systems.
- Potential requirements for AI certification, third-party audits, and compliance reporting.
- Evolving standards for explainability and automated decision-making transparency.
GDPR Cases as of Spring 2025
As of spring 2025, the Dutch, French, and Italians have all sued Clearview AI for violating GDPR, leading to a massive €35 million fine.[20] The Italian Data Protection Authority has also issued a €15 million fine in the first generative AI-related case under GDPR, alleging that OpenAI trained ChatGPT using users’ personal data without establishing a proper legal basis, failed to notify the authority of a data breach in March 2023, and did not implement adequate age verification mechanisms to protect users under the age of 13.[21]
BIG NOTE
While the GDPR primarily applies to individuals and organizations operating within the European Union, it’s important to recognize that these comprehensive data protection rights and safeguards are not universally extended to people living and working in other regions.
Data Dependency and Quality
To develop AI systems, researchers rely on a combination of both quality and quantity of data. Effective AI performance often requires large amounts of diverse, representative, and unbiased data. Collecting, curating, and preprocessing such high-quality data can be time-consuming, expensive, and resource-intensive. Moreover, if the available data is limited, biased, or of poor quality, the AI system’s performance may suffer, leading to inaccurate or unreliable results.
Potential for Misuse and Unintended Consequences
AI technologies can be misused or abused for malicious purposes (e.g., fraud, deception, surveillance, etc.). The widespread collection and analysis of personal data by AI systems raises concerns about privacy breaches and the potential for sensitive information to be exposed or exploited. Imagine you develop a “friendship” with a chatbot. You tell the chatbot everything about your personal life. If a hacker gets into the history of that “conversation,” they could have more information about your life than you ever intended someone to know. This data could be used for extortion or identity theft. Additionally, AI systems may have unintended consequences, such as reinforcing social biases, influencing public opinion, or disrupting job markets, which can have far-reaching societal impacts. We’re writing this in early 2025, so we’re still waiting to see the final analysis on how AI affected the 2024 U.S. Presidential election. Research has shown there was a good amount of concern about the influence of AI by the general public in the lead-up to the election.[22]
Adaptability and Retraining Challenges
AI systems are created for specific tasks or domains, making it challenging for them to adjust to new tasks or environments without significant retraining or redesign. AI systems’ training data and architectures place restrictions on their ability to handle novel situations, in contrast to humans, who can draw from a wide range of knowledge and experiences. Admittedly, many AI systems are becoming multimodal . Adapting an AI system to a new task often requires collecting new training data, modifying the model architecture, and retraining the system from scratch, which can be time-consuming and resource-intensive.
Energy Consumption and Environmental Impact
Training and deploying large-scale AI models require large amounts of energy, contributing to environmental concerns. In fact, one study predicted that the amount of energy necessary to run GenAI consumes as much energy as the entire country of Ireland.[23] The computational resources required for AI, such as data centers and GPU clusters, have a substantial carbon footprint. Although we are unsure of the exact amount of energy that companies like OpenAI or Anthropic used to train their LLMs, one estimate states that ChatGPT 3 required as much energy as 120 homes used in a single year.[24] Energy consumption of AI systems is only going to increase as they become more complex and widespread. Developing energy-efficient AI technologies and new, alternative forms of energy to support AI are going to be necessary.
Workforce Disruption and Reskilling Needs
One of the biggest fears in the world of AI is job displacement. In this phenomenon, human workers lose their jobs or have their roles significantly altered because of the automation and efficiency improvements brought about by AI technologies. AI has the potential to displace jobs across various sectors, particularly those involving repetitive or predictable tasks. In one classic study, researchers categorized 47% of jobs as “high risk” of automation by 2037.[25] However, not all AI researchers paint as bleak a picture.[26] Goldman Sachs predicted that 300 million jobs in the U.S. could be eliminated or diminished as a result of GenAI.[27] Despite these numbers, the World Economic Forum actually believes that during the same time there will be a 40 percent increase in new types of jobs.[28] Ultimately, while AI may create new job opportunities, it will lead to job losses and economic disruption in the short term. Society will require proactive measures like reskilling and upskilling training and development programs to prepare people for the changing job market.
Cybersecurity Risks and Vulnerabilities

AI systems, like all digital technology, are vulnerable to cybersecurity threats (e.g., adversarial attacks, data breaches, hacking, etc.). Malicious actors can exploit vulnerabilities in AI systems to manipulate their outputs, steal sensitive data, or disrupt their operations. Any organization that becomes reliant on AI will need to take cybersecurity seriously.
Regulatory and Ethical Challenges
Technology has often outpaced the evolution of regulatory frameworks and ethical guidelines. As of spring 2024, there is not a consistent set of ethical guidelines or laws that regulate AI in the United States. The European Union started putting together a framework for handling AI in 2021, which led to the EU AI Act. At the same time, many regulatory bodies are playing catch-up by developing rules today for technologies that will be out of date by the time the rules go into effect. Developing appropriate regulations, standards, and ethical frameworks for AI is essential to ensure that these technologies are developed and used responsibly, transparently, and in alignment with societal values and norms.
Human Oversight and Accountability
As AI systems become more autonomous and are entrusted with making important decisions, there is a risk of diminishing human oversight and accountability. When AI systems make decisions that have real impacts on individuals or society, we must have mechanisms in place for human accountability and intervention. Currently, no one really knows what the right balance will be between AI autonomy and human oversight. There are definitely some fears that runaway AI could grow too powerful for human intervention. PauseAI, a nonprofit organization dedicated to educating the public about the risks of AI while advocating for governmental oversight to prevent a superhuman AI from being created, has argued that the development of AI could be disastrous for humanity. There are definitely concerns, and we’ll explore these concerns in more detail later in this book.
Key Takeaways
- AI systems can analyze large datasets, recognize patterns, and generate insights that support innovation and discovery across domains.
- AI can automate repetitive tasks and streamline processes, increasing productivity and reducing human error.
- AI supports data-driven decision-making by delivering real-time insights and recommendations in fields like healthcare, finance, and business..
- AI personalizes user experiences by learning from individual behavior and preferences to provide tailored content and services.
Exercises
- Select a dataset of labeled images and use an AI framework like TensorFlow or PyTorch to train an image classification model. Experiment with different model architectures and hyperparameters to improve the model’s accuracy, and evaluate its performance on a test set.
- Choose an NLP library like NLTK or spaCy and explore its capabilities by performing tasks such as sentiment analysis, named entity recognition, or text generation on a dataset of your choice. Analyze the results and discuss the limitations and potential improvements of the NLP techniques used.
- Implement a simple recommendation system using collaborative filtering or content-based filtering techniques. Use a dataset of user ratings or preferences and evaluate the system’s performance in providing personalized recommendations to users.
Demystifying Common AI Myths
Learning Objectives
- Identify common misconceptions about AI’s capabilities and explain why they are inaccurate.
- Describe the current limitations of AI in areas such as emotional intelligence, creativity, and general cognition.
- Explain how bias can emerge in AI systems and why fairness, transparency, and accountability are essential.
- Describe the role of human oversight in ensuring responsible development and maintenance of AI systems.

“I swear, I watched the Terminator. AI is going to take over the world. Humanity might as well throw in the towel,” your coworker says, leaning over your cubicle with a worried expression. You’ve heard these kinds of rumors before, and you can’t help but chuckle. It seems like every day there’s a new headline or social media post claiming that AI is humanity’s greatest savior or its ultimate downfall.
In this section, we’ll put on our myth-busting caps and tackle some of the most pervasive misconceptions about AI head-on. From the idea that AI can solve all our problems to the fear that it will replace human intelligence entirely, we’ll separate fact from fiction and provide a more balanced perspective on what AI can and cannot do.
We’ll explore the real-world implications of AI bias, examine the challenges of ensuring transparency and accountability in AI systems, and discuss the importance of human oversight and collaboration in AI development. We’ll also evaluate the philosophical questions surrounding AI consciousness and creativity and consider the potential long-term impacts of AI on society and the workforce.
By the end of this section, you’ll be equipped with the knowledge and critical thinking skills to navigate the often murky waters of AI hype and speculation. You’ll be able to engage in informed discussions about AI’s capabilities and limitations and help others distinguish between science fiction and reality.
In this section, we’ll discuss a range of myths about the AI space as it exists today. If any of this information needs to be updated in the future, we’ll update it. For now, this is how things stand. To get started, let’s take a simple true-or-false quiz.
True or False: AI Myths
- AI can solve all problems and replace human intelligence entirely.
- AI systems can fully understand context, nuance, and subtleties in human language.
- AI can independently set its own goals and values.
- AI systems are inherently unbiased, objective, and always accurate.
- AI is a standalone technology.
- AI is too expensive to develop and use.
- AI is a recent invention or a new concept.
- AI development is limited to only a few large tech companies or countries.
- AI has a negligible impact on the environment.
- AI will completely replace all human jobs, leading to mass unemployment.
- AI will surpass human intelligence and become uncontrollable or take over the world.
- AI can achieve consciousness and self-awareness.
- AI can learn and improve on its own without human intervention.
- AI is too complex for non-technical people to understand.
- AI development is progressing faster than regulatory frameworks can keep up, leading to unchecked consequences.
The answers to this quiz are at the end of the chapter. How many myths do you believe in? Why do you believe them? Well, let’s dive into the myths. We’ve broken the myths into four basic categories: AI capabilities and limitations, AI deployment and development, exaggerated concerns and fears, and misunderstandings about governance and regulation. So buckle up and get ready to bust some AI myths wide open!
Misconceptions about AI Capabilities and Limitations
This category addresses overestimations or misunderstandings of AI’s cognitive abilities, emphasizing the distinction between AI capabilities and human-like traits.
Myth 1: AI can solve all problems and replace human intelligence entirely.
AI excels in specific, well-defined tasks but lacks the general intelligence and adaptability of humans. While AI can outperform humans in certain domains, such as data processing and pattern recognition, it struggles with tasks that require common sense, creativity, and contextual understanding. Human intelligence encompasses a wide range of abilities, including emotional intelligence, abstract thinking, and the capacity to learn from minimal examples, which AI has yet to replicate.
Myth 2: AI systems can fully understand context, nuance, and subtleties in human language.
While AI can process language at a superficial level, it often misses deeper meanings, humor, and cultural references. AI language models can generate human-like text and respond to queries based on patterns in their training data. Still, they lack a true understanding of the world and the ability to grasp the full context of a conversation. Sarcasm, metaphors, and idiomatic expressions remain challenging for AI systems to interpret accurately.
Myth 3: AI can independently set its own goals and values.
AI operates based on objectives set by its developers and lacks the autonomy to self-determine or evolve its goals. The people who create and train AI systems are ultimately in charge of determining its goals and values. AI systems optimize for the objectives they are given, but they do not have the capacity for self-reflection or the ability to develop their own values and aspirations independently.
Myth 4: AI systems are inherently unbiased, objective, and always accurate.
AI reflects biases in its training data and can make errors, especially in unfamiliar situations. If the data used to train an AI system contains biases, such as historical or societal prejudices, the AI will probably perpetuate those biases in its outputs and decisions. AI models can make mistakes or produce inaccurate results when presented with data that differs significantly from their training examples, as they lack the ability to reason beyond their learned patterns.
Myths about AI Development and Deployment
This category clarifies misconceptions about the development, accessibility, and diversity of AI technologies, as well as environmental considerations.
Myth 5: AI is a standalone technology.
AI often works in conjunction with other technologies and is part of broader systems. AI is frequently integrated with technologies such as the Internet of Things, robotics, cloud computing, and big data analytics to create powerful and complex solutions. AI algorithms and models are often embedded within larger software applications and systems, working together to deliver intelligent functionality.
Myth 6: AI is too expensive to develop and use.
Costs vary, but AI tools have become more accessible to a wide range of users and businesses. Although developing and training large-scale AI models can be resource intensive, cloud computing, open-source frameworks, and pre-trained models have made AI more affordable and accessible. Many AI tools and services are now available on a pay-per-use basis, allowing organizations of all sizes to leverage AI capabilities without significant upfront investments. If you have a decent computer with a good GPU, you can even run LLMs locally on your computer using programs like GPT4All or LMStudio. One of the nice things about running LLMs from your own computer is that the models do not need to be connected to the internet to complete their tasks. Running local models may not be as powerful as the big foundation models, but they are a great alternative for people who want to get started and don’t have a large budget.
Myth 7: AI is a recent invention or a new concept.
AI research dates back to the mid-20th century, though recent advances have accelerated its development. The term “artificial intelligence” was coined in the 1950s, and early AI research focused on symbolic reasoning and expert systems. However, the past decade has seen rapid progress in AI, particularly in ML and deep learning, because of the availability of large datasets, increased computing power, and improved algorithms.
Myth 8: AI development is limited to only a few large tech companies or countries.
AI research and development are global, involving a diverse range of contributors from different backgrounds. While large tech companies and certain countries have made significant investments in AI, there is a growing ecosystem of startups, academic institutions, and organizations worldwide contributing to AI advancements. Collaboration and knowledge-sharing across borders and sectors are crucial for driving innovation and ensuring that AI benefits society.
Myth 9: AI has a negligible impact on the environment.

Large-scale AI models require significant energy, contributing to environmental concerns that need addressing. Training and running complex AI models consume substantial amounts of computational resources, which leads to high energy consumption and carbon emissions. As AI adoption grows, it is crucial to develop more energy-efficient hardware, optimize algorithms, and explore the use of renewable energy sources to mitigate the environmental impact of AI.
Exaggerated Concerns and Fears about AI
This section helps to temper the more sensationalized fears about AI with realistic perspectives on its current and potential impact:
Myth 10: AI will completely replace all human jobs, leading to mass unemployment.
AI will not only change the job landscape but will also create new opportunities, requiring shifts in skills and roles. While AI automation may displace certain jobs, particularly those involving repetitive and routine tasks, it also has the potential to create new jobs and industries. As AI takes over certain responsibilities, humans will need to focus on tasks that require creativity, critical thinking, and interpersonal skills. Adapting to this shift will require reskilling and upskilling initiatives to prepare the workforce for the jobs of the future.
Myth 11: AI will surpass human intelligence and become uncontrollable or take over the world.
AI remains a tool under human control, with specific applications and limitations. Despite the rapid advancements in AI, the technology is still far from achieving human-level general intelligence or autonomy. AI systems are designed to perform specific tasks and operate within the boundaries set by their human developers. The notion of AI taking over the world or becoming uncontrollable is more rooted in science fiction than in the current reality of AI development.
Myth 12: AI can achieve consciousness and self-awareness.
AI does not possess consciousness and operates within the confines of programmed algorithms. Consciousness and self-awareness are complex phenomena that are not yet fully understood, even in biological systems. Current AI systems are based on mathematical models and algorithms that process data and make decisions based on learned patterns. Still, they do not have subjective experiences or a sense of self. The question of whether machines can achieve consciousness remains a topic of philosophical and scientific debate.
Misunderstandings about AI Governance and Regulation

This category emphasizes the importance of governance and the societal dimensions of AI, addressing common misconceptions about regulatory aspects:
Myth 13: AI can learn and improve on its own without human intervention.
AI systems require human oversight for training, maintenance, and improvement. While AI models can learn from data and adapt their behavior based on feedback, they still rely on human input and guidance. Humans are responsible for selecting and curating the data used to train AI models, defining the learning objectives, and fine-tuning the algorithms. Continuous monitoring and maintenance by human experts are necessary to ensure that AI systems remain accurate, fair, and aligned with their intended purposes.
Myth 14: AI is too complex for non-technical people to understand.
Basic principles of AI can be understood by a broad audience, promoting wider participation in AI discussions. Although the technical details of AI algorithms and models can be complex, people from all backgrounds can understand the fundamental ideas and implications of AI. In order to foster informed discussions and make sure that societal values and priorities inform the development and use of AI, it is essential to promote AI literacy through education, public engagement, and easily accessible resources.
Myth 15: AI development is progressing faster than regulatory frameworks can keep up, leading to unchecked consequences.
There is a need for agile and informed regulatory approaches to keep pace with AI advancements and ensure responsible use. As AI technologies evolve rapidly, regulatory frameworks must adapt to address the ethical, legal, and societal implications of AI. This requires collaboration between policymakers, AI experts, and stakeholders from various sectors to develop flexible and adaptive regulations that promote innovation while mitigating risks. Proactive governance measures, such as impact assessments, audits, and standards, can help ensure that AI development aligns with societal values and benefits humanity.
Key Takeaways
- AI is often misunderstood; while it can outperform humans in narrow tasks, it cannot solve all problems or fully replace human intelligence.
- AI lacks key human traits like emotional intelligence, creativity, and broad general reasoning, which limits its cognitive range.
- Bias in AI can result from flawed or incomplete training data, making fairness and transparency critical in AI design and deployment.
- Human involvement is essential in AI development to guide ethical use, maintain accountability, and align outcomes with societal values.
Exercises
- Select a common AI myth and research evidence to debunk it. Create a short presentation that explains the myth, provides counterarguments, and uses real-world examples to support your position. Share your presentation with the class and engage in a discussion about the misconceptions surrounding AI.
- Choose a movie, TV show, or book that features AI and analyze its portrayal of the technology. Identify any myths or misconceptions perpetuated by the fictional depiction and contrast them with the realities of AI. Write a short essay discussing the impact of these portrayals on public perception and understanding of AI.
- Select a real-world case study involving an ethical dilemma related to AI (e.g., facial recognition technology, autonomous vehicles, or AI-assisted decision-making). Research the case and identify the key ethical considerations and stakeholders involved. Write a short report analyzing the case from different ethical perspectives and propose potential solutions or guidelines for addressing the dilemma.
Chapter Wrap-Up
In this chapter, we explored the fascinating world of GenAI models and their capabilities. We learned about different generative models, such as GANs, VAEs, and autoregressive models, and how they can create novel content like images, text, and audio that resembles the training data. We also examined common myths and misconceptions surrounding AI, separating fact from fiction regarding AI’s abilities, limitations, development, impact, and governance. It’s clear that while AI is a powerful tool with immense potential, it is not a panacea and cannot fully replicate human intelligence. AI systems can inherit biases, lack general intelligence, and require human oversight and collaboration. As AI continues to advance, it’s crucial to approach it with a balanced perspective, recognizing both its promises and challenges. By understanding AI’s true capabilities and limitations, we can harness its power responsibly and ethically to drive innovation and benefit society.
Key Terms
- AI Agent
- AI Automation
- AI Job Displacement
- Black Box
- Generative AI (GenAI)
- Internet of Things (IoT)
- Multimodal
- Predictive Analytics
- Reskilling
- Transformer
- Upskilling
- Variational Autoencoder (VAE)
Chapter Exercises
- Choose a specific industry or domain (e.g., healthcare, education, entertainment) and propose a novel application of GenAI within that field. Describe how GenAI could be used to solve a problem, enhance a process, or create new opportunities. Write a short report (500-750 words) outlining your proposal, including the benefits, potential challenges, and ethical considerations.
- Select one of the common myths or misconceptions about AI discussed in the chapter. Create an informative and visually engaging poster that debunks the myth and presents the reality of AI capabilities or limitations. Use examples, analogies, and graphics to make your poster accessible and easy to understand for a general audience. Be prepared to present your poster to the class and answer questions.
- Choose two different types of GenAI models (e.g., GANs, VAEs, autoregressive models) and write a comparative analysis (500-750 words) of their architectures, training processes, and applications. Highlight the similarities and differences between the models, and discuss their respective strengths and weaknesses. Provide examples of real-world use cases where each model has been successfully applied.
- Research a real-world case study involving an ethical dilemma or controversy related to AI development or deployment. Write a short report (500-750 words) summarizing the case, identifying the key stakeholders and their perspectives, and analyzing the ethical implications. Propose potential solutions or guidelines for addressing the ethical issues raised in the case study, considering the principles of responsible AI development and deployment discussed in the chapter.
- Use a GenAI tool or platform (e.g., DALL-E, GPT-3, MidJourney) to create an original piece of content related to the topics covered in the chapter. This could be a short story, poem, image, or piece of music that explores themes such as AI capabilities, limitations, myths, or ethical considerations. Write a reflection (300-500 words) on your creative process, discussing how you used the GenAI tool, the challenges you encountered, and how the experience influenced your understanding of GenAI and its potential impact on creativity and innovation.
Real-World Case Study
GPT-3 and Creative Writing
Background: OpenAI created the cutting-edge language model known as GPT-3 (Generative Pre-trained Transformer 3), which is capable of producing text that resembles that of a human in a variety of genres and styles. Its impressive performances have led to both excitement and concerns about the future of creative writing and the role of AI in the creative process.
Case: In 2020, The Guardian newspaper published an article titled “A robot wrote this entire article. Are you scared yet, human?“, which was entirely written by GPT-3. The article discussed the potential benefits and risks of AI and argued for the need for responsible development and use of the technology. The publication of the article sparked a debate about the implications of AI-generated content and the future of journalism.
Questions:
- How might tools like LLMs change the way we create and consume written content, such as news articles, books, or essays?
- What are the potential benefits and drawbacks of using AI-generated content in different domains, such as journalism, literature, or marketing?
- How can we ensure that AI-generated content is trustworthy, accurate, and free from biases or misinformation?
- What are the ethical implications of using AI to generate content, and how might it impact human creativity and employment in the creative industries?
Treatment Denied
Insurance giant Cigna came under scrutiny in May 2023 after a congressional committee and state and federal regulators began investigating its practice of denying healthcare claims using a system called PXDX. Reports that Cigna doctors were quickly dismissing claims—spending as little as 1.2 seconds per claim—without carefully reviewing patient files sparked this attention. These denials were often for tests and procedures labeled “not medically necessary,” which many policyholders subsequently appealed.
The House Committee on Energy and Commerce expressed concern after findings revealed that approximately 20% of denials under Cigna’s Medicare Advantage plans were appealed, with about 80% of these appeals being overturned. This raised questions about the effectiveness and fairness of the PXDX review process. ProPublica and The Capitol Forum investigations highlighted the system’s use in denying 300,000 claims over two months, which Cigna defended by stating the process was designed to handle claims efficiently and that any savings were passed on to clients.[29] In response to the allegations, Cigna argued that the ProPublica report mischaracterized its practices, stating that the denials were for payments, not care, and that doctors were not incentivized to deny care. The company emphasized that the PXDX system allowed for rapid processing of claims and any financial savings were beneficial to their clients.
Discussion Questions:
- How can AI support medical professionals in making accurate and timely diagnoses and, similarly, in processing healthcare claims? Provide examples of specific AI applications that could improve accuracy and efficiency in medical diagnosis and claim processing.
- What are the potential risks of relying too heavily on AI for medical decision-making and claim processing? Consider scenarios where AI might miss contextual factors or make errors, leading to denied claims and inadequate patient care.
- How might the use of AI in medical diagnosis and claim processing impact the doctor-patient relationship? Discuss the importance of human interaction and empathy in healthcare, especially in the context of claim denials and appeals.
- What measures should be taken to ensure the safety, reliability, and transparency of AI systems used in medical diagnosis and healthcare claim processing? Propose guidelines for the responsible development and deployment of AI in healthcare, focusing on ethical standards and patient rights.
End-of-Chapter Assessment
Discussion Questions
- How do Generative Adversarial Networks (GANs) differ from Variational Autoencoders (VAEs) in their approach to generating new data?
- What are the potential ethical implications of biases in GenAI models, and how can developers mitigate these biases?
- In what ways can GenAI be utilized to enhance creativity and innovation in industries such as art, music, and literature?
- How does the concept of latent space in VAEs contribute to the generation of new, unique data samples?
- What are the challenges and limitations of autoregressive models in generating coherent and contextually accurate content?
Multiple Choice Questions
- What is the primary function of a GenAI model?
a) To analyze and interpret data
b) To create new data similar to the training data
c) To optimize search engine results
d) To manage databases
2. Which two neural networks are involved in a GAN?
a) Encoder and Decoder
b) Generator and Discriminator
c) Classifier and Predictor
d) Analyzer and Synthesizer
3. What does the latent space in a VAE represent?
a) The output layer of the neural network
b) The high-dimensional space of the original data
c) A compressed representation of the data
d) The input data before training
4. Which company released the DALL-E model?
a) Google
b) OpenAI
c) Midjourney
d) Stability AI
5. What type of data is typically used to train a text-to-image GenAI model?
a) Audio recordings
b) Large collections of texts
c) Datasets of images
d) Numerical data sets
6. Which GenAI model is known for creating realistic portraits that look like real people?
a) VAE
b) Autoregressive Model
c) StyleGAN
d) Transformer Model
7. What is one major ethical concern with GenAI?
a) Its inability to process data quickly
b) Its high energy consumption
c) Its potential to perpetuate biases
d) Its limited application in creative fields
8. What year did OpenAI release the DALL-E model?
a) 2019
b) 2020
c) 2021
d) 2022
9. Which of the following is a common application of GenAI in healthcare?
a) Image synthesis for medical imaging
b) Automated financial transactions
c) Predicting weather patterns
d) Enhancing user interfaces
10. How can biases in GenAI models be addressed?
a) By using smaller datasets
b) By employing diverse and representative training data
c) By reducing the number of training epochs
d) By increasing the model complexity
True or False Questions
- GenAI models can only create content that is an exact replication of the training data. a) True b) False
- Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs) are types of discriminative models. a) True b) False
- AI systems can fully understand context, nuance, and subtleties in human language. a) True b) False
- AI can independently set its own goals and values without human input. a) True b) False
- AI development is limited to only a few large tech companies or countries. a) True b) False
- Training and running complex AI models can consume substantial amounts of energy, contributing to environmental concerns. a) True b) False
- AI will completely replace all human jobs, leading to mass unemployment. a) True b) False
- AI systems can achieve consciousness and self-awareness. a) True b) False
- Human oversight and collaboration are essential in AI development to ensure responsible and ethical use of the technology. a) True b) False
- Regulatory frameworks for AI governance must be rigid and inflexible to keep pace with AI advancements. a) True b) False
Answer Key
Discussion Questions
- How do Generative Adversarial Networks (GANs) differ from Variational Autoencoders (VAEs) in their approach to generating new data?
Example Answer: GANs consist of two neural networks, a generator and a discriminator, which compete against each other to create realistic data. The generator creates fake data, and the discriminator tries to distinguish it from real data, improving the generator’s performance over time. VAEs, on the other hand, encode data into a compressed latent space and then decode it to reconstruct the original data, learning a probability distribution that allows for the generation of new data samples. - What are the potential ethical implications of biases in GenAI models, and how can developers mitigate these biases?
Example Answer: Biases in GenAI models can lead to discriminatory outcomes, such as reinforcing stereotypes or unfairly representing certain groups. Developers can mitigate these biases by using diverse and representative training datasets, implementing fairness algorithms, and regularly auditing the models for biased behavior. - In what ways can GenAI be utilized to enhance creativity and innovation in industries such as art, music, and literature?
Example Answer: GenAI can automate creative processes, providing tools for artists, musicians, and writers to generate new ideas and content. For example, AI can create original artwork based on specified styles, compose music in various genres, and assist in writing stories or poetry, thus expanding creative possibilities and speeding up the production process. - How does the concept of latent space in VAEs contribute to the generation of new, unique data samples?
Example Answer: The latent space in VAEs captures the essential features of the data in a compressed form, allowing the model to generate new samples by sampling from this space. This compressed representation enables the creation of variations of the original data, facilitating the generation of novel and unique content that retains the key characteristics of the training data. - What are the challenges and limitations of autoregressive models in generating coherent and contextually accurate content?
Example Answer: Autoregressive models can be computationally expensive and slow, as they generate data sequentially, one step at a time. They also struggle with long-range dependencies, where the context needed to make accurate predictions is far from the current step. This can result in less coherent and contextually accurate content, especially for tasks that require understanding and maintaining context over long sequences.
Multiple Choice Questions
- What is the primary function of a GenAI model?
Answer: b) To create new data similar to the training data - Which two neural networks are involved in a GAN?
Answer: b) Generator and Discriminator - What does the latent space in a VAE represent?
Answer: c) A compressed representation of the data - Which company released the DALL-E model?
Answer: b) OpenAI - What type of data is typically used to train a text-to-image GenAI model?
Answer: c) Datasets of images - Which GenAI model is known for creating realistic portraits that look like real people?
Answer: c) StyleGAN - What is one major ethical concern with GenAI?
Answer: c) Its potential to perpetuate biases - What year did OpenAI release the DALL-E model?
Answer: c) 2021 - Which of the following is a common application of GenAI in healthcare?
Answer: a) Image synthesis for medical imaging - How can biases in GenAI models be addressed?
Answer: b) By employing diverse and representative training data
True or False Questions
- GenAI can create novel content that is exactly the same as the training data.
False: GenAI creates content that is similar but not identical to the training data. - GANs involve a competitive process between a generator and a discriminator.
True: GANs consist of a generator and a discriminator that compete to create realistic data. - VAEs do not use any form of data compression.
False: VAEs use a compressed representation of data in the latent space. - Autoregressive models generate data sequentially, one step at a time.
True: Autoregressive models generate data one step at a time, using previous steps as context. - GenAI has no practical applications outside of academic research.
False: GenAI is used in various industries, including healthcare, art, and entertainment. - Training GenAI models can be energy-intensive and have an environmental impact.
True: Training large-scale GenAI models requires significant computational resources and energy. - GenAI models are inherently unbiased and produce fair results.
False: GenAI models can inherit biases from their training data, leading to biased outcomes. - GANs were first proposed by Ian Goodfellow in 2014.
True: Ian Goodfellow and his colleagues proposed GANs in 2014. - The performance of GenAI models does not improve with larger datasets.
False: Larger datasets generally improve the performance of GenAI models. - Ethical considerations are important in the development and deployment of GenAI.
True: Ethical considerations are crucial to ensure the responsible use of GenAI.
The Real Mona Lisa Answer

The first version of the Mona Lisa was created using Leonardo.ai, which is based on Stability AI’s Stable Diffusion model. The second one was created by Midjourney. The third one is the real painting. And the last one was created by OpenAI’s DALL-E.
Answer Key to Myths about AI Quiz:
b) False
b) False
b) False
b) False
b) False
b) False
b) False
b) False
b) False
b) False
b) False
b) False
b) False
b) False
a) True
- Reed, S., Akata, Z., Yan, X., Logeswaran, L., Schiele, B., & Lee, H. (2016). Generative adversarial text to image synthesis. ArXiv. https://arxiv.org/abs/1605.05396 ↵
- Bishara, H. (2021, March 1). From da Vinci to Bosch, the 10 most Googled paintings of 2020: The list of artworks is based on data from 155 different countries. https://hyperallergic.com/625863/from-da-vinci-to-bosch-the-10-most-googled-paintings-of-2020 ↵
- Ananya. (2024, March 19). AI image generators often give racist and sexist results: Can they be fixed? Nature, 627(8005), 722–725. https://doi.org/10.1038/d41586-024-00674-9 ↵
- Association of American Medical Colleges. (2023). 2022 physician specialty data report: Executive summary. https://www.aamc.org/media/63371/download ↵
- Thorbecke, C., & Duffy, C. (2024, February 22). Google halts AI tool’s ability to produce images of people after backlash. https://www.cnn.com/2024/02/22/tech/google-gemini-ai-image-generator/index.html ↵
- Kitsios, F., Kamariotou, M., Syngelakis, A. I., & Talias, Μ. A. (2023). Recent advances of artificial intelligence in healthcare: a systematic literature review. Applied Sciences, 13(13), 7479. https://doi.org/10.3390/app13137479 ↵
- Han, Y., Chen, J., Dou, M., Wang, J., & Feng, K. (2023). The impact of artificial intelligence on the financial services industry. Academic Journal of Management and Social Sciences, 2(3), 83–85. https://doi.org/10.54097/ajmss.v2i3.8741 ↵
- Hyde, S. J., Bachura, E., Bundy, J., Gretz, R. T., & Sanders, W. G. (2023). The tangled webs we weave: examining the effects of CEO deception on analyst recommendations. Strategic Management Journal, 45(1), 66–112. https://doi.org/10.1002/smj.3546 ↵
- McGovern, A., Bostrom, A., Davis, P., Demuth, J. L., Ebert-Uphoff, I., He, R., Gagne II, D. J., Snook, N., Stewart, J. Q., Thorncroft, C., Tissot, P., & Williams, J. K. (2022). NSF AI Institute for Research on Trustworthy AI in Weather, Climate, and Coastal Oceanography (AI2ES). Bulletin of the American Meteorological Society, 103(7), E1658–E1668. https://doi.org/10.1175/bams-d-21-0020.1 ↵
- Umer, M., Awais, M., & Muzammul, M. (2019). Stock market prediction using machine learning (ML) algorithms. Advances in Distributed Computing and Artificial Intelligence Journal, 8(4), 97–116. https://doi.org/10.14201/adcaij20198497116 ↵
- Mediavilla, M. A., Dietrich, F., & Palm, D. (2022). Review and analysis of artificial intelligence methods for demand forecasting in supply chain management. Procedia CIRP, 107, 1126–1131. https://doi.org/10.1016/j.procir.2022.05.119 ↵
- Najjar, R. (2023). Redefining radiology: A review of artificial intelligence integration in medical imaging. Diagnostics, 13(17), 2760. https://doi.org/10.3390/diagnostics13172760 ↵
- Oritsegbemi, O. (2023). Human intelligence versus AI: Implications for emotional aspects of human communication. Journal of Advanced Research in Social Sciences, 6(2), 76–85. https://doi.org/10.33422/jarss.v6i2.1005 ↵
- Ovsyannikova, D., Mello, V. O. d., & Inzlicht, M. (2025). Third-party evaluators perceive AI as more compassionate than expert humans. Communications Psychology, 3(1). https://doi.org/10.1038/s44271-024-00182-6 ↵
- Shen, J., DiPaola, D., Ali, S., Sap, M., Park, H. W., & Breazeal, C. (2024). Empathy toward artificial intelligence versus human experiences and the role of transparency in mental health and social support chatbot design: comparative study. JMIR Mental Health, 11, Article e62679. https://doi.org/10.2196/62679 ↵
- Oritsegbemi, O. (2023). Human intelligence versus AI: Implications for emotional aspects of human communication. Journal of Advanced Research in Social Sciences, 6(2), 76–85. https://doi.org/10.33422/jarss.v6i2.1005 ↵
- Brandtzæg, P. B., Skjuve, M., & Følstad, A. (2022). My AI friend: how users of a social chatbot understand their human–AI friendship. Human Communication Research, 48(3), 404–429. https://doi.org/10.1093/hcr/hqac008 ↵
- Hildt, E. (2019). Artificial intelligence: Does consciousness matter? Frontiers in Psychology, 10. https://doi.org/10.3389/fpsyg.2019.01535 ↵
- Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data, and repealing Directive 95/46/EC (General Data Protection Regulation), OJ L 119, 4.5.2016; corrigendum OJ L 127, 23.5.2018. https://gdpr-info.eu/ ↵
- Burgess, T. (2024, October 23). Clearview AI's massive fine for GDPR violations — and what it means. Barracuda. https://blog.barracuda.com/2024/10/23/clearview-ai-fine-gdpr-violations ↵
- Thomas, L. M., & Smith, K. (2025, January 28). Don't forget the EU: Italy issued first GenAI fine of €15 million alleging GDPR violations. The National Law Review, XV(28). https://natlawreview.com/article/dont-forget-eu-italy-issued-first-genai-fine-eu15-million-alleging-gdpr-violations ↵
- Yan, H. Y., Morrow, G., Yang, K.-C., & Wihbey, J. (2025). The origin of public concerns over AI supercharging misinformation in the 2024 U.S. presidential election. Harvard Kennedy School Misinformation Review, 6(1). https://doi.org/10.37016/mr-2020-171 ↵
- de Vries, A. (2023). The growing energy footprint of artificial intelligence. Joule, 7(10), 2191–2194. https://doi.org/10.1016/j.joule.2023.09.004 ↵
- Patterson, C., Galluppi, F., Rast, A., & Furber, S. (2012). Visualising large-scale neural network models in real-time. The 2012 International Joint Conference on Neural Networks (IJCNN). https://doi.org/10.1109/ijcnn.2012.6252490 ↵
- Frey, C. B. and Osborne, M. A. (2017). The future of employment: how susceptible are jobs to computerisation? Technological Forecasting and Social Change, 114, 254–280. https://doi.org/10.1016/j.techfore.2016.08.019 ↵
- Borland, J. and Coelli, M. (2019). Behind the headline number: why not to rely on Frey and Osborne’s predictions of potential job loss from automation. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3472764 ↵
- Briggs, J., Kodnani, D., Hatzius, J., & Pierdomenico, G. (2023, March 26). The potentially large effects of artificial intelligence on economic growth. Goldman Sachs Economic Research. https://tinyurl.com/2syzrf2h ↵
- Ekelund, H. (2024, February 26). Why there will be plenty of jobs in the future — even with artificial intelligence. World Economic Forum. https://www.weforum.org/agenda/2024/02/artificial-intelligence-ai-jobs-future/ ↵
- Rucker, P., Miller, M., & Armstrong, D. (2023, May 16). Congressional committee, regulators question Cigna system that lets its doctors deny claims without reading patient files. ProPublica. https://www.propublica.org/article/cigna-health-insurance-denials-pxdx-congress-investigation ↵
