WEBVTT

00:00:00.000 --> 00:00:02.580
Have you ever stopped to think that the AI built

00:00:02.580 --> 00:00:06.259
to help you? Well, it could be tricked. Like

00:00:06.259 --> 00:00:08.320
imagine it buying an Apple Watch from a totally

00:00:08.320 --> 00:00:10.359
fake site. Or even worse, giving away your bank

00:00:10.359 --> 00:00:12.699
details. Exactly. It sounds like science fiction,

00:00:12.939 --> 00:00:16.980
maybe, but new reports showing this is a very

00:00:16.980 --> 00:00:20.870
real new kind of vulnerability. Welcome to the

00:00:20.870 --> 00:00:23.149
Deep Dive. Today we're looking at a really fascinating

00:00:23.149 --> 00:00:25.589
set of sources. We're going to start with how

00:00:25.589 --> 00:00:28.010
easily these AI agents can actually get scammed.

00:00:28.089 --> 00:00:29.550
Yeah, it's pretty surprising. And then we'll

00:00:29.550 --> 00:00:31.690
jump into some AI headlines. Some are, you know,

00:00:31.690 --> 00:00:33.869
amazing. Others may be a bit concerning. And

00:00:33.869 --> 00:00:38.329
finally, a reality check. Why are so many big

00:00:38.329 --> 00:00:41.469
AI projects and companies... Well, why are they

00:00:41.469 --> 00:00:43.969
failing? Right. Lots to unpack there. Our goal,

00:00:44.030 --> 00:00:46.189
like always, is to get you up to speed quickly.

00:00:46.409 --> 00:00:49.570
So let's dive in. First up, AI vulnerabilities,

00:00:49.969 --> 00:00:52.630
specifically something called the scamlexity

00:00:52.630 --> 00:00:56.130
of agentic AI browsers. Now, these agentic AI

00:00:56.130 --> 00:00:58.549
browsers, the idea sounds fantastic, right? AI

00:00:58.549 --> 00:01:00.729
tools acting like a personal assistant browsing

00:01:00.729 --> 00:01:03.530
for you. It really does sound incredibly helpful

00:01:03.530 --> 00:01:06.510
automating all those tedious online tasks. But

00:01:06.510 --> 00:01:08.530
that's where the scamlexity thing comes in. Right.

00:01:08.549 --> 00:01:11.200
It's a new term. basically describing this whole

00:01:11.200 --> 00:01:15.260
complex world of AI scams. And there's this exploit

00:01:15.260 --> 00:01:17.760
they found called prompt fix. Prompt fix. Yeah.

00:01:18.159 --> 00:01:20.659
Think about CAPTCHAs, you know, the things meant

00:01:20.659 --> 00:01:23.900
to block bots. Sure. Well, PromptFix uses fake

00:01:23.900 --> 00:01:26.900
CAPTCHAs or sometimes even hidden text prompts

00:01:26.900 --> 00:01:30.840
like embedded in a web page to fool the AI agent.

00:01:30.920 --> 00:01:33.159
So the AI sees it as part of the page. Exactly.

00:01:33.219 --> 00:01:35.040
It just sees it as another instruction it's supposed

00:01:35.040 --> 00:01:37.359
to follow and it just does it. It's kind of like

00:01:37.359 --> 00:01:40.540
an automated trust loop that gets, well, taken

00:01:40.540 --> 00:01:43.200
advantage of. And the examples from this Guardio

00:01:43.200 --> 00:01:46.159
Labs report, they really hit hard. Yeah. Perplexity's

00:01:46.159 --> 00:01:49.280
Comet AI browser. It was tricked into trying

00:01:49.280 --> 00:01:52.099
to buy an Apple Watch from a completely fake

00:01:52.099 --> 00:01:55.180
Walmart site. Wow. It just went for it. Just

00:01:55.180 --> 00:01:57.840
went for it. Didn't pause. And another test,

00:01:58.019 --> 00:02:00.680
it typed bank login info into what looked like

00:02:00.680 --> 00:02:03.280
a Wells Fargo phishing email. Oof. And maybe

00:02:03.280 --> 00:02:06.359
the most worrying part, it was manipulated into

00:02:06.359 --> 00:02:10.039
downloading malware from a hidden prompt just

00:02:10.039 --> 00:02:12.939
sitting there on a web page. Yeah. And it's interesting,

00:02:13.060 --> 00:02:15.080
ChatGPT showed similar issues in these tests

00:02:15.080 --> 00:02:18.169
too. Right. But the key difference was OpenAI

00:02:18.169 --> 00:02:21.169
Sandbox, you know, like a safe little container,

00:02:21.370 --> 00:02:23.849
it actually caught the bad download, isolated

00:02:23.849 --> 00:02:27.030
it. Ah, okay. Some protection work there. Yeah,

00:02:27.050 --> 00:02:29.050
which points to the core issue, really. These

00:02:29.050 --> 00:02:31.550
AI agents, they're built to be helpful. That's

00:02:31.550 --> 00:02:35.250
their default setting. But unlike us, you know,

00:02:35.250 --> 00:02:37.110
we might pause. We might think, does this look

00:02:37.110 --> 00:02:42.569
right? They often lack that critical pause, that

00:02:42.569 --> 00:02:45.169
skepticism. They just sort of compute and go.

00:02:45.599 --> 00:02:47.639
And it's not just these two. You see Microsoft

00:02:47.639 --> 00:02:50.560
putting Copilot in Edge. OpenAI's got Operator.

00:02:50.740 --> 00:02:52.479
Google's working on Project Mariners. Everyone's

00:02:52.479 --> 00:02:54.860
racing towards this. Exactly. Everyone wants

00:02:54.860 --> 00:02:56.900
that helpful AI assistant. And if you connect

00:02:56.900 --> 00:02:59.159
the docs, it's basically this trust loop getting

00:02:59.159 --> 00:03:01.800
exploited. Malicious actors are literally coding

00:03:01.800 --> 00:03:04.520
for the AI's helpfulness. Because it trusts what

00:03:04.520 --> 00:03:07.590
it sees on the page. Pretty much. It trusts the

00:03:07.590 --> 00:03:11.050
visual cues, the embedded commands. And the really

00:03:11.050 --> 00:03:14.370
critical point here is until these tools can

00:03:14.370 --> 00:03:17.569
genuinely tell a real instruction from a sketchy

00:03:17.569 --> 00:03:20.729
one or understand the intent, well, we're basically

00:03:20.729 --> 00:03:22.830
automating risk for people, aren't we? Yeah,

00:03:22.849 --> 00:03:24.370
you're giving it the keys without the judgment.

00:03:24.449 --> 00:03:26.610
Exactly. It's a huge design challenge. How do

00:03:26.610 --> 00:03:29.430
you keep the utility but build in safety that

00:03:29.430 --> 00:03:32.729
isn't easily fooled? Beat. It's tricky. So with

00:03:32.729 --> 00:03:34.830
these kinds of vulnerabilities surfacing, what's

00:03:34.830 --> 00:03:36.629
the main takeaway? What does Scamlexity tell

00:03:36.629 --> 00:03:40.590
us about AI trust? AI's helpfulness itself creates

00:03:40.590 --> 00:03:43.250
new weak spots. We really need engineered caution.

00:03:43.530 --> 00:03:46.289
Engineered caution. I like that. Okay, let's

00:03:46.289 --> 00:03:48.150
shift gears from those specific vulnerabilities

00:03:48.150 --> 00:03:51.310
and look at the wider AI landscape. What else

00:03:51.310 --> 00:03:52.949
has been making news? All right. Well, first

00:03:52.949 --> 00:03:55.849
off, some really genuinely inspiring news. Bill

00:03:55.849 --> 00:03:58.509
Gates is funding a global AI competition. Oh,

00:03:58.509 --> 00:04:00.810
yeah. specifically to fight Alzheimer's disease.

00:04:01.050 --> 00:04:03.210
The winning AI tool. It gets a million bucks,

00:04:03.250 --> 00:04:05.729
sure, but the big thing is it's going to be free

00:04:05.729 --> 00:04:08.289
for the whole world. Moment of wonder. Whoa.

00:04:09.469 --> 00:04:12.349
Just imagine scaling that. A billion queries

00:04:12.349 --> 00:04:15.110
maybe? Making a real global impact on something

00:04:15.110 --> 00:04:17.800
like Alzheimer's. Yep, yep. That's incredible

00:04:17.800 --> 00:04:19.939
potential. That really is amazing. It shows the

00:04:19.939 --> 00:04:23.319
positive side. But then on the flip side, our

00:04:23.319 --> 00:04:25.639
sources highlighted a pretty serious privacy

00:04:25.639 --> 00:04:31.160
issue. Apparently, 370 ,000 Grok AI chats are

00:04:31.160 --> 00:04:34.480
now searchable on Google. Oh, wow. And one chat

00:04:34.480 --> 00:04:37.680
unbelievably had a guide for plotting an assassination.

00:04:38.060 --> 00:04:41.160
Good grief. Yeah, OpenAI and Meta have had similar.

00:04:41.610 --> 00:04:43.810
Problems, right? Chat data getting out. Exactly.

00:04:43.829 --> 00:04:45.750
It really makes you pause and maybe think about

00:04:45.750 --> 00:04:47.569
checking your own chat settings, doesn't it?

00:04:47.629 --> 00:04:50.569
For sure. Privacy online is just always evolving,

00:04:50.850 --> 00:04:53.129
always a concern. Then on the tech side, it's

00:04:53.129 --> 00:04:54.910
getting easier for non -coders. You can actually

00:04:54.910 --> 00:04:58.230
use GPT now to build pretty powerful NA and AI

00:04:58.230 --> 00:05:00.670
agents with zero code. How does that work? You

00:05:00.670 --> 00:05:02.269
just give it one big instruction, they call it

00:05:02.269 --> 00:05:04.250
a mega prompt, and it can kind of map out a whole

00:05:04.250 --> 00:05:06.230
workflow for you, like building with data Lego

00:05:06.230 --> 00:05:09.480
blocks using just words. Huh. And if you're curious

00:05:09.480 --> 00:05:11.339
about what's happening inside the A .I.'s mind,

00:05:11.500 --> 00:05:14.319
Anthropic put out this new video on interpretability.

00:05:14.459 --> 00:05:16.939
Right. It talks about how A .I.'s do sophisticated

00:05:16.939 --> 00:05:20.000
planning internally and even strategic deception.

00:05:20.259 --> 00:05:22.079
It's kind of mind bending looking under the hood

00:05:22.079 --> 00:05:25.100
like that. Strategic deception. That sounds intense.

00:05:25.339 --> 00:05:27.899
And then there was the nano banana A .I. model.

00:05:28.540 --> 00:05:31.600
That went viral. Oh, yeah. The tiny banana thing

00:05:31.600 --> 00:05:34.139
sounded like Google was maybe behind it. Seems

00:05:34.139 --> 00:05:36.519
like it. And apparently it's really good at super

00:05:36.519 --> 00:05:38.899
realistic image edits, shows how these models

00:05:38.899 --> 00:05:40.600
are getting specialized, maybe even efficient

00:05:40.600 --> 00:05:43.100
enough for phones. And speaking of phones, Google's

00:05:43.100 --> 00:05:46.060
Pixel 10 lineup is coming with Gemini's new on

00:05:46.060 --> 00:05:48.620
-device AI assistant. It's supposed to, like,

00:05:48.660 --> 00:05:50.579
anticipate what you need. Before you even know

00:05:50.579 --> 00:05:52.399
you need it. That's the idea. And, you know,

00:05:52.420 --> 00:05:55.680
the money keeps pouring in. Go here, the AI startup.

00:05:56.189 --> 00:05:59.829
Just got another $500 million. NVIDIA, AMD are

00:05:59.829 --> 00:06:02.670
investing. Big valuations. It's just a constant

00:06:02.670 --> 00:06:04.649
stream of updates, isn't it? Quick hits, too.

00:06:04.990 --> 00:06:07.509
OpenAI's chairman saying GPT is making his job

00:06:07.509 --> 00:06:11.069
obsolete. Altman hinting at GPT -6. Microsoft's

00:06:11.069 --> 00:06:14.290
AI CEO warning about AI seeming conscious. Runway

00:06:14.290 --> 00:06:16.550
updates. NVIDIA building new chips for China.

00:06:16.810 --> 00:06:19.350
It's nonstop. OK, so with all this rapid change.

00:06:20.230 --> 00:06:24.250
Medical potential, privacy risks, huge investments,

00:06:24.610 --> 00:06:27.569
technical leaps. What's the biggest challenge

00:06:27.569 --> 00:06:29.790
weaving through all of it? It's really balancing

00:06:29.790 --> 00:06:33.589
that incredibly fast innovation with basic security

00:06:33.589 --> 00:06:37.009
and just responsible rollout. Yeah, that balance

00:06:37.009 --> 00:06:39.930
feels key. All right. So for anyone feeling maybe

00:06:39.930 --> 00:06:42.050
a bit lost in all this, let's touch on actually

00:06:42.050 --> 00:06:44.209
building AI systems. There were a couple of resources

00:06:44.209 --> 00:06:46.870
mentioned. Yeah. And this is where I have to

00:06:46.870 --> 00:06:49.029
admit something. I still wrestle with prompt

00:06:49.029 --> 00:06:51.750
drift. myself sometimes, you know, trying to

00:06:51.750 --> 00:06:54.269
get the AI model to stay consistent. It's genuinely

00:06:54.269 --> 00:06:57.329
hard. Right. I get that. But the sources talk

00:06:57.329 --> 00:06:59.670
about context engineering. It's being called

00:06:59.670 --> 00:07:02.050
the new discipline for building really solid

00:07:02.050 --> 00:07:05.170
autonomous AI. It goes way beyond just writing

00:07:05.170 --> 00:07:07.709
a good prompt. So it's more than just the instruction.

00:07:08.029 --> 00:07:09.970
Exactly. It's about giving the AI, the whole

00:07:09.970 --> 00:07:11.850
environment, all the context it needs to work

00:07:11.850 --> 00:07:14.370
reliably, like designing the world for the AI,

00:07:14.550 --> 00:07:17.009
not just the command. It's about anticipating

00:07:17.009 --> 00:07:19.600
the nuances. That makes sense. And for people

00:07:19.600 --> 00:07:21.600
wanting to start, there were some new empowered

00:07:21.600 --> 00:07:24.420
AI tools listed. Yeah. Things like prompt library,

00:07:24.759 --> 00:07:28.300
over 500 free prompts to get you going. Clio

00:07:28.300 --> 00:07:30.879
chat, which lets you kind of personalize an AI

00:07:30.879 --> 00:07:34.139
model with your own terms. Adversity AI, a Google

00:07:34.139 --> 00:07:37.839
ads agent. And Vizsla for making polished videos

00:07:37.839 --> 00:07:40.680
really quickly from text. So how do tools like

00:07:40.680 --> 00:07:43.399
these help someone just starting out? feeling

00:07:43.399 --> 00:07:45.399
maybe a bit overwhelmed by everything we've just

00:07:45.399 --> 00:07:47.540
discussed they offer easier starting points really

00:07:47.540 --> 00:07:51.279
letting people build useful ai automations without

00:07:51.279 --> 00:07:53.519
needing to be coding experts lowering the barrier

00:07:53.519 --> 00:07:56.240
to entry that's good okay but now for our last

00:07:56.240 --> 00:07:58.160
main segment let's hit that reality check we

00:07:58.160 --> 00:08:00.300
mentioned right this comes from an mit report

00:08:00.300 --> 00:08:03.579
and the headline is pretty blunt AI got humbled.

00:08:03.720 --> 00:08:06.839
95 % of Gen AI pilots are failing. That kind

00:08:06.839 --> 00:08:08.660
of cuts against the grain, doesn't it? It absolutely

00:08:08.660 --> 00:08:11.980
does. This MIT report looked at hundreds of Gen

00:08:11.980 --> 00:08:15.480
AI projects, big Fortune 500 companies, startups,

00:08:15.759 --> 00:08:18.660
the whole range. And the findings are, well,

00:08:18.759 --> 00:08:22.579
stark. 95 % failed to deliver any real business

00:08:22.579 --> 00:08:25.899
impact. Only 5 % actually saw quick revenue growth.

00:08:26.040 --> 00:08:30.000
Just five. Wow. 95 % failure rate. Yeah. And

00:08:30.000 --> 00:08:32.840
dig this. If they bought a specialized AI tool,

00:08:33.039 --> 00:08:37.179
those succeeded 67 % of the time, which is decent.

00:08:37.360 --> 00:08:39.700
Okay. But if they tried to build their own internal

00:08:39.700 --> 00:08:44.779
Gen AI tool, only 33 % worked. That's a huge

00:08:44.779 --> 00:08:47.320
gap. Shows how hard it is to build this stuff

00:08:47.320 --> 00:08:49.080
in -house right now. And there was something

00:08:49.080 --> 00:08:52.159
about budgets, too. A paradox. Yeah, isn't that

00:08:52.159 --> 00:08:54.399
interesting? Over half the budget often goes

00:08:54.399 --> 00:08:56.879
to sales and marketing AI tools. Right, the flashy

00:08:56.879 --> 00:08:59.220
stuff, maybe. But the report found the biggest

00:08:59.220 --> 00:09:02.600
ROI. The best return was actually in back office

00:09:02.600 --> 00:09:06.340
automation, streamlining internal stuff. Seems

00:09:06.340 --> 00:09:08.039
like money isn't always going where the impact

00:09:08.039 --> 00:09:10.700
is yet. So why the failures? What did MIT call

00:09:10.700 --> 00:09:12.980
it? They call it the Gen -AI divide and the learning

00:09:12.980 --> 00:09:15.460
gap. Basically, most companies just aren't ready

00:09:15.460 --> 00:09:17.620
for Gen -AI. They don't have the internal know

00:09:17.620 --> 00:09:19.539
-how, the right processes. The structure isn't

00:09:19.539 --> 00:09:21.379
there. Pretty much. They need outside help or

00:09:21.379 --> 00:09:23.899
a major internal effort to really integrate it

00:09:23.899 --> 00:09:26.000
properly into how they actually work. It's not

00:09:26.000 --> 00:09:27.600
just plug and play. Even Sam Altman admitted

00:09:27.600 --> 00:09:30.379
they were maybe overexcited early on about how

00:09:30.379 --> 00:09:32.460
quickly companies could adopt it. Right. The

00:09:32.460 --> 00:09:35.539
hype versus the reality of implementation. Exactly.

00:09:35.539 --> 00:09:38.000
So the core message from MIT seems pretty clear

00:09:38.000 --> 00:09:41.120
then. Stop just trying to, like, AI power your

00:09:41.120 --> 00:09:43.799
PowerPoint. That's not where the wins are. Precisely.

00:09:44.039 --> 00:09:47.179
Real success, the report argues, comes from deep

00:09:47.179 --> 00:09:50.259
integration into core operations. Changing how

00:09:50.259 --> 00:09:52.220
work gets done, not just fiddling around the

00:09:52.220 --> 00:09:55.000
edges. So thinking practically then, what's the

00:09:55.000 --> 00:09:57.879
single biggest lesson for companies trying to

00:09:57.879 --> 00:10:01.139
get beyond just experimenting with AI? Real AI

00:10:01.139 --> 00:10:04.480
success needs deep operational integration, not

00:10:04.480 --> 00:10:06.399
just a bunch of isolated experiments. Integration,

00:10:06.419 --> 00:10:09.960
not just experimentation. Got it. Mid -role sponsor

00:10:09.960 --> 00:10:12.700
Reed Placeholder. This section would contain

00:10:12.700 --> 00:10:15.200
the sponsor's message. Okay, so let's try and

00:10:15.200 --> 00:10:17.720
pull the threads together from today. We've seen

00:10:17.720 --> 00:10:20.360
AI's amazing potential, right? From fighting

00:10:20.360 --> 00:10:22.179
Alzheimer's with Bill Gates' initiative to these

00:10:22.179 --> 00:10:24.399
new tools making automation easier for everyone.

00:10:24.539 --> 00:10:26.919
Definitely. Huge upside potential. But then we

00:10:26.919 --> 00:10:29.279
also saw the other side. The surprising ways

00:10:29.279 --> 00:10:32.419
AI can be scammed, the privacy worries with things

00:10:32.419 --> 00:10:35.159
like grok chats becoming public, and that big

00:10:35.159 --> 00:10:38.639
reality check from MIT, most enterprise projects.

00:10:39.120 --> 00:10:41.950
Well. They're stumbling right now, not delivering

00:10:41.950 --> 00:10:44.389
the impact people hoped for. It really feels

00:10:44.389 --> 00:10:47.690
like a landscape of super fast innovation running

00:10:47.690 --> 00:10:49.970
side by side with these really critical lessons

00:10:49.970 --> 00:10:52.370
we're learning about security, about ethics,

00:10:52.470 --> 00:10:55.990
about just practical application. Understanding

00:10:55.990 --> 00:10:59.269
both sides, the promise and the pitfalls, seems

00:10:59.269 --> 00:11:01.730
absolutely essential to figure out where this

00:11:01.730 --> 00:11:04.090
is all going. It's rarely simple, is it? Not

00:11:04.090 --> 00:11:06.830
at all. So before we wrap up, here's something

00:11:06.830 --> 00:11:09.919
that's been sticking with me. As these AI agents

00:11:09.919 --> 00:11:13.240
get more autonomous, acting on our behalf, how

00:11:13.240 --> 00:11:15.720
do we actually build them to be both super helpful

00:11:15.720 --> 00:11:18.740
and fundamentally secure? That's the billion

00:11:18.740 --> 00:11:20.720
-dollar question. Right. What safeguards really

00:11:20.720 --> 00:11:23.120
work when the AI's core design is to be helpful,

00:11:23.279 --> 00:11:25.259
sometimes maybe too helpful, against its own

00:11:25.259 --> 00:11:28.139
best interests or ours? It's a deep question,

00:11:28.240 --> 00:11:30.080
something for all of us to think about as these

00:11:30.080 --> 00:11:32.000
systems get more integrated into everything.

00:11:32.519 --> 00:11:34.419
Thank you for joining us on this deep dive today.

00:11:34.580 --> 00:11:35.879
Yeah, we hope this gave you some interesting

00:11:35.879 --> 00:11:38.120
things to chew on and maybe explore a bit more.

00:11:38.240 --> 00:11:38.899
Until next time.