WEBVTT

00:00:00.000 --> 00:00:03.759
For years, the browser was this perfectly safe

00:00:03.759 --> 00:00:07.299
sandbox. We kept our AI models neatly contained,

00:00:07.500 --> 00:00:10.300
you know, behind digital screens. Yeah, totally

00:00:10.300 --> 00:00:13.580
contained. But right now, the tech is aggressively

00:00:13.580 --> 00:00:16.239
stepping out. It's literally taking physical

00:00:16.239 --> 00:00:19.320
control of the real world. We're watching it

00:00:19.320 --> 00:00:22.519
run the back end of major banks. We're even seeing

00:00:22.519 --> 00:00:26.420
it... um control human hand muscles directly

00:00:26.420 --> 00:00:28.859
which is just crazy i mean we are talking about

00:00:28.859 --> 00:00:31.660
raw electrical signals bypassing our brains entirely

00:00:31.660 --> 00:00:35.340
so welcome to today's deep dive i'm really glad

00:00:35.340 --> 00:00:37.000
you're here with us yeah thanks for having me

00:00:37.000 --> 00:00:40.659
today we're looking at the landscape of may 2026.

00:00:41.100 --> 00:00:43.840
we are unpacking a massive shift happening right

00:00:43.840 --> 00:00:46.890
now ai labs are just Well, they're done waiting

00:00:46.890 --> 00:00:48.969
for users to sign up. They're completely done.

00:00:49.090 --> 00:00:50.950
Right. They're forcibly embedding themselves

00:00:50.950 --> 00:00:53.609
into the global economy. They're entirely rebuilding

00:00:53.609 --> 00:00:57.030
their infrastructure just to handle real time

00:00:57.030 --> 00:00:59.829
human interaction and, you know, causing real

00:00:59.829 --> 00:01:02.149
world hardware shortages in the process. It's

00:01:02.149 --> 00:01:04.390
an incredibly wild time to be watching this space.

00:01:04.549 --> 00:01:06.709
The whole landscape is shifting right under our

00:01:06.709 --> 00:01:09.650
feet. We're seeing this rapid transition from

00:01:09.650 --> 00:01:13.739
like digital toys to. physical reality. And it's

00:01:13.739 --> 00:01:15.900
happening much faster than anyone modeled. Exactly.

00:01:16.140 --> 00:01:18.420
So let's start by just following the money. Always

00:01:18.420 --> 00:01:20.540
a good idea to understand where AI is going.

00:01:20.680 --> 00:01:22.340
We first have to look at the enterprise side.

00:01:22.480 --> 00:01:24.939
The biggest labs in the world are forcing their

00:01:24.939 --> 00:01:27.620
way in. They're actively drilling into the corporate

00:01:27.620 --> 00:01:30.859
bedrock of our economy because they have these

00:01:30.859 --> 00:01:34.040
massive IPO goals they need to hit. And to do

00:01:34.040 --> 00:01:35.719
that, they basically need to be indispensable.

00:01:35.900 --> 00:01:39.599
Right. And. Subscriptions alone just aren't enough

00:01:39.599 --> 00:01:44.079
anymore. OpenAI and Tropic realized a very hard

00:01:44.079 --> 00:01:46.959
financial truth recently. Yeah. You simply can't

00:01:46.959 --> 00:01:48.799
wait for individual people to buy a plus plan.

00:01:49.379 --> 00:01:51.659
Consumer subscriptions are just historically

00:01:51.659 --> 00:01:55.099
fickle. Users churn all the time. They do. If

00:01:55.099 --> 00:01:57.590
you want to hit... multi -billion dollar IPO

00:01:57.590 --> 00:02:00.590
numbers, you need guaranteed revenue. You need

00:02:00.590 --> 00:02:03.290
deep institutional contracts that span years.

00:02:03.530 --> 00:02:05.250
You essentially have to physically go into their

00:02:05.250 --> 00:02:07.390
offices. You have to custom build the intelligence

00:02:07.390 --> 00:02:10.030
for them. OpenAI just made a structural move

00:02:10.030 --> 00:02:13.150
here. They quietly created something called the

00:02:13.150 --> 00:02:15.590
deployment company. Oh, yeah. It's already valued

00:02:15.590 --> 00:02:20.050
at. $10 billion. Which is staggering. Right.

00:02:20.150 --> 00:02:22.669
OpenAI owns the vast majority of the equity,

00:02:22.810 --> 00:02:25.729
but they've got serious heavy hitters backing

00:02:25.729 --> 00:02:28.270
them. TPG, Brookfield, SoftBank, they're all

00:02:28.270 --> 00:02:30.169
getting involved. That is undeniably serious

00:02:30.169 --> 00:02:33.590
institutional capital entering the chat. And

00:02:33.590 --> 00:02:37.150
their deployment strategy is, frankly, highly

00:02:37.150 --> 00:02:40.530
aggressive. They're targeting over 2 ,000 major

00:02:40.530 --> 00:02:43.949
portfolio companies. The ultimate goal is integrating

00:02:43.949 --> 00:02:47.349
GPT 5 .4 and Codex directly into their systems.

00:02:47.550 --> 00:02:50.289
They want to fundamentally rewire how these companies

00:02:50.289 --> 00:02:52.430
operate. It's not a tool anymore. It's a completely

00:02:52.430 --> 00:02:54.629
new operating system. A totally new operating

00:02:54.629 --> 00:02:56.830
system. Yeah. And Anthropic certainly isn't sitting

00:02:56.830 --> 00:02:59.050
on the sidelines here. They formed their own

00:02:59.050 --> 00:03:02.110
private equity coalition to compete. Right. Blackstone,

00:03:02.289 --> 00:03:04.430
Goldman Sachs, Hellman and Friedman. Anthropic

00:03:04.430 --> 00:03:06.789
is aggressively going after the midsize market.

00:03:06.969 --> 00:03:08.949
But, you know, their actual integration method

00:03:08.949 --> 00:03:11.599
is fascinating to me. It really is. They aren't

00:03:11.599 --> 00:03:14.060
just sending a download link. They're literally

00:03:14.060 --> 00:03:16.460
embedding their own flesh and blood engineers

00:03:16.460 --> 00:03:18.759
into these companies. Right. And that is a crucial

00:03:18.759 --> 00:03:20.900
distinction. They aren't just selling an API

00:03:20.900 --> 00:03:24.139
key and hoping for the best. No. They're sending

00:03:24.139 --> 00:03:27.659
highly paid human engineers physically into the

00:03:27.659 --> 00:03:30.960
building. These engineers are actively rewriting

00:03:30.960 --> 00:03:34.580
entrenched legacy workflows. They're using Claude

00:03:34.580 --> 00:03:37.080
to rebuild operations from scratch. So they're

00:03:37.080 --> 00:03:39.580
basically hiring a totally new kind of worker,

00:03:39.659 --> 00:03:41.620
not just developers who sit in code. They need

00:03:41.620 --> 00:03:44.860
engineers who can persuasively talk to a CEO.

00:03:45.120 --> 00:03:47.500
Yeah, translators, basically. Exactly. The biggest

00:03:47.500 --> 00:03:50.180
example of this is Anthropic's partnership with

00:03:50.180 --> 00:03:54.659
FIS. FIS is a staggeringly massive deal. For

00:03:54.659 --> 00:03:57.240
context, it's a software running a huge chunk

00:03:57.240 --> 00:03:59.879
of global banks. We're talking about the core

00:03:59.879 --> 00:04:02.659
financial plumbing of the world. Anthropic is

00:04:02.659 --> 00:04:04.680
bringing Claude directly into that sensitive

00:04:04.680 --> 00:04:08.680
ecosystem. By late 2026, it's going to be widely

00:04:08.680 --> 00:04:10.819
available across the sector. Institutions like

00:04:10.819 --> 00:04:13.159
BMO and Amalgamated Bank will be running on it.

00:04:13.240 --> 00:04:15.219
It's kind of like stacking Lego blocks of data

00:04:15.219 --> 00:04:17.279
directly into the foundation of Wall Street.

00:04:17.439 --> 00:04:19.199
That's a great way to put it. But these blocks

00:04:19.199 --> 00:04:21.819
are constantly thinking. They're actively adapting.

00:04:22.600 --> 00:04:24.959
If one block decides to change its shape, the

00:04:24.959 --> 00:04:29.300
whole tower shakes. Beat. It's honestly a little

00:04:29.300 --> 00:04:31.839
terrifying. Oh, absolutely. I mean, I still wrestle

00:04:31.839 --> 00:04:34.379
with prompt drift myself. We all do. Getting

00:04:34.379 --> 00:04:37.319
a model to just stay on track is genuinely hard.

00:04:37.540 --> 00:04:40.860
Yet they're trusting AI to run global banks seamlessly.

00:04:41.300 --> 00:04:43.779
Well, that underlying risk is exactly why they

00:04:43.779 --> 00:04:46.600
embed human engineers. They absolutely cannot

00:04:46.600 --> 00:04:49.939
afford any prompt drift at a major bank. Right.

00:04:50.040 --> 00:04:52.319
Joint ventures secure that guaranteed revenue

00:04:52.319 --> 00:04:55.319
stream. When you're deeply integrated into a

00:04:55.319 --> 00:04:57.899
bank's ledger, you're incredibly sticky. You

00:04:57.899 --> 00:04:59.740
don't just get canceled like a Spotify subscription.

00:04:59.939 --> 00:05:01.920
No, you become a permanent utility. Exactly.

00:05:03.439 --> 00:05:05.699
Sierra is another perfect example of this enterprise

00:05:05.699 --> 00:05:08.180
demand. Yeah. Sierra's wild. They just raised

00:05:08.180 --> 00:05:11.139
$950 million. Yeah. That puts their valuation

00:05:11.139 --> 00:05:14.500
at $15 billion. They already serve 40 % of the

00:05:14.500 --> 00:05:17.819
entire Fortune 50. Their annual recurring revenue

00:05:17.819 --> 00:05:20.740
jumped from $100 million to $150 million. Almost

00:05:20.740 --> 00:05:23.759
overnight. It's crazy fast. It clearly proves

00:05:23.759 --> 00:05:26.600
the corporate market is starving for this infrastructure.

00:05:26.980 --> 00:05:29.660
It effectively shows that enterprise AI is not

00:05:29.660 --> 00:05:33.529
a bubble. It's a structural rewiring of how corporate

00:05:33.529 --> 00:05:36.970
infrastructure operates. The raw demand for automated

00:05:36.970 --> 00:05:40.449
workflows is just staggering. Companies are panicking

00:05:40.449 --> 00:05:42.410
that they'd be left behind. Okay, let me pause

00:05:42.410 --> 00:05:44.550
and get this straight. If they are embedding

00:05:44.550 --> 00:05:47.990
AI into the core of global banking, how does

00:05:47.990 --> 00:05:49.870
the infrastructure keep up without crashing?

00:05:50.089 --> 00:05:52.449
Well, they realized they couldn't use the old

00:05:52.449 --> 00:05:54.370
Internet backbone. They had to completely rip

00:05:54.370 --> 00:05:56.870
out the old web architecture. They built a totally

00:05:56.870 --> 00:06:00.670
new split brain system for real time scale. So

00:06:00.670 --> 00:06:02.449
they basically built a brand new Internet just

00:06:02.449 --> 00:06:05.189
to handle the load. Exactly. And that naturally

00:06:05.189 --> 00:06:07.290
brings us to the voice infrastructure miracle.

00:06:07.959 --> 00:06:10.180
You simply can't rewrite global bank operations

00:06:10.180 --> 00:06:13.019
without flawless systems. You need split -second

00:06:13.019 --> 00:06:15.779
reliability at a massive scale. OpenAI recently

00:06:15.779 --> 00:06:18.199
released an engineering deep dive detailing this

00:06:18.199 --> 00:06:20.579
exact flex. And the raw numbers are absolutely

00:06:20.579 --> 00:06:24.000
staggering. As of May 2026, OpenAI officially

00:06:24.000 --> 00:06:27.680
handles 900 million weekly active users. Think

00:06:27.680 --> 00:06:30.120
about the physical volume of data moving there.

00:06:30.259 --> 00:06:33.339
To make a digital conversation feel human, it

00:06:33.339 --> 00:06:36.720
has to be fast. The AI constantly has to handle

00:06:36.720 --> 00:06:39.300
unpredictable human interruptions. It has to

00:06:39.300 --> 00:06:41.560
manage conversational turn -taking seamlessly.

00:06:41.980 --> 00:06:44.019
And traditional web setups just couldn't handle

00:06:44.019 --> 00:06:45.879
it. that kind of dynamic load. Yeah. Standard

00:06:45.879 --> 00:06:48.899
HTTP requests naturally have way too much inherent

00:06:48.899 --> 00:06:51.819
lag. It breaks the illusion of a conversation

00:06:51.819 --> 00:06:54.740
instantly. Yeah, it does. So OpenAI just ditched

00:06:54.740 --> 00:06:57.160
the old setups entirely. They moved to a custom

00:06:57.160 --> 00:07:00.910
WebRTC on Kubernetes stack. Let's define that

00:07:00.910 --> 00:07:03.670
jargon quickly. That's just tech that keeps live

00:07:03.670 --> 00:07:06.410
audio streams stable without crashing. Spot on.

00:07:06.550 --> 00:07:08.329
For you listening, think of the old internet

00:07:08.329 --> 00:07:10.930
like sending letters. There's always a noticeable

00:07:10.930 --> 00:07:14.029
delay waiting for a response. WebRTC is like

00:07:14.029 --> 00:07:16.850
keeping an unclosable phone line directly open.

00:07:17.110 --> 00:07:19.610
That's a perfect visual. And to actually make

00:07:19.610 --> 00:07:22.129
it work, they created a brilliant split -brain

00:07:22.129 --> 00:07:24.350
infrastructure. They didn't just shove everything

00:07:24.350 --> 00:07:27.129
onto one overheating server. They divided the

00:07:27.129 --> 00:07:29.269
computational labor to mass. maximize speed.

00:07:29.529 --> 00:07:31.850
Right. They built a very lightweight component

00:07:31.850 --> 00:07:34.490
they call a relay. The relay just handles the

00:07:34.490 --> 00:07:36.769
fast moving data packets. Its only job is to

00:07:36.769 --> 00:07:41.019
move raw audio quickly. Then. They built a much

00:07:41.019 --> 00:07:43.459
heavier stateful transceiver. The transceiver

00:07:43.459 --> 00:07:45.720
does all the actual heavy cognitive lifting.

00:07:45.779 --> 00:07:48.579
It handles the complex AI thinking and all the

00:07:48.579 --> 00:07:50.959
deep encryption. And they aggressively pushed

00:07:50.959 --> 00:07:53.579
this entire architecture out to the edge. They

00:07:53.579 --> 00:07:55.879
systematically deployed global relays at the

00:07:55.879 --> 00:07:58.180
Internet's physical edge. Which is wildly expensive.

00:07:58.699 --> 00:08:01.060
Incredibly expensive. It effectively means your

00:08:01.060 --> 00:08:04.079
voice hits a server almost instantly. The millisecond

00:08:04.079 --> 00:08:06.850
a sound leaves your lips, it... It aggressively

00:08:06.850 --> 00:08:09.709
cuts down the jitter and the conversational lag.

00:08:09.889 --> 00:08:12.689
It happens before the data even reaches the main

00:08:12.689 --> 00:08:14.689
model. And, you know, the core models themselves

00:08:14.689 --> 00:08:17.009
are entirely different now. We're talking about

00:08:17.009 --> 00:08:21.649
GPT 5 .5 and GPT real -time 1 .5. They are entirely

00:08:21.649 --> 00:08:25.069
audio native from the ground up. This is a profoundly

00:08:25.069 --> 00:08:28.170
deep shift in computer science. They aren't translating

00:08:28.170 --> 00:08:30.550
your natural speech into text anymore. Instead

00:08:30.550 --> 00:08:32.409
of reading a transcript, they're processing the

00:08:32.409 --> 00:08:34.720
pure sound waves. They can actually hear the

00:08:34.720 --> 00:08:36.960
hesitation or excitement in your voice. They

00:08:36.960 --> 00:08:39.740
process pure sound and speak with raw emotion

00:08:39.740 --> 00:08:42.460
directly. The median latency is currently sitting

00:08:42.460 --> 00:08:45.919
under 500 milliseconds. That is literally faster

00:08:45.919 --> 00:08:49.720
than human reaction time. Whoa. Imagine scaling

00:08:49.720 --> 00:08:55.120
to a billion queries. Two sec silence. It's genuinely

00:08:55.120 --> 00:08:57.299
hard to even wrap your head around the physics

00:08:57.299 --> 00:09:00.580
of that. Managing millions of concurrent stateful

00:09:00.580 --> 00:09:04.230
audio sessions is computationally brutal. Stateful

00:09:04.230 --> 00:09:06.409
means the server constantly remembers the entire

00:09:06.409 --> 00:09:09.090
conversation. It holds the context open. Exactly.

00:09:09.250 --> 00:09:11.210
It holds the context completely open while you

00:09:11.210 --> 00:09:14.210
talk. Doing that seamlessly for 900 million people

00:09:14.210 --> 00:09:16.570
without dropping context is brilliant engineering.

00:09:16.850 --> 00:09:18.490
But hold on. Think about the real world impact

00:09:18.490 --> 00:09:21.409
here. With 900 million people talking to audio

00:09:21.409 --> 00:09:24.389
native AI in real time, isn't that putting insane

00:09:24.389 --> 00:09:27.769
stress on actual physical hardware? Oh, it absolutely

00:09:27.769 --> 00:09:30.379
is. It's heavily straining the global power grid.

00:09:30.480 --> 00:09:32.259
It's gotten to the point that developers are

00:09:32.259 --> 00:09:35.019
panic buying consumer hardware to keep up. Wow.

00:09:35.240 --> 00:09:37.500
The cloud alone simply can't handle everything

00:09:37.500 --> 00:09:40.320
locally anymore. Right. The digital boom is creating

00:09:40.320 --> 00:09:43.960
a massive physical supply chain bottleneck. Exactly.

00:09:44.179 --> 00:09:47.279
And that seamlessly leads us to the profound

00:09:47.279 --> 00:09:49.720
fiction we're seeing. We are actively moving

00:09:49.720 --> 00:09:52.039
from digital infrastructure into physical reality.

00:09:52.379 --> 00:09:56.580
The scale of pure audio models. hits the real

00:09:56.580 --> 00:09:59.919
world hard. So if OpenAI is processing 900 million

00:09:59.919 --> 00:10:02.759
live audio streams, they're shifting a massive

00:10:02.759 --> 00:10:06.500
compute burden onto the grid. And that macroeconomic

00:10:06.500 --> 00:10:09.259
cloud strain is why we're seeing a microeconomic

00:10:09.259 --> 00:10:12.480
panic at the consumer level. Specifically, we

00:10:12.480 --> 00:10:14.340
are seeing this happen with Apple hardware. Yeah,

00:10:14.440 --> 00:10:16.299
Apple didn't foresee this. They didn't completely

00:10:16.299 --> 00:10:18.960
foresee this sudden crunch. Right now, Mac Mini

00:10:18.960 --> 00:10:21.299
and Mac Studio stock is running critically low.

00:10:21.720 --> 00:10:24.120
Software developers are aggressively hoarding

00:10:24.120 --> 00:10:26.450
these specific desks. cop machines because they

00:10:26.450 --> 00:10:28.990
desperately need them to run their local AI agents.

00:10:29.250 --> 00:10:31.710
Prices on the secondary market are already rising

00:10:31.710 --> 00:10:34.769
significantly. People intensely want to run AI

00:10:34.769 --> 00:10:37.049
locally for strict data privacy. They want to

00:10:37.049 --> 00:10:39.230
avoid that frustrating cloud latency entirely.

00:10:39.789 --> 00:10:42.490
So they're literally buying up every single Mac

00:10:42.490 --> 00:10:45.570
they can find. Industry experts think this severe

00:10:45.570 --> 00:10:48.649
shortage could easily hit iPhones next. If the

00:10:48.649 --> 00:10:51.409
iPhone supply chain actually gets hit, that is

00:10:51.409 --> 00:10:54.330
massive. It changes consumer tech availability

00:10:54.330 --> 00:10:56.809
overnight. It really does. And it's definitely

00:10:56.809 --> 00:10:58.789
not just physical supply chain friction we're

00:10:58.789 --> 00:11:01.110
dealing with. We're actively seeing major regulatory

00:11:01.110 --> 00:11:03.789
friction popping up, too. The White House is

00:11:03.789 --> 00:11:07.169
currently weighing some very serious new AI oversight

00:11:07.169 --> 00:11:09.730
rules. Yeah, they urgently want early access

00:11:09.730 --> 00:11:13.129
to new foundational models. The stated rationale

00:11:13.129 --> 00:11:15.289
from the administration is actually pretty straightforward.

00:11:15.750 --> 00:11:18.490
They want to manage the societal risks if things

00:11:18.490 --> 00:11:21.850
go completely wrong. It isn't really about broadly

00:11:21.850 --> 00:11:24.389
blocking AI development. They just want a thorough

00:11:24.389 --> 00:11:26.429
look under the hood before public deployment.

00:11:26.470 --> 00:11:28.840
Wait, hold on. If I'm understanding this dynamic

00:11:28.840 --> 00:11:31.019
right, the White House isn't trying to shut these

00:11:31.019 --> 00:11:32.860
local agents down. They just want a backdoor

00:11:32.860 --> 00:11:34.860
view into them because they're nervous. They're

00:11:34.860 --> 00:11:37.559
realizing an uncensored AI running freely on

00:11:37.559 --> 00:11:41.259
a local Mac studio is a wild card. Is that basically

00:11:41.259 --> 00:11:44.059
what's driving this sudden oversight push? Yeah,

00:11:44.139 --> 00:11:46.539
because the sheer lack of visibility is deeply

00:11:46.539 --> 00:11:49.340
concerning to regulators. If we connect the dots,

00:11:49.399 --> 00:11:52.679
the entire trend makes sense. The intense consumer

00:11:52.679 --> 00:11:55.480
desire for privacy and edge computing is driving

00:11:55.480 --> 00:11:58.500
this shift. Right. Everyday people want autonomous

00:11:58.500 --> 00:12:01.220
agents running locally in their own homes. That

00:12:01.220 --> 00:12:04.360
desire directly drives the massive Apple hardware

00:12:04.360 --> 00:12:08.360
shortage. And that decentralized deployment immediately

00:12:08.360 --> 00:12:10.539
prompts governments to seek visibility. They

00:12:10.539 --> 00:12:12.299
desperately want to know what is actually running

00:12:12.299 --> 00:12:14.159
on those machines. But let's bring this down

00:12:14.159 --> 00:12:15.700
to the listener for a second. If governments

00:12:15.700 --> 00:12:18.190
and hardware can barely keep up. How is this

00:12:18.190 --> 00:12:20.309
rapid integration actually showing up for everyday

00:12:20.309 --> 00:12:22.950
users right now? It's actively bleeding into

00:12:22.950 --> 00:12:26.169
highly personal physical applications at an unprecedented

00:12:26.169 --> 00:12:29.309
pace. We're seeing it fundamentally shift physical

00:12:29.309 --> 00:12:32.450
wearables and daily digital tools. So the tech

00:12:32.450 --> 00:12:35.049
is already fundamentally altering our daily physical

00:12:35.049 --> 00:12:38.009
and digital routines. Completely. We're rapidly

00:12:38.009 --> 00:12:40.269
moving from macroeconomic shortages down to the

00:12:40.269 --> 00:12:42.789
micro level. This is fundamentally about what

00:12:42.789 --> 00:12:46.250
you can actually use today. The sheer speed of

00:12:46.250 --> 00:12:47.559
practical applications application development

00:12:47.559 --> 00:12:50.019
is staggering right now. Let's actually look

00:12:50.019 --> 00:12:52.019
at the wildest edge case first, because this

00:12:52.019 --> 00:12:54.179
one honestly blew my mind when I read about it.

00:12:54.320 --> 00:12:57.759
There's a brand new AI wearable device out there

00:12:57.759 --> 00:13:00.019
right now. Well, this is crazy. It literally

00:13:00.019 --> 00:13:02.679
controls your human hands using direct electrical

00:13:02.679 --> 00:13:05.779
signals. It actively lets you perform physical

00:13:05.779 --> 00:13:08.669
skills you never learned. It's fascinating. It

00:13:08.669 --> 00:13:11.450
sends calibrated impulses straight to your hand

00:13:11.450 --> 00:13:14.269
muscles. It's almost literally like downloading

00:13:14.269 --> 00:13:17.610
physical abilities directly into your body. Yeah.

00:13:18.000 --> 00:13:20.259
It bypasses your brain's slow motor learning

00:13:20.259 --> 00:13:23.600
process entirely. It feels very sci -fi. It feels

00:13:23.600 --> 00:13:25.620
exactly like that scene in The Matrix. Totally.

00:13:25.779 --> 00:13:27.919
You just plug a cable in and suddenly you magically

00:13:27.919 --> 00:13:29.980
know Kung Fu. Or, you know, maybe you instantly

00:13:29.980 --> 00:13:31.740
know how to play the piano. You strap it on.

00:13:31.820 --> 00:13:34.919
Yeah. Your fingers are doing complex tasks. It's

00:13:34.919 --> 00:13:38.080
fascinating, but it's also profoundly weird to

00:13:38.080 --> 00:13:40.779
think about. It definitely bridges a crazy gap

00:13:40.779 --> 00:13:44.000
between biology and machine. But we also have

00:13:44.000 --> 00:13:47.379
very practical everyday tools bridging a similar

00:13:47.379 --> 00:13:50.779
gap. Look at what XAI just launched into the

00:13:50.779 --> 00:13:54.139
market this week. They released a wildly powerful

00:13:54.139 --> 00:13:58.179
new API specifically for voice cloning. It's

00:13:58.179 --> 00:14:01.429
wild. It lets you create. hyper -realistic custom

00:14:01.429 --> 00:14:04.029
voices. You can use them for podcasts, autonomous

00:14:04.029 --> 00:14:06.610
agents, synthetic videos. You can freely pick

00:14:06.610 --> 00:14:10.590
from over 80 distinct voices. They cover 28 different

00:14:10.590 --> 00:14:13.330
global languages with perfect accents. And the

00:14:13.330 --> 00:14:15.570
actual pricing model is what's truly disruptive.

00:14:15.830 --> 00:14:18.590
It starts at literally just $3 an hour. It completely

00:14:18.590 --> 00:14:20.870
commoditizes high -end voice production. Yeah.

00:14:20.950 --> 00:14:23.309
Anyone with a laptop can instantly spin up a

00:14:23.309 --> 00:14:25.590
multilingual ad campaign now. You don't need

00:14:25.590 --> 00:14:27.789
a massive recording studio or professional actors

00:14:27.789 --> 00:14:30.230
anymore. Right. Speaking of the advertising world,

00:14:30.370 --> 00:14:32.789
Meta just made a huge integration play. Meta

00:14:32.789 --> 00:14:35.570
now actively lets you connect AI straight into

00:14:35.570 --> 00:14:38.169
your ad account. You can directly plug in ChatGP

00:14:38.169 --> 00:14:41.250
to your cloud to run campaigns. The AI natively

00:14:41.250 --> 00:14:43.049
talks to your potential customers directly in

00:14:43.049 --> 00:14:45.750
the chat. The rapid adoption rate there is absolutely

00:14:45.750 --> 00:14:48.879
staggering. Weekly automated conversations jumped

00:14:48.879 --> 00:14:51.120
from 1 million to 10 million almost overnight.

00:14:51.360 --> 00:14:53.860
Wow. Small businesses are entirely letting the

00:14:53.860 --> 00:14:56.320
AI handle their sales funnels. It's negotiating,

00:14:56.620 --> 00:14:59.039
answering questions and closing deals in real

00:14:59.039 --> 00:15:02.379
time. We're also seeing an absolute flood of

00:15:02.379 --> 00:15:06.139
rapid fire. Daily tool updates, just highly practical

00:15:06.139 --> 00:15:08.879
tools you might use every single day. There's

00:15:08.879 --> 00:15:11.100
a new one called Avatar getting a lot of traction.

00:15:11.419 --> 00:15:14.220
It perfectly removes complex image backgrounds

00:15:14.220 --> 00:15:16.980
in one single click. It automatically balances

00:15:16.980 --> 00:15:20.340
colors flawlessly. It even intelligently restores

00:15:20.340 --> 00:15:22.740
missing parts of an old image. Then you have

00:15:22.740 --> 00:15:25.379
highly creative niche things like codex pets.

00:15:25.620 --> 00:15:28.039
I love them. Right? They're totally optional,

00:15:28.240 --> 00:15:31.419
small, animated companions specifically for Codex.

00:15:31.460 --> 00:15:33.639
They just sit quietly on your screen and show

00:15:33.639 --> 00:15:36.440
your thread status. They physically reflect whether

00:15:36.440 --> 00:15:38.940
Codex is actively running or just waiting. It

00:15:38.940 --> 00:15:40.799
has an interesting bit of personality to the

00:15:40.799 --> 00:15:44.230
coding process. There's also Droppy, which I

00:15:44.230 --> 00:15:46.750
think is a massive shift in retail power. It

00:15:46.750 --> 00:15:48.710
isn't just a basic price tracker. It's an autonomous

00:15:48.710 --> 00:15:51.289
agent fighting massive retail pricing algorithms.

00:15:51.509 --> 00:15:55.370
It constantly tracks Amazon, eBay, and AliExpress

00:15:55.370 --> 00:15:57.629
silently in the background. You just get a notification

00:15:57.629 --> 00:16:00.690
the exact millisecond a price drops. It shifts

00:16:00.690 --> 00:16:02.950
the power completely back to the consumer. And

00:16:02.950 --> 00:16:04.909
for independent website owners, there's a huge

00:16:04.909 --> 00:16:07.809
shift with sleek analytics. It's a completely...

00:16:08.419 --> 00:16:11.080
Privacy -first encrypted alternative to Google

00:16:11.080 --> 00:16:13.679
Analytics. It aggressively offers real -time

00:16:13.679 --> 00:16:16.379
data, entirely cookie -less tracking, and fast

00:16:16.379 --> 00:16:19.759
dashboards. All these diverse tools share one

00:16:19.759 --> 00:16:23.080
major common thread. They're taking highly complex

00:16:23.080 --> 00:16:25.179
AI architecture and making it utterly simple

00:16:25.179 --> 00:16:27.320
to use. Let's pause on that idea for a second

00:16:27.320 --> 00:16:29.399
because the societal implications are massive.

00:16:29.460 --> 00:16:31.559
With AI literally moving our hands and cloning

00:16:31.559 --> 00:16:33.980
our voices, where does the human element fit

00:16:33.980 --> 00:16:36.360
in? I think the human element fundamentally shifts

00:16:36.360 --> 00:16:39.710
up the cognitive chain entirely. As AI flawlessly

00:16:39.710 --> 00:16:42.690
handles the raw execution of tasks, we elevate.

00:16:42.970 --> 00:16:46.289
Our primary role permanently becomes one of deep

00:16:46.289 --> 00:16:49.649
curation. We handle the broad strategy and the

00:16:49.649 --> 00:16:52.129
complex intention behind the action. We decide

00:16:52.129 --> 00:16:54.570
what needs doing and the AI natively does it.

00:16:54.669 --> 00:16:57.049
Basically, we stop doing the heavy lifting and

00:16:57.049 --> 00:16:59.450
become the directors. Exactly. We confidently

00:16:59.450 --> 00:17:01.669
call the strategic shots. We architect the vision

00:17:01.669 --> 00:17:04.349
and the machine executes the labor. Let's pull

00:17:04.349 --> 00:17:06.250
all of this incredible information together.

00:17:06.829 --> 00:17:09.509
We've covered a truly massive amount of ground

00:17:09.509 --> 00:17:12.990
today. Mid -roll sponsor Red Placeholder. We

00:17:12.990 --> 00:17:15.650
started by looking at AI labs infiltrating major

00:17:15.650 --> 00:17:19.269
banks. They launched a massive $10 billion integration

00:17:19.269 --> 00:17:21.910
strategy to do it. They're actively securing

00:17:21.910 --> 00:17:23.869
guaranteed revenue from the global corporate

00:17:23.869 --> 00:17:25.869
economy. Then we look deep into the underlying

00:17:25.869 --> 00:17:28.730
infrastructure, making it possible. They absolutely

00:17:28.730 --> 00:17:30.950
had to rebuild internet audio architecture from

00:17:30.950 --> 00:17:33.309
scratch. Yeah. They're serving 900 million users

00:17:33.309 --> 00:17:36.630
seamlessly with a split brain system. They completely

00:17:36.630 --> 00:17:40.069
achieved sub -500 millisecond latency for pure

00:17:40.069 --> 00:17:43.430
emotional audio processing. And that massive

00:17:43.430 --> 00:17:47.309
digital explosion inevitably caused serious physical

00:17:47.309 --> 00:17:51.309
friction. We saw severe Apple hardware shortages

00:17:51.309 --> 00:17:54.470
hitting the secondary market hard. Software developers

00:17:54.470 --> 00:17:57.490
are aggressively hoarding Macs to run local agents.

00:17:57.630 --> 00:18:00.529
And the White House is pushing hard for new regulatory

00:18:00.529 --> 00:18:03.119
oversight. And it all eventually culminates in

00:18:03.119 --> 00:18:06.519
the most personal edge cases imaginable. Wearables

00:18:06.519 --> 00:18:09.019
that literally control your human muscles with

00:18:09.019 --> 00:18:12.980
electrical signals. XAI flawlessly cloning your

00:18:12.980 --> 00:18:16.940
exact voice for just $3 an hour. The great integration

00:18:16.940 --> 00:18:19.140
is officially no longer just a pending software

00:18:19.140 --> 00:18:22.420
update. It is a tangible, rapidly accelerating.

00:18:23.099 --> 00:18:25.039
physical reality happening right now. It really

00:18:25.039 --> 00:18:26.940
truly is. Thank you so much for joining us on

00:18:26.940 --> 00:18:29.079
this deep dive today. I highly encourage you

00:18:29.079 --> 00:18:31.119
to actively check out some of the specific tools

00:18:31.119 --> 00:18:33.460
we mentioned today. See exactly how they might

00:18:33.460 --> 00:18:35.519
seamlessly fit into your own daily workflow.

00:18:35.839 --> 00:18:38.000
And please be sure to subscribe for our next

00:18:38.000 --> 00:18:40.940
deep dive into this rapidly changing world. It's

00:18:40.940 --> 00:18:42.700
always an absolute pleasure to critically unpack

00:18:42.700 --> 00:18:44.400
this landscape with you. There's literally always

00:18:44.400 --> 00:18:46.500
something entirely new and profound to learn.

00:18:46.829 --> 00:18:48.769
I want to leave you with one final lingering

00:18:48.769 --> 00:18:51.589
thought today. If AI can perfectly replicate

00:18:51.589 --> 00:18:54.609
our voices with XAI and literally control our

00:18:54.609 --> 00:18:57.509
physical movements with new wearables, how long

00:18:57.509 --> 00:18:59.089
until we can't tell the difference between a

00:18:59.089 --> 00:19:01.509
skill we genuinely learned and a skill we simply

00:19:01.509 --> 00:19:04.089
rented for the afternoon? Beat. Let that sink

00:19:04.089 --> 00:19:04.289
in.
