WEBVTT

00:00:00.000 --> 00:00:02.700
Okay, let's just run a quick simulation. You

00:00:02.700 --> 00:00:06.719
spend an entire week, I mean a full week, grinding

00:00:06.719 --> 00:00:08.960
on a gaming video. Right. You capture all the

00:00:08.960 --> 00:00:11.240
footage, you edit the highlights, you're obsessing

00:00:11.240 --> 00:00:13.980
over every single word of commentary, you upload

00:00:13.980 --> 00:00:17.179
it. And by some miracle, the algorithm loves

00:00:17.179 --> 00:00:20.260
you. It hits a million views. Which is massive.

00:00:20.359 --> 00:00:22.160
I mean, that's a career moment for most creators.

00:00:22.460 --> 00:00:24.739
It's huge. You're popping the champagne, but

00:00:24.739 --> 00:00:29.850
then the check clears. And it's for like... $3

00:00:29.850 --> 00:00:33.350
,000. Which, look, isn't bad for a week's work.

00:00:33.509 --> 00:00:35.929
But now take that exact same energy, same hours,

00:00:35.950 --> 00:00:38.329
same effort. But instead of gaming, you make

00:00:38.329 --> 00:00:40.590
a video about personal finance. A million views.

00:00:40.869 --> 00:00:43.649
The check. The check is $20 ,000. That's the

00:00:43.649 --> 00:00:45.429
revenue gap. And honestly, that's the entire

00:00:45.429 --> 00:00:47.810
reason we're digging into this today. It basically

00:00:47.810 --> 00:00:50.509
turns content creation from like an art project

00:00:50.509 --> 00:00:53.219
into an arbitrage opportunity. Welcome back to

00:00:53.219 --> 00:00:55.420
the Deep Dive. Today we are unpacking a really

00:00:55.420 --> 00:00:57.259
fascinating and maybe a little controversial

00:00:57.259 --> 00:01:01.060
guide from early 2026 by a guy named Max Ann.

00:01:01.259 --> 00:01:04.680
It's titled The $20 ,000 Faceless Strategy. A

00:01:04.680 --> 00:01:07.900
bold title. Very bold. The whole premise is building

00:01:07.900 --> 00:01:10.519
a finance video empire with nothing but free

00:01:10.519 --> 00:01:14.379
AI tools. But the key constraint is... No camera,

00:01:14.459 --> 00:01:16.900
no microphone, and absolutely no showing your

00:01:16.900 --> 00:01:19.260
face. Which is, I mean, that's the holy grail

00:01:19.260 --> 00:01:21.439
for introverts, right? Or anyone who just wants

00:01:21.439 --> 00:01:23.700
a side hustle that doesn't require being on all

00:01:23.700 --> 00:01:25.840
the time. But what really caught my eye isn't

00:01:25.840 --> 00:01:28.280
just the tools. It's the philosophy behind it.

00:01:28.519 --> 00:01:31.599
And treats YouTube less like, you know, a creative

00:01:31.599 --> 00:01:34.099
outlet and more like a math problem. It is extremely

00:01:34.099 --> 00:01:36.200
industrial. So today we're going to walk through

00:01:36.200 --> 00:01:39.480
the economics of why this one meech pays so ridiculously

00:01:39.480 --> 00:01:42.040
well. This concept of retention engineering for

00:01:42.040 --> 00:01:44.659
scripts. which I think is the real secret here.

00:01:44.780 --> 00:01:46.920
Yeah. And the specific tech stack. We're talking

00:01:46.920 --> 00:01:50.159
Eleven Labs, Gemini, Grok, all the big ones.

00:01:50.260 --> 00:01:52.200
But we also have to talk about the cost. Because

00:01:52.200 --> 00:01:54.260
as we dug into this, it became really clear that

00:01:54.260 --> 00:01:57.140
free tools does not mean free time. There is

00:01:57.140 --> 00:02:00.420
a brutal reality check waiting at the end of

00:02:00.420 --> 00:02:02.379
this whole roadmap. Let's start with the money,

00:02:02.420 --> 00:02:05.640
though. Section two of the guide is called Why

00:02:05.640 --> 00:02:09.139
Finance Videos Print Money. We kind of touched

00:02:09.139 --> 00:02:12.509
on the... $3 ,000 versus $20 ,000 difference.

00:02:13.189 --> 00:02:15.449
Can you break that down for us? Why is the gap

00:02:15.449 --> 00:02:17.569
so huge? A million views is a million views,

00:02:17.710 --> 00:02:19.810
isn't it? You'd think so. But to an advertiser,

00:02:19.930 --> 00:02:22.750
those views are just radically different commodities.

00:02:22.949 --> 00:02:25.909
It all comes down to RPM revenue per mil. Which

00:02:25.909 --> 00:02:28.629
is just industry speak for what you get paid

00:02:28.629 --> 00:02:30.729
for a thousand views. Exactly. It's the price

00:02:30.729 --> 00:02:32.810
of attention. Think about the gaming audience.

00:02:33.009 --> 00:02:36.680
It's broad. It's younger. Kids, students, you

00:02:36.680 --> 00:02:38.360
know, people with limited disposable income.

00:02:38.900 --> 00:02:40.819
Advertisers know this. They're not going to bid

00:02:40.819 --> 00:02:43.060
high for that inventory. So they'll pay maybe

00:02:43.060 --> 00:02:45.319
$3 to get in front of a thousand of those eyeballs.

00:02:45.560 --> 00:02:47.800
That makes perfect sense. Low purchasing power,

00:02:48.039 --> 00:02:50.900
low bid. Now flip that. Who's watching videos

00:02:50.900 --> 00:02:54.039
on how to invest or the millionaire mindset?

00:02:54.360 --> 00:02:57.300
People with money or people who want money. Right.

00:02:57.379 --> 00:02:59.979
So the advertisers bidding for that slot are

00:02:59.979 --> 00:03:03.960
banks, trading platforms, fintech companies.

00:03:05.000 --> 00:03:07.580
Their customers have a high lifetime value, so

00:03:07.580 --> 00:03:10.180
they are more than willing to pay a premium to

00:03:10.180 --> 00:03:12.780
get those customers. We're talking 12, maybe

00:03:12.780 --> 00:03:16.300
even up to $22 RPM. So purely from a business

00:03:16.300 --> 00:03:19.639
standpoint, a finance viewer is worth what? Four

00:03:19.639 --> 00:03:21.860
to six times more than a gaming viewer? Correct.

00:03:21.939 --> 00:03:24.930
It's high value real estate online. But... And

00:03:24.930 --> 00:03:26.750
this is the huge catch the guide points out right

00:03:26.750 --> 00:03:29.090
away. Traditional finance content is incredibly

00:03:29.090 --> 00:03:30.889
boring. I was going to say, I don't think anyone

00:03:30.889 --> 00:03:32.949
wants to sit through a 15 minute lecture on the

00:03:32.949 --> 00:03:36.210
technical mechanics of compound interest. And

00:03:36.210 --> 00:03:38.469
that's what creates the content gap. The old

00:03:38.469 --> 00:03:40.389
way is a lecture. It's a guy in a suit with a

00:03:40.389 --> 00:03:43.150
whiteboard. The viral strategy, the one Max Anne

00:03:43.150 --> 00:03:45.509
outlines, is all about wrapping those high value

00:03:45.509 --> 00:03:48.150
keywords in storytelling. This is the part I

00:03:48.150 --> 00:03:50.150
found really interesting. We think of finance

00:03:50.150 --> 00:03:53.560
as this cold, logical field. spreadsheets, percentages.

00:03:54.099 --> 00:03:56.520
But this guide suggests you have to lean almost

00:03:56.520 --> 00:03:59.639
entirely into emotion to succeed. Why does that

00:03:59.639 --> 00:04:01.759
emotional angle matter so much if you're just

00:04:01.759 --> 00:04:04.599
chasing ad rates? Because the ad rate only matters

00:04:04.599 --> 00:04:06.900
if people actually watch the video. High RPM

00:04:06.900 --> 00:04:09.979
is useless if your retention is 0%. You have

00:04:09.979 --> 00:04:12.719
to use those rags to riches narratives, stories

00:04:12.719 --> 00:04:15.240
about mindset shifts, the psychology of success.

00:04:15.419 --> 00:04:17.439
You basically have to trick the viewer's brain

00:04:17.439 --> 00:04:20.550
into enjoying a finance lesson. so probing question

00:04:20.550 --> 00:04:23.009
for you you hook them with the heart and you

00:04:23.009 --> 00:04:25.050
cash out with the keywords that is the model

00:04:25.050 --> 00:04:28.269
100 emotion drives the view the keywords drive

00:04:28.269 --> 00:04:30.209
the check okay let's get into the execution then

00:04:30.209 --> 00:04:33.589
the how the guide has a complete tool kit and

00:04:33.589 --> 00:04:37.189
a strategy it calls pattern recognition walk

00:04:37.189 --> 00:04:39.430
us through the tech stack first is this stuff

00:04:39.430 --> 00:04:43.610
really free for 2026 it is surprisingly accessible

00:04:43.610 --> 00:04:46.910
on free tiers if you know how to Game them a

00:04:46.910 --> 00:04:49.389
little. You've got ChatGPT for scripting and

00:04:49.389 --> 00:04:51.790
research. 11 Labs for the voiceover, which is

00:04:51.790 --> 00:04:54.389
totally non -negotiable for quality. Then Gemini

00:04:54.389 --> 00:04:57.370
for making consistent images. Grok, imagine,

00:04:57.569 --> 00:04:59.649
to turn those images into short videos. And then

00:04:59.649 --> 00:05:02.930
a basic editor like CapCut or Filmora. Yep. All

00:05:02.930 --> 00:05:04.689
free to start. Okay, so the tools are there.

00:05:05.339 --> 00:05:07.600
But the guide is pretty stern about not just

00:05:07.600 --> 00:05:09.660
jumping in and making whatever you feel like.

00:05:09.779 --> 00:05:12.699
It calls this pattern recognition. Yeah, and

00:05:12.699 --> 00:05:14.500
this is where most beginners just completely

00:05:14.500 --> 00:05:17.300
crash and burn. They wake up and think, I have

00:05:17.300 --> 00:05:19.420
a great idea. I'm going to make a video about

00:05:19.420 --> 00:05:22.899
how to save money on latte art. Which sounds

00:05:22.899 --> 00:05:24.879
like a, you know, a perfectly reasonable, helpful

00:05:24.879 --> 00:05:27.180
video. But in this model, it's a total waste

00:05:27.180 --> 00:05:29.500
of time because you're guessing. The strategy

00:05:29.500 --> 00:05:32.269
here is all about reverse engineering. You go

00:05:32.269 --> 00:05:35.009
to YouTube, you search financial freedom or passive

00:05:35.009 --> 00:05:37.970
income, then you sort by most popular, and you

00:05:37.970 --> 00:05:40.329
just study the top three videos in the last six

00:05:40.329 --> 00:05:42.670
months. So you aren't inventing, you're analyzing.

00:05:43.009 --> 00:05:45.449
Precisely. You look at the titles. What emotions

00:05:45.449 --> 00:05:47.490
are they hitting on? Is it fear? The crash is

00:05:47.490 --> 00:05:50.430
coming. Is it greed? How I made 10 grand in a

00:05:50.430 --> 00:05:52.810
week. You take those winning concepts and you

00:05:52.810 --> 00:05:55.250
bring them right over to ChatGPT. And the guide

00:05:55.250 --> 00:05:58.470
mentions a specific prompt strategy here. You

00:05:58.470 --> 00:06:01.209
don't just ask for ideas. No, absolutely not.

00:06:01.610 --> 00:06:04.870
Generic prompts get generic results. You feed

00:06:04.870 --> 00:06:07.990
ChatGPT the viral titles you actually found.

00:06:08.089 --> 00:06:11.490
You say, analyze these, now generate 20 new ideas

00:06:11.490 --> 00:06:13.810
that appeal to a U .S. audience, include specific

00:06:13.810 --> 00:06:16.149
numbers in the title, and follow this emotional

00:06:16.149 --> 00:06:19.329
style. You're basically telling the AI, here's

00:06:19.329 --> 00:06:21.970
the winning formula, give me 20 variations of

00:06:21.970 --> 00:06:23.850
it. I have to play devil's advocate for a second.

00:06:23.889 --> 00:06:27.329
Is this just plagiarism with extra steps? It's

00:06:27.329 --> 00:06:30.110
a fair question. I'd argue it's market research.

00:06:30.750 --> 00:06:32.589
You're not copying the script word for word.

00:06:32.689 --> 00:06:35.629
You're copying the structure of the demand. If

00:06:35.629 --> 00:06:37.790
thousands of people are clicking on the five

00:06:37.790 --> 00:06:39.670
rules of money, you make a video about the seven

00:06:39.670 --> 00:06:41.689
money rules of the rich. You're building on a

00:06:41.689 --> 00:06:44.329
proven foundation instead of just hoping for

00:06:44.329 --> 00:06:45.970
the best. Right. You're mitigating your risk.

00:06:46.350 --> 00:06:49.329
So probing question. Is it fair to say this is

00:06:49.329 --> 00:06:51.430
less about creativity and more about fitting

00:06:51.430 --> 00:06:53.850
a market signal? Exactly. It's letting the market

00:06:53.850 --> 00:06:56.009
tell you what it wants before you do any of the

00:06:56.009 --> 00:06:57.850
work. Okay. So you have your topic. Now you need

00:06:57.850 --> 00:06:59.790
the actual script. Section five is called retention

00:06:59.790 --> 00:07:02.910
engineering. I love that term. It sounds so industrial.

00:07:03.269 --> 00:07:06.009
It is industrial. It's about manufacturing attention.

00:07:06.370 --> 00:07:09.620
The guide is very clear about this. The modern

00:07:09.620 --> 00:07:13.120
attention span is fragile. It's like trying to

00:07:13.120 --> 00:07:16.060
hold water in your hands. To keep it, you need

00:07:16.060 --> 00:07:18.399
what it calls pattern interrupts. Define that

00:07:18.399 --> 00:07:20.519
for us. A pattern interrupt is basically a reset

00:07:20.519 --> 00:07:23.699
button for the brain. If a video stays the same...

00:07:23.959 --> 00:07:26.800
Same tone, same visuals, same pace for too long.

00:07:26.939 --> 00:07:29.560
The brain just tunes out. The guide says every

00:07:29.560 --> 00:07:32.319
90 seconds something has to change. You drop

00:07:32.319 --> 00:07:34.459
a controversial statement. You ask a rhetorical

00:07:34.459 --> 00:07:37.120
question. You throw in a surprising stat. It's

00:07:37.120 --> 00:07:39.199
like gently shaking the viewer when they start

00:07:39.199 --> 00:07:41.759
to doze off. Exactly. And it starts from the

00:07:41.759 --> 00:07:44.120
very first second. The guide has a super strict

00:07:44.120 --> 00:07:46.959
rule for the first five seconds. No greetings.

00:07:47.220 --> 00:07:49.300
But that's the classic YouTuber opening. Hey

00:07:49.300 --> 00:07:51.259
guys, welcome back to the channel. And in this

00:07:51.259 --> 00:07:53.579
niche, it's a retention killer. Nobody cares

00:07:53.579 --> 00:07:55.660
who you are yet. If you start with a bold statement

00:07:55.660 --> 00:07:59.040
or even a lie, for example, most people will

00:07:59.040 --> 00:08:01.839
never be rich because they believe one dangerous

00:08:01.839 --> 00:08:04.199
lie. OK, I'm listening. I want to know what the

00:08:04.199 --> 00:08:07.399
lie is. That's the point. Then from second six

00:08:07.399 --> 00:08:10.399
to 30, you give them the promise. You say in

00:08:10.399 --> 00:08:12.100
the next 10 minutes, you'll learn exactly how

00:08:12.100 --> 00:08:14.620
to fix that. You hook them, you promise a solution,

00:08:14.860 --> 00:08:17.269
and then you can start the actual video. You

00:08:17.269 --> 00:08:19.389
know, reading through this, I had a bit of a

00:08:19.389 --> 00:08:22.310
vulnerable moment myself. The guide mentions

00:08:22.310 --> 00:08:24.550
you still have to read the AI draft out loud.

00:08:24.790 --> 00:08:27.389
I think I've definitely been guilty of just trusting

00:08:27.389 --> 00:08:30.089
the AI output too much. Oh, we all have, for

00:08:30.089 --> 00:08:32.830
sure. It's called prompt drift or just laziness.

00:08:32.889 --> 00:08:35.429
You get lazy, you paste the script into the voice

00:08:35.429 --> 00:08:37.610
generator, and your video ends up sounding like

00:08:37.610 --> 00:08:40.129
a robot reading a dictionary. You have to humanize

00:08:40.129 --> 00:08:42.250
it. You have to hear where a real person would

00:08:42.250 --> 00:08:44.929
breathe. The AI gets you maybe 80 % of the way

00:08:44.929 --> 00:08:48.139
there, but that last 20%. That's on you. So,

00:08:48.299 --> 00:08:51.019
probing question. The biggest mistake beginners

00:08:51.019 --> 00:08:53.120
make in the first 10 seconds is just being too

00:08:53.120 --> 00:08:56.460
polite. Yes. Politeness is for the outro. The

00:08:56.460 --> 00:08:59.019
intro is for the hook. Period. Let's talk visuals.

00:08:59.259 --> 00:09:01.500
Since this is a faceless channel, you aren't

00:09:01.500 --> 00:09:04.320
on screen. But you also can't just use random

00:09:04.320 --> 00:09:07.139
stock footage. The guide has this workflow using

00:09:07.139 --> 00:09:09.879
a consistent character it calls Max. Yeah, this

00:09:09.879 --> 00:09:11.700
is where it gets really technical, but also really

00:09:11.700 --> 00:09:14.259
cool. The biggest problem with AI image generation

00:09:14.259 --> 00:09:17.590
is consistency. You generate a character for

00:09:17.590 --> 00:09:19.710
scene one, let's say an expert in a suit. He

00:09:19.710 --> 00:09:22.309
looks great. You ask for him again in scene two,

00:09:22.389 --> 00:09:24.750
looking at a chart, and suddenly he has a beard.

00:09:25.289 --> 00:09:27.889
Or his tie is different. It kills the immersion

00:09:27.889 --> 00:09:30.649
instantly. It destroys your authority. If your

00:09:30.649 --> 00:09:33.590
expert keeps shapeshifting, the viewer just subconsciously

00:09:33.590 --> 00:09:36.059
checks out. So the solution in the guide is to

00:09:36.059 --> 00:09:37.779
create a base character. We'll call him Max.

00:09:37.980 --> 00:09:40.720
But you use a very specific prompt. Yeah. Simple

00:09:40.720 --> 00:09:43.799
2D hand -drawn flat pastel colors, plain t -shirt.

00:09:43.980 --> 00:09:47.740
But why simple? Why not aim for, like, photorealism?

00:09:47.820 --> 00:09:50.539
Because simple lines are way easier for the AI

00:09:50.539 --> 00:09:53.539
to replicate consistently. A complex human face

00:09:53.539 --> 00:09:56.500
has too many variables. Pores, lighting, individual

00:09:56.500 --> 00:09:59.299
hairs, a doodle of a guy in a green shirt. The

00:09:59.299 --> 00:10:01.279
AI can draw that a thousand times and it'll look

00:10:01.279 --> 00:10:03.340
the same every time. And there's a specific trick

00:10:03.340 --> 00:10:06.220
mentioned, the reference image hack. Right. So

00:10:06.220 --> 00:10:08.720
you generate that one perfect base image of Max.

00:10:09.120 --> 00:10:11.500
Then you upload it back into a new Gemini chat

00:10:11.500 --> 00:10:14.690
and you tell the AI. This is Max. Use him for

00:10:14.690 --> 00:10:17.110
future reference. Now, every time you ask for

00:10:17.110 --> 00:10:19.649
a scene Max at a desk, Max looking stressed,

00:10:19.909 --> 00:10:22.750
it uses that original image as its source data.

00:10:22.909 --> 00:10:25.230
You're basically training your own personal animator.

00:10:25.309 --> 00:10:27.789
That is incredibly clever, but static images

00:10:27.789 --> 00:10:30.210
are still boring for YouTube. Which brings us

00:10:30.210 --> 00:10:33.350
to Grok. And this, for me, was the real moment

00:10:33.350 --> 00:10:35.649
of wonder. You take that static doodle of Max,

00:10:35.769 --> 00:10:38.210
you put it into Grok Imagine, and you just add

00:10:38.210 --> 00:10:40.289
subtle motion. We're not talking about making

00:10:40.289 --> 00:10:42.289
an action movie. You're just making him blink.

00:10:42.669 --> 00:10:44.909
Or making his chest kind of rise and fall like

00:10:44.909 --> 00:10:47.269
he's breathing. Just enough movement to trick

00:10:47.269 --> 00:10:49.970
the eye. Yeah. It turns a JPEG into a video clip

00:10:49.970 --> 00:10:52.190
in just a few seconds. And it triggers that part

00:10:52.190 --> 00:10:54.289
of the brain that says, hey, this is high production

00:10:54.289 --> 00:10:56.769
value. Even though it's just a process doodle.

00:10:56.870 --> 00:11:00.169
So, probing question. We use simple doodles instead

00:11:00.169 --> 00:11:02.990
of photorealism because consistency trumps detail.

00:11:03.450 --> 00:11:06.990
100%. A consistent cartoon is trustworthy. A

00:11:06.990 --> 00:11:09.370
morphing human is just creepy. Okay, we have

00:11:09.370 --> 00:11:12.210
the script. We have the visuals. Now sound. We

00:11:12.210 --> 00:11:15.370
mentioned Eleven Labs. Why that tool specifically?

00:11:15.710 --> 00:11:18.210
There are tons of AI voice generators out there.

00:11:18.409 --> 00:11:20.549
It's just it's currently the gold standard for

00:11:20.549 --> 00:11:23.549
natural intonation. But the guide has a big warning

00:11:23.549 --> 00:11:26.009
about the settings. If you just leave it on default,

00:11:26.129 --> 00:11:29.870
it can sound, well, uncanny. You know that weird,

00:11:30.029 --> 00:11:33.610
hollow AI sound? Oh, yeah. I know it well. The

00:11:33.610 --> 00:11:35.889
guide recommends setting stability to around

00:11:35.889 --> 00:11:38.940
50 or 60 percent. If it's too high, the voice

00:11:38.940 --> 00:11:41.500
is monotonic, too perfect, too low, and it gets

00:11:41.500 --> 00:11:44.799
all crackly and weirdly emotional. 50 % seems

00:11:44.799 --> 00:11:46.620
to be that sweet spot where it sounds like a

00:11:46.620 --> 00:11:49.019
confident person just talking to you. And there's

00:11:49.019 --> 00:11:50.700
a little trick in here for the free tier, too,

00:11:50.779 --> 00:11:53.159
because those 11 Labs credits run out fast. The

00:11:53.159 --> 00:11:55.519
Gmail Plus trick. This is like the resourcefulness

00:11:55.519 --> 00:11:58.799
101. If your email is name at gmail .com, you

00:11:58.799 --> 00:12:01.159
can sign up for a new 11 Labs account using name

00:12:01.159 --> 00:12:03.240
plus one at gmail .com. And that actually works.

00:12:03.500 --> 00:12:07.320
Yeah. Eleven Labs sees it as a new user, but

00:12:07.320 --> 00:12:10.100
Gmail ignores the plus sign and everything after

00:12:10.100 --> 00:12:12.179
it, so the verification email still lands in

00:12:12.179 --> 00:12:14.580
your main inbox. You can do name plus two, name

00:12:14.580 --> 00:12:17.879
plus three, basically infinite trials. That is

00:12:17.879 --> 00:12:20.159
very sneaky. I like it. It's all about hacking

00:12:20.159 --> 00:12:23.379
the system to keep your overhead at zero. Now,

00:12:23.379 --> 00:12:26.200
editing. The guide lists a few specific rules.

00:12:26.700 --> 00:12:29.120
B -roll changes every five seconds, music at

00:12:29.120 --> 00:12:32.840
30%. But one rule seems totally non -negotiable.

00:12:33.309 --> 00:12:35.669
The eight minute mark. The holy grail of duration.

00:12:35.990 --> 00:12:38.649
Why is eight minutes so critical? Why not seven?

00:12:38.789 --> 00:12:41.870
Why not ten? Mid -roll ads. This is purely a

00:12:41.870 --> 00:12:44.429
business decision. If your video is seven minutes

00:12:44.429 --> 00:12:47.149
and 59 seconds long, YouTube only lets you place

00:12:47.149 --> 00:12:49.350
ads at the beginning and the end. If it's eight

00:12:49.350 --> 00:12:51.029
minutes and one second, you can place ads in

00:12:51.029 --> 00:12:52.850
the middle of the video. So you literally double

00:12:52.850 --> 00:12:55.389
your potential ad inventory. Exactly. If you're

00:12:55.389 --> 00:12:57.710
at 730. You drag that intro out a little. You

00:12:57.710 --> 00:13:00.450
speak slower. You add a longer outro. You do

00:13:00.450 --> 00:13:02.149
whatever you have to do to cross that eight minute

00:13:02.149 --> 00:13:04.889
line or you are leaving money on the table. So

00:13:04.889 --> 00:13:08.769
probing question. The eight minute mark isn't

00:13:08.769 --> 00:13:10.909
an artistic choice. It's a revenue multiplier.

00:13:11.690 --> 00:13:14.090
It is the difference between a hobby and a business.

00:13:14.389 --> 00:13:16.649
So we've built the video. Now we have to upload

00:13:16.649 --> 00:13:18.789
it. And this is where the strategy shifts from

00:13:18.789 --> 00:13:22.090
creation to targeting. You can't just upload

00:13:22.090 --> 00:13:24.590
and pray. No, you're just shouting into a void

00:13:24.590 --> 00:13:27.490
unless you tell the algorithm exactly who this

00:13:27.490 --> 00:13:30.090
video is for. And remember, we want that high

00:13:30.090 --> 00:13:33.029
RPM. That means we want viewers in the USA, the

00:13:33.029 --> 00:13:36.289
UK, Canada, and Australia. How do you even force

00:13:36.289 --> 00:13:38.149
that? You can't really control who clicks on

00:13:38.149 --> 00:13:40.629
your video. You can signal it, though. Use American

00:13:40.629 --> 00:13:44.080
spelling. Realized with a Z, not an S. Reference

00:13:44.080 --> 00:13:46.940
U .S. dollars, not euros. Use New York in your

00:13:46.940 --> 00:13:50.120
examples, not say London. And crucially, upload

00:13:50.120 --> 00:13:52.820
when they're actually awake. The guide says 6

00:13:52.820 --> 00:13:55.259
to 9 p .m. Eastern Standard Time. You're tailoring

00:13:55.259 --> 00:13:57.500
the metadata to fit the audience profile you

00:13:57.500 --> 00:14:00.120
want to sell to advertisers. 100%. But then comes

00:14:00.120 --> 00:14:02.440
the hard part, section 11, the reality check.

00:14:02.639 --> 00:14:04.360
And I really appreciate that the guide includes

00:14:04.360 --> 00:14:06.679
this because up until this point, it all sounds

00:14:06.679 --> 00:14:09.490
a bit like a magic money printer. Just use AI

00:14:09.490 --> 00:14:12.629
and get rich. It does. It sounds way too easy.

00:14:12.850 --> 00:14:15.090
And this is where most people will fail. The

00:14:15.090 --> 00:14:18.690
reality is the grind. The guide states it clearly.

00:14:19.090 --> 00:14:23.210
This takes 8 to 11 hours of work per video. Wow.

00:14:23.629 --> 00:14:27.450
Let's just pause on that. 11 hours for a 10 -minute

00:14:27.450 --> 00:14:30.429
video. I mean, think about it. Researching the

00:14:30.429 --> 00:14:33.710
viral topics, prompting chat GPT, rewriting the

00:14:33.710 --> 00:14:36.090
script so it isn't robotic, generating maybe

00:14:36.090 --> 00:14:38.710
50 different images of Macs, animating them in

00:14:38.710 --> 00:14:41.409
Grok, editing it all together, captioning. It

00:14:41.409 --> 00:14:44.070
is a full day of work. And the kicker. The kicker

00:14:44.070 --> 00:14:45.970
is that for your first five videos, you're going

00:14:45.970 --> 00:14:48.110
to be posting to a ghost town. The ghost town

00:14:48.110 --> 00:14:50.990
phase. You might get 100 views, maybe 50. And

00:14:50.990 --> 00:14:53.049
that's normal. The algorithm has no idea who

00:14:53.049 --> 00:14:55.049
you are yet. It's testing you. It needs data.

00:14:55.210 --> 00:14:58.289
So you're working an 11 -hour day for... Potentially

00:14:58.289 --> 00:15:01.269
zero dollars in return at first. For 90 days.

00:15:01.470 --> 00:15:03.409
Yeah. That is the commitment the guide asked

00:15:03.409 --> 00:15:05.669
for. Two videos a week for 90 days. That's what,

00:15:05.710 --> 00:15:08.570
24 videos? That is the valley of death you have

00:15:08.570 --> 00:15:10.809
to cross before the algorithm trusts you enough

00:15:10.809 --> 00:15:13.070
to start pushing your content to millions of

00:15:13.070 --> 00:15:16.990
people. It's a test of pure endurance. So probing

00:15:16.990 --> 00:15:20.269
question. What actually separates the winners

00:15:20.269 --> 00:15:23.480
from the losers in this model? Is it the quality

00:15:23.480 --> 00:15:25.980
of the AI art, the cleverness of the script?

00:15:26.200 --> 00:15:27.879
I don't think so. I really think it's psychological.

00:15:28.080 --> 00:15:30.899
It's about dealing with a math of that 90 -day

00:15:30.899 --> 00:15:34.480
grind without quitting. The losers quit at video

00:15:34.480 --> 00:15:36.399
number seven because they worked 80 hours and

00:15:36.399 --> 00:15:39.360
only got 200 views. The winners know that video

00:15:39.360 --> 00:15:41.559
number 20 is where the compounding really starts.

00:15:41.899 --> 00:15:44.820
It's just pure, stubborn persistence. We're going

00:15:44.820 --> 00:15:46.779
to take a very short break. When we come back,

00:15:46.779 --> 00:15:49.240
we will recap the big idea and give you one final

00:15:49.240 --> 00:15:53.990
thought to take with you. And we are back. So

00:15:53.990 --> 00:15:57.129
we have unpacked this $20 ,000 faceless strategy.

00:15:57.409 --> 00:16:00.090
We've looked at the high RPM of the finance niche,

00:16:00.269 --> 00:16:02.769
the retention engineering of the scripts, the

00:16:02.769 --> 00:16:05.009
AI tools you need to build consistent characters,

00:16:05.110 --> 00:16:07.250
and the grueling reality of the work involved.

00:16:07.570 --> 00:16:09.750
It's a lot, but it's also a complete ecosystem

00:16:09.750 --> 00:16:11.830
when you look at it. If you had to distill this

00:16:11.830 --> 00:16:14.250
all down, what is the one big idea you think

00:16:14.250 --> 00:16:16.070
people should walk away with? That this isn't

00:16:16.070 --> 00:16:18.889
just about making videos. If you think of it

00:16:18.889 --> 00:16:21.830
that way, you'll probably fail. This is an exercise

00:16:21.830 --> 00:16:24.309
in retention engineering and algorithm training.

00:16:24.549 --> 00:16:27.809
You're engineering a piece of media to hold a

00:16:27.809 --> 00:16:30.429
person's attention, and you're training a machine

00:16:30.429 --> 00:16:34.230
YouTube to recognize you as a source of high

00:16:34.230 --> 00:16:38.389
-value inventory. It's a numbers game. High RPM

00:16:38.389 --> 00:16:42.769
niche plus consistent AI characters plus 90 days

00:16:42.769 --> 00:16:46.590
of volume. That equals what the guide calls the

00:16:46.590 --> 00:16:48.889
unfair advantage. It really reminds me of that

00:16:48.889 --> 00:16:50.970
quote at the very end of the guide. The best

00:16:50.970 --> 00:16:53.549
time to plant a tree was 20 years ago. The second

00:16:53.549 --> 00:16:57.149
best time is now. Exactly. The tech is basically

00:16:57.149 --> 00:16:59.710
free. The information is all out there. The only

00:16:59.710 --> 00:17:01.830
variable left is whether you can commit to those

00:17:01.830 --> 00:17:03.929
90 days. So don't just bookmark the strategy.

00:17:04.049 --> 00:17:05.769
If you're going to do it, you have to commit

00:17:05.769 --> 00:17:07.549
to the grind. We'll put a full list of all the

00:17:07.549 --> 00:17:08.849
tools we mentioned in the show notes for you.

00:17:08.890 --> 00:17:11.190
Eleven Labs, Gemini, Grok, all of them. Good

00:17:11.190 --> 00:17:13.470
luck with the pattern recognition. Thanks for

00:17:13.470 --> 00:17:15.430
listening to the deep dive. We will see you in

00:17:15.430 --> 00:17:15.849
the next one.
