WEBVTT

00:00:00.000 --> 00:00:02.779
Welcome back to the deep dive. You know, we love

00:00:02.779 --> 00:00:06.540
finding a really simple way to frame an overwhelmingly

00:00:06.540 --> 00:00:10.720
complex system. And today we are appearing deep

00:00:10.720 --> 00:00:12.800
into the architecture of modern communication,

00:00:12.800 --> 00:00:16.399
but we're using a lens that's entirely human.

00:00:16.589 --> 00:00:18.510
We're talking about computers that listen and

00:00:18.510 --> 00:00:21.089
computers that shout. Our mission today is really

00:00:21.089 --> 00:00:23.910
to synthesize that massive evolutionary jump

00:00:23.910 --> 00:00:28.149
in communication. I mean, from simple ephemeral

00:00:28.149 --> 00:00:31.629
sound waves to the instant global electronic

00:00:31.629 --> 00:00:34.009
signals that define our lives in the 21st century.

00:00:34.369 --> 00:00:36.429
When you look at all the complex systems connecting

00:00:36.429 --> 00:00:39.109
us, the networks, the servers, that tiny device

00:00:39.109 --> 00:00:41.560
in your pocket, it... it actually helps to think

00:00:41.560 --> 00:00:44.579
about them in these very human terms. We're contrasting

00:00:44.579 --> 00:00:47.619
the finite limits of our own biology with the

00:00:47.619 --> 00:00:49.939
boundless reach of technology. And to really

00:00:49.939 --> 00:00:52.259
understand the magnitude of that shift, I think

00:00:52.259 --> 00:00:53.880
we have to start with the human baseline. Where

00:00:53.880 --> 00:00:56.039
we came from. Exactly. When you strip away all

00:00:56.039 --> 00:00:58.140
the innovations, all the tech, human communication

00:00:58.140 --> 00:01:00.079
basically rests on two systems. The first one

00:01:00.079 --> 00:01:03.070
is the most basic. speech and hearing. That's

00:01:03.070 --> 00:01:04.950
the default human setting, right? Absolutely.

00:01:05.609 --> 00:01:09.090
Speech is, by its very nature, ephemeral. If

00:01:09.090 --> 00:01:10.629
I say something right now and you don't hear

00:01:10.629 --> 00:01:12.870
it, well, it's just gone forever. It's local.

00:01:13.090 --> 00:01:15.370
It only works over pretty short distances, you

00:01:15.370 --> 00:01:17.590
know, limited by my voice and the room we're

00:01:17.590 --> 00:01:21.030
in. The upside, though, it requires only our

00:01:21.030 --> 00:01:23.969
basic anatomy. It develops with basically no

00:01:23.969 --> 00:01:26.590
special training. It's just foundational. So

00:01:26.590 --> 00:01:29.829
quick, local, and temporary. We needed something

00:01:29.829 --> 00:01:31.430
more permanent, something that could go further,

00:01:31.469 --> 00:01:33.450
and that, I assume, brings us to the second system,

00:01:34.090 --> 00:01:36.750
writing. Writing and reading fundamentally changed

00:01:36.750 --> 00:01:39.329
the game. I mean, this system extended communication

00:01:39.329 --> 00:01:41.930
over distance, and maybe even more crucially,

00:01:42.230 --> 00:01:45.670
over time. Time. Right, messages became permanent,

00:01:45.969 --> 00:01:48.510
repeatable, transportable. One of the most fascinating

00:01:48.510 --> 00:01:51.430
ideas we pulled from the material is this concept

00:01:51.430 --> 00:01:54.209
that writers are, in essence, communicating with

00:01:54.209 --> 00:01:56.930
readers who are situated in the future. Communication

00:01:56.930 --> 00:01:59.200
through time. It's wild to think about that.

00:01:59.280 --> 00:02:02.099
You write down a law, a story, a philosophical

00:02:02.099 --> 00:02:04.379
text. You're basically projecting your voice

00:02:04.379 --> 00:02:06.640
hundreds, maybe thousands of years forward. It's

00:02:06.640 --> 00:02:09.860
the ultimate legacy system. It is. It's how civilizations

00:02:09.860 --> 00:02:14.120
persist. But, and here's the rub, even with writing...

00:02:13.900 --> 00:02:16.580
And later, you know, print, which let us make

00:02:16.580 --> 00:02:19.639
countless identical copies, the speed was still

00:02:19.639 --> 00:02:23.000
fundamentally limited. How so? It was bound by

00:02:23.000 --> 00:02:25.300
the speed at which that physical object, the

00:02:25.300 --> 00:02:28.000
paper or the tablet, could be physically transported.

00:02:28.539 --> 00:02:30.699
Think about that for a moment. For millennia,

00:02:31.139 --> 00:02:33.379
the speed of information was literally the speed

00:02:33.379 --> 00:02:37.319
of a horse or the fastest ship or a runner communicating

00:02:37.319 --> 00:02:39.379
from Rome to London. You're measuring that in

00:02:39.379 --> 00:02:41.780
weeks, maybe months. That governed everything.

00:02:41.979 --> 00:02:45.099
Commerce, politics, war. And this is where that

00:02:45.099 --> 00:02:47.240
speed limit, the speed of transport, imposed

00:02:47.240 --> 00:02:49.979
this profound constraint, both logistically and

00:02:49.979 --> 00:02:52.379
I think psychologically. You had to really commit

00:02:52.379 --> 00:02:55.060
to a message knowing the reply might take an

00:02:55.060 --> 00:02:57.840
eternity. Information traveled at the speed of

00:02:57.840 --> 00:03:00.599
matter. OK, so let's unpack this. If transportation

00:03:00.599 --> 00:03:02.919
was the ultimate speed limit for almost all of

00:03:02.919 --> 00:03:05.400
human history, what happens when time and distance

00:03:05.400 --> 00:03:08.360
aren't just shortened, but effectively annihilated?

00:03:08.639 --> 00:03:10.520
That's the moment the electrons enter the conversation.

00:03:10.719 --> 00:03:13.039
And what's so fascinating here is just how quickly

00:03:13.039 --> 00:03:15.439
the world shrank once we harnessed electrical

00:03:15.439 --> 00:03:18.479
signals. The source material points specifically

00:03:18.479 --> 00:03:21.379
to the telegraph. This machine carried messages

00:03:21.379 --> 00:03:23.800
across continents and eventually under oceans

00:03:23.800 --> 00:03:26.340
with what people at the time called unimaginable

00:03:26.340 --> 00:03:29.460
speed. We have to pause on that word. Unimaginable.

00:03:29.740 --> 00:03:33.400
For someone in, say, 1840, the idea you could

00:03:33.400 --> 00:03:35.900
communicate across 3 ,000 miles in a few seconds

00:03:35.900 --> 00:03:37.930
would have sounded like magic. or maybe something

00:03:37.930 --> 00:03:40.849
demonic. Suddenly, the only delay wasn't the

00:03:40.849 --> 00:03:43.689
journey. It was just a brief moment for a human

00:03:43.689 --> 00:03:46.669
to encode and decode the message. Tap, tap, tap.

00:03:46.969 --> 00:03:49.710
The physical barrier of space was just gone.

00:03:50.189 --> 00:03:52.539
Precisely. And that revolutionary shift, it leads

00:03:52.539 --> 00:03:54.860
directly to this concept of the global village.

00:03:55.020 --> 00:03:57.819
The term really captures this new reality that

00:03:57.819 --> 00:04:00.020
with electronic communication, everyone on the

00:04:00.020 --> 00:04:03.259
planet can effectively be within earshot of everyone

00:04:03.259 --> 00:04:05.300
else. That's a powerful metaphor, within earshot.

00:04:05.560 --> 00:04:07.860
It really is, because it connects that local,

00:04:08.080 --> 00:04:10.780
instinctual feeling of human speech. Being able

00:04:10.780 --> 00:04:14.479
to call out and be heard to a global infrastructure.

00:04:14.909 --> 00:04:17.589
And this applies from the very first telegraphs

00:04:17.589 --> 00:04:20.050
all the way up to our 21st century satellite

00:04:20.050 --> 00:04:23.550
-mediated world. That instant collapse of distance,

00:04:23.930 --> 00:04:26.250
it's foundational to understanding modern life.

00:04:26.389 --> 00:04:29.410
It forces us to deal with events instantly, no

00:04:29.410 --> 00:04:31.850
matter where they happen. So if the telegraph

00:04:31.850 --> 00:04:33.930
was the first electronic whisper across the ocean,

00:04:34.529 --> 00:04:38.189
today, the global shouting, this instantaneous

00:04:38.189 --> 00:04:40.410
capacity, it's become completely commonplace.

00:04:40.649 --> 00:04:42.470
Let's focus on the main instrument for this in

00:04:42.470 --> 00:04:45.529
the 21st century, the smartphone. The data suggests

00:04:45.529 --> 00:04:47.329
there are something like two billion of these

00:04:47.329 --> 00:04:49.110
devices out there. These are our personal tools

00:04:49.110 --> 00:04:51.769
for shouting. The scale is just astronomical.

00:04:52.069 --> 00:04:54.029
And it's a complex shout, too. It's not just

00:04:54.029 --> 00:04:56.829
a single acoustic sound. With these devices,

00:04:56.829 --> 00:04:59.290
you can compose a message in so many formats,

00:04:59.490 --> 00:05:02.449
text, an image, audio, a video, and send it instantly.

00:05:02.829 --> 00:05:05.569
The power isn't just in the speed. It's the reach

00:05:05.569 --> 00:05:08.220
and the modality, like you said. You can send

00:05:08.220 --> 00:05:10.439
that rich message to a few people, or you can

00:05:10.439 --> 00:05:12.459
blast it out to servers on the World Wide Web.

00:05:12.899 --> 00:05:14.699
Share it with anyone in the world who can find

00:05:14.699 --> 00:05:17.800
it. That's why we call it Shouting. It's designed

00:05:17.800 --> 00:05:21.029
for maximum reach with minimal effort. And we

00:05:21.029 --> 00:05:23.250
experience this every day, right? Often, without

00:05:23.250 --> 00:05:25.649
even realizing how truly phenomenal it is when

00:05:25.649 --> 00:05:27.509
you look at it against the backdrop of human

00:05:27.509 --> 00:05:30.709
history. Our sources give some great, really

00:05:30.709 --> 00:05:33.829
specific examples that show just how routine

00:05:33.829 --> 00:05:36.410
this global shouting has become. Okay, think

00:05:36.410 --> 00:05:39.149
about watching a big, broadcasted sporting event.

00:05:39.370 --> 00:05:41.689
You're on the West Coast, your friend is on the

00:05:41.689 --> 00:05:44.370
East Coast, and you're sending real -time texts,

00:05:44.629 --> 00:05:47.649
videos, jokes back and forth as the game unfolds.

00:05:47.709 --> 00:05:49.769
That's shouting across a continent in perfect

00:05:49.769 --> 00:05:52.740
sync. Or, what about the professional absurdities

00:05:52.740 --> 00:05:55.860
we all accept now? A huge team project, and one

00:05:55.860 --> 00:05:57.980
person needs to update the boss on some system

00:05:57.980 --> 00:06:00.000
upgrades. They're in an office in New England

00:06:00.000 --> 00:06:02.500
and the boss is in a hotel room in, I don't know,

00:06:02.660 --> 00:06:05.079
Helsinki, Finland, and they just slumped video

00:06:05.079 --> 00:06:07.779
chat. The physical locations are totally irrelevant.

00:06:08.339 --> 00:06:11.019
The message is instant and rich with detail.

00:06:11.620 --> 00:06:13.860
Or in a meeting, you could be in a conference

00:06:13.860 --> 00:06:16.620
room in Texas talking with people who are simultaneously

00:06:16.620 --> 00:06:19.240
in New York and California and maybe even Sydney.

00:06:19.389 --> 00:06:22.170
Wow. The time zones are the only real friction

00:06:22.170 --> 00:06:24.509
left. And this is where it gets really interesting

00:06:24.509 --> 00:06:27.990
for me. These mundane everyday interactions show

00:06:27.990 --> 00:06:30.949
that our shouting is no longer bound by local

00:06:30.949 --> 00:06:34.730
acoustics or a printed page or even oceans. We

00:06:34.730 --> 00:06:38.009
are constantly tirelessly and instantly shouting

00:06:38.009 --> 00:06:41.069
across the entire planet. The world genuinely

00:06:41.069 --> 00:06:44.069
has become within earshot of every citizen. And

00:06:44.069 --> 00:06:45.709
for every shout, there has to be a listener.

00:06:46.189 --> 00:06:48.910
Which brings us to the second, and I think maybe

00:06:48.910 --> 00:06:51.350
the more profound, half of this whole analogy.

00:06:51.709 --> 00:06:53.709
Exactly. It's easy to focus on the sending, on

00:06:53.709 --> 00:06:55.829
the shouting, but the listening capacity of these

00:06:55.829 --> 00:06:58.350
devices is the real game changer. Just think

00:06:58.350 --> 00:07:01.350
about the sheer volume. Uncountable emails, texts,

00:07:01.769 --> 00:07:03.910
social media posts, all flying across the internet

00:07:03.910 --> 00:07:06.589
at any given moment. The typical user might have

00:07:06.589 --> 00:07:08.790
hundreds of messages meant just for them every

00:07:08.790 --> 00:07:11.550
single day. If we had to rely on human listening...

00:07:11.600 --> 00:07:13.680
I mean, a person sitting there actually monitoring

00:07:13.680 --> 00:07:16.120
all those channels, they would collapse from

00:07:16.120 --> 00:07:18.759
exhaustion in a few hours. This is where the

00:07:18.759 --> 00:07:21.199
device's ability just dramatically surpasses

00:07:21.199 --> 00:07:24.100
our own limitations. Electronic devices listen

00:07:24.100 --> 00:07:27.339
with much more precision and with far, far greater

00:07:27.339 --> 00:07:30.040
patience and energy than any human possibly could.

00:07:30.699 --> 00:07:33.199
Precision is a huge point. If I'm in a loud room

00:07:33.199 --> 00:07:34.959
trying to listen to three different conversations,

00:07:35.300 --> 00:07:37.180
I'm going to miss details, I'll get distracted,

00:07:37.360 --> 00:07:40.040
I'll prioritize poorly. A phone doesn't have

00:07:40.040 --> 00:07:41.819
those problems. It doesn't. And the mechanics

00:07:41.819 --> 00:07:44.459
are really sophisticated. Your phone is configured

00:07:44.459 --> 00:07:47.899
not just to passively receive. It actively listens

00:07:47.899 --> 00:07:50.379
for specific messages meant for you, and then

00:07:50.379 --> 00:07:53.399
it notifies you. And crucially, it's contextual.

00:07:53.839 --> 00:07:56.199
It pays attention to who the sender is, the channel

00:07:56.199 --> 00:07:59.399
email, text, and app, and the nature of the message

00:07:59.399 --> 00:08:02.279
itself. And it varies the notification. A call

00:08:02.279 --> 00:08:04.360
from your kid might get a loud ring overriding

00:08:04.360 --> 00:08:07.300
a simple text from a mailing list. That detail,

00:08:07.660 --> 00:08:10.209
varying the notification. based on context. That's

00:08:10.209 --> 00:08:13.209
key. It's not just a dumb receiver. It's an active

00:08:13.209 --> 00:08:15.629
sorter. It's a prioritizer. We're outsourcing

00:08:15.629 --> 00:08:18.050
our attention filtering to the machine. But the

00:08:18.050 --> 00:08:20.850
real game changer, the profound difference, is

00:08:20.850 --> 00:08:23.509
just the tireless nature of the listening. Absolutely.

00:08:23.870 --> 00:08:26.050
The device listens for messages when the human

00:08:26.050 --> 00:08:28.769
is sound asleep, or busy, or just completely

00:08:28.769 --> 00:08:31.069
disconnected from a conversation. The phone never

00:08:31.069 --> 00:08:33.009
says, you know what, I'm tired. I'll check those

00:08:33.009 --> 00:08:35.250
tomorrow. And it listens to many channels at

00:08:35.250 --> 00:08:37.639
once. When you're asleep, your conscious brain

00:08:37.639 --> 00:08:40.500
is tuned out, but your phone is wide awake. It's

00:08:40.500 --> 00:08:43.519
simultaneously monitoring email, text, five social

00:08:43.519 --> 00:08:46.480
media feeds, app notifications. We've achieved

00:08:46.480 --> 00:08:49.600
simultaneous multi -channel tireless listening.

00:08:49.960 --> 00:08:51.679
And if you connect this to the bigger picture,

00:08:52.000 --> 00:08:54.409
this ability to listen without exhaustion. It

00:08:54.409 --> 00:08:56.309
profoundly changes our relationship with the

00:08:56.309 --> 00:08:59.629
flow of information. A human listener is finite.

00:09:00.230 --> 00:09:02.629
An electronic listener is effectively infinite

00:09:02.629 --> 00:09:04.950
in its capacity for precision and endurance.

00:09:05.450 --> 00:09:07.990
It guarantees our presence, even in our physical

00:09:07.990 --> 00:09:10.889
absence. So we've established the shout instant,

00:09:11.269 --> 00:09:13.929
global, multimodal, and the listen tireless,

00:09:14.269 --> 00:09:17.809
precise, and multi -channel. So what does this

00:09:17.809 --> 00:09:21.009
capacity actually mean in a real world context?

00:09:21.909 --> 00:09:24.870
The source material specifically points out how

00:09:24.870 --> 00:09:27.789
this is leveraged in education. Education is

00:09:27.789 --> 00:09:30.389
the perfect application, really, because it traditionally

00:09:30.389 --> 00:09:33.149
relied so heavily on that localized, ephemeral

00:09:33.149 --> 00:09:35.809
classroom model. The teacher speaking to the

00:09:35.809 --> 00:09:38.389
student face -to -face within a set hour. The

00:09:38.389 --> 00:09:40.669
listening and shouting capabilities just completely

00:09:40.669 --> 00:09:43.009
transcend that. So how does it extend the reach

00:09:43.009 --> 00:09:45.570
of the school? Well, you see immediate value

00:09:45.570 --> 00:09:48.230
in collapsing distance and time. First, educators

00:09:48.230 --> 00:09:50.389
and students can share details of classroom events

00:09:50.389 --> 00:09:52.370
instantly with the whole school community or

00:09:52.370 --> 00:09:54.710
with parents. More fundamentally, they can learn

00:09:54.710 --> 00:09:56.870
about world events just as they're happening,

00:09:57.070 --> 00:09:59.909
not days later. The classroom becomes globally

00:09:59.909 --> 00:10:02.809
responsive. That turns it into a real -time observation

00:10:02.809 --> 00:10:05.149
post, not just a place where history is discussed

00:10:05.149 --> 00:10:08.110
after the fact. Exactly. But maybe the most important

00:10:08.110 --> 00:10:10.309
application is extending the classroom community

00:10:10.309 --> 00:10:13.210
beyond the physical limits of the campus itself.

00:10:13.629 --> 00:10:16.789
Extending it to who? Like mentors or experts?

00:10:17.090 --> 00:10:19.519
Precisely. This technology allows the classroom

00:10:19.519 --> 00:10:22.659
to include vital resources like mentors, experts,

00:10:22.820 --> 00:10:25.279
peers, regardless of their physical location

00:10:25.279 --> 00:10:27.820
or their time zone. A student researching marine

00:10:27.820 --> 00:10:31.320
biology can communicate in real time or asynchronously

00:10:31.320 --> 00:10:34.100
with an expert who's actually studying the Great

00:10:34.100 --> 00:10:37.220
Barrier Reef. The ability to collaborate on projects

00:10:37.220 --> 00:10:39.899
across the globe becomes instantaneous. So instead

00:10:39.899 --> 00:10:42.299
of being limited to the expertise available within,

00:10:42.299 --> 00:10:44.379
you know, a five -mile radius of the school,

00:10:44.840 --> 00:10:47.080
the classroom can tap into the entire World Wide

00:10:47.080 --> 00:10:50.210
Web. Learning is no longer restricted to a building

00:10:50.210 --> 00:10:52.830
between certain hours. It transforms education

00:10:52.830 --> 00:10:55.970
from a finite event into a continuous conversation.

00:10:56.549 --> 00:10:58.990
This capacity to collapse distance and time,

00:10:59.129 --> 00:11:01.330
it completely transforms the traditional structure

00:11:01.330 --> 00:11:03.649
of learning. We're moving from the finite classroom

00:11:03.649 --> 00:11:05.710
to the infinite connected learning community.

00:11:06.039 --> 00:11:08.139
And that brings us to the end of our deep dive

00:11:08.139 --> 00:11:10.799
today. We started with the physical limits of

00:11:10.799 --> 00:11:12.600
the human body, ephemeral speech and written

00:11:12.600 --> 00:11:15.759
print, communication bound by the speed of transport.

00:11:16.059 --> 00:11:18.600
And we finished with 21st century devices that

00:11:18.600 --> 00:11:21.440
are capable of global instantaneous shouting,

00:11:21.980 --> 00:11:24.340
complimented by hyper precise multi -channel

00:11:24.340 --> 00:11:26.850
listening that quite literally never sleeps.

00:11:27.629 --> 00:11:29.730
We've gone from the human voice, which required

00:11:29.730 --> 00:11:32.269
no special training, but was purely local and

00:11:32.269 --> 00:11:34.830
temporary, to a technological extension of our

00:11:34.830 --> 00:11:37.840
senses that is global, immediate, permanent.

00:11:38.059 --> 00:11:40.220
It's a mind -bending shift and it happened in

00:11:40.220 --> 00:11:42.360
less than two centuries. And this raises the

00:11:42.360 --> 00:11:44.659
final question for you, the listener, to mull

00:11:44.659 --> 00:11:47.539
over. If human attention and physical presence

00:11:47.539 --> 00:11:49.899
are inherently finite, you know, we need sleep,

00:11:49.899 --> 00:11:51.960
we get distracted, we can only monitor so much

00:11:51.960 --> 00:11:54.580
at once, what are the true implications when

00:11:54.580 --> 00:11:56.820
technology is configured to possess infinite

00:11:56.820 --> 00:11:59.559
patience and energy for listening? What happens

00:11:59.559 --> 00:12:01.799
to our right to be absent or our expectation

00:12:01.799 --> 00:12:03.779
of silence when the machine never stops paying

00:12:03.779 --> 00:12:06.139
attention? That's the boundary we're all living

00:12:06.139 --> 00:12:06.799
on right now.
