1
00:00:00,000 --> 00:00:02,340
Okay, chat GPT's voice feature.

2
00:00:02,340 --> 00:00:03,840
You sent us quite a bit about this.

3
00:00:03,860 --> 00:00:07,080
So robotic voices, fun accents.

4
00:00:07,320 --> 00:00:09,120
This feels like something bigger, right?

5
00:00:09,280 --> 00:00:10,000
It really is.

6
00:00:10,000 --> 00:00:10,380
Yeah.

7
00:00:10,420 --> 00:00:13,520
It's like how we talk to machines is totally changing.

8
00:00:13,520 --> 00:00:14,640
We're not just typing anymore.

9
00:00:14,640 --> 00:00:18,160
It's becoming a conversation almost like, like we're talking to another person.

10
00:00:18,200 --> 00:00:19,240
Yeah, exactly.

11
00:00:19,260 --> 00:00:21,300
There's nuance now, even emotion.

12
00:00:21,300 --> 00:00:22,280
It's wild.

13
00:00:22,500 --> 00:00:22,760
Okay.

14
00:00:22,760 --> 00:00:23,120
Hold on.

15
00:00:23,120 --> 00:00:28,400
Because the transcript mentions someone using just their voice to browse the web.

16
00:00:28,400 --> 00:00:32,680
Like ordering food, making appointments, even paying bills.

17
00:00:33,120 --> 00:00:34,560
All just by talking.

18
00:00:34,560 --> 00:00:35,360
That's yeah.

19
00:00:35,360 --> 00:00:38,480
And that's a perfect example of how this tech is making things easier.

20
00:00:38,480 --> 00:00:42,120
Imagine tech just gets us, you know, like it just responds to how we naturally speak.

21
00:00:42,120 --> 00:00:43,200
But is that good?

22
00:00:43,200 --> 00:00:43,760
Yeah.

23
00:00:43,760 --> 00:00:46,720
I mean, it's cool, but a little creepy too, right?

24
00:00:46,720 --> 00:00:47,440
Sorry, go on.

25
00:00:47,440 --> 00:00:47,840
You're saying.

26
00:00:47,840 --> 00:00:52,800
Oh, no, it's just, it's definitely both exciting and a bit unsettling, but that's technology, right?

27
00:00:52,800 --> 00:00:53,360
True.

28
00:00:53,360 --> 00:00:56,560
Now back to this transcript, it keeps mentioning custom instructions.

29
00:00:56,560 --> 00:00:57,360
What are those?

30
00:00:57,360 --> 00:00:58,560
They sound important.

31
00:00:58,560 --> 00:00:58,960
Okay.

32
00:00:58,960 --> 00:01:05,680
So custom instructions, think of it like you're telling chat GPT how to be like giving it

33
00:01:05,680 --> 00:01:08,880
guidelines or even a personality, a personality.

34
00:01:08,880 --> 00:01:09,440
Okay.

35
00:01:09,440 --> 00:01:13,760
I need an example because like you could say you're a tutor and you're teaching me business

36
00:01:13,760 --> 00:01:15,920
English and boom, it changes how it talks to you.

37
00:01:15,920 --> 00:01:17,680
So it's way more than just picking an accent.

38
00:01:17,680 --> 00:01:21,280
It's like building the whole vibe of how chat GPT interacts.

39
00:01:21,280 --> 00:01:22,160
Exactly.

40
00:01:22,160 --> 00:01:24,880
Which makes sense why the transcript talks about tutoring so much, right?

41
00:01:24,880 --> 00:01:25,840
Absolutely.

42
00:01:25,840 --> 00:01:28,400
Imagine you need to practice for a big presentation.

43
00:01:28,400 --> 00:01:32,080
You've got chat GPT there 24 seven ready to help.

44
00:01:32,080 --> 00:01:32,400
Okay.

45
00:01:32,400 --> 00:01:36,000
But is it really like having someone else there though, because that's starting to feel

46
00:01:36,000 --> 00:01:40,560
like that movie her, you know, where he falls for his AI.

47
00:01:40,560 --> 00:01:42,560
That's what's so crazy about this tech, right?

48
00:01:42,560 --> 00:01:46,640
It makes you ask those questions like what is intelligence, you know, what separates

49
00:01:46,640 --> 00:01:50,080
us from machines if they can sound and even feel well human.

50
00:01:50,080 --> 00:01:50,400
Okay.

51
00:01:50,400 --> 00:01:53,280
But the transcript also talks about language learning.

52
00:01:53,280 --> 00:01:54,640
Hold on and translate.

53
00:01:54,640 --> 00:01:58,800
Translate and it'll be practice way more than basic translation chat.

54
00:01:58,800 --> 00:02:03,440
GPT can help with pronunciation vocabulary, even like the little cultural things you need

55
00:02:03,440 --> 00:02:03,920
to know.

56
00:02:03,920 --> 00:02:04,880
So I'm going to Paris.

57
00:02:06,160 --> 00:02:09,360
I could have a conversation with chat GPT and it'd be like, I'm actually talking to

58
00:02:09,360 --> 00:02:10,160
a Parisian.

59
00:02:10,160 --> 00:02:10,400
Yeah.

60
00:02:10,960 --> 00:02:12,320
Slang the whole deal.

61
00:02:12,320 --> 00:02:12,960
That's wild.

62
00:02:12,960 --> 00:02:13,200
Okay.

63
00:02:13,200 --> 00:02:14,320
But how good are the accents though?

64
00:02:14,320 --> 00:02:15,040
Really?

65
00:02:15,040 --> 00:02:15,680
Be honest.

66
00:02:15,680 --> 00:02:16,800
Oh, it's getting good.

67
00:02:18,000 --> 00:02:23,360
Really good from standard American British, even to more specific dialects, you know,

68
00:02:23,360 --> 00:02:24,800
but it's not like perfect.

69
00:02:24,800 --> 00:02:29,920
We're just still in development, but it's pretty amazing how well it captures how we

70
00:02:29,920 --> 00:02:30,640
talk.

71
00:02:30,640 --> 00:02:30,960
Okay.

72
00:02:30,960 --> 00:02:32,480
That's kind of freaky really a little bit.

73
00:02:32,480 --> 00:02:32,960
Yeah.

74
00:02:32,960 --> 00:02:34,320
It is kind of weird though, right?

75
00:02:34,320 --> 00:02:39,920
Like hearing a computer talk like a person makes you wonder when does it become, you

76
00:02:39,920 --> 00:02:41,760
know, more than just a computer talking.

77
00:02:41,760 --> 00:02:42,960
It's almost too real.

78
00:02:42,960 --> 00:02:44,160
That's that's kind of creepy.

79
00:02:44,160 --> 00:02:44,800
Exactly.

80
00:02:44,800 --> 00:02:49,360
And that's something we got to think about as this tech gets, you know, even better

81
00:02:49,920 --> 00:02:52,080
because it could be, I don't know, dangerous.

82
00:02:52,080 --> 00:02:55,520
I wouldn't say dangerous, but we got to be careful.

83
00:02:56,160 --> 00:02:58,640
But there's good stuff too, especially for like accessibility.

84
00:02:59,200 --> 00:02:59,520
Okay.

85
00:02:59,520 --> 00:03:00,080
How so?

86
00:03:00,880 --> 00:03:04,320
And the transcript mentions voice API here.

87
00:03:05,200 --> 00:03:06,080
What is that?

88
00:03:06,080 --> 00:03:06,400
Yeah.

89
00:03:06,400 --> 00:03:09,920
So API sounds super technical, but it's actually pretty simple.

90
00:03:10,560 --> 00:03:14,720
Think of it like chat GPT's voice can work with other apps now.

91
00:03:14,720 --> 00:03:16,320
Oh, so other people can use it.

92
00:03:16,320 --> 00:03:16,800
Exactly.

93
00:03:16,800 --> 00:03:18,880
It's like the possibilities are endless now.

94
00:03:18,880 --> 00:03:21,360
We're not just talking to chat GPT on our phones.

95
00:03:21,360 --> 00:03:22,400
It could be in anything.

96
00:03:22,400 --> 00:03:25,440
Wait, so like instead of just talking to my phone, I could talk to like.

97
00:03:25,440 --> 00:03:28,560
Your fridge, your car, basically anything.

98
00:03:28,560 --> 00:03:29,200
Okay.

99
00:03:29,200 --> 00:03:32,160
That's both cool and scary at the same time.

100
00:03:32,160 --> 00:03:34,320
Like what happens when everything has a voice?

101
00:03:34,320 --> 00:03:36,160
It's a whole new world, right?

102
00:03:36,160 --> 00:03:38,480
But there are definitely some ethical things to consider.

103
00:03:38,480 --> 00:03:41,440
Like in the transcript, they mentioned AI companions.

104
00:03:41,440 --> 00:03:41,760
Yeah.

105
00:03:41,760 --> 00:03:42,720
AI friends.

106
00:03:42,720 --> 00:03:46,400
It's like, are we going to start needing robots to cure our loneliness now?

107
00:03:46,400 --> 00:03:47,280
It's a tough one.

108
00:03:47,280 --> 00:03:51,680
Some people might really benefit from that kind of companionship, especially people who

109
00:03:51,680 --> 00:03:54,640
have trouble, you know, connecting with others.

110
00:03:54,640 --> 00:03:57,760
But then it's like, are we just going to become more isolated because of tech?

111
00:03:57,760 --> 00:03:58,960
It's like a double-edged sword.

112
00:03:58,960 --> 00:03:59,360
Right.

113
00:03:59,360 --> 00:03:59,920
For sure.

114
00:03:59,920 --> 00:04:02,000
It's something we're going to have to figure out as we go.

115
00:04:02,000 --> 00:04:05,200
But back to what I was saying about the API in games.

116
00:04:05,200 --> 00:04:05,600
Oh yeah.

117
00:04:06,240 --> 00:04:07,920
AI characters in video games.

118
00:04:07,920 --> 00:04:08,240
Yeah.

119
00:04:08,240 --> 00:04:09,680
That's got to be a game changer.

120
00:04:09,680 --> 00:04:10,080
Right.

121
00:04:10,080 --> 00:04:14,400
Imagine talking to a character and their responses are different every time because

122
00:04:14,400 --> 00:04:15,760
they're learning from you.

123
00:04:15,760 --> 00:04:18,640
So it's not just like the same old dialogue options.

124
00:04:18,640 --> 00:04:20,560
It's like you're having a real conversation.

125
00:04:20,560 --> 00:04:20,800
Yeah.

126
00:04:21,440 --> 00:04:23,600
And it could change how stories are told in games.

127
00:04:23,600 --> 00:04:27,360
Instead of following a script, you're creating a unique experience.

128
00:04:27,360 --> 00:04:28,080
That's amazing.

129
00:04:28,960 --> 00:04:29,200
Okay.

130
00:04:29,200 --> 00:04:32,000
But what about more realistic stuff?

131
00:04:32,000 --> 00:04:35,200
Like the transcript mentioned using it as a universal translator.

132
00:04:35,200 --> 00:04:36,000
Oh yeah.

133
00:04:36,000 --> 00:04:36,640
Think about it.

134
00:04:36,640 --> 00:04:39,840
You could travel anywhere and have a conversation with anyone.

135
00:04:39,840 --> 00:04:41,280
No more language barriers.

136
00:04:41,280 --> 00:04:42,160
That would be incredible.

137
00:04:42,160 --> 00:04:45,520
No more awkward hand gestures or trying to find someone who speaks English.

138
00:04:45,520 --> 00:04:46,320
Exactly.

139
00:04:46,320 --> 00:04:51,280
And think about what that could mean for like global communication and diplomacy.

140
00:04:51,280 --> 00:04:53,920
If we could all just understand each other, imagine the possibility.

141
00:04:53,920 --> 00:04:55,040
Seriously.

142
00:04:55,040 --> 00:04:59,040
But all of that is still using chat GPT through like an app or a website, right?

143
00:04:59,040 --> 00:05:03,040
But this API I was talking about, that's where things get really interesting.

144
00:05:03,040 --> 00:05:05,360
So we could be talking to like anything.

145
00:05:05,360 --> 00:05:07,760
Our houses, our cars, even like our toasters.

146
00:05:07,760 --> 00:05:08,560
Exactly.

147
00:05:08,560 --> 00:05:09,120
Think about it.

148
00:05:09,120 --> 00:05:13,760
Like your refrigerator tells you what to make for dinner or your car warns you about traffic.

149
00:05:13,760 --> 00:05:17,520
But it's like a conversation, not a robot voice.

150
00:05:17,520 --> 00:05:17,840
Okay.

151
00:05:17,840 --> 00:05:21,360
But back to the AI friends thing for a second, because that's still messing with me.

152
00:05:21,840 --> 00:05:28,720
What happens when the tech gets so good, people would rather hang out with their AI buddy than a real person.

153
00:05:28,720 --> 00:05:29,840
It's a valid concern.

154
00:05:29,840 --> 00:05:31,680
And honestly, we don't have the answer yet.

155
00:05:32,640 --> 00:05:36,160
On one hand, imagine someone who's lonely, isolated.

156
00:05:36,160 --> 00:05:38,720
An AI companion could be amazing for them.

157
00:05:38,720 --> 00:05:39,280
Right.

158
00:05:39,280 --> 00:05:42,960
Like company without the pressure of, you know, actual human interaction.

159
00:05:42,960 --> 00:05:43,840
Exactly.

160
00:05:43,840 --> 00:05:46,960
But it could also make some people even more withdrawn, right?

161
00:05:46,960 --> 00:05:51,200
Why bother with real relationships when your AI buddy gets you perfectly?

162
00:05:51,200 --> 00:05:54,240
It's like, does tech bring us together or make us more alone?

163
00:05:54,240 --> 00:05:56,000
It's that age old question.

164
00:05:56,000 --> 00:05:57,600
And it's tougher to answer than ever.

165
00:05:57,600 --> 00:05:58,560
It really is.

166
00:05:58,560 --> 00:06:00,720
Tech, AI, it's all just tools.

167
00:06:00,720 --> 00:06:02,960
It's up to us to use them wisely.

168
00:06:02,960 --> 00:06:07,520
But with AI, especially now that it can sound and feel so real, we got to be careful.

169
00:06:07,520 --> 00:06:10,160
Which reminds me, we didn't touch on the ethical side much.

170
00:06:10,160 --> 00:06:11,200
Oh, right.

171
00:06:11,200 --> 00:06:12,400
The transcript did mention that.

172
00:06:12,400 --> 00:06:17,280
Like, what happens when AI is so good at faking emotions, we can't tell if it's real or not.

173
00:06:17,280 --> 00:06:18,560
It's freaky, right?

174
00:06:18,560 --> 00:06:23,840
Imagine you're pouring your heart out to someone and it turns out to be a computer program the whole time.

175
00:06:23,840 --> 00:06:26,000
What does that even mean for being human, you know?

176
00:06:26,000 --> 00:06:27,680
It's like we're living in a sci-fi movie.

177
00:06:27,680 --> 00:06:30,240
And it's exciting, but also kind of terrifying.

178
00:06:30,240 --> 00:06:30,960
Totally.

179
00:06:30,960 --> 00:06:35,120
But even with all the what-ifs, there's a ton of good that can come from this.

180
00:06:35,120 --> 00:06:35,920
Think about it.

181
00:06:35,920 --> 00:06:42,240
AI could help people who have trouble socializing, give them a safe space to practice, or even in education,

182
00:06:42,240 --> 00:06:47,680
imagine a world where every kid has a personalized tutor that adapts to how they learn best.

183
00:06:47,680 --> 00:06:49,920
OK, now that's the kind of future I get behind.

184
00:06:49,920 --> 00:06:50,400
Yeah.

185
00:06:50,400 --> 00:06:52,160
But it feels like we're only scratching the surface here.

186
00:06:52,160 --> 00:06:52,720
We are.

187
00:06:52,720 --> 00:06:55,360
This tech is so new, we're all figuring it out as we go.

188
00:06:55,360 --> 00:06:56,320
Exactly.

189
00:06:56,320 --> 00:06:58,560
So, listeners, we want to hear from you.

190
00:06:58,560 --> 00:07:03,600
Is ChatGPT's voice feature just a cool party trick, or is this the start of something much bigger?

191
00:07:03,600 --> 00:07:06,000
Let us know.

192
00:07:06,000 --> 00:07:10,000
And that's it for our deep dive into the world of AI and voice technology.

193
00:07:10,000 --> 00:07:34,400
Thanks for listening.

