Search
Search titles only
By:
Search titles only
By:
Log in
Register
Search
Search titles only
By:
Search titles only
By:
Menu
Install the app
Install
Forums
New posts
All threads
Latest threads
New posts
Trending threads
Trending
Search forums
What's new
New posts
New ads
New profile posts
Latest activity
Free Ads
Latest reviews
Search ads
Members
Current visitors
New profile posts
Search profile posts
Contact us
Latest ads
Colombo
Red Hat Certified System Administrator (RHCSA) - RHEL 10
Sanjeewani95
Updated:
Yesterday at 7:43 PM
NURSING , CAREGIVER , HOTEL & BEAUTY COURSES
IVA Para Medical Campus
Updated:
Thursday at 9:24 AM
Handmade Character Soft Toys Peppa Pig Family
anil1961
Updated:
Wednesday at 9:58 PM
Ad icon
Video Content Creator
pramukag
Updated:
Sunday at 6:10 AM
Ad icon
QA Engineer Intern
pramukag
Updated:
Sunday at 6:07 AM
Electronics
Vehicles
Property
Search
Reply to thread
Forums
General
ElaKiri Talk!
Why building GPT-8 is currently impossible?
Get the App
JavaScript is disabled. For a better experience, please enable JavaScript in your browser before proceeding.
You are using an out of date browser. It may not display this or other websites correctly.
You should upgrade or use an
alternative browser
.
Message
<blockquote data-quote="SLHodahitha" data-source="post: 29592163" data-attributes="member: 565060"><p><span style="font-size: 15px"><strong>Training an AI takes three things:</strong></span></p><ul> <li data-xf-list-type="ul"><span style="font-size: 15px">Compute (ie computing power, hardware, chips)</span></li> <li data-xf-list-type="ul"><span style="font-size: 15px">Electricity (to power the compute)</span></li> <li data-xf-list-type="ul"><span style="font-size: 15px">Training data</span><br /> <br /> <em><span style="font-size: 15px"><strong><u>Compute</u></strong></span></em><span style="font-size: 15px"><br /> Compute is measured in floating point operations (FLOPs). GPT-3 took 10^23 FLOPs to train, and GPT-4 plausibly 10^25. <br /> The capacity of all the computers in the world is about 10^21 FLOP/second, so they could train GPT-4 in 10^4 seconds (ie two hours). Since OpenAI has fewer than all the computers in the world, it took them six months. This suggests OpenAI was using about 1/2000th of all the computers in the world during that time.<br /> </span><br /> <br /> <u><strong><span style="font-size: 15px"><em><strong><u>Energy</u></strong></em></span></strong></u><br /> <span style="font-size: 15px">GPT-4 took about <a href="https://www.ri.se/en/news/blog/generative-ai-does-not-run-on-thin-air" target="_blank">50 gigawatt-hours</a> of energy to train. Using our scaling factor of 30x, we expect GPT-5 to need 1,500, GPT-6 to need 45,000, and GPT-7 to need 1.3 million</span><br /> <br /> <br /> <u><strong><span style="font-size: 15px"><em><strong><u>Training Data</u></strong></em></span></strong></u><br /> <span style="font-size: 15px">This is the text or images or whatever that the AI reads to understand how its domain works. <a href="https://lambdalabs.com/blog/demystifying-gpt-3" target="_blank">GPT-3</a> used 300 billion tokens. <a href="https://www.springboard.com/blog/data-science/machine-learning-gpt-3-open-ai/" target="_blank">GPT-4</a> used 13 trillion tokens (another source says 6 trillion).<br /> <br /> <img src="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e29cdd8-3523-4386-b574-7a6f7c7fb0e4_1083x748.png" alt="" class="fr-fic fr-dii fr-draggable " style="" /></span></li> <li data-xf-list-type="ul"><span style="font-size: 15px"><strong>GPT-5</strong> might need about 1% the world’s computers, a small power plant’s worth of energy, and a lot of training data.</span></li> <li data-xf-list-type="ul"><span style="font-size: 15px"><strong>GPT-6</strong> might need about 10% of the world’s computers, a large power plant’s worth of energy, and more training data than exists. Probably this looks like a town-sized data center attached to a lot of solar panels or a nuclear reactor.</span></li> <li data-xf-list-type="ul"><span style="font-size: 15px"><strong>GPT-7</strong> might need all of the world’s computers, a gargantuan power plant beyond any that currently exist, and <em>way</em> more training data than exists. Probably this looks like a city-sized data center attached to a fusion plant.</span></li> <li data-xf-list-type="ul"><span style="font-size: 15px"><strong>Building GPT-8 is currently impossible.</strong> Even if you solve synthetic data and fusion power, and you take over the whole semiconductor industry, you wouldn’t come close. Your only hope is that GPT-7 is superintelligent and helps you with this, either by telling you how to build AIs for cheap, or by growing the global economy so much that it can fund currently-impossible things.</span><span style="font-size: 18px"><br /> <br /> GPT = Generative Pre-trained Transformer = කලින් පුහුනු කල තොරතුරු ඇසුරෙන් දෙයක් මනුස්සයෙක් කරන විදියත විස්තර කරන්න හදන එක වගේ තෙරුමක්</span><span style="font-size: 9px"><br /> <br /> <a href="https://www.cnbc.com/2023/05/10/microsoft-agrees-to-buy-power-from-sam-altman-backed-helion-in-2028.html" target="_blank">https://www.cnbc.com/2023/05/10/microsoft-agrees-to-buy-power-from-sam-altman-backed-helion-in-2028.html</a><br /> <a href="https://www.astralcodexten.com/p/sam-altman-wants-7-trillion" target="_blank">https://www.astralcodexten.com/p/sam-altman-wants-7-trillion</a></span></li> </ul></blockquote><p></p>
[QUOTE="SLHodahitha, post: 29592163, member: 565060"] [SIZE=4][B]Training an AI takes three things:[/B][/SIZE] [LIST] [*][SIZE=4]Compute (ie computing power, hardware, chips)[/SIZE] [*][SIZE=4]Electricity (to power the compute)[/SIZE] [*][SIZE=4]Training data[/SIZE] [I][SIZE=4][B][U]Compute[/U][/B][/SIZE][/I][SIZE=4] Compute is measured in floating point operations (FLOPs). GPT-3 took 10^23 FLOPs to train, and GPT-4 plausibly 10^25. The capacity of all the computers in the world is about 10^21 FLOP/second, so they could train GPT-4 in 10^4 seconds (ie two hours). Since OpenAI has fewer than all the computers in the world, it took them six months. This suggests OpenAI was using about 1/2000th of all the computers in the world during that time. [/SIZE] [U][B][SIZE=4][I][B][U]Energy[/U][/B][/I][/SIZE][/B][/U] [SIZE=4]GPT-4 took about [URL='https://www.ri.se/en/news/blog/generative-ai-does-not-run-on-thin-air']50 gigawatt-hours[/URL] of energy to train. Using our scaling factor of 30x, we expect GPT-5 to need 1,500, GPT-6 to need 45,000, and GPT-7 to need 1.3 million[/SIZE] [U][B][SIZE=4][I][B][U]Training Data[/U][/B][/I][/SIZE][/B][/U] [SIZE=4]This is the text or images or whatever that the AI reads to understand how its domain works. [URL='https://lambdalabs.com/blog/demystifying-gpt-3']GPT-3[/URL] used 300 billion tokens. [URL='https://www.springboard.com/blog/data-science/machine-learning-gpt-3-open-ai/']GPT-4[/URL] used 13 trillion tokens (another source says 6 trillion). [IMG]https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e29cdd8-3523-4386-b574-7a6f7c7fb0e4_1083x748.png[/IMG][/SIZE] [*][SIZE=4][B]GPT-5[/B] might need about 1% the world’s computers, a small power plant’s worth of energy, and a lot of training data.[/SIZE] [*][SIZE=4][B]GPT-6[/B] might need about 10% of the world’s computers, a large power plant’s worth of energy, and more training data than exists. Probably this looks like a town-sized data center attached to a lot of solar panels or a nuclear reactor.[/SIZE] [*][SIZE=4][B]GPT-7[/B] might need all of the world’s computers, a gargantuan power plant beyond any that currently exist, and [I]way[/I] more training data than exists. Probably this looks like a city-sized data center attached to a fusion plant.[/SIZE] [*][SIZE=4][B]Building GPT-8 is currently impossible.[/B] Even if you solve synthetic data and fusion power, and you take over the whole semiconductor industry, you wouldn’t come close. Your only hope is that GPT-7 is superintelligent and helps you with this, either by telling you how to build AIs for cheap, or by growing the global economy so much that it can fund currently-impossible things.[/SIZE][SIZE=5] GPT = Generative Pre-trained Transformer = කලින් පුහුනු කල තොරතුරු ඇසුරෙන් දෙයක් මනුස්සයෙක් කරන විදියත විස්තර කරන්න හදන එක වගේ තෙරුමක්[/SIZE][SIZE=1] [URL]https://www.cnbc.com/2023/05/10/microsoft-agrees-to-buy-power-from-sam-altman-backed-helion-in-2028.html[/URL] [URL]https://www.astralcodexten.com/p/sam-altman-wants-7-trillion[/URL][/SIZE] [/LIST] [/QUOTE]
Insert quotes…
Verification
Dahaya deken beduwama keeyada?
Post reply
Top
Bottom