The Extreme Cost Of Training AI Models
The cost of training AI models has exploded in just the past year, according to data released by the research firm Epoch AI. This development aptly shows how much more complex and capable AI models have gotten in a short time span. Last year saw the release of ChatGPT-4 in March by OpenAI, which kickstarted the global AI hype. Google followed suit with its advanced AI model, Gemini, in December.
Both systems have been much more expensive to train than previous AI models and their development has potentially cost hundreds of millions of dollars, according to the Epoch AI release. The cost of training Gemini, which is a large language model that can be inputted with text, voice commands and images, reportedly stood between $30 and $191 million even before taking staff salaries into consideration. According to Epoch AI, these can make up 29% to 49% of the final price. ChatGPT-4, the latest edition, had a technical creation cost of $41 million to $78 million, according to the source. Sam Altman, CEO of OpenAI, has in the past said that the model has cost more than $100 million, confirming the calculations.
Looking back, the cost of earlier AI models was much lower. ChatGPT-3 cost only around $2 million to $4 million make in 2020, while Gemini’s precursor PaLM in 2022 took between $3 million and $12 million to train when only looking at the cost of computing. Even at these price points, keeping up with cutting-edge AI development might have proved difficult for academic or other public institutions that have traditionally been active in AI research.
At the 2023 cost estimates, it is basically impossible, as Epoch AI notes while mentioning the National AI Research Resource created by the Biden Administration in late 2023 as a potential remedy for this. It would grant researchers and students access to relevant AI tools and give out grants. However, it is still in its pilot phase. The executive order that created the resource mainly focuses on setting standards for AI safety and privacy—for example, strengthening consumer rights opposite algorithms as well as employees’ rights in the face of changes to the workplace.
AI For Consumption?
While it was updated to support voice and images in fall 2023, ChatGPT-4, like its name suggests, started out based around its central text input, while Gemini and its app have been designed as a multimodal LLM from the get-go. This explains why ChatGPT’s initial training cost might have been lower. On the other hand, Gemini’s general focus on app delivery—for example, prompting users to snap pictures with their smartphones, pick out features in them and have them analyzed—could have warranted a higher cost.
Gemini also includes e-commerce-relevant features like showing where to buy something that appears in a picture in a fashion similar to a Google (shopping) search. This shows how Google is applying its brand identity as a search engine to AI models, while AI-first company OpenAI had to forge its identity and strongpoints in the AI sphere from scratch. It also poses the question if in the future, AI will drift more toward commercial support functions, like anticipated by the Biden Administration, instead of original text creation, the most publicized feature of ChatGPT.
OpenAI’s text-to-image model, DALL-E, had a much lower cost in 2021 than LLMs created around this time, including ChatGPT’s version 3 from 2020. It only cost between $118,000 and $335,000 to make, according to Epoch AI. Its latest follow-up version DALL-E 3 is now part of extended ChatGPT versions for paying customers. The price for owned hardware is always lower than the cloud computing approach in this calculation as it uses amortized cost, meaning it only takes into account the share of the total lifetime cost of a hardware component relative to the time used to train the respective AI model.
Charted by Statista