Industry News

Microsoft Developed AI Chip to Reduce the Cost of Running Generative Artificial Intelligence

Views : 15
Update time : 2023-05-03 14:55:41
        Dylan-Patel, principal analyst at global semiconductor research firm SemiAnalysis, recently said that OpenAI could spend up to $700,000 per day to run ChatGPT because it runs on expensive computing infrastructure.
 
 
        Whether it's writing a cover letter, generating a lesson plan, helping users optimize their profiles, or analyzing things based on facts or assumptions, ChatGPT requires a lot of computing power to provide feedback based on user input, which comes from expensive servers, Dylan-Patel said.
        Both Dylan-Patel and his colleague Afzal-Ahmad argue that while it may cost hundreds of millions of dollars to train the big language model behind ChatGPT, its operating costs or the content production behind it will be much higher, even with any reasonable deployment size that far exceeds its training costs.
        Microsoft is rumored to be developing an AI chip codenamed "Athena" to reduce the cost of running generative AI models. The report says the project has been in production since 2019 and is available for testing by a small group of Microsoft and OpenAI employees.
Microsoft previously reached a $1 billion investment agreement with OpenAI that requires OpenAI to run its models only on Microsoft's Azure cloud servers. This follows news that shortages have led Microsoft to ration GPUs for some internal teams, and that NVIDIA's processors sell for a premium, so Microsoft expects to run them more cheaply for the same workload.
        In addition to powerful performance, Nvidia's chips have significant software advantages, with most AI workloads designed for them and decades of developer experience. Microsoft currently has about 300-plus employees working on the chip.
        Sources said the chip could be released as early as next year for internal use by Microsoft and OpenAI, to which Microsoft did not officially respond, but whether it will also be used by Azure customers is still under discussion. Google has developed its own line of AI chips, TPU, and is currently the only competitor chip developing LLM, while Amazon has its own alternative product line, Trainium.

 
Related News
Read More >>
How many chips does a car need? How many chips does a car need?
Sep .19.2024
Automotive chips can be divided into four types according to their functions: control (MCU and AI chips), power, sensors, and others (such as memory). The market is monopolized by international giants. The automotive chips people often talk about refer to
Position and Function of Main Automotive Sensors Position and Function of Main Automotive Sensors
Sep .18.2024
The function of the air flow sensor is to convert the amount of air inhaled into the engine into an electrical signal and provide it to the electronic control unit (ECU). It is the main basis for determining the basic fuel injection volume. Vane type: The
Chip: The increasingly intelligent electronic brain Chip: The increasingly intelligent electronic brain
Sep .14.2024
In this era of rapid technological development, we often marvel at how mobile phones can run various application software smoothly, how online classes can be free of lag and achieve zero latency, and how the functions of electronic devices are becoming mo
LDA100 Optocoupler: Outstanding Performance, Wide Applications LDA100 Optocoupler: Outstanding Performance, Wide Applications
Sep .13.2024
In terms of characteristics, LDA100 is outstanding. It offers AC and DC input versions for optional selection, enabling it to work stably in different power supply environments. The small 6-pin DIP package not only saves space but also facilitates install