Major Outage on ChatGPT on Web
OpenAI recently experienced an issue involving their training infrastructure, resulting in the unavailability of some services. This issue impacted their AI-assisted customer support system and its associated endpoints, as well as the OpenAI Platform and its API endpoints.
The incident was caused by a problem with the OpenAI's infrastructure, which was unable to properly serve connections and requests from users. This caused the platform’s performance to degrade, resulting in a partial outage. The team quickly identified the root cause of the problem and implemented a fix to restore functionality.
Overall, the incident lasted for around 40 minutes, during which time some users were unable to access the AI-assisted customer support system or complete some tasks on the OpenAI Platform. Following the resolution of the incident, the team continued to monitor the platform and related services.
In the future, the OpenAI team plans to improve their monitoring system infrastructure and the scalability of their services to prevent similar incidents from happening again. They have also committed to regularly updating their incident communication channels and providing better transparency to its users about what went wrong and how it was fixed.
Read more here: External Link