All Systems Operational

NGC Core Services Operational
NGC Catalog Operational
Private Registry Operational
Base Command Operational
Fleet Command Operational
Cloud Functions Operational
NVIDIA Remote Attestation Service (NRAS) Operational
RIM Service Operational
OCSP Service Operational
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance

Scheduled Maintenance

Cloud Functions Apr 30, 2025 15:00-20:00 PDT

Cloud Functions must perform critical maintenance on Google H100 clusters, during which all functions deployed on the affected clusters will experience downtime. This is a required action to improve service uptime and reliability. The maintenance window is expected to last 5 hours. This event is scheduled to begin on April 30th, 2025, at 3:00 PM PDT (UTC-7).
Posted on Apr 23, 2025 - 14:57 PDT
Apr 25, 2025

No incidents reported today.

Apr 24, 2025
Resolved - Issues affecting the Base Command service have been resolved and the service is operating normally.
Apr 24, 18:27 PDT
Monitoring - A fix for the Base Command service issues has been applied and we are monitoring the service for stability.
Apr 24, 17:41 PDT
Identified - The cause of issues with Base Command service has been identified and a fix is being applied.
Apr 24, 16:28 PDT
Investigating - The Base Command service is currently experiencing issues which are affecting users.

The issues are being actively investigated and updates will be posted on this status page.

Apr 24, 15:14 PDT
Resolved - Issues affecting the Private Registry service have been resolved and the service is operating normally.
Apr 24, 15:13 PDT
Monitoring - A fix for the Private Registry service issues has been applied and we are monitoring the service for stability.
Apr 24, 12:40 PDT
Identified - The cause of issues with Private Registry service has been identified and a fix is being applied.
Apr 24, 12:02 PDT
Investigating - The Private Registry service is currently experiencing issues which are affecting users.

The issues are being actively investigated and updates will be posted on this status page.

Apr 24, 11:48 PDT
Apr 23, 2025
Resolved - This incident has been resolved.
Apr 23, 09:45 PDT
Identified - We have identified, that the issue is still ongoing. We are investigating it further.
Apr 23, 04:10 PDT
Monitoring - Multiple NVCF function deployment calls were recently failing with error 504 Gateway timeout to NGC API. We have implemented a fix and are monitoring the situation.
Apr 23, 03:59 PDT
Apr 22, 2025

No incidents reported.

Apr 21, 2025

No incidents reported.

Apr 20, 2025

No incidents reported.

Apr 19, 2025

No incidents reported.

Apr 18, 2025

No incidents reported.

Apr 17, 2025

No incidents reported.

Apr 16, 2025
Resolved - This incident has been resolved.
Apr 16, 09:47 PDT
Monitoring - A fix has been implemented and we are monitoring the results.
Apr 16, 09:36 PDT
Investigating - Multiple NVCF function deployment calls are failing with error 504 Gateway timeout to NGC API. Please bear with us as we investigate and address this issue. Thank you.
Apr 16, 07:15 PDT
Apr 15, 2025

No incidents reported.

Apr 14, 2025

No incidents reported.

Apr 13, 2025

No incidents reported.

Apr 12, 2025

No incidents reported.

Apr 11, 2025

No incidents reported.