All Systems Operational

Hive Login Node Operational
90 days ago
100.0 % uptime
Today
Hive Storage Operational
90 days ago
100.0 % uptime
Today
Compute Nodes Operational
90 days ago
99.96 % uptime
Today
GPU Nodes ? Operational
90 days ago
100.0 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.

Scheduled Maintenance

Mandatory NVIDIA driver update May 6, 2025 08:00-20:00 PDT

NVIDIA has notified users of a severity High vulnerability in their GPU drivers for Linux which could allow an unprivileged user to escalate permissions (https://nvidia.custhelp.com/app/answers/detail/a_id/5630). Due to UCOP IS-3 policy, we are required to patch affected systems as soon as possible.

As a result, we will be patching NVIDIA drivers on all HPC GPU systems and rebooting them starting at 8:00 a.m. on May 6th, 2025. Jobs that are currently utilizing HPC GPUs will be killed with a reboot. New jobs will be unavailable to start until patching is complete. We expect the maintenance to last until 6:00 p.m. on the same day.

Please email hpc-help@ucdavis with any questions.

Posted on Apr 25, 2025 - 16:04 PDT
Apr 25, 2025

No incidents reported today.

Apr 24, 2025

No incidents reported.

Apr 23, 2025

No incidents reported.

Apr 22, 2025

No incidents reported.

Apr 21, 2025

No incidents reported.

Apr 20, 2025

No incidents reported.

Apr 19, 2025

No incidents reported.

Apr 18, 2025

No incidents reported.

Apr 17, 2025

No incidents reported.

Apr 16, 2025

No incidents reported.

Apr 15, 2025

No incidents reported.

Apr 14, 2025

No incidents reported.

Apr 13, 2025

No incidents reported.

Apr 12, 2025

No incidents reported.

Apr 11, 2025

No incidents reported.