Infrastructure renewal: Difference between revisions

Jump to navigation Jump to search
remove more redundancy
(mention space and power to explain outages)
(remove more redundancy)
Line 4: Line 4:


= What's coming in 2025? =  
= What's coming in 2025? =  
In 2023, The Digital Research Alliance of Canada was given formal approval and funding for a complete replacement of ageing national systems.  
In 2023, The Digital Research Alliance of Canada was given formal approval and funding for a complete replacement of aging national systems.  
The new equipment will offer:
The new equipment will offer:
* Increased processing capacity
* Increased processing capacity
Line 10: Line 10:
* Improved reliability
* Improved reliability


This new infrastructure will better support your research and computational tasks, providing a better-performing and more efficient environment for your work.
This new infrastructure will better support your computational tasks, providing a better-performing and more efficient environment for your research.


The systems being replaced are [[Arbutus]], [[Béluga]], [[Cedar]], [[Graham]] and [[Niagara]]. The new systems will be broadly comparable to the old systems, but with significantly increased capacity.
The systems being replaced are [[Arbutus]], [[Béluga]], [[Cedar]], [[Graham]] and [[Niagara]]. The new systems will be broadly comparable to the old systems, but with significantly increased capacity.
Line 20: Line 20:
   |title=Important information
   |title=Important information
   |content=
   |content=
There will be outages in the winter of 2024/25 and spring of 2025. No details yet, but we recommend that researchers consider the possibility of such outages when planning research programs, graduate examinations, etc., for next winter and spring.
There will be outages in the winter of 2024/25 and spring of 2025. We recommend that researchers consider the possibility of such outages when planning research programs, graduate examinations, etc., for next winter and spring.
}}
}}


Line 28: Line 28:
{| class="wikitable"
{| class="wikitable"
|-
|-
| Current Status || Sep 3, 2024: Currently all sites have completed various Requests for Proposal, and are working with the vendors on deliverables and purchase orders. This means that we do not yet have specific delivery and installation schedules. Current system status will still be available at [https://status.alliancecan.ca/ status.alliancecan.ca] (for old and new systems)
| Current Status || Sep 3, 2024: Currently all sites have completed Requests for Proposals, and are working with the vendors on deliverables and purchase orders.
|-
|-
| Specifications || The sites are negotiating with various vendors, but we expect the new systems to be installed during Winter 2025, with a reasonable expectation that they will be in production and available to users in early Summer 2025. Currently the sites are not yet in a position to provide detailed technical specifications of the new systems. Generally, the new systems will be similar in architecture to the old systems but with considerably increased capacity and performance. For instance, we expect to have fewer compute nodes, but each node will have a very significant increase in the number of cores due to modern multi-core CPUs
| Specifications || The sites cannot yet provide detailed technical specifications of the new systems. Generally, the new systems will be similar in architecture to the old systems but with considerably increased capacity and performance. For instance, we expect to have fewer compute nodes, but each node will have a significant increase in the number of cores due to the increase in the size of multi-core CPUs since 2017.
|-
|-
| Timeline || The sites are negotiating with various vendors, but we are expecting the new systems to be installed during Winter 2025, with a reasonable expectation that they will be in production and available to users in early Summer 2025
| Timeline || We expect the new systems to be installed in the first quarter of 2025, with a reasonable expectation that they will be in production and available to users in early summer 2025. More specific delivery and installation schedules are not yet available.
|}
|}


= RAC allocation policy and renewals =
= Resource Allocation Competition and renewals =
The RAC allocation policy and renewals will be affected by this transition, but we are not changing the normal RAC process. Expect to see the usual announcements for the competition in September 2024. We expect to implement the 2025/26 allocations on the new machines when they become available so there may be some delay in RAC implementation. Detailed updates to follow.  
The Resource Allocation Competition (RAC) and RAC renewals will be affected by this transition, but we are not changing the normal RAC process. Expect to see the usual announcements for the competition in September 2024. We expect to implement the 2025/26 allocations on the new machines when they become available so there may be some delay in RAC implementation. Detailed updates to follow.  
See RAC documentation available [https://www.alliancecan.ca/en/services/advanced-research-computing/accessing-resources/resource-allocation-competition here].
See RAC documentation available [https://www.alliancecan.ca/en/services/advanced-research-computing/accessing-resources/resource-allocation-competition here].
   
   
Line 61: Line 61:


== Will data be copied to the new systems? ==
== Will data be copied to the new systems? ==
Data migration to the new systems is a site responsibility. Each site will let you know exactly what to do once the details are finalized.
Data migration to the new systems is a site responsibility. Each site will let you know what you need to do and what will be done for you once the details are finalized.


== When will outages occur? ==
== When will outages occur? ==
Each site will have their own schedule for outages as the new equipment is installed and transitioned. Specific outages will as usual be described on the status pages (https://status.alliancecan.ca). We will also provide more general updates through this wiki page as we know more, probably in early Autumn 2024.
Each site will have their own schedule for outages as the new equipment is installed and transitioned. Specific outages will as usual be described on the status pages (https://status.alliancecan.ca). We will also provide more general updates through this wiki page as we know more, probably in early autumn 2024.
We will also periodically send system emails with updates and outage notices.
We will also periodically send emails with updates and outage notices.


== Who should I contact for questions about the transition? ==
== Who should I contact for questions about the transition? ==
Contact our [[Technical support]]
Contact our [[Technical support]], but don't expect them to know a great deal more than you read here.


== Will my jobs/applications run without change on the new system? ==
== Will my jobs/applications run without change on the new system? ==
Bureaucrats, cc_docs_admin, cc_staff
2,879

edits

Navigation menu