Lecturer | Prof. Dr. Estela Suarez |
---|---|
Module (BASIS) | MA-INF-1322 |
eCampus | eCampus_MA-INF-1322 |
Type of Lecture | Seminar |
Credits | 4 CP |
Research Area | High Performance Computing |
Language | English |
Max. Number of Participants | 10 |
High Performance Computing (HPC) refers to the use of large cluster computers to solve scientific and technical problems unsolvable on small-scale systems, and doing it so that the hardware resources are efficiently employed. HPC systems are designed to achieve the maximum computing performance at the lowest possible power consumption. They are parallel computers made of hundreds to thousands of compute nodes connected to each other via a high-speed network. Operating HPC systems requires specific software distributions, programming models, and tools. Applications must be parallelized, meaning they must be programmed so that the problem to be solved is split into many individual operations that can be executed in parallel. This seminar addresses important topics and challenges on today’s HPC. It is designed to cover a wide range of areas allowing for a general overview on each of the aspects, as well as for a deep-dive into specific solutions within any of the proposed topics.
Event | Date |
---|---|
Application open | 01.10.2022 |
Term of application | 01.11.2022 |
Preliminary discussion and subject assignment | 04.11.2020 at 10:00-12:00 Seminar-Room 3.035b |
Deadline application finalization (in BASIS) | 16.12.2022 |
Deadline written report (concept) | 16.12.2022 |
Deadline written report (first submission) | 20.01.2023 |
Deadline to review a paper from a colleague | 27.01.2023 |
Deadline written report (final version) | 17.02.2023 |
Deadline presentation slides (structure) | 03.03.2023 |
Deadline presentation slides (full version) | 05.03.2023 |
Presentations | 07.03.2023 |
The topics below are formulated in a general manner. There are two possible ways of approaching them, which lead to two different kinds of seminar report:
# | Topics (some specific aspects as suggestions) | Refs |
---|---|---|
A | HPC computer architectures (basic principles, system-level approaches, homogeneous clusters, heterogeneous systems, modular systems) | [01], [02], [03], [04] |
B | Memory and storage hierarchies (cache levels, memory technologies, storage systems) | [05], [06], [07] |
C | Network interconnect (principles, intra-node and system-wide approaches, topologies, technologies) | [08], [09], [10] |
D | Resource management and job scheduling (principles of multi-user environment, standard approaches, advanced features (e.g. malleability, dynamic resource allocation) | [11], [12] |
E | Programming models for parallel computing (shared memory approaches, distributed memory approaches, hardware-specific, high-level programming models, performance portability) | [13], [14], [15] |
F | Scalability (principles, Amdahl’s and Gustafsons law) | [16], [17], [18] |
G | Energy efficiency and power consumption (challenges, trends, measurement/monitoring techniques) | [19], [20], [21] |
H | Performance prediction and measurement (principles, metrics, classifications, roofline, analysis and modelling tools) | [02], [22], [23], [24], [25], [26] |
I | Co-design (how application requirements are fed into system/component design, principles, approaches, examples) | [27], [28] |
J | Exascale (trends and challenges) | [29], [30], [31], [32] |
A selection of scientific publications / subjects is available above. The subject will be assigned at the preliminary discussion (see section Dates).
During the semester the participants will create their written report and presentation slides. During this period there will be no regular meetings with the other participants. The tutor Prof. Dr. Estela Suarez will be available for any organizational or content-related questions. Feel free to drop an email anytime, also if if you'd like to setup a meeting.
First step: a concept about the written report must be created. This should contain the projected structure of the report as well as a basic description of the contents of each section. Furthermore, it should contain the used literature. To avoid misconceptions and unnecessary work, the conception should be discussed with the tutor before the actual work on the written report is started.
The written report must span 8-10 pages and must be created using LaTeX (report-template.zip). Generally, it is necessary to make a selection and prioritization of topics discussed in the source literature. The content of the written report should match the later presentation, although a different depth, ordering, and prioritization is possible. It is advantageous to incorporate other scientific sources. A scientific complete reference to all used sources is mandatory. It is expected to critically review the subject and literature at hand. A complete and successful written report is necessary to continue with the seminar. Simple rephrasing will be considered unsuccessful.
The tutor will review the written report after the first submission. All participants have the possibility to incorporate desired changes and suggestions from the review and submit a final version afterwards. Only the final version will be subject to grading. Both submissions are mandatory. The projected structure of the presentation slides should be submitted to the tutor in advance to the complete version for discussion. It is possible that the tutor will request further changes to the presentation slides after the submission of the complete slides. Be prepared to incorporate these changes before the seminar takes place.
All participants will present their subject in a 30 minute presentation during the seminar. After each presentation, there is a 15 minute time slot to discuss the presentation and the subject.
To write your report, please use the following LaTeX template: report-template.zip
These are just some possible sources on the topics of the Seminar. In order to get an overview of the topic that you selected, it is a good approach to start by one of the related references and look further into papers that have cited it, and on its own references (especially those given as related work). If you cannot get access to the papers via the UniBonn library licence, please contact Prof. Dr. Estela Suarez.
A - Computer Architectures
B – Memory and storage hierarchies
C – Network and interconnect
D – Resource management and job scheduling
E - Programming models for parallel computing
F – Scalability
G - Energy efficiency and power consumption
H – Performance prediction and measurement
I – Codesign
J – Exascale