Skip to content

UFRGS-CAROL/ipdps2021

Repository files navigation

Demystifying GPU Reliability: Comparing and Combining Beam, Fault Simulation, and Profiling

Authors

  • Fernando Fernandes dos Santos - Universidade Federal do Rio Grande do Sul (UFRGS)
  • Siva Kumar Sastry Hari - NVIDIA Corporation
  • Pedro Martins Basso - UFRGS
  • Luigi Carro - UFRGS
  • Paolo Rech - Politecnico di Torino

About the data

This repository contains almost all the data presented in the paper. In the following section, there is a link to each Jupyter notebook with the data. Beam experiments data are normalized to protect NVIDIA sensitive information. The smallest value normalizes beam data for each device (Kepler and Volta).

Data used in the paper

The data is divided in the following categories:

Final result comparing radiation vs prediction

To obtain the prediction explained in the paper, you have to use the data presented above. The details of the methodologies used in the paper are better explained in the manuscript. Then after the prediction using the proposed methodology the following graphs are calculated using the equation:

As in the paper, the final result is:

  • Kepler - Tesla K40. Figure 6.a in the paper.

  • Volta - Tesla V100. Figure 6.b in the paper.

Used frameworks and codes

We used in this paper the following frameworks:

The codes used in this paper are from:

Citing

@INPROCEEDINGS{ipdps2021,
    author={Fernando Fernandes dos Santos and 
            Siva Kumar Sastry Hari and 
            Pedro Martins Basso and
            Luigi Carro and Paolo Rech},
    booktitle={35th IEEE International Parallel & 
            Distributed Processing Symposium {IPDPS}},
    title={Unveiling GPU Vulnerabilities: 
    Comparing and Combining Beam, Fault Simulation, and Profiling},
    year={2021},
    month={May}
}

About

Repository for the data for IPDPS 2021

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors