Image may be NSFW.
Clik here to view.© RO, RUG CIT
The time spent was significantly more than estimated at the start of this project because a number of unforeseen major hurdles had to be overcome. All critical issues have been addressed and the result is a technically up-to-date “workhorse” that will process LOFAR data for the coming 5 years. We will continue to explore and harvest the new opportunities this new cluster is able to offer to LOFAR and its community (e.g. unlocking the power of the GPU nodes).
Specs for the tech-enthusiasts:
Nodes: 1 management, 2 heads, 2 filesystem meta-data, 18 storage, 4 GPU compute and 50 regular CPU compute.
Tflops (theoretical): 96 from CPUs, 68 from GPUs, Total 164 (CEP2 was 20)
Filesystem: 3.5 PB
New techniques and frameworks that have been incorporated in the LOFAR system as part of the migration to CEP4 are a.o.: Lustre (cluster file system), Docker (containerized applications), SLURM (batch scheduling), Qpid (message broker infrastructure), Ganglia (scalable distributed monitoring system), Spacewalk (systems management), Robin Hood (policy engine and reporting tool for large file systems) and a new standard for an OS: CentOS 7.