The Joint Accelerator Conferences Website (JACoW) is an international collaboration that publishes the proceedings of accelerator conferences held around the world.
TY - CONF AU - Masetti, L. AU - Andre, J.M. AU - Andronidis, A. AU - Behrens, U. AU - Branson, J. AU - Chaze, O. AU - Cittolin, S. AU - Darlea, G.L. AU - Deldicque, C. AU - Dobson, M. AU - Dupont, A.D. AU - Erhan, S. AU - Gigi, D. AU - Glege, F. AU - Gomez-Ceballos, G. AU - Hegeman, J. AU - Holme, O. AU - Holzner, A. AU - Janulis, M. AU - Jiménez Estupiñán, R.J. AU - Meijers, F. AU - Meschi, E. AU - Mommsen, R.K. AU - Morovic, S. AU - Nunez-Barranco-Fernandez, C. AU - O'Dell, V. AU - Orsini, L. AU - Paus, C. AU - Petrucci, A. AU - Pieri, M. AU - Racz, A. AU - Roberts, P. AU - Sakulin, H. AU - Schwick, C. AU - Stieger, B. AU - Sumorok, K. AU - Veverka, J. AU - Zaza, S. AU - Zejdl, P. ED - Corvetti, Lou ED - Riches, Kathleen ED - Schaa, Volker RW TI - Increasing Availability by Implementing Software Redundancy in the CMS Detector Control System J2 - Proc. of ICALEPCS2015, Melbourne, Australia, 17-23 October 2015 C1 - Melbourne, Australia T2 - International Conference on Accelerator and Large Experimental Physics Control Systems T3 - 15 LA - english AB - The Detector Control System (DCS) of the Compact Muon Solenoid (CMS) experiment ran with high availability throughout the first physics data-taking period of the Large Hadron Collider (LHC). This was achieved through the consistent improvement of the control software and the provision of a 24-hour expert on-call service. One remaining potential cause of significant downtime was the failure of the computers hosting the DCS software. To minimize the impact of these failures after the restart of the LHC in 2015, it was decided to implement a redundant software layer for the control system where two computers host each DCS application. By customizing and extending the redundancy concept offered by WinCC Open Architecture (WinCC OA), the CMS DCS can now run in a fully redundant software configuration. The implementation involves one host being active, handling all monitoring and control tasks, with the second host running in a minimally functional, passive configuration. Data from the active host is constantly copied to the passive host to enable a rapid switchover as needed. This paper describes details of the implementation and practical experience of redundancy in the CMS DCS. PB - JACoW CP - Geneva, Switzerland SP - 717 EP - 720 KW - controls KW - software KW - detector KW - hardware KW - status DA - 2015/12 PY - 2015 SN - 978-3-95450-148-9 DO - 10.18429/JACoW-ICALEPCS2015-WEPGF013 UR - http://jacow.org/icalepcs2015/papers/wepgf013.pdf ER -