Paper | Title | Other Keywords | Page |
---|---|---|---|
MOMPR007 | Scalable High Demand Analytics Environments with Heterogeneous Clouds | experiment, scattering, operation, software | 171 |
|
|||
Funding: UK Research and Innovation - Science & Technology Facilities Council (UK SBS IT18160) The Ada Lovelace Centre (ALC) at STFC provides on-demand, data analysis, interpretation and analytics services to scientists using UK research facilities. ALC and Tessella have built software systems to scale analysis environments to handle peaks and troughs in demand as well as to reduce latency by provision environments closer to scientists around the world. The systems can automatically provision infrastructure and supporting systems within compute resources around the world and in different cloud types (including commercial providers). The system then uses analytics to dynamically provision and configure virtual machines in various locations ahead of demand so that users experience as little delay as possible. In this poster, we report on the architecture and complex software engineering used to automatically scale analysis environments to heterogeneous clouds, make them secure and easy to use. We then discuss how analytics was used to create intelligent systems in order to allow a relatively small team to focus on innovation rather than operations. |
|||
![]() |
Poster MOMPR007 [1.650 MB] | ||
DOI • | reference for this paper ※ https://doi.org/10.18429/JACoW-ICALEPCS2019-MOMPR007 | ||
About • | paper received ※ 30 September 2019 paper accepted ※ 09 October 2019 issue date ※ 30 August 2020 | ||
Export • | reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml) | ||
MOPHA086 | The Design of Experimental Performance Analysis and Visualization System | experiment, data-management, database, laser | 409 |
|
|||
The analysis of experimental performance is an essential task to any experiment. With the increasing demand on experimental data mining and utilization. methods of experimental data analysis abound, including visualization, multi-dimensional performance evaluation, experimental process modeling, performance prediction, to name but a few. We design and develop an experimental performance analysis and visualization system, consisting of data source configuration component, algorithm management component, and data visualization component. It provides us feasibilities such as experimental data extraction and transformation, algorithm flexible configuration and validation, and multi-views presentation of experimental performance. It will bring great convenience and improvement for the analysis and verification of experimental performance. | |||
![]() |
Poster MOPHA086 [0.232 MB] | ||
DOI • | reference for this paper ※ https://doi.org/10.18429/JACoW-ICALEPCS2019-MOPHA086 | ||
About • | paper received ※ 30 September 2019 paper accepted ※ 10 October 2019 issue date ※ 30 August 2020 | ||
Export • | reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml) | ||
MOPHA160 | Enabling Data Analytics as a Service for Large Scale Facilities | simulation, distributed, software, experiment | 614 |
|
|||
Funding: UK Research and Innovation - Science & Technology Facilities Council (UK SBS IT18160) The Ada Lovelace Centre (ALC) at STFC is an integrated, cross-disciplinary data intensive science centre, for better exploitation of research carried out at large scale UK Facilities including the Diamond Light Source, the ISIS Neutron and Muon Facility, the Central Laser Facility and the Culham Centre for Fusion Energy. ALC will provide on-demand, data analysis, interpretation and analytics services to worldwide users of these research facilities. Using open-source components, ALC and Tessella have together created a software infrastructure to support the delivery of that vision. The infrastructure comprises a Virtual Machine Manager, for managing pools of VMs across distributed compute clusters; components for automated provisioning of data analytics environments across heterogeneous clouds; a Data Movement System, to efficiently transfer large datasets; a Kubernetes cluster to manage on demand submission of Spark jobs. In this paper, we discuss the challenges of creating an infrastructure to meet the differing analytics needs of multiple facilities and report the architecture and design of the infrastructure that enables Data Analytics as a Service. |
|||
![]() |
Poster MOPHA160 [1.665 MB] | ||
DOI • | reference for this paper ※ https://doi.org/10.18429/JACoW-ICALEPCS2019-MOPHA160 | ||
About • | paper received ※ 30 September 2019 paper accepted ※ 10 October 2019 issue date ※ 30 August 2020 | ||
Export • | reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml) | ||
TUCPR02 | Data Exploration and Analysis with Jupyter Notebooks | FEL, experiment, software, detector | 799 |
|
|||
Funding: With support from EU’s H{2}020 grants 823852 (PaNOSC) and #676541 (OpenDreamKit), the Gordon and Betty Moore Foundation GBMF #4856, the EPSRC’s CDT (EP/L015382/1) and program grant (EP/N032128/1). Jupyter notebooks are executable documents that are displayed in a web browser. The notebook elements consist of human-authored contextual elements and computer code, and computer-generated output from executing the computer code. Such outputs can include tables and plots. The notebook elements can be executed interactively, and the whole notebook can be saved, re-loaded and re-executed, or converted to read-only formats such as HTML, LaTeX and PDF. Exploiting these characteristics, Jupyter notebooks can be used to improve the effectiveness of computational and data exploration, documentation, communication, reproducibility and re-usability of scientific research results. They also serve as building blocks of remote data access and analysis as is required for facilities hosting large data sets and initiatives such as the European Open Science Cloud (EOSC). In this contribution we report from our experience of using Jupyter notebooks for data analysis at research facilities, and outline opportunities and future plans. |
|||
![]() |
Slides TUCPR02 [15.943 MB] | ||
DOI • | reference for this paper ※ https://doi.org/10.18429/JACoW-ICALEPCS2019-TUCPR02 | ||
About • | paper received ※ 24 September 2019 paper accepted ※ 20 October 2019 issue date ※ 30 August 2020 | ||
Export • | reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml) | ||
WEPHA057 | Building a Data Analysis as a Service Portal | software, site, photon, feedback | 1228 |
|
|||
Funding: This project has received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement No 730872 As more and more scientific data are stored at photon sources there is a growing need to provide services to access to view, reduce and analyze the data remotely. The Calipsoplus* project, in which all photon sources in Europe are involved in, has recognized this need and created a prototype portal for Data Analysis as a Service. This paper will present the technology choices, the architecture of the blueprint, the prototype services and the objectives of the production version planned in the medium term. The paper will cover the challenges of building a portal from scratch which covers the needs of multiple sites, each with their own data catalogue, local computing infrastructure and different workflows. User authentication and management are essential to creating a useful but sustainable service. *http://www.calipsoplus.eu/ |
|||
DOI • | reference for this paper ※ https://doi.org/10.18429/JACoW-ICALEPCS2019-WEPHA057 | ||
About • | paper received ※ 01 October 2019 paper accepted ※ 09 October 2019 issue date ※ 30 August 2020 | ||
Export • | reference for this paper using ※ BibTeX, ※ LaTeX, ※ Text/Word, ※ RIS, ※ EndNote (xml) | ||