Dr Philipp Jordt | Finding the Best Way for Large Research Facilities to Handle All Their Data
Article written by Matthew Davies, PhD
As technology advances, physical experiments are performed by bigger, more powerful, and more sophisticated machines. Research conducted using these large-scale facilities (LSF) typically involves collaboration between the operating staff, instrument scientists, and external research groups, who propose specific experiments to be performed. With many people involved and such large volumes of data generated by each experiment, it has become challenging to make sure that results are collected, catalogued, stored, and distributed efficiently.
The German consortium DAPHNE4NFDI has been working to integrate Electronic Laboratory Notebooks (ELNs) into the Photon and Neutron (PaN) research community. One of their main goals has been to develop the decision-making process behind how research facilities, universities, and researchers can evaluate the range of ELNs available and decide on which solution to integrate into their operations.
The Age of Big Data
We live in the Age of Data. It’s estimated that over 90% of all the world’s data have been generated just in the last 2 years, and this exponential increase shows no signs of slowing down. Among some of the most impressive data generators in the world are LSFs. PETRA III at DESY, Hamburg, is one of the world’s biggest and brightest storage ring X-ray sources and produces nearly 5 petabytes of data every year. In nearby Schenefeld, at the European X-ray Free Electron Laser, 30 petabytes of scientific data were acquired in 2024, with data rates up to hundreds of gigabits per second for a single large area X-ray detector. These two facilities combined have generated enough data to fill up the hard drives of nearly 20,000 typical home computers —and these are just two examples from large scale facilities. If you include all the other experiments at the many other LSFs across Europe, the amount of data physical science produces each year is almost unfathomable.
With so much data being generated, it’s imperative that research facilities have the appropriate software in place to deal with such volume. Gone are the days where pen and paper could do an adequate job. This is where ELNs come in. In the last few years, the number of ELNs available has considerably increased but many are aimed at individual researchers. This means that LSFs now have a range of solutions to choose from, but it leads to a new question: how does one decide which ELN is the most appropriate for a LSF research community?
Making Data More FAIR
The Data from Photon and Neutron Experiments (DAPHNE4NFDI) consortium led by Dr Bridget Murphy is a member of the German National Research Data Infrastructure (NFDI), whose aim is to develop and improve the research data management tools available to LSFs. The DAPHNE4NFDI team have developed a workflow which facilities can use to decide which of the many ELNs out there would be most suitable to adopt.
The main goal of implementing an ELN is to help record experiment details and results according to ‘FAIR’ principles: these letters stand for Findable, Accessible, Interoperable, and Reusable. Data is Findable if it’s straightforward for a human or machine to locate it; this is especially important when such large quantities of it are generated daily. Once it’s found, that data should be easily Accessible by those who need to access it. The ideal is then for the data to be Interoperable — that it can be straightforwardly integrated with other data or applications. Finally, it’s desirable for data to be as Reusable as possible. It shouldn’t just vanish into the ether after experiments are performed.
Successful Deployment in X-Ray Reflectivity
The DAPHNE4NFDI team have already successfully engineered the implementation of ELNs – in the field of X-ray reflectivity. This is the area in which Kiel University researchers at DAPHNE4NFDI work. In these experiments, X-rays are used to study the properties of surfaces (or, more technically, interfaces) of solid materials and liquids, such as magnetic materials for sensors or lipid membranes as potential biological switches. Understanding the character of the interface on the nanoscale is critical to understanding how the material or device is going to behave. When X-rays hit the interface, if the angle of incidence is smaller than a certain critical angle and the surface is flat, the X-rays are almost totally reflected. By changing the angle of incidence, one can vary the depth to which the X-rays can penetrate into the sample and analyse their thickness, density, and roughness on the nanoscale. To extract this information from the measured data some advanced analysis tools have to be used, eg. neural networks, as developed within DAPHNE4NFDI by Prof. Frank Schreibers group at Tübingen university.
One of the key advantages of using an ELN is the potential for supervisors, collaborators, and research students to collaborate during experiments, even when the team is at diverse locations. With the enhanced productivity ELNs have shown for X-ray reflectivity experiments, Dr Jordt and the DAPHNE4NFDI team are now looking to extend the use of ELNs into other communities in collaboration with German LSFs, such as DESY in Hamburg, and MLZ in Munich.
How to Choose an ELN Using Specifications
One of the key questions the DAPHNE4NFDI team wanted to address was how to choose what ELN to use out of the currently available options. Initially, they wanted to assess the ‘suitability’ and ‘maturity’ of the available solutions, including two different ELN approaches such as the science driven, bottom-up, snip logbook by Dr. Markus Osterhoff from Göttingen University, and the facility provisioned MLZ-ELN tested by Dr. Yuliia Tymoshenko from the neutron scattering group at KIT. An ELN is suitable if it’s a good fit for the workflows and procedures at the facility, and it’s mature if it’s a well-developed, technically proficient software package. In general, many ELNs will be mature in that they operate effectively, but not all of them may be suitable for use in a LSF due to the demanding environment. Out of the 12 ELNs identified, 82% were considered to be mature and 65% were suitable. Overall, this led to 53% of ELNs that were both suitable and mature, meaning they would be satisfactory for potential deployment at PaN facilities.
But the DAPHNE4NFDI team were interested in performing a more detailed analysis, one that took more carefully into account the individual features characteristic of a good ELN. They drew up a list of 31 different specifications; these are individual features that are necessary for an ELN to improve productivity at the facility. Some of the most important specifications included: intuitive user interfaces, auto-saving, parallel editing, and the ability to manage access rights.
With these specifications in place, Dr Jordt and his colleagues asked some of the senior scientists to provide a ‘satisfaction’ rating. They then devised a formula to calculate an overall rating for each ELN –the Figure of Merit (FOM). The FOM would be a number between 0 and 1, with a FOM of 0 indicates that one or more essential features are missing or unsatisfactory from a given ELN. A cutoff value of 0.6 was pre-selected to determine ELN suitability and only 2 out of 12 ELNs passed the cutoff – that’s a pass rate of 17% compared with the 53% from the previous overall survey due to the lack of critical components.
Future Integration into the Photon and Neutron Community
One of the most intriguing results from the DAPHNE4NFDI team’s research is that ELNs are much more likely to be deemed suitable when evaluated wholesale than when evaluated by rating each specification one at a time. This suggests that researchers providing evaluations may not have all the important necessary features in mind when they’re asked to rank just the overall suitability of the ELN. As a result, the DAPHNE4NFDI team suggest that rating each individual specification and computing a FOM score may provide a less biased, more accurate way to determine whether an ELN is suitable for use in a large research facility.
But the FOM shouldn’t be the one and only arbiter of an ELN’s suitability. The DAPHNE4NFDI team suggest that, while the calculation of an FOM is a good guide and an essential step in the decision-making process, once a selection of ELNs have passed, a series of detailed discussions should follow, considering critical functionality and requirements before deciding on which ELN to finally adopt. For the PaN community, the DAPHNE4NFDI team’s work has helped to narrow down the options and get closer to making a final decision. But beyond the PaN field, it also offers a model workflow for the adoption of ELNs.
SHARE
DOWNLOAD E-BOOK
REFERENCE
https://doi.org/10.33548/SCIENTIA1314
MEET THE RESEARCHER
Doctor Philipp Jordt
Interface Physics Group, Institute of Experimental and Applied Physics, Kiel University, Kiel, Germany
Dr Philipp Jordt is a researcher at Kiel University in Germany. He completed his Masters Degree in 2015 and his PhD in 2021, both at Kiel. During his doctoral studies he used X-ray scattering techniques to study properties of magentoelectric sensor components, e.g. piezotronic semiconductor materials and was involved with multiple research campaigns at a range of large-scale Synchrotron Light Source facilities. These included DESY in Germany, the Diamond Light Source in the UK, ESRF in France and APS in the US. Since 2021, Dr Jordt has been a member of the Data from Photon and Neutron Experiments (DAPHNE4NFDI) consortium based in Germany. This is an initiative hoping to enable the collection of Findable, Accessible, Interoperable, and Reusable (FAIR) data within large institutes, such as those Dr Jordt has worked with throughout his career. In particular, Dr Jordt is interested in the evaluation of Electronic Laboratory Notebooks (ELNs), digital upgrades of traditional paper lab-books, and the development of AI tools for research data management.
CONTACT
E: jordt@physik.uni-kiel.de
LI: https://www.linkedin.com/in/philipp-jordt-498312245/
FUNDING
NFDI e.V., German National Research Data Infrastructure
DFG, German Research Foundation
FURTHER READING
P Jordt, M Osterhoff, Y Tymoshenko, et al., Specifications for Electronic Laboratory Notebooks (ELN) in the Photon and Neutron Community, Synchrotron Radiation News, 2024, 37 (6), 3-8. DOI: 10.1080/08940886.2024.2432265
REPUBLISH OUR ARTICLES
We encourage all formats of sharing and republishing of our articles. Whether you want to host on your website, publication or blog, we welcome this. Find out more
Creative Commons Licence (CC BY 4.0)
This work is licensed under a Creative Commons Attribution 4.0 International License.
What does this mean?
Share: You can copy and redistribute the material in any medium or format
Adapt: You can change, and build upon the material for any purpose, even commercially.
Credit: You must give appropriate credit, provide a link to the license, and indicate if changes were made.
SUBSCRIBE NOW
Follow Us
MORE ARTICLES YOU MAY LIKE
Putting AI in your Ears with 3D Neural Networks
It’s difficult to communicate with someone when neither of you speak the same language; apps and online tools like Google Translate can help to bridge the gap, but they hardly make for a natural conversation. However, the rise of artificial intelligence (AI) has opened the door for the development of speech-to-speech technology, where spoken language is translated instantly in real time. To bring this idea closer to reality, a collaboration of European researchers led by Professor Cristell Maneux at the University of Bordeaux have proposed concepts for a 3D artificial neural network accelerator. This ultra-compact, highly efficient design could enable the construction of standalone earpieces capable of translating spoken language instantly, with no need for internet access.
Ongoing research by Professor Han Lamers (University of Oslo) and Professor Bettina Reitz-Joosse (University of Groningen) reveals how Fascist Italy weaponized ancient Rome’s language to legitimise its power and connect Mussolini’s regime to Italy’s imperial past. Their projects involve collaboration with an international team of mostly junior researchers based in Norway, the Netherlands, Austria, and Italy.
Gopal Ranganathan | The Use of Artificial Intelligence in Personalised Airline Services
In today’s competitive airline industry, providing personalised services to passengers is becoming increasingly important for customer satisfaction and business success. Gopal Ranganathan from Quad Optima Analytics has developed an innovative artificial intelligence system to help airline executives implement and govern personalisation programmes. This cutting-edge technology aims to increase profits by tailoring services to individual customers while maintaining sound revenue management principles.
Professor Nancy Burnham | Imaging on the Nanoscale: Improving Techniques in Atomic Force Microscopy
Atomic force microscopy (AFM) provides the means to image surfaces with nanometre resolution, allowing scientists to look at the individual building blocks and forces that make up the world around us. Professor Nancy Burnham of Worcester Polytechnic Institute and her colleagues Lei Lyu and Lily Poulikakos at the Swiss Federal Laboratories for Materials Science and Technology (Empa) have worked on how we can reduce artefacts in these images and ensure they are accurately interpreted. By considering and applying these techniques, high-quality AFM research can be produced.
Dr Sebastian Rabien | Making Membrane Mirrors for Future Space Telescopes
Mirrors play a key role in space telescopes, but to keep increasing the scale of this technology, mirrors need to be light and compact, so they can be transported in spacecraft, but also able to be adaptively corrected and controlled to ensure their accuracy. Dr Rabien and his colleagues from the Max Planck Institute for Extraterrestrial Physics, in Germany, have developed a technique to make extremely thin and lightweight mirrors, which can then be controlled with adaptive optics, making them a potential solution for larger space telescopes.