I arrived at SOLEIL in 2004 and my mission was to define a way of recording the data:
what format and with which tools. I spent my first few months searching the “ideal” format and I then came across NeXus. Neutron sources and synchrotron radiation allow research which is relatively similar, so the specifications I found with the NeXus format were applicable without modification. Today, more and more of the measurements recorded on the beamlines represent large volumes of information that will always grow in size. Thus, the classic text formats that people used previously have proved unsuitable. On the other hand, one of the constraints imposed on the site was to use the same format on all beamlines. What was needed was a generic format to meet these diverse needs. There has also been an effort
at the European level to try to harmonize data formats and NeXus is a good candidate. But it is not alone.
Now, at SOLEIL, in collaboration with an Australian Institute (ANSTO), we are trying to go beyond NeXus and free ourselves completely from the data format. Having standardized the format of data files produced by the beamlines allows SOLEIL to have a “library” of its experimental data, a library the content of which can be explored computationally using, for example, the Web application TWIST. SOLEIL thus possesses the scientific memory of experiments that have taken place on its beamlines.
Managing on a large-scale basis, our experimental data is new in the synchrotron world and puts us in a very favorable position for different European projects (PANDATA, PNI...) related to this issue. Today, we are taking advantage of this position trying to convince IT personnel of partner institutes (ESRF, DESY, etc.) to join us in the “CommonDataModel” approach, an ambitious project that aims to unify access to scientific data for the data analysis software. So stay tuned….