Skip to content

Fill shipment from CSV

StephMonaco edited this page Nov 27, 2018 · 25 revisions

Import shipment from CSV data file

EXI allows to fill in a shipment uploading a CSV file. Such file should contain information about the sample, container, parcel and diffraction plan. Some parameters are mandatory and others will be optional.

What is a CSV file?

A CSV file is a plain text format mainly used to store tabular data. CSV means Comma-Separated Values then each value is separated with a comma. Example:

Dewar1,CA288,SPINEpuck,1,ACRO,xtal101,HX562B6A,P121,87,55.8,112.6,90,90.4,90,MXpressE,1.8, 2.2,50,2,4,98,1, P222,Cn1cnc2n(C)c(=O)n(C)c(=O)c12,2.5, Best looking sample

This line describes a sample located in a parcel called 'Dewar1' that contains one container called 'CA288' which type is SPINEpuck.

If a column has no value then it is left empty. Example:

Dewar1,CA288,SPINEpuck,1,ACRO,xtal101,,,,,,,,,,,,,,,,,

Why a CSV file and not a Excel file?

CSV is a stardard format capable to be read for all different operating systems and it does not depend on any program.

Supported format

Value Description Example
parcel name mandatory identifier of the parcel Dewar1
container name mandatory identifier of the container CA288
Container type mandatory Controlled list: Unipuck, SPINEpuck SPINEpuck
Container position mandatory sample position within the container 1
protein acronym mandatory Acronym of the protein. It should exist in ISPyB (coming from the sample sheet declared in the user portal). It is case sensitive. ACRO
sample acronym mandatory Name of the sample that should be unique within the shipment xtal101
Pin barcode optional Barcode identifier HX562B6A
SPG & cell optional If present, used for strategy calculation and autoprocessing. If a SPG is suspected but no unit cell associated is known, the Forced SPG parameter is more adequate to use P121,87,55.8,112.6,90,90.4,90
Experiment Type optional Controlled list: Default (no workflow), MXPressE, MXPressO, MXpressE_SAD, MXpressI, MXpressP..
Aimed resolution optional Resolution that will be used for the experiment - A default value of 2Å is taken if this parameter is not filled 1.8Å
Required resolution optional If observed resolution is worse than this value some automatic workflows will not launch the data collection following the crystal analysis (e.g. MxpressE) 2.2Å
Beam diameter optional Several workflows adapt the beam size to the volume of the sample found during the Xtal characterisation but it is possible to force it in some cases (e.g. MxpressO) - value in micronmeters. depending on the beamline aperture list, the closest aperture to this value will be set 50
Number of positions optional if you expect several samples (N) on the same support you can ask to collect automatically a full dataset on each of the N positions requested and found in the initial 2D mesh scan of some workflows 3
Aimed multiplicity optional 4
Aimed completeness optional 98
Forced SPG optional If present, used for strategy calculation and autoprocessing and will overwrite the SPG described in the SPG & cell columns of the sample P222
Radiation sensitivity optional BEST input in the case of highly radiation-sensitive crystals (0.5–2.0 - low to high sensitivity)
SMILES optional Cn1cnc2n(C)c(=O)n(C)c(=O)c12
Total Rotation Angle optional
Min. Osc. Angle optional Used by the EDNA characterisation in the MXpress workflows. The calculated diffraction plan will not propose an oscillation angle smaller than this value
Observed resolution optional For user purpose only (not used in the pipelines)
Comments optional For user purpose

Rules

  1. Parcel name can not be empty and has to be unique for this shipment
  2. Container name can not be empty and has to be unique for this shipment
  3. Container type is a controlled list of values (case sensitive): Unipuck or Spinepuck
  4. Sample position has to be an integer and depends on container type. If container type is Unipuck then accepted values are between 1 and 16. In case of Spinepuck values should be in the range of 1 and 10.
  5. Protein has to be in the list of proteins of your proposal (case sensitive)
  6. Sample name and the protein attached has to be unique for the whole proposal. That means that it will not be possible to have the same sample described twice in the same proposal (even from various shipments).