Workspace methods

Workspace methods are the core interaction a user has with ARTS calculations. They are called from the Workspace object in python but are implemented in C++ under-the-hood.

The workspace methods definition are mainly located in the workspace_methods.cpp and workspace_meta_methods.cpp files. Some workspace methods are automatically generated elsewhere.

Workspace methods are exposed to other C++ files via the <workspace.h> header file. Only the CMake target artsworkspace and targets that depend on it may include <workspace.h>.

How to add a workspace method?

The easiest way to add a workspace method is to copy an existing method. Go to the workspace_methods.cpp or workspace_meta_methods.cpp file, depending on if it is a basic or meta-method you wish to add. Copy an existing method and modify it to suit your needs.

Meta-methods are automatically generated and do not require any additional code. Basic methods require new code to be compiled as part of the artsworkspace target. Please add this code to an existing m_-file or create a new one. Please ensure that the new file is listed in the CMake target artsworkspace.

Defining a workspace method

You may define either a basic method or a meta-method that depends on other methods. The easiest way to do so is to copy an existing method and modify it to suit your needs. Please design your method input and output to allow meta-methods to be defined, as they simplify composability significantly.

Basic methods

Basic methods completely define their input and output and do not explicitly depend on other methods. The method must be manually implemented.

Basic methods are defined entirely in workspace_methods.cpp. They are part of a map object called wsm_data. The name of the method is the key and the value is a struct with the following fields:

  • desc - a description of the method as a string.

  • author - the author(s) of the method as a list of strings.

  • out - the Workspace variables output of the method as a list of strings.

  • gout - the generic output of the method as a list of strings. These must not be Workspace variables.

  • gout_type - the type of the generic output as a list of strings. These must be Workspace groups.

  • gout_desc - the description of the generic output as a list of strings.

  • in - the Workspace variables input of the method as a list of strings.

  • gin - the generic input of the method as a list of strings. These must not be Workspace variables.

  • gin_type - the type of the generic input as a list of strings. These must be Workspace groups.

  • gin_value - the default value of the generic input as an optional initialized Workspace groups.

  • gin_desc - the description of the generic input as a list of strings.

  • pass_workspace - a boolean indicating if a Workspace instance should be passed to the method. If true, the first argument to the method is a const Workspace&.

The expected signature of the method will depend on these fields. A linker error will likely occur if the actual signature does not match the expected signature.

The in and out may contain the same Workspace variables. If they do, the variable must be initialized before the method is called because it is treated as if the method is intended to simply modify the existing value. Please indicate strongly in the documentation if you sometimes overwrite the input variable.

On the other hand, if the Workspace variables is only in out and not in in, it is treated as if the workspace variable is created by the method. Note that since the type system does not account for this, it is important that you clear the current state of the Workspace variables in a method that is intended to create a new workspace variable.

The fields gin, gin_type, gin_value, and gin_desc must be the same size. The same is true for gout, gout_type, and gout_desc. These are user-generated inputs and outputs, and are often used to pass information pertinent to the method itself but not to the workspace as a whole.

Please check other workspace methods for examples by comparing their actual signature to the expected signature to figure out how the fields should be filled in. Also check that the documentation is generated as intended by building the pyarts_docs target.

Tip

All fields but desc and author are optional. If a field is not needed, it is convenient to leave it out.

Meta-methods

Meta-methods do not define all their input and output, but instead define a call order into other methods. From this call order, the inputs of the user-facing workspace method is inferred. This method should not be implemented manually.

These methods are defined in workspace_meta_methods.cpp. They are defined as part of a list called wsm_meta. A single meta-method data contains:

  • name - the name of the method as a string.

  • desc - a description of the method as a string.

  • author - the author(s) of the method as a list of strings.

  • methods - the methods that the meta-method depends on as a list of strings.

  • out - the output of the method as a list of strings. These must be workspace variables.

  • preset_gin - The preset gin values for the method as a list of workspace values.

  • preset_gin_value - The preset gin_value values for the method as a list of workspace values.

Tip

A meta-method may depend on another meta-method. If it does, it is important that the meta-method it depends on is defined before it in the list.

Automatic methods

All methods that execute a workspace agenda are automatically generated. These will be named as agenda_nameExecute and may otherwise be treated as normal workspace method. You need to do nothing to define these methods. But please refrain from defining them manually as that may cause undefined naming conflicts.

The expected signature of the method propagation_matrix_agendaAuto() is also generated automatically near the end of workspace_methods.cpp. It takes its input and output from a list of other methods. Feel free to add to this list but make sure that any naming conflicts regarding gin are resolved before doing so. Adding a method to this list may also require changing the actual signature (which is why the method is generated, so that a change in the required actual signature is immediately made apparent).

The methods that begin with RetrievalAdd... are partly generated. These methods all require a corresponding jacobian_targetsAdd... method that fills in the jacobian_targets workspace variable. To keep that part of the signature consistent, the additional RetrievalAdd... information is simply appended to the in, out, and gin-lists of the corresponding jacobian_targetsAdd... method using the local jac2ret lambda.

Generated files

The workspace method interface generates a lot of files during the build process. These generated files are located in the build directory and are named as auto_wsm_N.cc, where N is a number, as auto_wsm.cpp, as auto_wsm.h, and as auto_wsmmeta.cpp for the C++ interfacing code. The python-binding code is also generated as py_auto_wsm_N.cpp, where N is still a number.

Workspace method naming convention

Names carry meaning. Please follow the naming convention below, and please do not hesitate to fix any naming inconsistencies you find.

Method naming

Workspace method names should be descriptive and follow the naming convention that the main workspace variable output of the method in snake_case is followed by a short but descriptive name of what the method does with the output in PascalCase. A general rule of thumb is to use verbs for methods that modify the workspace variable and nouns for methods that create a new workspace variable.

For example, propagation_matrixAddLines() has a main output of propagation_matrix and adds line absorption to it. It needs to be preceded by a call to propagation_matrixInit() which sets up the propagation matrix to an initial state.

Of course, every use-case is different, but please try to follow this convention.

File naming

The file that a workspace method is implemented in should be named m_<concept>.cc. The concept should be a short but descriptive name of what the methods therein do. Multiple methods per file is allowed and encouraged, but keep them conceptually similar. To ensure compatibility with various file systems, please avoid using spaces and capital letters in the filename.

Lastly, please ensure that the file is listed in the CMake target artsworkspace, or it will not be compiled.

Workspace method documentation

Workspace documentation that contains *text* is automatically turned into links to the relevant ARTS-related variable or method. Please use this feature to link between workspace methods and variables.

If a method require extra information beyond what you can fit in the desc field, there’s a workspace_method_extra_doc.cpp file that you can add to. This file has access to the full workspace as part of the artsworkspace target and the python documentation adds a separate subsection for the information in this file (documentation level -------).

Examples of defined workspace methods

The following examples are taken from the ARTS source code. Please check the source code for the full context of the examples.

Method creating a workspace variable

The following is a basic method that creates or set a workspace variable.

This is the extration of the text in the workspace_methods.cpp file:

wsm_data["ray_pathGeometricUplooking"] = {
    .desc =
        R"--(Wraps *ray_pathGeometric* for straight uplooking paths from the surface altitude at the position
)--",
    .author = {"Richard Larsson"},
    .out    = {"ray_path"},
    .in     = {"atmospheric_field", "surface_field", "latitude", "longitude"},
    .gin    = {"max_step"},
    .gin_type  = {"Numeric"},
    .gin_value = {Numeric{1e3}},
    .gin_desc  = {"The maximum step length"},
};

The signature of the method is:

void ray_pathGeometricUplooking(ArrayOfPropagationPathPoint& ray_path,
                                const AtmField& atmospheric_field,
                                const SurfaceField& surface_field,
                                const Numeric& latitude,
                                const Numeric& longitude,
                                const Numeric& max_step);

The signature of the method returns void. This is the same for all ARTS methods.

The first argument of the method is a reference to ray_path. Since ray_path is in out but not in in, it is expected that the method overwrite any existing value of ray_path.

The arguments atmospheric_field, surface_field, latitude, and longitude are defined in in and are passed to the method as immutable references to the respective workspace variables.

Lastly, the argument max_step is defined in gin and is passed as an immutable reference as well. The type of the argument is Numeric and the default value is 1e3. The default value is passed to the method if the user does not provide a value for max_step.

All other fields are there to provide context and to generate the documentation. See ray_pathGeometricUplooking() for the full documentation.

Method modifying a workspace variable

The following is a basic workspace method that modifies existing workspace variables.

This is the extraction of the text in the workspace_methods.cpp file:

wsm_data["propagation_matrixAddLines"] = {
    .desc      = R"--(Line-by-line calculations.
)--",
    .author    = {"Richard Larsson"},
    .out       = {"propagation_matrix",
                  "propagation_matrix_source_vector_nonlte",
                  "propagation_matrix_jacobian",
                  "propagation_matrix_source_vector_nonlte_jacobian"},
    .in        = {"propagation_matrix",
                  "propagation_matrix_source_vector_nonlte",
                  "propagation_matrix_jacobian",
                  "propagation_matrix_source_vector_nonlte_jacobian",
                  "frequency_grid",
                  "jacobian_targets",
                  "select_species",
                  "absorption_bands",
                  "ecs_data",
                  "atmospheric_point",
                  "ray_path_point"},
    .gin       = {"no_negative_absorption"},
    .gin_type  = {"Index"},
    .gin_value = {Index{1}},
    .gin_desc =
        {"Turn off to allow individual absorbers to have negative absorption"},
};

The signature of the method is:

void propagation_matrixAddLines(PropmatVector& propagation_matrix,
                                StokvecVector& propagation_matrix_source_vector_nonlte,
                                PropmatMatrix& propagation_matrix_jacobian,
                                StokvecMatrix& propagation_matrix_source_vector_nonlte_jacobian,
                                const AscendingGrid& frequency_grid,
                                const JacobianTargets& jacobian_targets,
                                const SpeciesEnum& select_species,
                                const AbsorptionBands& absorption_bands,
                                const LinemixingEcsData& ecs_data,
                                const AtmPoint& atmospheric_point,
                                const PropagationPathPoint& ray_path_point,
                                const Index& no_negative_absorption);

The signature of the method returns void. This is the same for all ARTS methods.

The first four arguments of the method are references to propagation_matrix. propagation_matrix_source_vector_nonlte, propagation_matrix_jacobian, and propagation_matrix_source_vector_nonlte_jacobian are both output (out) and input (in). The method is expected to modify the existing values of these workspace variables instead of creating new ones.

The arguments frequency_grid, jacobian_targets, select_species, absorption_bands, ecs_data, atmospheric_point, and ray_path_point are just defined in in and are passed to the method as immutable references to the respective workspace variables.

Lastly, the argument no_negative_absorption is defined in gin and is passed as an immutable reference as well. The type of the argument is Index and the default value is 1. The default value is passed to the method if the user does not provide a value for no_negative_absorption. The no_negative_absorption argument is used to turn off the check for negative absorption, which is useful for debugging purposes.

The other fields are there to provide context and to generate the documentation. See propagation_matrixAddLines() for the full documentation.

Method that uses a workspace agenda

The following is a basic workspace method that creates workspace variables.

This is the extraction of the text in the workspace_methods.cpp file:

wsm_data["measurement_vectorFromSensor"] = {
      .desc =
          R"--(Sets measurement vector by looping over all sensor elements

The core calculations happens inside the *spectral_radiance_observer_agenda*.

User choices of *spectral_radiance_unit* does not adversely affect this method
unless the *measurement_vector* or *measurement_jacobian* are further modified
before consumption by, e.g., *OEM*
)--",
      .author         = {"Richard Larsson"},
      .out            = {"measurement_vector", "measurement_jacobian"},
      .in             = {"measurement_sensor",
                        "jacobian_targets",
                        "atmospheric_field",
                        "surface_field",
                        "spectral_radiance_unit",
                        "spectral_radiance_observer_agenda"},
      .pass_workspace = true,
  };

The signature of the method is:

void measurement_vectorFromSensor(const Workspace& ws,
                                  Vector& measurement_vector,
                                  Matrix& measurement_jacobian,
                                  const ArrayOfSensorObsel& measurement_sensor,
                                  const JacobianTargets& jacobian_targets,
                                  const AtmField& atmospheric_field,
                                  const SurfaceField& surface_field,
                                  const SpectralRadianceUnitType& spectral_radiance_unit,
                                  const Agenda& spectral_radiance_observer_agenda);

The signature of the method returns void. This is the same for all ARTS methods.

The first argument of the method is a reference to the workspace object itself. This is passed as a const Workspace& reference to the method. It is passed to the method because pass_workspace is set to true in the method definition. Note that the workspace object is passed as a const reference, so it cannot be modified.

The coming two arguments of the method are references to measurement_vector and measurement_jacobian. Since measurement_vector and measurement_jacobian are in out but not in in, it is expected that the method overwrite any existing values they might hold.

The arguments measurement_sensor, jacobian_targets, atmospheric_field, surface_field, spectral_radiance_unit, and spectral_radiance_observer_agenda are defined in in and are passed to the method as immutable references to the respective workspace variables.

The other fields are there to provide context and to generate the documentation. See measurement_vectorFromSensor() for the full documentation.

Meta-method output with workspace variables

The following is a meta-method that creates workspace variables.

This is the extraction of the text in the workspace_meta_methods.cpp file:

wsm_meta.push_back(WorkspaceMethodInternalMetaRecord{
    .name             = "atmospheric_fieldRead",
    .desc             = "Reads absorption file from a directory",
    .author           = {"Richard Larsson"},
    .methods          = {"atmospheric_fieldInit",
                         "atmospheric_fieldAppendBaseData",
                         "atmospheric_fieldAppendAbsorptionData"},
    .out              = {"atmospheric_field"},
    .preset_gin       = {"replace_existing"},
    .preset_gin_value = {Index{0}},
});

The signature of the generated meta-method is not important because it is generated automatically.

Calling the above method is effectively the same as calling the listed methods one after the other and then deleting all method output that is not in out. In other words, even if a sub-method has an output that is not in out, it will not be passed to the user.

The call order and documentation of See atmospheric_fieldRead() makes it possible to follow the call order.