Workspace methods
Workspace methods are the core interaction a user has with ARTS calculations.
They are called from the Workspace
object in python but are
implemented in C++ under-the-hood.
The workspace methods definition are mainly located in the workspace_methods.cpp
and workspace_meta_methods.cpp
files. Some workspace methods are
automatically generated elsewhere.
Workspace methods are exposed to other C++ files via the <workspace.h>
header file.
Only the CMake target artsworkspace
and targets that depend on it may include
<workspace.h>
.
How to add a workspace method?
The easiest way to add a workspace method is to copy an existing method.
Go to the workspace_methods.cpp
or workspace_meta_methods.cpp
file, depending on
if it is a basic or meta-method you wish to add.
Copy an existing method and modify it to suit your needs.
Meta-methods are automatically generated and do not require any additional code.
Basic methods require new code to be compiled as part of the artsworkspace
target.
Please add this code to an existing m_
-file or create a new one.
Please ensure that the new file is listed in the CMake target artsworkspace
.
Defining a workspace method
You may define either a basic method or a meta-method that depends on other methods. The easiest way to do so is to copy an existing method and modify it to suit your needs. Please design your method input and output to allow meta-methods to be defined, as they simplify composability significantly.
Basic methods
Basic methods completely define their input and output and do not explicitly depend on other methods. The method must be manually implemented.
Basic methods are defined entirely in workspace_methods.cpp
.
They are part of a map object called wsm_data
. The name of the
method is the key and the value is a struct with the following fields:
desc
- a description of the method as a string.author
- the author(s) of the method as a list of strings.out
- the Workspace variables output of the method as a list of strings.gout
- the generic output of the method as a list of strings. These must not be Workspace variables.gout_type
- the type of the generic output as a list of strings. These must be Workspace groups.gout_desc
- the description of the generic output as a list of strings.in
- the Workspace variables input of the method as a list of strings.gin
- the generic input of the method as a list of strings. These must not be Workspace variables.gin_type
- the type of the generic input as a list of strings. These must be Workspace groups.gin_value
- the default value of the generic input as an optional initialized Workspace groups.gin_desc
- the description of the generic input as a list of strings.pass_workspace
- a boolean indicating if aWorkspace
instance should be passed to the method. If true, the first argument to the method is aconst Workspace&
.
The expected signature of the method will depend on these fields. A linker error will likely occur if the actual signature does not match the expected signature.
The in
and out
may contain the same Workspace variables. If they do, the variable must be
initialized before the method is called because it is treated as if the method is intended to
simply modify the existing value. Please indicate strongly in the documentation if you sometimes overwrite the input variable.
On the other hand, if the Workspace variables is only in out
and not in in
,
it is treated as if the workspace variable is created by the method. Note that since the type system
does not account for this, it is important that you clear the current state of the Workspace variables
in a method that is intended to create a new workspace variable.
The fields gin
, gin_type
, gin_value
, and gin_desc
must be the same size.
The same is true for gout
, gout_type
, and gout_desc
. These are user-generated
inputs and outputs, and are often used to pass information pertinent to the method itself
but not to the workspace as a whole.
Please check other workspace methods for examples by comparing their actual signature
to the expected signature to figure out how the fields should be filled in. Also check
that the documentation is generated as intended by building the pyarts_docs
target.
Tip
All fields but desc
and author
are optional. If a field is not needed, it
is convenient to leave it out.
Meta-methods
Meta-methods do not define all their input and output, but instead define a call order into other methods. From this call order, the inputs of the user-facing workspace method is inferred. This method should not be implemented manually.
These methods are defined in workspace_meta_methods.cpp
. They are defined
as part of a list called wsm_meta
.
A single meta-method data contains:
name
- the name of the method as a string.desc
- a description of the method as a string.author
- the author(s) of the method as a list of strings.methods
- the methods that the meta-method depends on as a list of strings.out
- the output of the method as a list of strings. These must be workspace variables.preset_gin
- The presetgin
values for the method as a list of workspace values.preset_gin_value
- The presetgin_value
values for the method as a list of workspace values.
Tip
A meta-method may depend on another meta-method. If it does, it is important that the meta-method it depends on is defined before it in the list.
Automatic methods
All methods that execute a workspace agenda are automatically generated.
These will be named as agenda_nameExecute
and may otherwise be
treated as normal workspace method.
You need to do nothing to define these methods. But please refrain from defining
them manually as that may cause undefined naming conflicts.
The expected signature of the method propagation_matrix_agendaAuto()
is also
generated automatically near the end of workspace_methods.cpp
. It takes
its input and output from a list of other methods. Feel free to add to this
list but make sure that any naming conflicts regarding gin
are resolved
before doing so. Adding a method to this list may also require changing the
actual signature (which is why the method is generated, so that a change in
the required actual signature is immediately made apparent).
The methods that begin with RetrievalAdd...
are partly generated.
These methods all require a corresponding jacobian_targetsAdd...
method
that fills in the jacobian_targets
workspace variable. To keep that
part of the signature consistent, the additional RetrievalAdd...
information
is simply appended to the in
, out
, and gin
-lists of the
corresponding jacobian_targetsAdd...
method using the local jac2ret
lambda.
Generated files
The workspace method interface generates a lot of files during the build process.
These generated files are located in the build directory and are named
as auto_wsm_N.cc
, where N is a number, as auto_wsm.cpp
, as auto_wsm.h
,
and as auto_wsmmeta.cpp
for the C++ interfacing code. The python-binding
code is also generated as py_auto_wsm_N.cpp
, where N is still a number.
Workspace method naming convention
Names carry meaning. Please follow the naming convention below, and please do not hesitate to fix any naming inconsistencies you find.
Method naming
Workspace method names should be descriptive and follow the naming convention
that the main workspace variable output of the method in snake_case
is followed by a short but descriptive name of what the method does with the output
in PascalCase
.
A general rule of thumb is to use verbs for methods that modify the workspace
variable and nouns for methods that create a new workspace variable.
For example, propagation_matrixAddLines()
has a main output of propagation_matrix
and
adds line absorption to it. It needs to be preceded by a call to
propagation_matrixInit()
which sets up the
propagation matrix to an initial state.
Of course, every use-case is different, but please try to follow this convention.
File naming
The file that a workspace method is implemented in should be named m_<concept>.cc
.
The concept should be a short but descriptive name of what the methods therein do.
Multiple methods per file is allowed and encouraged, but keep them conceptually similar.
To ensure compatibility with various file systems, please avoid using spaces
and capital letters in the filename.
Lastly, please ensure that the file is listed in the CMake target artsworkspace
,
or it will not be compiled.
Workspace method documentation
Workspace documentation that contains *text*
is automatically turned into links
to the relevant ARTS-related variable or method. Please use this feature to link
between workspace methods and variables.
If a method require extra information beyond what you can fit in the desc
field,
there’s a workspace_method_extra_doc.cpp
file that you can add to. This file
has access to the full workspace as part of the artsworkspace
target and the
python documentation adds a separate subsection for the information in this file (documentation level -------
).
Examples of defined workspace methods
The following examples are taken from the ARTS source code. Please check the source code for the full context of the examples.
Method creating a workspace variable
The following is a basic method that creates or set a workspace variable.
This is the extration of the text in the workspace_methods.cpp
file:
wsm_data["ray_pathGeometricUplooking"] = {
.desc =
R"--(Wraps *ray_pathGeometric* for straight uplooking paths from the surface altitude at the position
)--",
.author = {"Richard Larsson"},
.out = {"ray_path"},
.in = {"atmospheric_field", "surface_field", "latitude", "longitude"},
.gin = {"max_step"},
.gin_type = {"Numeric"},
.gin_value = {Numeric{1e3}},
.gin_desc = {"The maximum step length"},
};
The signature of the method is:
void ray_pathGeometricUplooking(ArrayOfPropagationPathPoint& ray_path,
const AtmField& atmospheric_field,
const SurfaceField& surface_field,
const Numeric& latitude,
const Numeric& longitude,
const Numeric& max_step);
The signature of the method returns void
. This is the same for all ARTS methods.
The first argument of the method is a reference to ray_path
.
Since ray_path
is in out
but not in in
,
it is expected that the method overwrite any existing value of ray_path
.
The arguments atmospheric_field
, surface_field
,
latitude
, and longitude
are defined in in
and are passed to the method as immutable references to the respective
workspace variables.
Lastly, the argument max_step
is defined in gin
and is passed
as an immutable reference as well. The type of the argument is Numeric
and the default value is 1e3
. The default value is passed to the method
if the user does not provide a value for max_step
.
All other fields are there to provide context and to generate the documentation.
See ray_pathGeometricUplooking()
for the full documentation.
Method modifying a workspace variable
The following is a basic workspace method that modifies existing workspace variables.
This is the extraction of the text in the workspace_methods.cpp
file:
wsm_data["propagation_matrixAddLines"] = {
.desc = R"--(Line-by-line calculations.
)--",
.author = {"Richard Larsson"},
.out = {"propagation_matrix",
"propagation_matrix_source_vector_nonlte",
"propagation_matrix_jacobian",
"propagation_matrix_source_vector_nonlte_jacobian"},
.in = {"propagation_matrix",
"propagation_matrix_source_vector_nonlte",
"propagation_matrix_jacobian",
"propagation_matrix_source_vector_nonlte_jacobian",
"frequency_grid",
"jacobian_targets",
"select_species",
"absorption_bands",
"ecs_data",
"atmospheric_point",
"ray_path_point"},
.gin = {"no_negative_absorption"},
.gin_type = {"Index"},
.gin_value = {Index{1}},
.gin_desc =
{"Turn off to allow individual absorbers to have negative absorption"},
};
The signature of the method is:
void propagation_matrixAddLines(PropmatVector& propagation_matrix,
StokvecVector& propagation_matrix_source_vector_nonlte,
PropmatMatrix& propagation_matrix_jacobian,
StokvecMatrix& propagation_matrix_source_vector_nonlte_jacobian,
const AscendingGrid& frequency_grid,
const JacobianTargets& jacobian_targets,
const SpeciesEnum& select_species,
const AbsorptionBands& absorption_bands,
const LinemixingEcsData& ecs_data,
const AtmPoint& atmospheric_point,
const PropagationPathPoint& ray_path_point,
const Index& no_negative_absorption);
The signature of the method returns void
. This is the same for all ARTS methods.
The first four arguments of the method are references to
propagation_matrix
.
propagation_matrix_source_vector_nonlte
,
propagation_matrix_jacobian
, and
propagation_matrix_source_vector_nonlte_jacobian
are both output (out
) and input (in
). The method is expected to modify the existing values
of these workspace variables instead of creating new ones.
The arguments
frequency_grid
,
jacobian_targets
,
select_species
,
absorption_bands
,
ecs_data
,
atmospheric_point
, and
ray_path_point
are just defined in in
and are passed to the method
as immutable references to the respective workspace variables.
Lastly, the argument no_negative_absorption
is defined in gin
and is passed
as an immutable reference as well. The type of the argument is Index
and the default value is 1
. The default value is passed to the method
if the user does not provide a value for no_negative_absorption
.
The no_negative_absorption
argument is used to turn off the check for negative absorption,
which is useful for debugging purposes.
The other fields are there to provide context and to generate the documentation.
See propagation_matrixAddLines()
for the full documentation.
Method that uses a workspace agenda
The following is a basic workspace method that creates workspace variables.
This is the extraction of the text in the workspace_methods.cpp
file:
wsm_data["measurement_vectorFromSensor"] = {
.desc =
R"--(Sets measurement vector by looping over all sensor elements
The core calculations happens inside the *spectral_radiance_observer_agenda*.
User choices of *spectral_radiance_unit* does not adversely affect this method
unless the *measurement_vector* or *measurement_jacobian* are further modified
before consumption by, e.g., *OEM*
)--",
.author = {"Richard Larsson"},
.out = {"measurement_vector", "measurement_jacobian"},
.in = {"measurement_sensor",
"jacobian_targets",
"atmospheric_field",
"surface_field",
"spectral_radiance_unit",
"spectral_radiance_observer_agenda"},
.pass_workspace = true,
};
The signature of the method is:
void measurement_vectorFromSensor(const Workspace& ws,
Vector& measurement_vector,
Matrix& measurement_jacobian,
const ArrayOfSensorObsel& measurement_sensor,
const JacobianTargets& jacobian_targets,
const AtmField& atmospheric_field,
const SurfaceField& surface_field,
const SpectralRadianceUnitType& spectral_radiance_unit,
const Agenda& spectral_radiance_observer_agenda);
The signature of the method returns void
. This is the same for all ARTS methods.
The first argument of the method is a reference to the workspace object itself.
This is passed as a const Workspace&
reference to the method. It is passed
to the method because pass_workspace
is set to true
in the method definition.
Note that the workspace object is passed as a const
reference, so it cannot be modified.
The coming two arguments of the method are references to
measurement_vector
and
measurement_jacobian
.
Since measurement_vector
and
measurement_jacobian
are in out
but not in in
,
it is expected that the method overwrite any existing values they might hold.
The arguments measurement_sensor
,
jacobian_targets
,
atmospheric_field
,
surface_field
,
spectral_radiance_unit
, and
spectral_radiance_observer_agenda
are defined in in
and are passed to the method
as immutable references to the respective workspace variables.
The other fields are there to provide context and to generate the documentation.
See measurement_vectorFromSensor()
for the full documentation.
Meta-method output with workspace variables
The following is a meta-method that creates workspace variables.
This is the extraction of the text in the workspace_meta_methods.cpp
file:
wsm_meta.push_back(WorkspaceMethodInternalMetaRecord{
.name = "atmospheric_fieldRead",
.desc = "Reads absorption file from a directory",
.author = {"Richard Larsson"},
.methods = {"atmospheric_fieldInit",
"atmospheric_fieldAppendBaseData",
"atmospheric_fieldAppendAbsorptionData"},
.out = {"atmospheric_field"},
.preset_gin = {"replace_existing"},
.preset_gin_value = {Index{0}},
});
The signature of the generated meta-method is not important because it is generated automatically.
Calling the above method is effectively the same as calling
the listed methods one after the other and then deleting all method output
that is not in out
.
In other words, even if a sub-method has an output that is not in out
,
it will not be passed to the user.
The call order and documentation of
See atmospheric_fieldRead()
makes it possible to follow the call order.