Designing a clean architecture for a software product is a difficult task and developers can easily overlook it under pressure or when they are eager to deliver exciting new features. However, choosing the quick solution instead of a clean one will likely lead to a large amount of additional work in the future (technical debt).
To keep technical debt at acceptable levels, design and implementation of new features should follow best practices:
- Make sure your code is always covered by automatic testing to ensure its correctness. Where unit tests are impractical, use integration tests.
- Adhere to the SOLID principles of software design.
- Do not reinvent the wheel: if someone solved your problem within the project or in a third party library, consider using their solution or extending it before writing a new component for the same purpose.
- You ain't gonna need it (YAGNI): do not design solutions for problems that might come up in the future, as chances are that you will never need that code. Focus on current problems and prepare for future requirements by writing clean code.
- Do not repeat yourself (DRY): if you are writing the same code snippet in several places, extract it into a function.
- Use meaningful names: the purpose of an object should be clear from its name. Usually, class names are nouns and function names are verbs.
Remember that important design decisions should be properly documented for future reference and to share the knowledge with other developers. We decided to use lightweight Architecture Decision Records (ADRs) for this purpose. The full list of ADRs and documentation for writing new ones can be found in docs/functional/architecture/Index.md.
We follow the Google Python Style Guide with a few minor changes (mentioned below). Since the best way to remember something is to understand the reasons behind it, make sure you go through the style guide at least once, paying special attention to the discussions in the Pros, Cons, and Decision subsections.
We deviate from the Google Python Style Guide only in the following points:
- We use
ruff-linter
instead ofpylint
. - We use
ruff-formatter
for source code and imports formatting, which may work differently than indicated by the guidelines in section 3. Python Style Rules. For example, maximum line length is set to 100 instead of 79 (although docstring lines should still be limited to 79). - According to subsection 2.19 Power Features, direct use of power features (e.g. custom metaclasses, import hacks, reflection) should be avoided, but standard library classes that internally use these power features are accepted. Following the same spirit, we allow the use of power features in infrastructure code with similar functionality and scope as the Python standard library.
- According to subsection 3.19.12 Imports For Typing, symbols from
typing
andcollections.abc
modules used in type annotations "can be imported directly to keep common annotations concise and match standard typing practices". Following the same spirit, we allow symbols to be imported directly from third-party or internal modules when they only contain a collection of frequently used typying definitions.
-
pass
vs...
(Ellipsis
)pass
is the no-op statement in Python and...
is a literal value (called Ellipsis) introduced for slicing collections of unknown number of dimensions. Although they are very different in nature, both of them are used in places where a statement is required purely for syntactic reasons, and there is not yet a clear standard practice in the community about when to use one or the other. We decided to align with the common pattern of using...
in the body of empty function definitions working as placeholders for actual implementations defined somewhere else (e.g. type stubs, abstract methods and methods appearing inProtocol
classes) andpass
in any other place where its usage is mixed with actual statements.# Correct use of `...` as the empty body of an abstract method class AbstractFoo: @abstractmethod def bar(self) -> Bar: ... # Correct use of `pass` when mixed with other statements try: resource.load(id=42) except ResourceException: pass
-
where to
import from gt4py.next
orimport gt4py.next as gtx
?- Client code (like tests, doctests and examples) should use the above style for public FieldView API
- Library code should always import the defining module and use qualified names.
Error messages should be written as sentences, starting with a capital letter and ending with a period (avoid exclamation marks). Try to be informative without being verbose. Code objects such as 'ClassNames' and 'function_names' should be enclosed in single quotes, and so should string values used for message interpolation.
Examples:
raise ValueError(f"Invalid argument 'dimension': should be of type 'Dimension', got '{dimension.type}'.")
Interpolated integer values do not need double quotes, if they are indicating an amount. Example:
raise ValueError(f"Invalid number of arguments: expected 3 arguments, got {len(args)}.")
The double quotes can also be dropped when presenting a sequence of values. In this case the message should be rephrased so the sequence is separated from the text by a colon ':'.
raise ValueError(f"unexpected keyword arguments: {', '.join(set(kwarg_names} - set(expected_kwarg_names)))}.")
The message should be kept to one sentence if reasonably possible. Ideally the sentence should be kept short and avoid unneccessary words. Examples:
# too many sentences
raise ValueError(f"Received an unexpeted number of arguments. Should receive 5 arguments, but got {len(args)}. Please provide the correct number of arguments.")
# better
raise ValueError(f"Wrong number of arguments: expected 5, got {len(args)}.")
# less extreme
raise TypeError(f"Wrong argument type. Can only accept 'int's, got '{type(arg)}' instead.")
# but can still be improved
raise TypeError(f"Wrong argument type: 'int' expected, got '{type(arg)}'")
The terseness vs. helpfulness tradeoff should be more in favor of terseness for internal error messages and more in favor of helpfulness for DSLError
and it's subclassses, where additional sentences are encouraged if they point out likely hidden sources of the problem or common fixes.
We generate the API documentation automatically from the docstrings using Sphinx and some extensions such as Sphinx-autodoc and Sphinx-napoleon. These follow the Google Python Style Guide docstring conventions to automatically format the generated documentation. A complete overview can be found here: Example Google Style Python Docstrings.
Sphinx supports the reStructuredText (reST) markup language for defining additional formatting options in the generated documentation, however section 3.8 Comments and Docstrings of the Google Python Style Guide does not specify how to use markups in docstrings. As a result, we decided to forbid reST markup in docstrings, except for the following cases:
- Cross-referencing other objects using Sphinx text roles for the Python domain (as explained here).
- Very basic formatting markup to improve readability of the generated documentation without obscuring the source docstring (e.g.
``literal``
strings, bulleted lists).
We highly encourage the doctest format for code examples in docstrings. In fact, doctest runs code examples and makes sure they are in sync with the codebase.
In general, you should structure new Python modules in the following way:
- shebang line:
#! /usr/bin/env python3
(only for executable scripts!). - License header (see
LICENSE_HEADER.txt
). - Module docstring.
- Imports, alphabetically ordered within each block (fixed automatically by
ruff-formatter
):- Block of imports from the standard library.
- Block of imports from general third party libraries using standard shortcuts when customary (e.g.
numpy as np
). - Block of imports from specific modules of the project.
- Definition of exported symbols (optional, mainly for re-exporting symbols from other modules):
__all__ = ["func_a", "CONST_B"]
- Public constants and typing definitions.
- Module contents organized in a convenient way for understanding how the pieces of code fit together, usually defining functions before classes.
Try to keep sections and items logically ordered, add section separator comments to make section boundaries explicit when needed. If there is not a single evident logical order, pick the order you consider best or use alphabetical order.
Consider configuration files as another type of source code and apply the same criteria, using comments when possible for better readability.
You may occasionally need to disable checks from quality assurance (QA) tools (e.g. linters, type checkers, etc.) on specific lines as some tool might not be able to fully understand why a certain piece of code is needed. This is usually done with special comments, e.g. # noqa: F401
, # type: ignore
. However, you should only ignore QA errors when you fully understand their source and rewriting your code to pass QA checks would make it less readable. Additionally, you should add a short descriptive code if possible (check ruff rules and mypy error codes for reference):
f = lambda: 'empty' # noqa: E731 [lambda-assignment]
and, if needed, a brief comment for future reference:
...
return undeclared_symbol # noqa: F821 [undefined-name] on purpose to trigger black-magic
Testing components is a critical part of a software development project. We follow standard practices in software development and write unit, integration, and regression tests. Note that even though doctests are great for documentation purposes, they lack many features and are difficult to debug. Hence, they should not be used as replacement for proper unit tests except in trivial cases.
There are separate test suites (each living in a subfolder) for the separate subpackages of GT4Py. This is so that not all subpackages have to be tested on CI all the time (see [ci-docs
][the ci docs] for details).
The tests
folder should not be a package but the contained test suites are python packages.
Each test suite should follow the following structure:
<subpackage>_tests/
__init__.py # each subpackage test should be a python package
integration_tests/
__init__.py
feature_test/
__init__.py
<starting_end>_tests/
__init__.py
test_<feature>.py # test <feature> in bigger context, starting from <starting_end>
multi_feature_tests/
__init__.py
<starting_end>_tests/
test_<feature_combination>.py # test <feature_combination>, which can be user code snipets etc
<feature_group>_tests/ # grouped feature combination tests with common data
__init__.py
<test_data>.py
test_<feature_combination>.py
test_<other_feature_combination>.py
unit_tests/
__init__.py
... # mirror subpackage folder structure
<subsubpackage>_tests/
test_<module>.py # unit tests for gt4py.<subpackage>.<subsubpackage>.<module>
regression_tests/
__init__.py
test_<bug_description>.py
Temporarily it may be allowed to split unit tests for a module into multiple test_<module>_<feature>.py
test modules. This should be taken as a reason to think about splitting the source module though.
Temporarily, tests for testing utilities can be placed next to the module containing them, with the name test_util_<module>.py
. This should be taken as a hint that the tested utils should be moved into the library.
Integrations tests come with their own utilities, found in cases.py
, for better test automation and simplification:
- Predefined field type annotations, e.g.
IJKField = Field[[IDim, JDim, KDim], np.int64]
. - Test fixtures:
cartesian_case
for structured, e.g.IDim
xJDim
;unstructured_case
, e.g.EdgeDim
. - Parameter allocations for objects, derived directly from the decorator's function definition:
input_param = cases.allocate(case_fixture, decorator_name, "input_label")()
- Data initialization/modification classes, such as
ConstInitializer()
, to be used asallocate(...).strategy()
parameters - Decorator verification functions:
cases.verify()
: used when input and output field generation cannot be automated. For example the case when theextend()
method needs to be used for an offset or sparse fields are part of the input dataset.cases.verify_with_default_data()
: used when input and output fields generation can be automated.
- Backends are set automatically with default switched off. If backend specification is needed, this can be extracted as a cases' attribute.
In case new features are needed, check if an existing utility can be extended before implementing a whole new one. The name of new utilities should reflect their purpose and follow the naming conventions of existing tools. If the new utility is a completely stand-alone component, create a new file in the bottom-most directory enclosing all usages of the utility; otherwise, place it in the appropriate file containing utilities with similar functionalities.
Note: The name cases
for the new test module was chosen based on the idea that details like backend, grid size, etc. should be summarized in a parametrizable “test case”, of which there are two types: cartesian and unstructured.
TODO: add missing test conventions.