Part 2: tzresult and Lwt-tzresult¶
This is Part 2 of 4 of the The Error Monad tutorial.
error, trace, and tzresult¶
The error type is an extensible variant
type defined in
the Error_monad module.
Each part of the Mavkit source code can extend it to include some errors
relevant to that part. E.g., the p2p layer adds
type error += Connection_refused whilst the store layer adds
type error += Snapshot_file_not_found of string. More information on
errors is presented later.
The 'error trace type is a data structure dedicated to holding
errors. It is exported by the Error_monad module. It is only
ever instantiated as error trace. It can accumulate multiple errors,
which helps present information about the context in which an error
occurs. More information on traces is presented later.
The 'a tzresult type is an alias for ('a, error trace) result.
It is used in Mavkit to represent the possibility of failure in a generic
way. Using tzresult is a trade-off. You should only use it in
situations where the pros outweigh the cons.
Pros:
- A generic error that is the same everywhere means that the binding operator - let*can be used everywhere without having to convert errors to a common type.
- Traces allow you to easily add context to an existing error (see how later). 
Cons:
- A generic wrapper around generic errors that cannot be meaningfully matched against (although the pattern exists in legacy code). 
- Type information about errors is coarse-grained to the point of being meaningless (e.g., there is no exhaustiveness check when matching). 
- Registration is syntactically heavy and requires care (see below). 
In general, the type tzresult is mostly useful at a high-enough
level of abstraction and in the outermost interface of a module or even
package (i.e., when exposing errors to callers that do not have access
to internal implementation details).
Generic failing¶
The easiest way to return a tzresult is via functions provided by
the Error_monad. These functions are located at the top-level of the
module, as such they are available, unqualified, everywhere in the code.
They do not even require the syntax modules to be open.
- error_withis for formatting a string and wrapping it inside an- errorinside a- traceinside an- Error. E.g.,- let retry original_limit (f : unit -> unit tzresult) = let rec retry limit f if limit < 0 then error_with "retried %d times" original_limit else match f () with | Error _ -> retry (limit - 1) f | Ok () -> Ok () in retry original_limit f- You can use all the formatting percent-escapes from the Format module. However, you should generally keep the message on a single line so that it can be printed nicely in logs. - Formally, - error_withhas type- ('a, Format.formatter, unit, 'b tzresult) format4 -> 'a. This is somewhat inscrutable. Suffice to say: it is used with a format string and returns an- Error.
- error_with_exn : exn -> 'a tzresultis for wrapping an exception inside an- errorinside a- traceinside an- Error.- let media_type_kind s = match s with | "json" | "bson" -> Ok `Json | "octet-stream" -> Ok `Binary | _ -> error_with_exn Not_found 
- failwithis for formatting a string and wrapping it inside an- errorinside a- traceinside an- Errorinside a promise. E.g.,- failwith "Cannot read file %s (ot %s)" file_name (Unix.error_message error) - failwithis similar to- error_withbut for the combined Lwt-- tzresultmonad. Again the type (- ('a, Format.formatter, unit, 'b tzresult Lwt.t) format4 -> 'a) is inscrutable, but again usage is as simple as a formatting.
- fail_with_exnis for wrapping an exception inside an- errorinside a- traceinside an- Errorinside a promise. E.g.,- fail_with_exn Not_found - fail_with_exnis similar to- error_with_exnbut for the combined Lwt-- tzresultmonad.
These functions are useful as a way to fail with generic errors that carry a simple payload. The next section is about using custom errors with more content.
Exercises¶
- Write a function - tzresult_of_option : 'a option -> 'a tzresultwhich replaces- Nonewith a generic failure of your choosing.
- Write a function - catch : (unit -> 'a) -> 'a tzresultwhich wraps any exception raised during the evaluation of the function call.
Declaring and registering errors¶
On many occasions, error_with and friends (see above) are not
sufficient: you may want your error to carry more information than can
be conveyed in a simple string or a simple exception. When you do, you
need a custom error. You must first declare and then register an
error.
To declare a new error you simply extend the error type. Remember
that the error type is defined and exported to the rest of the code
by the Error_monad module. You extend it with the += syntax:
type error +=
  Invalid_configuration of { expected: string; got: string; line: int }
The registration function register_error_kind is also part of the
Error_monad module. You register a new error by calling this
function.
let () =
  register_error_kind
    `Temporary
    ~id:"invalid_configuration"
    ~title:"Invalid configuration"
    ~description:"The configuration is invalid."
    ~pp:(fun f (expected, got, line) ->
      Format.fprintf f
        "When parsing configuration expected %s but got %s (at line %d)"
        expected got line
    )
    Data_encoding.(
      obj3
        (req "expected" string)
        (req "got" string)
        (req "line" int31))
    (function
      | Invalid_configuration {expected; got; line} ->
        Some (expected, got, line)
      | _ -> None)
    (fun (expected, got, line) -> Invalid_configuration {expected; got; line})
Note that you MUST register the errors you declare. Failure to do so can lead to serious issues.
The arguments for the register_error_kind function are as follows: -
category (`Temporary): the category argument is meaningless
in the shell, just use `Temporary.
- id: a short string containing only characters that do not need escaping (- [a-zA-Z0-9._-]), must be unique across the whole program.
- title: a short human readable string.
- description: a longer human readable string.
- pp: a pretty-printing function carrying enough information for a full error message for the user. Note that the function does not receive the error, instead it receives the projected payload of the error (here a 3-tuple- (expected, got, line).
- encoding: an encoding for the projected payload of the error. 
- projection: a partial function that matches the specific error (out of all of them) and return its projected payload. This function always has the form - function | <the error you are returning> -> Some <projected payload> | _ -> None.
- injection: a function that takes the projected payload and constructs the error out of it. 
For errors that do not carry information (e.g.,
type error += Size_limit_exceeded), the projected payload of the
error is unit.
It is customary to either register the error immediately after the error is declared or to register multiple errors immediately after declaring them all. In some cases, the registration happens in a separate module. Either way, registration of declared error is compulsory.
Exercises¶
- Register the following error - (** [Size_limit_exceeded {limit; current_size; attempted_insertion}] is used when an insertion into the global table of known blocks would cause the size of the table to exceed the limit. The field [limit] holds the maximum allowed size, the field [current_size] holds the current size of the table and [attempted_insertion] holds the size of the element that was passed to the insertion function. *) type error += Size_limit_exceeded { limit: int; current_size: int; attempted_insertion: int }
The Result_syntax’s mv extensions¶
Remember that 'a tzresult is a special case of ('a, 'e) result.
Specifically, a special case where 'e is error trace.
Consequently, you can handle tzresult values using the
Result_syntax module.
The module Result_syntax exports a few functions dedicated to handling
tzresult. These functions were omitted from Part 1.
- tzfail: 'e -> ('a, 'e trace) result: the expression- tzfail ewraps- ein a- traceinside an- Error. When- eis of type- erroras is the case throughout Mavkit,- tzfail eis of type- 'a tzresult.
- and*: a binding operator alias for- tzboth(see below). You can use it with- let*the same way you use- andwith- let.- let apply_triple f (x, y, z) = let open Result_syntax in let* u = f x and* v = f y and* w = f z in return (u, v, w) - When you use - and*, the bound results (- f x,- f y, and- f z) are all evaluated fully, regardless of the success/failure of the others. The expression which follows the- in(- return ..) is evaluated if all the bound results are successful.
- tzboth : ('a, 'e trace) result -> ('b, 'e trace) result -> ('a * 'b, 'e trace) result: the expression- both a bis- Okif both- aand- bare- Okand- Errorotherwise`.- Note that unlike - both, the type of errors (- error trace) is the same on both the argument and return side of this function: the traces are combined automatically. This remark applies to the- tzalland- tzjoin(see below) as well.- The stability of the return type is what allows this syntax module to include an - and*binding operator.
- tzall : ('a, 'e trace) result list -> ('a list, 'e trace) result: the function- tzallis a generalisation of- tzbothfrom tuples to lists.
- tzjoin : (unit, 'e trace) result list -> (unit, 'e trace) result: the function- tzjoinis a specialisation of- tzallfor list of unit-typed expressions (typically, for side-effects).
- and+is a binding operator similar to- and*but for use with- let+rather than- let*.
Exercises¶
- What is the difference between the two following functions? - let twice f = let open Result_syntax in let* () = f () in let* () = f () in return_unit - let twice f = let open Result_syntax in let* () = f () and* () = f () in return_unit 
The Lwt_result_syntax’s mv extensions¶
In the same way result can be combined with Lwt, tzresult can
also be combined with Lwt. And in the same way that Result_syntax exports a
few mv-specific extensions, Lwt_result_syntax exports a few Lwt+``mv``
specific extensions.
There are possibly too many parallels to keep track of, so the diagram below might help.
'a  -----------> ('a, 'e) result ------------> 'a tzresult
 |                        |                         |
 |                        |                         |
 V                        V                         V
'a Lwt.t ------> ('a, 'e) result Lwt.t ------> 'a tzresult Lwt.t
Anyway, the Lwt_result_syntax module exports a few functions dedicated to
handling Lwt+``tzresult``. These functions were omitted from Part 1.
- tzfail: 'e -> ('a, 'e trace) result Lwt.t: the expression- tzfail ewraps- ein a- traceinside an- Errorinside a promise. When- eis of type- erroras is the case throughout Mavkit,- tzfail eis of type- 'a tzresult Lwt.t.
- and*: a binding operator alias for- tzboth. You can use it with- let*the same way you use- andwith- let.- let apply_triple f (x, y, z) = let open Lwt_result_syntax in let* u = f x and* v = f y and* w = f z in return (u, v, w) - When you use - and*, the bound promises (- f x,- f y, and- f z) are evaluated concurrently, and the expression which follows the- in(- return ..) is evaluated once all the bound promises have all resolved but only if all of them resolve successfully.- Note how this - and*binding operator inherits the properties of both- Lwt_syntax.( and* )and- Result_syntax.( and* ). Specifically, the promises are evaluated concurrently and the expression which follows the- inis evaluated only if all the bound promises have successfully resolved. These two orthogonal properties are combined. This remark also applies to- tzboth,- tzall,- tzjoinand- and+below.
- tzboth : ('a, 'e trace) result Lwt.t -> ('b, 'e trace) result Lwt.t -> ('a * 'b, 'e trace) result Lwt.t: the expression- tzboth p qis a promise that resolves once both- pand- qhave resolved. It resolves to- Okif both- pand- qdo, and to- Errorotherwise`.- Note that unlike - Lwt_result_syntax.both, the type of errors (- error trace) is the same on both the argument and return side of this function: the trace are combined automatically. This remark applies to the- tzalland- tzjoin(see below) as well.- The stability of the return type is what allows this syntax module to include an - and*binding operator.
- tzall : ('a, 'e trace) result Lwt.t list -> ('a list, 'e trace) result Lwt.t: the function- tzallis a generalisation of- tzbothfrom tuples to lists.
- join : (unit, 'e trace) result Lwt.t list -> (unit, 'e trace) result Lwt.t: the function- tzjoinis a specialisation of- tzallfor lists of unit-typed expressions (typically, for side-effects).
- and+is a binding operator similar to- and*but for use with- let+rather than- let*.
Exercises¶
- Rewrite this function to use the - Lwt_result_syntaxmodule and no other syntax module.- let apply_tuple (f, g) (x, y) = let open Lwt_syntax in let* u = f x and* v = g y in let r = Result_syntax.tzboth u v in return r 
- Write the implementation for - (** [map f [x1; x2; ..]] is [[y1; y2; ..]] where [y1] is the successful result of [f x1], [y2] is the successful result of [f x2], etc. If [f] fails on any of the inputs, returns an [Error] instead. Either way, all the calls to [f] on all the inputs are evaluated concurrently and all the calls to [f] have resolved before the whole promise resolves. *) val map : ('a -> 'b tzresult Lwt.t) -> 'a list -> 'b list tzresult
Lifting¶
When you are working with promises of tzresult (i.e., within
Lwt-tzresult), you may occasionally need to call functions that
return a simple promise (i.e., within Lwt-only) or a simple tzresult
(i.e., within tzresult-only).
Because tzresult is a special case of result, you can use the same
operators let*! and let*? as presented in Part 1.
let*! x = plain_lwt_function foo bar in
let*? x = plain_result_function foo bar in
..
Tracing¶
Remember that a trace is a data structure specifically designed for errors.
Traces have two roles:
- As a programmer you benefit from the traces’ ability to combine automatically. Indeed, this feature of traces makes the - and*binding operators possible which can simplify some tasks such as concurrent evaluation of multiple- tzresultpromises.
- For the user, traces combine multiple errors, allowing for high-level errors (e.g., - Cannot_bootstrap_node) to be paired with low-level errors (e.g.,- Unix_error EADDRINUSE). When used correctly, this can help create more informative error messages which, in turn, can help debugging.
Tracing primitives are declared in the Error_monad module. As such,
they are available almost everywhere in the code. They should be used
whenever an error passes from one component (say the p2p layer, or the
storage layer) into another (say the shell).
- record_trace err rleaves- runtouched if it evaluates to- Ok. Otherwise, it adds the error- errto the trace carried in- Error.- let check_hashes head block operation = let open Result_syntax in let* () = record_trace (Invalid_hash { kind: "head"; hash: head}) @@ check_hash chain in let* () = record_trace (Invalid_hash { kind: "block"; hash: block}) @@ check_hash block in let* () = record_trace (Invalid_hash { kind: "operation"; hash: operation}) @@ check_hash operation in return_unit- In this example a failure from any of the calls to - check_hashwill be given context that helps with understanding the source of the error.
- record_trace_evalis a lazy version of- record_tracein that the error added to the trace is only evaluated if needed. More formally- record_trace_eval make_err rleaves- runtouched if it evaluates to- Ok. Otherwise it calls- make_errand adds the returned error onto the trace.- You should use the strict - record_traceversion when the error you are adding to the trace is an immediate value (e.g.,- Overflow) or a constructor with immediate values (e.g.,- Invalid_file name). You should use the lazy- record_trace_evalversion when the error you are adding to the trace requires computation to generate (e.g., if it requires formatting or querying).
- traceis the Lwt-aware variant of- record_trace. More formally,- trace err pleaves- puntouched if it resolves successfully, otherwise it adds the error- errto the trace carried by the unsuccessful resolve.- let get_data_and_gossip_it () = let open Lwt_result_syntax in let* data = trace Cannot_get_random_data_from_storage @@ Storage.get_random_data () in let* number_of_peers = trace Cannot_gossip_data @@ P2p.gossip data in return (data, number_of_peers)- In this example, low-level storage errors are given more context by the - Cannot_get_random_data_from_storageerror. Similarly, low-level p2p errors are given more context by the- Cannot_gossip_dataerror. This is important because both the storage and the p2p layer may suffer from similar system issues (such as file-descriptor exhaustion).
- trace_evalis the lazy version of- trace, or, equivalently, the Lwt-aware version of- record_trace_eval.- You should use the strict - traceversion when the error you are adding is immediate or a constructor with immediate values. You should use the lazy- trace_evalversion when the error you are adding requires computation to generate.
Do not hesitate to use the tracing primitives. Too much context is
better than too little context. Think of trace (and variants) as a
way to document Mavkit. Specifically, as making the error messages of
Mavkit more informative.
META COMMENTARY¶
The previous sections had a practical focus: how to handle errors? how to mix different syntaxes? how?! By contrast, the following sections are in-depth discussions of advanced or tangential topics which you should feel free to skim or even to skip.