package ppx_expect

  1. Overview
  2. Docs
Cram like framework for OCaml

Install

Dune Dependency

Authors

Maintainers

Sources

ppx_expect-v0.16.0.tar.gz
sha256=e0795a0ae2d576758aaaa685440951b28fe75d072d88f5c6bf415fb1a44e423c

Description

Part of the Jane Street's PPX rewriters collection.

Published: 02 Jun 2023

README

README.org

#+TITLE: expect-test - a cram like framework for OCaml

** Introduction

Expect-test is a framework for writing tests in OCaml, similar to [[https://bitheap.org/cram/][Cram]].
Expect-tests mimic the existing inline tests framework with the =let%expect_test= construct.
The body of an expect-test can contain output-generating code, interleaved with =%expect= extension
expressions to denote the expected output.

When run, these tests will pass iff the output matches what was expected. If a test fails, a
corrected file with the suffix ".corrected" will be produced with the actual output, and the
=inline_tests_runner= will output a diff.

Here is an example Expect-test program, say in =foo.ml=

#+begin_src ocaml
open Core

let%expect_test "addition" =
  printf "%d" (1 + 2);
  [%expect {| 4 |}]
#+end_src

When the test is run (as part of =inline_tests_runner=), =foo.ml.corrected= will be produced with the
contents:

#+begin_src ocaml
open Core

let%expect_test "addition" =
  printf "%d" (1 + 2);
  [%expect {| 3 |}]
#+end_src

=inline_tests_runner= will also output the diff:

#+begin_src
---foo.ml
+++foo.ml.corrected
File "foo.ml", line 5, characters 0-1:
  open Core

  let%expect_test "addition" =
    printf "%d" (1 + 2);
-|  [%expect {| 4 |}]
+|  [%expect {| 3 |}]
#+end_src

Diffs will be shown in color if the =-use-color= flag is passed to the test runner executable.

** Expects reached from multiple places

A [%expect] can exist in a way that it is encountered multiple times, e.g. in a
functor or a function:

#+begin_src ocaml
let%expect_test _ =
  let f output =
    print_string output;
    [%expect {| hello world |}]
  in
  f "hello world";
  f "hello world";
;;
#+end_src

The =[%expect]= should capture the exact same output (i.e. up to string equality) at every
invocation. In particular, this does **not** work:

#+begin_src ocaml
let%expect_test _ =
  let f output =
    print_string output;
    [%expect {| \(foo\|bar\) (regexp) |}]
  in
  f "foo";
  f "bar";
;;
#+end_src

** Output matching

Matching is done on a line-by-line basis. If any output line fails to
match its expected output, the expected line is replaced with the
actual line in the final output.

*** Whitespace

Inside =%expect= nodes, whitespace around patterns are ignored, and
the user is free to put any amount for formatting purposes. The same
goes for the actual output.

Ignoring surrounding whitespace allows to write nicely formatted
expectation and focus only on matching the bits that matter.

To do this, ppx_expect strips patterns and outputs by taking the
smallest rectangle of text that contains the non-whitespace
material. All end of line whitespace are ignored as well. So for
instance all these lines are equivalent:

#+begin_src ocaml
  print blah;
  [%expect {|
abc
defg
  hij|}]

  print blah;
  [%expect {|
                abc
                defg
                  hij
  |}]

  print blah;
  [%expect {|
    abc
    defg
      hij
  |}]
#+end_src

However, the last one is nicer to read.

For the rare cases where one does care about what the exact output is,
ppx_expect provides the =%expect_exact= extension point, which only
succeed when the untouched output is exactly equal to the untouched
pattern.

When producing a correction, ppx_expect tries to respect as much as
possible the formatting of the pattern.

** Output capture

The extension point =[%expect.output]= returns a =string= with the output that
would have been matched had an =[%expect]= node been there instead.

An idiom for testing non-deterministic output is to capture the output using
=[%expect.output]= and either post-process it or inspect it manually, e.g.,

#+BEGIN_SRC ocaml
show_process ();
let pid_and_exit_status = [%expect.output] in
let exit_status = discard_pid pid_and_exit_status in
print_endline exit_status;
[%expect {| 1 |}]
#+END_SRC

This is preferred over output patterns (see below).

** Integration with Async, Lwt or other cooperative libraries

If you are writing expect tests for a system using Async, Lwt or any
other libraries for cooperative threading, you need some preparation
so that everything works well. For instance, you probably need to
flush some =stdout= channel. The expect test runtime takes care of
flushing =Stdlib.stdout= but it doesn't know about
=Async.Writer.stdout=, =Lwt_io.stdout= or anything else.

To deal with this, expect\_test provides some hooks in the form of a
configuration module =Expect_test_config=. The default module in scope
define no-op hooks that the user can override. =Async= redefines
this module so when =Async= is opened you can write async-aware
expect test.

In addition to =Async.Expect_test_config=, there is an
alternative, =Async.Expect_test_config_with_unit_expect=.  That is
easier to use than =Async.Expect_test_config= because =[%expect]= has
type =unit= rather than =unit Deferred.t=.  So one can write:

#+begin_src ocaml
[%expect foo];
#+end_src

rather than:

#+begin_src ocaml
let%bind () = [%expect foo] in
#+end_src

=Expect_test_config_with_unit_expect= arrived in 2019-06.  We hope to
transition from =Expect_test_config= to
=Expect_test_config_with_unit_expect=, eventually renaming the latter
as the former.

*** LWT

This is what you would need to write expect tests with Lwt:

#+begin_src ocaml
module Lwt_io_run = struct
  type 'a t = 'a Lwt.t
end

module Lwt_io_flush = struct
  type 'a t = 'a Lwt.t
  let return x = Lwt.return x
  let bind x ~f = Lwt.bind x f
  let to_run x = x
end

module Expect_test_config :
  Expect_test_config_types.S
    with module IO_run = Lwt_io_run
     and module IO_flush = Lwt_io_flush = struct
  module IO_run = Lwt_io_run
  module IO_flush = Lwt_io_flush
  let run x = Lwt_main.run (x ())
  let upon_unreleasable_issue = `CR
end
#+end_src

** Comparing Expect-test and unit testing (e.g. =let%test_unit=)

The simple example above can be easily represented as a unit test:

#+begin_src ocaml
let%test_unit "addition" = [%test_result: int] (1 + 2) ~expect:4
#+end_src

So, why would one use Expect-test rather than a unit test?  There are
several differences between the two approaches.

With a unit test, one must write code that explicitly checks that the
actual behavior agrees with the expected behavior.  =%test_result= is
often a convenient way of doing that, but even using that requires:

- creating a value to compare
- writing the type of that value
- having a comparison function on the value
- writing down the expected value

With Expect-test, we can simply add print statements whose output gives
insight into the behavior of the program, and blank =%expect=
attributes to collect the output.  We then run the program to see if
the output is acceptable, and if so, *replace* the original program
with its output.  E.g we might first write our program like this:

#+begin_src ocaml
let%expect_test _ =
  printf "%d" (1 + 2);
  [%expect {||}]
#+end_src

The corrected file would contain:

#+begin_src ocaml
let%expect_test _ =
  printf "%d" (1 + 2);
  [%expect {| 3 |}]
#+end_src

With Expect-test, we only have to write code that prints things that we
care about.  We don't have to construct expected values or write code
to compare them.  We get comparison for free by using diff on the
output.  And a good diff (e.g. patdiff) can make understanding
differences between large outputs substantially easier, much easier
than typical unit-testing code that simply states that two values
aren't equal.

Once an Expect-test program produces the desired expected output and we
have replaced the original program with its output, we now
automatically have a regression test going forward.  Any undesired
change to the output will lead to a mismatch between the source
program and its output.

With Expect-test, the source program and its output are interleaved.  This
makes debugging easier, because we do not have to jump between source
and its output and try to line them up.  Furthermore, when there is a
mismatch, we can simply add print statements to the source program and
run it again.  This gives us interleaved source and output with the
debug messages interleaved in the right place.  We might even insert
additional empty =%%expect= attributes to collect debug messages.

** Implementation

Every =%expect= node in an Expect-test program becomes a point at which
the program output is captured. Once the program terminates, the
captured outputs are matched against the expected outputs, and interleaved with
the original source code to produce the corrected file. Trailing output is appended in a
new =%expect= node.

** Build system integration

Follow the same rules as for [[https://github.com/janestreet/ppx_inline_test][ppx_inline_test]]. Just make sure to
include =ppx_expect.evaluator= as a dependency of the test runner. The
[[https://github.com/janestreet/jane-street-tests][Jane Street tests]] contains a few working examples using oasis.

Dependencies (8)

  1. re >= "1.8.0"
  2. ppxlib >= "0.28.0"
  3. dune >= "2.0.0"
  4. stdio >= "v0.16" & < "v0.17"
  5. ppx_inline_test >= "v0.16" & < "v0.17"
  6. ppx_here >= "v0.16" & < "v0.17"
  7. base >= "v0.16" & < "v0.17"
  8. ocaml >= "4.14.0"

Dev Dependencies

None

  1. ansi-parse >= "0.4.0"
  2. autofonce
  3. autofonce_config
  4. autofonce_core
  5. autofonce_lib
  6. autofonce_m4
  7. autofonce_misc
  8. autofonce_patch
  9. autofonce_share
  10. bio_io >= "0.2.1"
  11. bitpack_serializer
  12. bitwuzla
  13. bitwuzla-c
  14. bitwuzla-cxx
  15. camelot >= "1.3.0"
  16. charInfo_width
  17. combinaml
  18. combinat < "3.0"
  19. ctypes_stubs_js
  20. dap
  21. data-encoding >= "0.6"
  22. dkml-install-runner
  23. dream
  24. dream-pure
  25. drom
  26. drom_lib
  27. drom_toml
  28. dune-action-plugin
  29. electrod >= "0.1.6" & < "0.2.1"
  30. ez_cmdliner >= "0.2.0"
  31. ez_config >= "0.2.0"
  32. ez_file >= "0.2.0"
  33. ez_hash < "0.5.3"
  34. ez_opam_file
  35. ez_search
  36. ez_subst
  37. feather >= "0.2.0"
  38. fiat-p256 < "0.2.0"
  39. fiber >= "3.7.0"
  40. fiber-lwt
  41. GT >= "0.4.0" & < "0.5.0"
  42. gccjit
  43. graphv
  44. graphv_core
  45. graphv_core_lib
  46. graphv_font
  47. graphv_font_js
  48. graphv_font_stb_truetype
  49. graphv_gles2
  50. graphv_gles2_native
  51. graphv_gles2_native_impl
  52. graphv_webgl
  53. graphv_webgl_impl
  54. header-check
  55. hl_yaml
  56. http
  57. http-cookie >= "4.0.0"
  58. http-multipart-formdata >= "2.0.0"
  59. hyper
  60. influxdb >= "0.2.0"
  61. js_of_ocaml >= "3.10.0"
  62. js_of_ocaml-compiler >= "3.4.0"
  63. js_of_ocaml-lwt >= "3.10.0"
  64. js_of_ocaml-ocamlbuild >= "3.10.0" & < "5.0"
  65. js_of_ocaml-ppx >= "3.10.0"
  66. js_of_ocaml-ppx_deriving_json >= "3.10.0"
  67. js_of_ocaml-toplevel >= "3.10.0"
  68. js_of_ocaml-tyxml >= "3.10.0"
  69. kdl
  70. knights_tour
  71. kqueue >= "0.2.0"
  72. learn-ocaml >= "0.16.0"
  73. learn-ocaml-client >= "0.16.0"
  74. little_logger
  75. loga >= "0.0.5"
  76. lsp < "1.8.0" | >= "1.11.3"
  77. m_tree
  78. merge-fmt >= "0.3"
  79. mlt_parser >= "v0.16.0"
  80. module-graph
  81. nloge
  82. nsq >= "0.4.0" & < "0.5.2"
  83. OCanren-ppx >= "0.3.0~alpha1"
  84. ocaml-lsp-server >= "1.15.0-4.14"
  85. ocaml-protoc-plugin >= "1.0.0"
  86. ocluster >= "0.2"
  87. ocp-search
  88. ocplib_stuff >= "0.3.0"
  89. octez-libs
  90. octez-protocol-009-PsFLoren-libs
  91. octez-protocol-010-PtGRANAD-libs
  92. octez-protocol-011-PtHangz2-libs
  93. octez-protocol-012-Psithaca-libs
  94. octez-protocol-013-PtJakart-libs
  95. octez-protocol-014-PtKathma-libs
  96. octez-protocol-015-PtLimaPt-libs
  97. octez-protocol-016-PtMumbai-libs
  98. octez-protocol-017-PtNairob-libs
  99. octez-protocol-018-Proxford-libs
  100. octez-protocol-alpha-libs
  101. octez-shell-libs
  102. odate >= "0.6"
  103. odoc >= "2.0.0"
  104. odoc-parser
  105. omd >= "2.0.0~alpha3"
  106. opam-bin >= "0.9.5"
  107. opam-check-npm-deps
  108. opam_bin_lib >= "0.9.5"
  109. owork
  110. poll
  111. pp
  112. ppx_jane >= "v0.16.0"
  113. ppx_minidebug
  114. ppx_protocol_conv_json >= "5.0.0"
  115. ppx_relit >= "0.2.0"
  116. ppx_ts
  117. psmt2-frontend >= "0.3.0"
  118. pvec
  119. pyml_bindgen
  120. pythonlib >= "v0.16.0"
  121. res_tailwindcss
  122. routes >= "2.0.0"
  123. sarif >= "0.2.1"
  124. sedlex >= "3.1"
  125. seqes < "0.2"
  126. solidity-alcotest
  127. solidity-common
  128. solidity-parser
  129. solidity-test
  130. solidity-typechecker
  131. spawn < "v0.9.0" | >= "v0.14.0"
  132. tdigest >= "2.2.0"
  133. tezos-benchmark
  134. tezos-client-009-PsFLoren >= "14.0"
  135. tezos-client-010-PtGRANAD >= "14.0"
  136. tezos-client-011-PtHangz2 >= "14.0"
  137. tezos-client-012-Psithaca >= "14.0"
  138. tezos-client-013-PtJakart >= "14.0"
  139. tezos-client-014-PtKathma
  140. tezos-client-015-PtLimaPt
  141. tezos-client-016-PtMumbai
  142. tezos-client-017-PtNairob
  143. tezos-client-alpha >= "14.0"
  144. tezos-injector-013-PtJakart
  145. tezos-injector-014-PtKathma
  146. tezos-injector-015-PtLimaPt
  147. tezos-injector-016-PtMumbai
  148. tezos-injector-alpha
  149. tezos-layer2-utils-016-PtMumbai
  150. tezos-layer2-utils-017-PtNairob
  151. tezos-micheline >= "14.0"
  152. tezos-shell >= "15.0"
  153. tezos-smart-rollup-016-PtMumbai
  154. tezos-smart-rollup-017-PtNairob
  155. tezos-smart-rollup-alpha
  156. tezos-smart-rollup-layer2-016-PtMumbai
  157. tezos-smart-rollup-layer2-017-PtNairob
  158. tezos-stdlib >= "14.0"
  159. tezos-tx-rollup-013-PtJakart
  160. tezos-tx-rollup-014-PtKathma
  161. tezos-tx-rollup-015-PtLimaPt
  162. tezos-tx-rollup-alpha
  163. toplevel_expect_test >= "v0.16.0"
  164. torch
  165. travesty >= "0.3.0" & < "0.6.0" | >= "0.7.0"
  166. um-abt
  167. wtr >= "2.0.0"
  168. wtr-ppx
  169. zanuda

Conflicts

None