Seq: Difference between revisions

From HaskellWiki
No edit summary
mNo edit summary
 
(29 intermediate revisions by 3 users not shown)
Line 1: Line 1:
The <tt>seq</tt> function is the most basic method of introducing strictness to a Haskll program. <tt>seq :: a -> b -> b</tt> takes two arguments of any type, and returns the second. However, it also has the important property that it is magically strict in its first argument. In essence, <tt>seq</tt> is defined by the following two equations:
<span>{{DISPLAYTITLE:seq}}</span>
 
While its name might suggest otherwise, the <code>seq</code> function's purpose is to introduce ''strictness'' to a Haskell program. As indicated by its type signature:
 
<haskell>
seq :: a -> b -> b
</haskell>
 
it takes two arguments of any type, and returns the second. However, it also has the important property that it is always strict in its first argument. In essence, <code>seq</code> is defined by the following two equations:
 
<haskell>
⊥ `seq` b        = ⊥
a `seq` b | a ≠ ⊥ = b
</haskell>
 
(See [[Bottom]] for an explanation of the <code>⊥</code> symbol.)
__NOTOC__
 
== History ==
 
The need to specify an order of evaluation in otherwise-nonstrict programming languages has appeared before:
 
<blockquote>
We can use <code>VAL</code> to define a "sequential evaluation" operator which evaluates
its first argument and then returns its second:
 
<pre>
a ; b = (λx. b) val a
</pre>
 
and we can define other functions which control evaluation order [...]
 
:<small>[https://www.cs.ox.ac.uk/files/3309/PRG40.pdf The Design and Implementation of Programming Languages] (page 88 of 159)</small>.
</blockquote>
 
<blockquote>
<b>10.1. Forcing Sequential Execution</b><br>
One of the primary complications of side-effects is that one must be able to <i>order their
execution</i>. In conventional languages the flow of control explicit in the semantics accomplishes
this ordering. In ALFL, however, there is no explicit flow of control (a feature!), and so it is
necessary to introduce a special operator to do this if we are to admit any form of side-effects.
 
:<small>[https://www.cs.yale.edu/publications/techreports/tr322.pdf ALFL Reference Manual and Programmer's Guide] (page 17 of 29).</small>
</blockquote>
 
<blockquote>
[...] we want some control over the evaluation
order in order to make the program behave correctly in time. In this case there
is a straightforward solution. What is needed is a built-in function <code>seq</code>, with
the behavior
 
<pre>
seq ⊥ y = ⊥
seq x y = y
</pre>
 
Pragmatically,
 
:<code>seq</code> evaluates its first argument, discards it,
:and then returns its second argument.
 
:<small>[https://simon.peytonjones.org/assets/pdfs/slpj-book-1987-searchable.pdf The Implementation of Functional Programming Languages], Simon Peyton Jones (page 413 of 460).</small>
</blockquote>
 
<blockquote>
`seq' applied to two values, returns the second but checks that the first value is not completely undefined. Sometimes needed, e.g. to ensure correct synchronisation in interactive programs.
<pre>
> seq :: *->**->** ||defined internally
</pre>
 
:<small>[https://web.archive.org/web/20080602040708/https://www.cs.kent.ac.uk/people/staff/dat/miranda/stdenv.html The Miranda Standard Environment © Research Software Limited 1989]</small>
</blockquote>
 
<blockquote>
<pre>
/*
**      seq:            sequentailly evaluate two values and return the second.
**                      The first value is evaluated to WHNF.
*/
</pre>
 
:<small>src/lib/seq.m - [https://web.archive.org/web/20240920070532/https://ftp.gwdg.de/pub/languages/funet.fi/ml/lml/ LML 0.99]</small>
</blockquote>
 
and in Haskell since at least 1996:
 
<blockquote>
The <code>seq</code> combinator implements sequential composition. When the expression <code>e1 ‘seq‘ e2</code> is evaluated, <code>e1</code>
is evaluated to weak head normal form first, and then the value of <code>e2</code> is returned. In the following parallel <code>nfib</code>
function, <code>seq</code> is used to force the evaluation of <code>n2</code> before the addition takes place. This is because Haskell does not
specify which operand is evaluated first, and if <code>n1</code> was evaluated before <code>n2</code>, there would be no parallelism.
 
<haskell>
nfib :: Int -> Int
nfib n | n <= 1    = 1
      | otherwise = n1 ‘par‘ (n2 ‘seq‘ n1 + n2 + 1)
                    where
                      n1 = nfib (n-1)
                      n2 = nfib (n-2)
</haskell>
 
:<small>[https://web.archive.org/web/20040228202402/http://www.dcs.gla.ac.uk/fp/workshops/fpw96/Trinder.pdf Accidents always Come in Threes: A Case Study of Data-intensive Programs in Parallel Haskell] (page 2 of 14).</small>
</blockquote>
 
...the same year <code>seq</code> was introduced in Haskell 1.3 as a method of the (now-abandoned) <code>Eval</code> type class:
 
<haskell>
class Eval a where
  strict :: (a -> b) -> a -> b
  seq    :: a -> b -> b
 
  strict f x = x ‘seq‘ f x
</haskell>
 
However, despite that need by the time Haskell 98 was released <code>seq</code> had been reduced to a primitive strictness definition. But in 2009, all doubts about the need for a primitive sequencing definition were vanquished:
 
<blockquote>
<b>2.1 The need for</b> <code>pseq</code>
 
The <code>pseq</code> combinator is used for sequencing; informally, it
evaluates its first argument to weak-head normal form, and then evaluates
its second argument, returning the value of its second argument.
Consider this definition of <code>parMap</code>:
 
<haskell>
parMap f []    = []
parMap f (x:xs) = y ‘par‘ (ys ‘pseq‘ y:ys)
  where y  = f x
        ys = parMap f xs
</haskell>
 
The intention here is to spark the evaluation of <code>f x</code>, and then
evaluate <code>parMap f xs</code>, before returning the new list <code>y:ys</code>. The
programmer is hoping to express an <i>ordering</i> of the evaluation: <i>first</i>
spark <code>y</code>, <i>then</i> evaluate <code>ys</code>.
 
:<small>[https://www.cs.tufts.edu/~nr/cs257/archive/simon-peyton-jones/multicore-ghc.pdf Runtime Support for Multicore Haskell] (page 2 of 12).</small>
</blockquote>
 
Alas, this confirmation failed to influence Haskell 2010 - to this day, <code>seq</code> remains just a primitive strictness definition. So for enhanced confusion the only major Haskell implementation still in widespread use now provides both <code>seq</code> and <code>pseq</code>.
 
== Demystifying <code>seq</code> ==
 
A common misconception regarding <code>seq</code> is that <code>seq x</code> "evaluates" <code>x</code>. Well, sort of. <code>seq</code> doesn't evaluate anything just by virtue of existing in the source file, all it does is introduce an artificial data dependency of one value on another: when the result of <code>seq</code> is evaluated, the first argument must also (sort of; see below) be evaluated. As an example, suppose <code>x :: Integer</code>, then <code>seq x b</code> behaves essentially like <code>if x == 0 then b else b</code> – unconditionally equal to <tt>b</tt>, but forcing <code>x</code> along the way. In particular, the expression <code>x `seq` x</code> is completely redundant, and always has exactly the same effect as just writing <code>x</code>.
 
Strictly speaking, the two equations of <code>seq</code> are all it must satisfy, and if the compiler can statically prove that the first argument is not ⊥, or that its second argument ''is'', it doesn't have to evaluate anything to meet its obligations. In practice, this almost never happens, and would probably be considered highly counterintuitive behaviour on the part of GHC (or whatever else you use to run your code). So for example, in <code>seq a b</code> it is perfectly legitimate for <code>seq</code> to:
 
:1. evaluate <code>b</code> - its ''second'' argument,
 
:2. before evaluating <code>a</code> - its first argument,
 
:3. then returning <code>b</code>.
 
In this larger example:


<haskell>
<haskell>
⊥ `seq` b =
let x = ... in
a `seq` b = b
let y = sum [0..47] in
x `seq` 3 + y + y^2
</haskell>
</haskell>


A common misconception regarding <tt>seq</tt> is that <tt>seq x</tt> "evaluates" <tt>x</tt>. Well, sort of. <tt>seq</tt> doesn't evaluate anything just by virtue of existing in the source file, all it does is introduce an artificial data dependency of one value on another: when the result of <tt>seq</tt> is evaluated, the first argument must also be evaluated. As an example, suppose <tt>x :: Integer</tt>, then <tt>seq x b</tt> behaves essentially like <tt>if x == 0 then b else b</tt> – unconditionally equal to <tt>b</tt>, but forcing <tt>x</tt> along the way. In particular, the expression <tt>x `seq` x</tt> is completely redundant, and always has exactly the same effect as just writing <tt>x</tt>.
<code>seq</code> immediately evaluating its second argument (<code>3 + y + y^2</code>) avoids having to allocate space to store <code>y</code>:


Strictly speaking, the two equations of <tt>seq</tt> are all it must satisfy, and if the compiler can statically prove that the first argument is not ⊥, it doesn't have to evaluate it to meet its obligations. In practice, this almost never happens, and would probably be considered highly counterintuitive behaviour on the part of GHC (or whatever else you use to run your code). However, it ''is'' the case that evaluating <tt>b</tt> and ''then'' <tt>a</tt>, then returning <tt>b</tt> is a perfectly legitimate thing to do; it is to prevent this ambiguity that <tt>pseq</tt> was invented, but that's another story.
<haskell>
let x = ... in
case sum [0..47] of
  y -> x `seq` 3 + y + y^2
</haskell>


=== Common uses of <tt>seq</tt> ===
However, sometimes this ambiguity is undesirable, hence the need for <code>pseq</code>.


<tt>seq</tt> is typically used in the semantic interpretation of other strictness techniques, like strictness annotations in data types, or GHC's <tt>BangPatterns</tt> extension. For example, the meaning of this:
== Common uses of <code>seq</code> ==
 
<code>seq</code> is typically used in the semantic interpretation of other strictness techniques, like strictness annotations in data types, or GHC's <tt>BangPatterns</tt> extension. For example, the meaning of this:


<haskell>
<haskell>
Line 22: Line 182:
<haskell>
<haskell>
f x y | x `seq` y `seq` False = undefined
f x y | x `seq` y `seq` False = undefined
  | otherwise = z
      | otherwise = z
</haskell>
</haskell>


although that literal translation may not actually take place.
although that literal translation may not actually take place.


<tt>seq</tt> is frequently used with accumulating parameters to ensure that they don't become huge thunks, which will be forced at the end anyway. For example, strict foldl:
<code>seq</code> is frequently used with accumulating parameters to ensure that they don't become huge thunks, which will be forced at the end anyway. For example, strict <code>foldl</code>:


<haskell>
<haskell>
Line 44: Line 204:
which is useful for some of the same reasons.
which is useful for some of the same reasons.


=== Controversy! ===
== Controversy? ==


Note that <tt>seq</tt> is the ''only'' way to force evaluation of a value with a function type. It is the only reason why Haskell programs are able to distinguish between the following two values:
The presence of <code>seq</code> in Haskell does have some disadvantages:
:{|
|- style="vertical-align:top;"
|1.
|It is the only reason why Haskell programs are able to distinguish between the following two values:


<haskell>
<haskell>
undefined :: a -> b
undefined       :: a -> b
const undefined :: a -> b
const undefined :: a -> b
</haskell>
</haskell>


This violates the principle from lambda calculus of extensionality of functions, or eta-conversion, because <tt>f</tt> and <tt>\x -> f x</tt> are distinct functions, even though they return the same output for ''every'' input. For this reason, <tt>seq</tt>, and this distinction, is sometimes ignored e.g. when assessing the correctness of [[Correctness of short cut fusion|optimisation techniques]] or type class instances.
This violates the principle of extensionality of functions, or eta-conversion from the lambda calculus, because <code>f</code> and <code>\x -> f x</code> are distinct functions, even though they return the same output for every input.
|- style="vertical-align:top;"
|2.
|It can invalidate [[Correctness of short cut fusion|optimisation techniques]] which would normally be safe, causing the following two expressions to differ:
<haskell>
foldr ⊥ 0 (build seq) = foldr ⊥ 0 (seq (:) []) = foldr ⊥ 0 [] = 0
seq ⊥ 0                                                      = ⊥
</haskell>
 
This weakens the ability to use parametricity, which implies <code>foldr k z (build g) == g k z</code> for suitable values of <code>g</code>, <code>k</code> and <code>z</code>.
|- style="vertical-align:top;"
|3.
|It can invalidate laws which would otherwise hold, also causing expressions to have differing results:
 
<haskell>
seq (⊥ >>= return :: State s a) True = True
seq (⊥ :: State s a) True            = ⊥
</haskell>
 
This violates the first [[Monad laws|monad law]], that <code>m >>= return == m</code>.
|}
 
But <code>seq</code> (sequential or otherwise) isn't alone in causing such difficulties:
:{|
|- style="vertical-align:top;"
|1.
|When combined with call-by-need semantics, the use of weak-head normal form [https://www.pauldownen.com/publications/first-class-call-stacks.pdf is also detrimental to extensionality].
|- style="vertical-align:top;"
|2.
|When combined with GADTs, the associated map functions which uphold the functor laws [https://arxiv.org/pdf/2105.03389v3 is also problematic for parametricity].
|- style="vertical-align:top;"
|3.
|The ability to define the fixed-point combinator in Haskell using recursion:
 
<haskell>
yet :: (a -> a) -> a
yet f = f (yet f)
</haskell>
 
means [https://dl.acm.org/doi/pdf/10.1145/99370.99404 parametricity is further restricted].
|- style="vertical-align:top;"
|4.
|Strictness must be considered when [https://ris.utwente.nl/ws/portalfiles/portal/284115323/thesis_M_Fokkinga.pdf deriving laws for various recursive algorithms] (see chapter 6).
|- style="vertical-align:top;"
|5.
|Similar to <code>seq</code> and <code>⊥</code>, the use of division and zero present their own challenges!
|}
 
Therefore all such claims against <code>seq</code> (or calls for its outright removal) should be examined <i>in this context</i>.
 
=== Amelioration ===
 
;Parametricity
 
In 2004,
[https://web.archive.org/web/20130429222325/https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.71.1777&rep=rep1&type=pdf Johann and Voigtländer]
provided a solution to the loss of parametricity caused by <code>seq</code>:
 
<blockquote>
<small>(first page)</small>
 
[...] parametricity results <i>can</i> be recovered
in the presence of <code>seq</code> by restricting attention to left-closed, total, and admissible relations instead.
</blockquote>
 
;Extensionality
 
In 2016,
[https://www.pauldownen.com/publications/first-class-call-stacks.pdf Johnson-Freyd, Downen and Ariola]
suggested the use of head evaluation instead of weak-head evaluation to
preserve (functional) extensionality, and not just in the call-by-name
lambda calculus:
 
<blockquote>
<small>(page 16 of 18)</small>
 
[...] it may be that effective approaches to head evaluation such as
those in this paper are of interest even in call-by-value languages or in settings (such as Haskell with
<code>seq</code>) that lack the <i>η</i> axiom.
</blockquote>
 
One complication is the fact that <i>compiled</i> Haskell functions are
usually implemented directly as machine code, which would have to be
<q>run</q> somehow, without arguments, to obtain the necessary result.
Fortunately, using head evaluation everywhere in Haskell isn't necessary.
Haskell lacks the <i>η</i> axiom because of the <code>seq</code> function
(and <i>its</i> use of weak-head evaluation). Thus restoring extensionality
in Haskell only requires head evaluation to be used by <code>seq</code>.
 
For discovering what changes <code>seq</code> requires:
 
{|
|- style="vertical-align:top;"
|&nbsp;&nbsp;<code>(\ x -> ⊥ x) False</code><br>
= <code>⊥ False</code><br>
= <code>⊥</code>
|&nbsp;&nbsp;
|&nbsp;&nbsp;<code>(\ x -> not x) False</code><br>
= <code>not False</code><br>
= <code>True</code>
|&nbsp;&nbsp;
|&nbsp;&nbsp;<code>(\ x -> (⊥ . not) x) False</code><br>
= <code>(⊥ . not) False</code><br>
= <code>⊥ (not False)</code><br>
= <code>⊥</code>
|&nbsp;&nbsp;
|&nbsp;&nbsp;<code>(\ x -> (seq True . not) x) False</code><br>
= <code>(seq True . not) False</code><br>
= <code>seq True (not False)</code><br>
= <code>not False</code><br>
= <code>True</code>
|-
|Hence:
|
|
|
|
|
|
|-
|&nbsp;&nbsp;<code>seq (\ x -> ⊥ x) y</code><br>
= <code>⊥</code>
|
|&nbsp;&nbsp;<code>seq (\ x -> not x) y</code><br>
= <code>y</code>
|
|&nbsp;&nbsp;<code>seq (\ x -> (⊥ . not) x) y</code><br>
= <code>⊥</code>
|
|&nbsp;&nbsp;<code>seq (\ x -> (seq True . not) x) y</code><br>
= <code>y</code>
|-
|Therefore:
|
|
|
|
|
|
|- style="vertical-align:top;"
|&nbsp;&nbsp;<code>(\ x -> ⊥ x) UKN</code><br>
= <code>⊥ UKN</code><br>
= <code>⊥</code>
|
|&nbsp;&nbsp;<code>(\ x -> not x) UKN</code><br>
= <code>not UKN</code><br>
= <code>UKN</code>
|
|&nbsp;&nbsp;<code>(\ x -> (⊥ . not) x) UKN</code><br>
= <code>(⊥ . not) UKN</code><br>
= <code>⊥ (not UKN)</code><br>
= <code>⊥</code>
|
|&nbsp;&nbsp;<code>(\ x -> (seq UKN . not) x) UKN</code><br>
= <code>(seq UKN . not) UKN</code><br>
= <code>seq UKN (not UKN)</code><br>
= <code>not UKN</code><br>
= <code>UKN</code>
|}
 
with the ordinary Haskell values <code>False</code> and <code>True</code>
being replaced by the generic <code>UKN</code> ≠ <code>⊥</code>.
 
For <code>(seq UKN . not) UKN</code>, the intermediate steps:
 
* <code>seq UKN (not UKN)</code> replaced by <code>not UKN</code>
 
* <code>not UKN</code> replaced by <code>UKN</code>
 
suggests an exception-driven implementation, with <code>seq</code> catching
<code>UKN</code> and the implementation intervening directly if an attempt
to evaluate <code>UKN</code> occurs (by replacing the original application
with <code>UKN</code>).
 
Using a context more suitable for exceptions:
 
<pre>
data Ukn = Ukn
 
instance Show Ukn where show Ukn = "unknown value"
instance Exception Ukn
 
seqM x y = catch (evalM x >> return y) (\ (_ :: Ukn) -> return y)
seq x y  = runM (seqM x y)
 
ukn      = throw Ukn
ukn'    = throw Ukn
</pre>
 
to verify that example, because it does use the <code>Ukn</code> exception
twice:
 
&nbsp;&nbsp;<code>(\ x -> (seq ukn' . not) x) ukn</code><br>
= <code>(seq ukn' . not) ukn</code><br>
= <code>seq ukn' (not ukn)</code><br>
= <code>not ukn</code><br>
= <code>ukn</code>
 
then <code>evalM</code> can also be <q>defined</q>:
 
<pre>
evalM x =
do a <- evalWHNF x
    e <- tryFunction a
    case e of
      Left a          -> return a
      Right (h, arity) -> do r <- applyM h (replicate arity (throw Ukn))
                            evalM r
</pre>
 
though the need for <code>tryFunction</code> to inspect its argument so
intensively to return a variadic value <code>h</code> is further evidence
that it would be more appropriate to define <code>seq</code> directly
within the implementation.
 
So for the price of a few extra definitions in the implementation (or reusing
existing ones with sufficient ingenuity), <code>seq</code> and the
extensionality of the <i>η</i> axiom can both exist in Haskell.
 
== See also ==
 
* [https://mail.haskell.org/pipermail/glasgow-haskell-users/2006-October/011430.html Parallelism in 6.6 and seq vs. pseq], Haskell mail archives. <!-- 2006 -->
 
* [https://mail.haskell.org/pipermail/haskell-cafe/2011-April/091043.html Questioning seq], Haskell mail archives. <!-- 2011 -->
 
* [https://gitlab.haskell.org/ghc/ghc/-/wikis/commentary/compiler/seq-magic seq &#91;bedlam&#93;], the GHC wiki. <!-- 2011 -->
 
[[Category:Glossary]]

Latest revision as of 00:00, 20 March 2025

While its name might suggest otherwise, the seq function's purpose is to introduce strictness to a Haskell program. As indicated by its type signature:

seq :: a -> b -> b

it takes two arguments of any type, and returns the second. However, it also has the important property that it is always strict in its first argument. In essence, seq is defined by the following two equations:

⊥ `seq` b         = ⊥
a `seq` b | a ≠ ⊥ = b

(See Bottom for an explanation of the symbol.)


History

The need to specify an order of evaluation in otherwise-nonstrict programming languages has appeared before:

We can use VAL to define a "sequential evaluation" operator which evaluates its first argument and then returns its second:

a ; b = (λx. b) val a

and we can define other functions which control evaluation order [...]

The Design and Implementation of Programming Languages (page 88 of 159).

10.1. Forcing Sequential Execution
One of the primary complications of side-effects is that one must be able to order their execution. In conventional languages the flow of control explicit in the semantics accomplishes this ordering. In ALFL, however, there is no explicit flow of control (a feature!), and so it is necessary to introduce a special operator to do this if we are to admit any form of side-effects.

ALFL Reference Manual and Programmer's Guide (page 17 of 29).

[...] we want some control over the evaluation order in order to make the program behave correctly in time. In this case there is a straightforward solution. What is needed is a built-in function seq, with the behavior

 seq ⊥ y = ⊥
 seq x y = y

Pragmatically,

seq evaluates its first argument, discards it,
and then returns its second argument.
The Implementation of Functional Programming Languages, Simon Peyton Jones (page 413 of 460).

`seq' applied to two values, returns the second but checks that the first value is not completely undefined. Sometimes needed, e.g. to ensure correct synchronisation in interactive programs.

> seq :: *->**->** ||defined internally
The Miranda Standard Environment © Research Software Limited 1989
/*
**      seq:            sequentailly evaluate two values and return the second.
**                      The first value is evaluated to WHNF.
*/
src/lib/seq.m - LML 0.99

and in Haskell since at least 1996:

The seq combinator implements sequential composition. When the expression e1 ‘seq‘ e2 is evaluated, e1 is evaluated to weak head normal form first, and then the value of e2 is returned. In the following parallel nfib function, seq is used to force the evaluation of n2 before the addition takes place. This is because Haskell does not specify which operand is evaluated first, and if n1 was evaluated before n2, there would be no parallelism.

nfib :: Int -> Int
nfib n | n <= 1    = 1
       | otherwise = n1 ‘par‘ (n2 ‘seq‘ n1 + n2 + 1)
                     where
                       n1 = nfib (n-1)
                       n2 = nfib (n-2)
Accidents always Come in Threes: A Case Study of Data-intensive Programs in Parallel Haskell (page 2 of 14).

...the same year seq was introduced in Haskell 1.3 as a method of the (now-abandoned) Eval type class:

class Eval a where
   strict :: (a -> b) -> a -> b
   seq    :: a -> b -> b

   strict f x = x ‘seq‘ f x

However, despite that need by the time Haskell 98 was released seq had been reduced to a primitive strictness definition. But in 2009, all doubts about the need for a primitive sequencing definition were vanquished:

2.1 The need for pseq

The pseq combinator is used for sequencing; informally, it evaluates its first argument to weak-head normal form, and then evaluates its second argument, returning the value of its second argument. Consider this definition of parMap:

parMap f []     = []
parMap f (x:xs) = y ‘par‘ (ys ‘pseq‘ y:ys)
   where y  = f x
         ys = parMap f xs

The intention here is to spark the evaluation of f x, and then evaluate parMap f xs, before returning the new list y:ys. The programmer is hoping to express an ordering of the evaluation: first spark y, then evaluate ys.

Runtime Support for Multicore Haskell (page 2 of 12).

Alas, this confirmation failed to influence Haskell 2010 - to this day, seq remains just a primitive strictness definition. So for enhanced confusion the only major Haskell implementation still in widespread use now provides both seq and pseq.

Demystifying seq

A common misconception regarding seq is that seq x "evaluates" x. Well, sort of. seq doesn't evaluate anything just by virtue of existing in the source file, all it does is introduce an artificial data dependency of one value on another: when the result of seq is evaluated, the first argument must also (sort of; see below) be evaluated. As an example, suppose x :: Integer, then seq x b behaves essentially like if x == 0 then b else b – unconditionally equal to b, but forcing x along the way. In particular, the expression x `seq` x is completely redundant, and always has exactly the same effect as just writing x.

Strictly speaking, the two equations of seq are all it must satisfy, and if the compiler can statically prove that the first argument is not ⊥, or that its second argument is, it doesn't have to evaluate anything to meet its obligations. In practice, this almost never happens, and would probably be considered highly counterintuitive behaviour on the part of GHC (or whatever else you use to run your code). So for example, in seq a b it is perfectly legitimate for seq to:

1. evaluate b - its second argument,
2. before evaluating a - its first argument,
3. then returning b.

In this larger example:

let x = ... in
let y = sum [0..47] in
x `seq` 3 + y + y^2

seq immediately evaluating its second argument (3 + y + y^2) avoids having to allocate space to store y:

let x = ... in
case sum [0..47] of
  y -> x `seq` 3 + y + y^2

However, sometimes this ambiguity is undesirable, hence the need for pseq.

Common uses of seq

seq is typically used in the semantic interpretation of other strictness techniques, like strictness annotations in data types, or GHC's BangPatterns extension. For example, the meaning of this:

f !x !y = z

is this:

f x y | x `seq` y `seq` False = undefined
      | otherwise = z

although that literal translation may not actually take place.

seq is frequently used with accumulating parameters to ensure that they don't become huge thunks, which will be forced at the end anyway. For example, strict foldl:

foldl' :: (a -> b -> a) -> a -> [b] -> a
foldl' _ z [] = z
foldl' f z (x:xs) = let z' = f z x in z' `seq` foldl' f z' xs

It's also used to define strict application:

($!) :: (a -> b) -> a -> b
f $! x = x `seq` f x

which is useful for some of the same reasons.

Controversy?

The presence of seq in Haskell does have some disadvantages:

1. It is the only reason why Haskell programs are able to distinguish between the following two values:
undefined       :: a -> b
const undefined :: a -> b

This violates the principle of extensionality of functions, or eta-conversion from the lambda calculus, because f and \x -> f x are distinct functions, even though they return the same output for every input.

2. It can invalidate optimisation techniques which would normally be safe, causing the following two expressions to differ:
foldr ⊥ 0 (build seq) = foldr ⊥ 0 (seq (:) []) = foldr ⊥ 0 [] = 0
seq ⊥ 0                                                       = ⊥

This weakens the ability to use parametricity, which implies foldr k z (build g) == g k z for suitable values of g, k and z.

3. It can invalidate laws which would otherwise hold, also causing expressions to have differing results:
seq (⊥ >>= return :: State s a) True = True
seq (⊥ :: State s a) True            = ⊥

This violates the first monad law, that m >>= return == m.

But seq (sequential or otherwise) isn't alone in causing such difficulties:

1. When combined with call-by-need semantics, the use of weak-head normal form is also detrimental to extensionality.
2. When combined with GADTs, the associated map functions which uphold the functor laws is also problematic for parametricity.
3. The ability to define the fixed-point combinator in Haskell using recursion:
yet :: (a -> a) -> a
yet f = f (yet f)

means parametricity is further restricted.

4. Strictness must be considered when deriving laws for various recursive algorithms (see chapter 6).
5. Similar to seq and , the use of division and zero present their own challenges!

Therefore all such claims against seq (or calls for its outright removal) should be examined in this context.

Amelioration

Parametricity

In 2004, Johann and Voigtländer provided a solution to the loss of parametricity caused by seq:

(first page)

[...] parametricity results can be recovered in the presence of seq by restricting attention to left-closed, total, and admissible relations instead.

Extensionality

In 2016, Johnson-Freyd, Downen and Ariola suggested the use of head evaluation instead of weak-head evaluation to preserve (functional) extensionality, and not just in the call-by-name lambda calculus:

(page 16 of 18)

[...] it may be that effective approaches to head evaluation such as those in this paper are of interest even in call-by-value languages or in settings (such as Haskell with seq) that lack the η axiom.

One complication is the fact that compiled Haskell functions are usually implemented directly as machine code, which would have to be run somehow, without arguments, to obtain the necessary result. Fortunately, using head evaluation everywhere in Haskell isn't necessary. Haskell lacks the η axiom because of the seq function (and its use of weak-head evaluation). Thus restoring extensionality in Haskell only requires head evaluation to be used by seq.

For discovering what changes seq requires:

  (\ x -> ⊥ x) False

= ⊥ False
=

     (\ x -> not x) False

= not False
= True

     (\ x -> (⊥ . not) x) False

= (⊥ . not) False
= ⊥ (not False)
=

     (\ x -> (seq True . not) x) False

= (seq True . not) False
= seq True (not False)
= not False
= True

Hence:
  seq (\ x -> ⊥ x) y

=

  seq (\ x -> not x) y

= y

  seq (\ x -> (⊥ . not) x) y

=

  seq (\ x -> (seq True . not) x) y

= y

Therefore:
  (\ x -> ⊥ x) UKN

= ⊥ UKN
=

  (\ x -> not x) UKN

= not UKN
= UKN

  (\ x -> (⊥ . not) x) UKN

= (⊥ . not) UKN
= ⊥ (not UKN)
=

  (\ x -> (seq UKN . not) x) UKN

= (seq UKN . not) UKN
= seq UKN (not UKN)
= not UKN
= UKN

with the ordinary Haskell values False and True being replaced by the generic UKN.

For (seq UKN . not) UKN, the intermediate steps:

  • seq UKN (not UKN) replaced by not UKN
  • not UKN replaced by UKN

suggests an exception-driven implementation, with seq catching UKN and the implementation intervening directly if an attempt to evaluate UKN occurs (by replacing the original application with UKN).

Using a context more suitable for exceptions:

data Ukn = Ukn

instance Show Ukn where show Ukn = "unknown value"
instance Exception Ukn

seqM x y = catch (evalM x >> return y) (\ (_ :: Ukn) -> return y)
seq x y  = runM (seqM x y)

ukn      = throw Ukn
ukn'     = throw Ukn

to verify that example, because it does use the Ukn exception twice:

  (\ x -> (seq ukn' . not) x) ukn
= (seq ukn' . not) ukn
= seq ukn' (not ukn)
= not ukn
= ukn

then evalM can also be defined:

evalM x =
 do a <- evalWHNF x
    e <- tryFunction a
    case e of
      Left a           -> return a
      Right (h, arity) -> do r <- applyM h (replicate arity (throw Ukn))
                             evalM r

though the need for tryFunction to inspect its argument so intensively to return a variadic value h is further evidence that it would be more appropriate to define seq directly within the implementation.

So for the price of a few extra definitions in the implementation (or reusing existing ones with sufficient ingenuity), seq and the extensionality of the η axiom can both exist in Haskell.

See also