HaskellWiki - User contributions [en]

TypeCompose

2015-06-29T13:08:35Z

Imz: /* References */ updated a rotten link

[[Category:Composition]]
[[Category:Applicative Functor]]
[[Category:Libraries]]
[[Category:Packages]]
[[Category:Type-level programming]]

== Abstract ==

'''TypeCompose''' provides some classes & instances for forms of type composition, as well as some modules that haven't found another home.

Besides this wiki page, here are more ways to find out about TypeCompose:
* Visit the [http://hackage.haskell.org/cgi-bin/hackage-scripts/package/TypeCompose Hackage page] for library documentation and to download & install.
* Or install with <tt>cabal install TypeCompose</tt>.
* Get the code repository: <tt>darcs get http://code.haskell.org/~conal/code/TypeCompose</tt>.


== Type composition ==

The <hask>Control.Compose</hask> module includes
* Various type compositions (unary/unary, binary/unary, etc). Most are from [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative Programming with Effects]. In particular, <hask>g `O` f</hask> composes functors in to functors and applicative functors (AFs) into AFs. (In contrast, monads do not in general compose.) Composition makes AF-based programming simple and elegant, partly because we don't need an AF counterpart to monad transformers.
* Cofunctors (contravariant functors). Great for "consumer" types, just as functors suit "producer" (container) types. There are several composition options.
* Type argument flip. Handy for cofunctors: use <hask>Flip (->) o</hask>, for <hask>(-> o)</hask>.
* Constructor in pairs: <hask>(f a, g a)</hask>.
* Constructor in arrows/functions: <hask>f a ~> g a</hask>.

== Other features ==

=== Composable bijections ===

Given all the type constructors and compositions of them, I found myself writing some pretty awkward code to wrap & unwrap through multiple layers. Composable bijections help a lot.

The <hask>Data.Bijection</hask> module is inspired by [http://citeseer.ist.psu.edu/alimarine05there.html There and Back Again: Arrows for Invertible Programming], though done here in a less general setting.

=== Pair- & function-like types ===

The <hask>Data.Zip</hask> and <hask>Data.Lambda</hask> patterns emerged while working on [[DeepArrow]] and [[Eros]]. <hask>Data.Zip</hask> generalizes <hask>zip</hask> and <hask>unzip</hask> from <hask>[]</hask> to other functors. It also provides variants of type <hask>f a -> f (a,b)</hask> and <hask>f a -> f (a,b)</hask>. <hask>Data.Lambda</hask> is similar with classes for lambda-like constructions.

For example uses of <hask>Pair</hask> and <hask>Lambda</hask>, see [[TV]] and [[Eros]].

=== References ===

Monads with references. Direct rip-off from [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.23.145 Global Variables in Haskell].

=== Titling ===

For giving titles to things. I know it sounds kind of random. More useful than I first thought. Used in [[Phooey]], [[TV]], and [[Eros]].

=== Partial values ===

A monoid of partial values. See the [http://conal.net/blog/posts/a-type-for-partial-values/ teaser] and [http://conal.net/blog/posts/implementing-a-type-for-partial-values/ solution] blog
posts.

=== Context-dependent monoids ===

Bit of an oddball also. <hask>Data.CxMonoid</hask> defines a sort of meta-monoid, that can be supplied dynamically with choices of <hask>mempty</hask> and <hask>mappend</hask>. Used in [[Phooey]] (starting with version 1.3) so that layout could be a monoid but still vary in style.

Typeclassopedia

2015-04-21T19:58:23Z

Imz: /* Further reading */ fix a link (and hopefully it will be version-independent; is it good?)

''By [[User:Byorgey|Brent Yorgey]], byorgey@cis.upenn.edu''

''Originally published 12 March 2009 in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13] of [http://themonadreader.wordpress.com/ the Monad.Reader]. Ported to the Haskell wiki in November 2011 by [[User:Geheimdienst|Geheimdienst]].''

''This is now the official version of the Typeclassopedia and supersedes the version published in the Monad.Reader. Please help update and extend it by editing it yourself or by leaving comments, suggestions, and questions on the [[Talk:Typeclassopedia|talk page]].''

=Abstract=

The standard Haskell libraries feature a number of type classes with algebraic or category-theoretic underpinnings. Becoming a fluent Haskell hacker requires intimate familiarity with them all, yet acquiring this familiarity often involves combing through a mountain of tutorials, blog posts, mailing list archives, and IRC logs.

The goal of this document is to serve as a starting point for the student of Haskell wishing to gain a firm grasp of its standard type classes. The essentials of each type class are introduced, with examples, commentary, and extensive references for further reading.

=Introduction=

Have you ever had any of the following thoughts?
* What the heck is a monoid, and how is it different from a monad?

* I finally figured out how to use [[Parsec]] with do-notation, and someone told me I should use something called <code>Applicative</code> instead. Um, what?

* Someone in the [[IRC channel|#haskell]] IRC channel used <code>(***)</code>, and when I asked Lambdabot to tell me its type, it printed out scary gobbledygook that didn’t even fit on one line! Then someone used <code>fmap fmap fmap</code> and my brain exploded.

* When I asked how to do something I thought was really complicated, people started typing things like <code>zip.ap fmap.(id &&& wtf)</code> and the scary thing is that they worked! Anyway, I think those people must actually be robots because there’s no way anyone could come up with that in two seconds off the top of their head.

If you have, look no further! You, too, can write and understand concise, elegant, idiomatic Haskell code with the best of them.

There are two keys to an expert Haskell hacker’s wisdom:
# Understand the types.
# Gain a deep intuition for each type class and its relationship to other type classes, backed up by familiarity with many examples.

It’s impossible to overstate the importance of the first; the patient student of type signatures will uncover many profound secrets. Conversely, anyone ignorant of the types in their code is doomed to eternal uncertainty. “Hmm, it doesn’t compile ... maybe I’ll stick in an
<code>fmap</code> here ... nope, let’s see ... maybe I need another <code>(.)</code> somewhere? ... um ...”

The second key—gaining deep intuition, backed by examples—is also important, but much more difficult to attain. A primary goal of this document is to set you on the road to gaining such intuition. However—

:''There is no royal road to Haskell. {{h:title|Well, he probably would have said it if he knew Haskell.|—Euclid}}''

This document can only be a starting point, since good intuition comes from hard work, [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ not from learning the right metaphor]. Anyone who reads and understands all of it will still have an arduous journey ahead—but sometimes a good starting point makes a big difference.

It should be noted that this is not a Haskell tutorial; it is assumed that the reader is already familiar with the basics of Haskell, including the standard <code>[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html Prelude]</code>, the type system, data types, and type classes.

The type classes we will be discussing and their interrelationships:

[[Image:Typeclassopedia-diagram.png]]

{{note|<code>Semigroup</code> can be found in the [http://hackage.haskell.org/package/semigroups <code>semigroups</code> package], <code>Apply</code> in the [http://hackage.haskell.org/package/semigroupoids <code>semigroupoids</code> package], and <code>Comonad</code> in the [http://hackage.haskell.org/package/comonad <code>comonad</code> package].}}

* Solid arrows point from the general to the specific; that is, if there is an arrow from <code>Foo</code> to <code>Bar</code> it means that every <code>Bar</code> is (or should be, or can be made into) a <code>Foo</code>.
* Dotted arrows indicate some other sort of relationship.
* <code>Monad</code> and <code>ArrowApply</code> are equivalent.
* <code>Semigroup</code>, <code>Apply</code> and <code>Comonad</code> are greyed out since they are not actually (yet?) in the standard Haskell libraries {{noteref}}.

One more note before we begin. The original spelling of “type class” is with two words, as evidenced by, for example, the [http://www.haskell.org/onlinereport/haskell2010/ Haskell 2010 Language Report], early papers on type classes like [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.103.5639 Type classes in Haskell] and [http://research.microsoft.com/en-us/um/people/simonpj/papers/type-class-design-space/ Type classes: exploring the design space], and [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.168.4008 Hudak et al.’s history of Haskell]. However, as often happens with two-word phrases that see a lot of use, it has started to show up as one word (“typeclass”) or, rarely, hyphenated (“type-class”). When wearing my prescriptivist hat, I prefer “type class”, but realize (after changing into my descriptivist hat) that there's probably not much I can do about it.

We now begin with the simplest type class of all: <code>Functor</code>.

=Functor=

The <code>Functor</code> class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Functor haddock]) is the most basic and ubiquitous type class in the Haskell libraries. A simple intuition is that a <code>Functor</code> represents a “container” of some sort, along with the ability to apply a function uniformly to every element in the container. For example, a list is a container of elements, and we can apply a function to every element of a list, using <code>map</code>. As another example, a binary tree is also a container of elements, and it’s not hard to come up with a way to recursively apply a function to every element in a tree.

Another intuition is that a <code>Functor</code> represents some sort of “computational context”. This intuition is generally more useful, but is more difficult to explain, precisely because it is so general. Some examples later should help to clarify the <code>Functor</code>-as-context point of view.

In the end, however, a <code>Functor</code> is simply what it is defined to be; doubtless there are many examples of <code>Functor</code> instances that don’t exactly fit either of the above intuitions. The wise student will focus their attention on definitions and examples, without leaning too heavily on any particular metaphor. Intuition will come, in time, on its own.

==Definition==

Here is the type class declaration for <code>Functor</code>:

<haskell>
class Functor f where
fmap :: (a -> b) -> f a -> f b
</haskell>

<code>Functor</code> is exported by the <code>Prelude</code>, so no special imports are needed to use it.

First, the <code>f a</code> and <code>f b</code> in the type signature for <code>fmap</code> tell us that <code>f</code> isn’t just a type; it is a ''type constructor'' which takes another type as a parameter. (A more precise way to say this is that the ''kind'' of <code>f</code> must be <code>* -> *</code>.) For example, <code>Maybe</code> is such a type constructor: <code>Maybe</code> is not a type in and of itself, but requires another type as a parameter, like <code>Maybe Integer</code>. So it would not make sense to say <code>instance Functor Integer</code>, but it could make sense to say <code>instance Functor Maybe</code>.

Now look at the type of <code>fmap</code>: it takes any function from <code>a</code> to <code>b</code>, and a value of type <code>f a</code>, and outputs a value of type <code>f b</code>. From the container point of view, the intention is that <code>fmap</code> applies a function to each element of a container, without altering the structure of the container. From the context point of view, the intention is that <code>fmap</code> applies a function to a value without altering its context. Let’s look at a few specific examples.

==Instances==

{{note|Recall that <code>[]</code> has two meanings in Haskell: it can either stand for the empty list, or, as here, it can represent the list type constructor (pronounced “list-of”). In other words, the type <code>[a]</code> (list-of-<code>a</code>) can also be written <code>[] a</code>.}}

{{note|You might ask why we need a separate <code>map</code> function. Why not just do away with the current list-only <code>map</code> function, and rename <code>fmap</code> to <code>map</code> instead? Well, that’s a good question. The usual argument is that someone just learning Haskell, when using <code>map</code> incorrectly, would much rather see an error about lists than about <code>Functor</code>s.}}

As noted before, the list constructor <code>[]</code> is a functor {{noteref}}; we can use the standard list function <code>map</code> to apply a function to each element of a list {{noteref}}. The <code>Maybe</code> type constructor is also a functor, representing a container which might hold a single element. The function <code>fmap g</code> has no effect on <code>Nothing</code> (there are no elements to which <code>g</code> can be applied), and simply applies <code>g</code> to the single element inside a <code>Just</code>. Alternatively, under the context interpretation, the list functor represents a context of nondeterministic choice; that is, a list can be thought of as representing a single value which is nondeterministically chosen from among several possibilities (the elements of the list). Likewise, the <code>Maybe</code> functor represents a context with possible failure. These instances are:

<haskell>
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : fmap g xs
-- or we could just say fmap = map

instance Functor Maybe where
fmap _ Nothing = Nothing
fmap g (Just a) = Just (g a)
</haskell>

As an aside, in idiomatic Haskell code you will often see the letter <code>f</code> used to stand for both an arbitrary <code>Functor</code> and an arbitrary function. In this document, <code>f</code> represents only <code>Functor</code>s, and <code>g</code> or <code>h</code> always represent functions, but you should be aware of the potential confusion. In practice, what <code>f</code> stands for should always be clear from the context, by noting whether it is part of a type or part of the code.

There are other <code>Functor</code> instances in the standard libraries; below are a few. Note that some of these instances are not exported by the <code>Prelude</code>; to access them, you can import <code>Control.Monad.Instances</code>.

* <code>Either e</code> is an instance of <code>Functor</code>; <code>Either e a</code> represents a container which can contain either a value of type <code>a</code>, or a value of type <code>e</code> (often representing some sort of error condition). It is similar to <code>Maybe</code> in that it represents possible failure, but it can carry some extra information about the failure as well.

* <code>((,) e)</code> represents a container which holds an “annotation” of type <code>e</code> along with the actual value it holds. It might be clearer to write it as <code>(e,)</code>, by analogy with an operator section like <code>(1+)</code>, but that syntax is not allowed in types (although it is allowed in expressions with the <code>TupleSections</code> extension enabled). However, you can certainly ''think'' of it as <code>(e,)</code>.

* <code>((->) e)</code> (which can be thought of as <code>(e ->)</code>; see above), the type of functions which take a value of type <code>e</code> as a parameter, is a <code>Functor</code>. As a container, <code>(e -> a)</code> represents a (possibly infinite) set of values of <code>a</code>, indexed by values of <code>e</code>. Alternatively, and more usefully, <code>((->) e)</code> can be thought of as a context in which a value of type <code>e</code> is available to be consulted in a read-only fashion. This is also why <code>((->) e)</code> is sometimes referred to as the ''reader monad''; more on this later.

* <code>IO</code> is a <code>Functor</code>; a value of type <code>IO a</code> represents a computation producing a value of type <code>a</code> which may have I/O effects. If <code>m</code> computes the value <code>x</code> while producing some I/O effects, then <code>fmap g m</code> will compute the value <code>g x</code> while producing the same I/O effects.

* Many standard types from the [http://hackage.haskell.org/package/containers/ containers library] (such as <code>Tree</code>, <code>Map</code>, and <code>Sequence</code>) are instances of <code>Functor</code>. A notable exception is <code>Set</code>, which cannot be made a <code>Functor</code> in Haskell (although it is certainly a mathematical functor) since it requires an <code>Ord</code> constraint on its elements; <code>fmap</code> must be applicable to ''any'' types <code>a</code> and <code>b</code>. However, <code>Set</code> (and other similarly restricted data types) can be made an instance of a suitable generalization of <code>Functor</code>, either by [http://article.gmane.org/gmane.comp.lang.haskell.cafe/78052/ making <code>a</code> and <code>b</code> arguments to the <code>Functor</code> type class themselves], or by adding an [http://blog.omega-prime.co.uk/?p=127 associated constraint].

{{Exercises|
<ol>
<li>Implement <code>Functor</code> instances for <code>Either e</code> and <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> instances for <code>((,) e)</code> and for <code>Pair</code>, defined as

<haskell>data Pair a = Pair a a</haskell>

Explain their similarities and differences.
</li>
<li>Implement a <code>Functor</code> instance for the type <code>ITree</code>, defined as

<haskell>
data ITree a = Leaf (Int -> a)
| Node [ITree a]
</haskell>
</li>
<li>Give an example of a type of kind <code>* -> *</code> which cannot be made an instance of <code>Functor</code> (without using <code>undefined</code>).
</li>
<li>Is this statement true or false?

:''The composition of two <code>Functor</code>s is also a <code>Functor</code>.''

If false, give a counterexample; if true, prove it by exhibiting some appropriate Haskell code.
</li>
</ol>
}}

==Laws==

As far as the Haskell language itself is concerned, the only requirement to be a <code>Functor</code> is an implementation of <code>fmap</code> with the proper type. Any sensible <code>Functor</code> instance, however, will also satisfy the ''functor laws'', which are part of the definition of a mathematical functor. There are two:

<haskell>
fmap id = id
fmap (g . h) = (fmap g) . (fmap h)
</haskell>

{{note|Technically, these laws make <code>f</code> and <code>fmap</code> together an endofunctor on ''Hask'', the category of Haskell types (ignoring [[Bottom|&perp;]], which is a party pooper). See [http://en.wikibooks.org/wiki/Haskell/Category_theory Wikibook: Category theory].}}

Together, these laws ensure that <code>fmap g</code> does not change the ''structure'' of a container, only the elements. Equivalently, and more simply, they ensure that <code>fmap g</code> changes a value without altering its context {{noteref}}.

The first law says that mapping the identity function over every item in a container has no effect. The second says that mapping a composition of two functions over every item in a container is the same as first mapping one function, and then mapping the other.

As an example, the following code is a “valid” instance of <code>Functor</code> (it typechecks), but it violates the functor laws. Do you see why?

<haskell>
-- Evil Functor instance
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : g x : fmap g xs
</haskell>

Any Haskeller worth their salt would reject this code as a gruesome abomination.

Unlike some other type classes we will encounter, a given type has at most one valid instance of <code>Functor</code>. This [http://article.gmane.org/gmane.comp.lang.haskell.libraries/15384 can be proven] via the [http://homepages.inf.ed.ac.uk/wadler/topics/parametricity.html#free ''free theorem''] for the type of <code>fmap</code>. In fact, [http://byorgey.wordpress.com/2010/03/03/deriving-pleasure-from-ghc-6-12-1/ GHC can automatically derive] <code>Functor</code> instances for many data types.

{{note|Actually, if <code>seq</code>/<code>undefined</code> are considered, it [http://stackoverflow.com/a/8323243/305559 is possible] to have an implementation which satisfies the first law but not the second. The rest of the comments in this section should considered in a context where <code>seq</code> and <code>undefined</code> are excluded.}}

A [https://github.com/quchen/articles/blob/master/second_functor_law.md similar argument also shows] that any <code>Functor</code> instance satisfying the first law (<code>fmap id = id</code>) will automatically satisfy the second law as well. Practically, this means that only the first law needs to be checked (usually by a very straightforward induction) to ensure that a <code>Functor</code> instance is valid.{{noteref}}

{{Exercises|
# Although it is not possible for a <code>Functor</code> instance to satisfy the first <code>Functor</code> law but not the second (excluding <code>undefined</code>), the reverse is possible. Give an example of a (bogus) <code>Functor</code> instance which satisfies the second law but not the first.
# Which laws are violated by the evil <code>Functor</code> instance for list shown above: both laws, or the first law alone? Give specific counterexamples.
}}

==Intuition==

There are two fundamental ways to think about <code>fmap</code>. The first has already been mentioned: it takes two parameters, a function and a container, and applies the function “inside” the container, producing a new container. Alternately, we can think of <code>fmap</code> as applying a function to a value in a context (without altering the context).

Just like all other Haskell functions of “more than one parameter”, however, <code>fmap</code> is actually ''curried'': it does not really take two parameters, but takes a single parameter and returns a function. For emphasis, we can write <code>fmap</code>’s type with extra parentheses: <code>fmap :: (a -> b) -> (f a -> f b)</code>. Written in this form, it is apparent that <code>fmap</code> transforms a “normal” function (<code>g :: a -> b</code>) into one which operates over containers/contexts (<code>fmap g :: f a -> f b</code>). This transformation is often referred to as a ''lift''; <code>fmap</code> “lifts” a function from the “normal world” into the “<code>f</code> world”.

==Further reading==

A good starting point for reading about the category theory behind the concept of a functor is the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page on category theory].

=Applicative=

A somewhat newer addition to the pantheon of standard Haskell type classes, ''applicative functors'' represent an abstraction lying in between <code>Functor</code> and <code>Monad</code> in expressivity, first described by McBride and Paterson. The title of their classic paper, [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative Programming with Effects], gives a hint at the intended intuition behind the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html <code>Applicative</code>] type class. It encapsulates certain sorts of “effectful” computations in a functionally pure way, and encourages an “applicative” programming style. Exactly what these things mean will be seen later.

==Definition==

Recall that <code>Functor</code> allows us to lift a “normal” function to a function on computational contexts. But <code>fmap</code> doesn’t allow us to apply a function which is itself in a context to a value in a context. <code>Applicative</code> gives us just such a tool, <code>(<*>)</code>. It also provides a method, <code>pure</code>, for embedding values in a default, “effect free” context. Here is the type class declaration for <code>Applicative</code>, as defined in <code>Control.Applicative</code>:

<haskell>
class Functor f => Applicative f where
pure :: a -> f a
(<*>) :: f (a -> b) -> f a -> f b
</haskell>

Note that every <code>Applicative</code> must also be a <code>Functor</code>. In fact, as we will see, <code>fmap</code> can be implemented using the <code>Applicative</code> methods, so every <code>Applicative</code> is a functor whether we like it or not; the <code>Functor</code> constraint forces us to be honest.

{{note|Recall that <code>($)</code> is just function application: <code>f $ x {{=}} f x</code>.}}

As always, it’s crucial to understand the type signatures. First, consider <code>(<*>)</code>: the best way of thinking about it comes from noting that the type of <code>(<*>)</code> is similar to the type of <code>($)</code> {{noteref}}, but with everything enclosed in an <code>f</code>. In other words, <code>(<*>)</code> is just function application within a computational context. The type of <code>(<*>)</code> is also very similar to the type of <code>fmap</code>; the only difference is that the first parameter is <code>f (a -> b)</code>, a function in a context, instead of a “normal” function <code>(a -> b)</code>.

<code>pure</code> takes a value of any type <code>a</code>, and returns a context/container of type <code>f a</code>. The intention is that <code>pure</code> creates some sort of “default” container or “effect free” context. In fact, the behavior of <code>pure</code> is quite constrained by the laws it should satisfy in conjunction with <code>(<*>)</code>. Usually, for a given implementation of <code>(<*>)</code> there is only one possible implementation of <code>pure</code>.

(Note that previous versions of the Typeclassopedia explained <code>pure</code> in terms of a type class <code>Pointed</code>, which can still be found in the [http://hackage.haskell.org/package/pointed <code>pointed</code> package]. However, the current consensus is that <code>Pointed</code> is not very useful after all. For a more detailed explanation, see [[Why not Pointed?]])

==Laws==

{{note|See
[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html haddock for Applicative] and [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative programming with effects]}}

Traditionally, there are four laws that <code>Applicative</code> instances should satisfy {{noteref}}. In some sense, they are all concerned with making sure that <code>pure</code> deserves its name:

* The identity law: <haskell>pure id <*> v = v</haskell>
* Homomorphism: <haskell>pure f <*> pure x = pure (f x)</haskell>Intuitively, applying a non-effectful function to a non-effectful argument in an effectful context is the same as just applying the function to the argument and then injecting the result into the context with <code>pure</code>.
* Interchange: <haskell>u <*> pure y = pure ($ y) <*> u</haskell>Intuitively, this says that when evaluating the application of an effectful function to a pure argument, the order in which we evaluate the function and its argument doesn't matter.
* Composition: <haskell>u <*> (v <*> w) = pure (.) <*> u <*> v <*> w </haskell>This one is the trickiest law to gain intuition for. In some sense it is expressing a sort of associativity property of <code>(<*>)</code>. The reader may wish to simply convince themselves that this law is type-correct.

Considered as left-to-right rewrite rules, the homomorphism, interchange, and composition laws actually constitute an algorithm for transforming any expression using <code>pure</code> and <code>(<*>)</code> into a canonical form with only a single use of <code>pure</code> at the very beginning and only left-nested occurrences of <code>(<*>)</code>. Composition allows reassociating <code>(<*>)</code>; interchange allows moving occurrences of <code>pure</code> leftwards; and homomorphism allows collapsing multiple adjacent occurrences of <code>pure</code> into one.

There is also a law specifying how <code>Applicative</code> should relate to <code>Functor</code>:

<haskell>
fmap g x = pure g <*> x
</haskell>

It says that mapping a pure function <code>g</code> over a context <code>x</code> is the same as first injecting <code>g</code> into a context with <code>pure</code>, and then applying it to <code>x</code> with <code>(<*>)</code>. In other words, we can decompose <code>fmap</code> into two more atomic operations: injection into a context, and application within a context. The <code>Control.Applicative</code> module also defines <code>(<$>)</code> as a synonym for <code>fmap</code>, so the above law can also be expressed as:

<code>g <$> x = pure g <*> x</code>.

{{Exercises|
# (Tricky) One might imagine a variant of the interchange law that says something about applying a pure function to an effectful argument. Using the above laws, prove that<haskell>pure f <*> x = pure (flip ($)) <*> x <*> pure f</haskell>
}}

==Instances==

Most of the standard types which are instances of <code>Functor</code> are also instances of <code>Applicative</code>.

<code>Maybe</code> can easily be made an instance of <code>Applicative</code>; writing such an instance is left as an exercise for the reader.

The list type constructor <code>[]</code> can actually be made an instance of <code>Applicative</code> in two ways; essentially, it comes down to whether we want to think of lists as ordered collections of elements, or as contexts representing multiple results of a nondeterministic computation (see Wadler’s [http://www.springerlink.com/content/y7450255v2670167/ How to replace failure by a list of successes]).

Let’s first consider the collection point of view. Since there can only be one instance of a given type class for any particular type, one or both of the list instances of <code>Applicative</code> need to be defined for a <code>newtype</code> wrapper; as it happens, the nondeterministic computation instance is the default, and the collection instance is defined in terms of a <code>newtype</code> called <code>ZipList</code>. This instance is:

<haskell>
newtype ZipList a = ZipList { getZipList :: [a] }

instance Applicative ZipList where
pure = undefined -- exercise
(ZipList gs) <*> (ZipList xs) = ZipList (zipWith ($) gs xs)
</haskell>

To apply a list of functions to a list of inputs with <code>(<*>)</code>, we just match up the functions and inputs elementwise, and produce a list of the resulting outputs. In other words, we “zip” the lists together with function application, <code>($)</code>; hence the name <code>ZipList</code>.

The other <code>Applicative</code> instance for lists, based on the nondeterministic computation point of view, is:

<haskell>
instance Applicative [] where
pure x = [x]
gs <*> xs = [ g x | g <- gs, x <- xs ]
</haskell>

Instead of applying functions to inputs pairwise, we apply each function to all the inputs in turn, and collect all the results in a list.

Now we can write nondeterministic computations in a natural style. To add the numbers <code>3</code> and <code>4</code> deterministically, we can of course write <code>(+) 3 4</code>. But suppose instead of <code>3</code> we have a nondeterministic computation that might result in <code>2</code>, <code>3</code>, or <code>4</code>; then we can write

<haskell>
pure (+) <*> [2,3,4] <*> pure 4
</haskell>

or, more idiomatically,

<haskell>
(+) <$> [2,3,4] <*> pure 4.
</haskell>

There are several other <code>Applicative</code> instances as well:

* <code>IO</code> is an instance of <code>Applicative</code>, and behaves exactly as you would think: to execute <code>m1 <*> m2</code>, first <code>m1</code> is executed, resulting in a function <code>f</code>, then <code>m2</code> is executed, resulting in a value <code>x</code>, and finally the value <code>f x</code> is returned as the result of executing <code>m1 <*> m2</code>.

* <code>((,) a)</code> is an <code>Applicative</code>, as long as <code>a</code> is an instance of <code>Monoid</code> ([[#Monoid|section Monoid]]). The <code>a</code> values are accumulated in parallel with the computation.

* The <code>Applicative</code> module defines the <code>Const</code> type constructor; a value of type <code>Const a b</code> simply contains an <code>a</code>. This is an instance of <code>Applicative</code> for any <code>Monoid a</code>; this instance becomes especially useful in conjunction with things like <code>Foldable</code> ([[#Foldable|section Foldable]]).

* The <code>WrappedMonad</code> and <code>WrappedArrow</code> newtypes make any instances of <code>Monad</code> ([[#Monad|section Monad]]) or <code>Arrow</code> ([[#Arrow|section Arrow]]) respectively into instances of <code>Applicative</code>; as we will see when we study those type classes, both are strictly more expressive than <code>Applicative</code>, in the sense that the <code>Applicative</code> methods can be implemented in terms of their methods.

{{Exercises|
# Implement an instance of <code>Applicative</code> for <code>Maybe</code>.
# Determine the correct definition of <code>pure</code> for the <code>ZipList</code> instance of <code>Applicative</code>—there is only one implementation that satisfies the law relating <code>pure</code> and <code>(<*>)</code>.
}}

==Intuition==

McBride and Paterson’s paper introduces the notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> to denote function application in a computational context. If each <math>x_i\ </math> has type <math>f \; t_i\ </math> for some applicative functor <math>f\ </math>, and <math>g\ </math> has type <math>t_1 \to t_2 \to \dots \to t_n \to t\ </math>, then the entire expression <math>[[g \; x_1 \; \cdots \; x_n]]\ </math> has type <math>f \; t\ </math>. You can think of this as applying a function to multiple “effectful” arguments. In this sense, the double bracket notation is a generalization of <code>fmap</code>, which allows us to apply a function to a single argument in a context.

Why do we need <code>Applicative</code> to implement this generalization of <code>fmap</code>? Suppose we use <code>fmap</code> to apply <code>g</code> to the first parameter <code>x1</code>. Then we get something of type <code>f (t2 -> ... t)</code>, but now we are stuck: we can’t apply this function-in-a-context to the next argument with <code>fmap</code>. However, this is precisely what <code>(<*>)</code> allows us to do.

This suggests the proper translation of the idealized notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> into Haskell, namely
<haskell>
g <$> x1 <*> x2 <*> ... <*> xn,
</haskell>

recalling that <code>Control.Applicative</code> defines <code>(<$>)</code> as convenient infix shorthand for <code>fmap</code>. This is what is meant by an “applicative style”—effectful computations can still be described in terms of function application; the only difference is that we have to use the special operator <code>(<*>)</code> for application instead of simple juxtaposition.

Note that <code>pure</code> allows embedding “non-effectful” arguments in the middle of an idiomatic application, like
<haskell>
g <$> x1 <*> pure x2 <*> x3
</haskell>
which has type <code>f d</code>, given
<haskell>
g :: a -> b -> c -> d
x1 :: f a
x2 :: b
x3 :: f c
</haskell>

The double brackets are commonly known as “idiom brackets”, because they allow writing “idiomatic” function application, that is, function application that looks normal but has some special, non-standard meaning (determined by the particular instance of <code>Applicative</code> being used). Idiom brackets are not supported by GHC, but they are supported by the [http://personal.cis.strath.ac.uk/~conor/pub/she/ Strathclyde Haskell Enhancement], a preprocessor which (among many other things) translates idiom brackets into standard uses of <code>(<$>)</code> and <code>(<*>)</code>. This can result in much more readable code when making heavy use of <code>Applicative</code>.

==Alternative formulation==

An alternative, equivalent formulation of <code>Applicative</code> is given by

<haskell>
class Functor f => Monoidal f where
unit :: f ()
(**) :: f a -> f b -> f (a,b)
</haskell>

{{note|In category-theory speak, we say <code>f</code> is a ''lax'' monoidal functor because there aren't necessarily functions in the other direction, like <code>f (a, b) -> (f a, f b)</code>.}}
Intuitively, this states that a monoidal functor{{noteref}} is one which has some sort of "default shape" and which supports some sort of "combining" operation. <code>pure</code> and <code>(<*>)</code> are equivalent in power to <code>unit</code> and <code>(**)</code> (see the Exercises below). More technically, the idea is that <code>f</code> preserves the "monoidal structure" given by the pairing constructor <code>(,)</code> and unit type <code>()</code>. This can be seen even more clearly if we rewrite the types of <code>unit</code> and <code>(**)</code> as
<haskell>
unit' :: () -> f ()
(**') :: (f a, f b) -> f (a, b)
</haskell>

Furthermore, to deserve the name "monoidal" (see the [[#Monoid|section on Monoids]]), instances of <code>Monoidal</code> ought to satisfy the following laws, which seem much more straightforward than the traditional <code>Applicative</code> laws:

{{note|In this and the following laws, <code>≅</code> refers to isomorphism rather than equality. In particular we consider <code>(x,()) ≅ x ≅ ((),x)</code> and <code>((x,y),z) ≅ (x,(y,z))</code>.}}
* Left identity{{noteref}}: <haskell>unit ** v ≅ v</haskell>
* Right identity: <haskell>u ** unit ≅ u</haskell>
* Associativity: <haskell>u ** (v ** w) ≅ (u ** v) ** w</haskell>

These turn out to be equivalent to the usual <code>Applicative</code> laws. In a category theory setting, one would also require a naturality law:

{{note|Here <code>g *** h {{=}} \(x,y) -> (g x, h y)</code>. See [[#Arrow|Arrows]].}}
* Naturality: <haskell>fmap (g *** h) (u ** v) = fmap g u ** fmap h v</haskell>

but in the context of Haskell, this is a free theorem.

Much of this section was taken from [http://blog.ezyang.com/2012/08/applicative-functors/ a blog post by Edward Z. Yang]; see his actual post for a bit more information.

{{Exercises|
# Implement <code>pure</code> and <code>(<*>)</code> in terms of <code>unit</code> and <code>(**)</code>, and vice versa.
# Are there any <code>Applicative</code> instances for which there are also functions <code>f () -> ()</code> and <code>f (a,b) -> (f a, f b)</code>, satisfying some "reasonable" laws?
# (Tricky) Prove that given your implementations from the previous exercise, the usual <code>Applicative</code> laws and the <code>Monoidal</code> laws stated above are equivalent.
}}

==Further reading==

There are many other useful combinators in the standard libraries implemented in terms of <code>pure</code> and <code>(<*>)</code>: for example, <code>(*>)</code>, <code>(<*)</code>, <code>(<**>)</code>, <code>(<$)</code>, and so on (see [http://hackage.haskell.org/package/base/docs/Control-Applicative.html haddock for Applicative]). Judicious use of such secondary combinators can often make code using <code>Applicative</code>s much easier to read.

[http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s original paper] is a treasure-trove of information and examples, as well as some perspectives on the connection between <code>Applicative</code> and category theory. Beginners will find it difficult to make it through the entire paper, but it is extremely well-motivated—even beginners will be able to glean something from reading as far as they are able.

{{note|Introduced by [http://conal.net/papers/simply-reactive/ an earlier paper] that was since superseded by [http://conal.net/papers/push-pull-frp/ Push-pull functional reactive programming].}}

Conal Elliott has been one of the biggest proponents of <code>Applicative</code>. For example, the [http://conal.net/papers/functional-images/ Pan library for functional images] and the reactive library for functional reactive programming (FRP) {{noteref}} make key use of it; his blog also contains [http://conal.net/blog/tag/applicative-functor many examples of <code>Applicative</code> in action]. Building on the work of McBride and Paterson, Elliott also built the [[TypeCompose]] library, which embodies the observation (among others) that <code>Applicative</code> types are closed under composition; therefore, <code>Applicative</code> instances can often be automatically derived for complex types built out of simpler ones.

Although the [http://hackage.haskell.org/package/parsec Parsec parsing library] ([http://legacy.cs.uu.nl/daan/download/papers/parsec-paper.pdf paper]) was originally designed for use as a monad, in its most common use cases an <code>Applicative</code> instance can be used to great effect; [http://www.serpentine.com/blog/2008/02/06/the-basics-of-applicative-functors-put-to-practical-work/ Bryan O’Sullivan’s blog post] is a good starting point. If the extra power provided by <code>Monad</code> isn’t needed, it’s usually a good idea to use <code>Applicative</code> instead.

A couple other nice examples of <code>Applicative</code> in action include the [http://web.archive.org/web/20090416111947/chrisdone.com/blog/html/2009-02-10-applicative-configfile-hsql.html ConfigFile and HSQL libraries] and the [http://groups.inf.ed.ac.uk/links/formlets/ formlets library].

Gershom Bazerman's [http://comonad.com/reader/2012/abstracting-with-applicatives/ post] contains many insights into applicatives.

=Monad=

It’s a safe bet that if you’re reading this, you’ve heard of monads—although it’s quite possible you’ve never heard of <code>Applicative</code> before, or <code>Arrow</code>, or even <code>Monoid</code>. Why are monads such a big deal in Haskell? There are several reasons.

* Haskell does, in fact, single out monads for special attention by making them the framework in which to construct I/O operations.
* Haskell also singles out monads for special attention by providing a special syntactic sugar for monadic expressions: the <code>do</code>-notation.
* <code>Monad</code> has been around longer than other abstract models of computation such as <code>Applicative</code> or <code>Arrow</code>.
* The more monad tutorials there are, the harder people think monads must be, and the more new monad tutorials are written by people who think they finally “get” monads (the [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ monad tutorial fallacy]).

I will let you judge for yourself whether these are good reasons.

In the end, despite all the hoopla, <code>Monad</code> is just another type class. Let’s take a look at its definition.

==Definition==

The type class declaration for [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Monad <code>Monad</code>] is:

<haskell>
class Monad m where
return :: a -> m a
(>>=) :: m a -> (a -> m b) -> m b
(>>) :: m a -> m b -> m b
m >> n = m >>= \_ -> n

fail :: String -> m a
</haskell>

The <code>Monad</code> type class is exported by the <code>Prelude</code>, along with a few standard instances. However, many utility functions are found in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>], and there are also several instances (such as <code>((->) e)</code>) defined in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad-Instances.html <code>Control.Monad.Instances</code>].

{{note|However, as of GHC 7.10 this will be fixed!}}
Let’s examine the methods in the <code>Monad</code> class one by one. The type of <code>return</code> should look familiar; it’s the same as <code>pure</code>. Indeed, <code>return</code> ''is'' <code>pure</code>, but with an unfortunate name. (Unfortunate, since someone coming from an imperative programming background might think that <code>return</code> is like the C or Java keyword of the same name, when in fact the similarities are minimal.) From a mathematical point of view, every monad is an applicative functor, but for historical reasons, the <code>Monad</code> type class declaration unfortunately does not require this.{{noteref}}

We can see that <code>(>>)</code> is a specialized version of <code>(>>=)</code>, with a default implementation given. It is only included in the type class declaration so that specific instances of <code>Monad</code> can override the default implementation of <code>(>>)</code> with a more efficient one, if desired. Also, note that although <code>_ >> n = n</code> would be a type-correct implementation of <code>(>>)</code>, it would not correspond to the intended semantics: the intention is that <code>m >> n</code> ignores the ''result'' of <code>m</code>, but not its ''effects''.

The <code>fail</code> function is an awful hack that has no place in the <code>Monad</code> class; more on this later.

The only really interesting thing to look at—and what makes <code>Monad</code> strictly more powerful than <code>Applicative</code>—is <code>(>>=)</code>, which is often called ''bind''. An alternative definition of <code>Monad</code> could look like:

<haskell>
class Applicative m => Monad' m where
(>>=) :: m a -> (a -> m b) -> m b
</haskell>

We could spend a while talking about the intuition behind <code>(>>=)</code>—and we will. But first, let’s look at some examples.

==Instances==

Even if you don’t understand the intuition behind the <code>Monad</code> class, you can still create instances of it by just seeing where the types lead you. You may be surprised to find that this actually gets you a long way towards understanding the intuition; at the very least, it will give you some concrete examples to play with as you read more about the <code>Monad</code> class in general. The first few examples are from the standard <code>Prelude</code>; the remaining examples are from the [http://hackage.haskell.org/package/transformers <code>transformers</code> package].

<ul>
<li>The simplest possible instance of <code>Monad</code> is [http://hackage.haskell.org/packages/archive/mtl/1.1.0.2/doc/html/Control-Monad-Identity.html <code>Identity</code>], which is described in Dan Piponi’s highly recommended blog post on [http://blog.sigfpe.com/2007/04/trivial-monad.html The Trivial Monad]. Despite being “trivial”, it is a great introduction to the <code>Monad</code> type class, and contains some good exercises to get your brain working.
</li>
<li>The next simplest instance of <code>Monad</code> is <code>Maybe</code>. We already know how to write <code>return</code>/<code>pure</code> for <code>Maybe</code>. So how do we write <code>(>>=)</code>? Well, let’s think about its type. Specializing for <code>Maybe</code>, we have

<haskell>
(>>=) :: Maybe a -> (a -> Maybe b) -> Maybe b.
</haskell>

If the first argument to <code>(>>=)</code> is <code>Just x</code>, then we have something of type <code>a</code> (namely, <code>x</code>), to which we can apply the second argument—resulting in a <code>Maybe b</code>, which is exactly what we wanted. What if the first argument to <code>(>>=)</code> is <code>Nothing</code>? In that case, we don’t have anything to which we can apply the <code>a -> Maybe b</code> function, so there’s only one thing we can do: yield <code>Nothing</code>. This instance is:

<haskell>
instance Monad Maybe where
return = Just
(Just x) >>= g = g x
Nothing >>= _ = Nothing
</haskell>

We can already get a bit of intuition as to what is going on here: if we build up a computation by chaining together a bunch of functions with <code>(>>=)</code>, as soon as any one of them fails, the entire computation will fail (because <code>Nothing >>= f</code> is <code>Nothing</code>, no matter what <code>f</code> is). The entire computation succeeds only if all the constituent functions individually succeed. So the <code>Maybe</code> monad models computations which may fail.
</li>

<li>The <code>Monad</code> instance for the list constructor <code>[]</code> is similar to its <code>Applicative</code> instance; see the exercise below.
</li>

<li>Of course, the <code>IO</code> constructor is famously a <code>Monad</code>, but its implementation is somewhat magical, and may in fact differ from compiler to compiler. It is worth emphasizing that the <code>IO</code> monad is the ''only'' monad which is magical. It allows us to build up, in an entirely pure way, values representing possibly effectful computations. The special value <code>main</code>, of type <code>IO ()</code>, is taken by the runtime and actually executed, producing actual effects. Every other monad is functionally pure, and requires no special compiler support. We often speak of monadic values as “effectful computations”, but this is because some monads allow us to write code ''as if'' it has side effects, when in fact the monad is hiding the plumbing which allows these apparent side effects to be implemented in a functionally pure way.
</li>

<li>As mentioned earlier, <code>((->) e)</code> is known as the ''reader monad'', since it describes computations in which a value of type <code>e</code> is available as a read-only environment.

The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Reader.html <code>Control.Monad.Reader</code>] module provides the <code>Reader e a</code> type, which is just a convenient <code>newtype</code> wrapper around <code>(e -> a)</code>, along with an appropriate <code>Monad</code> instance and some <code>Reader</code>-specific utility functions such as <code>ask</code> (retrieve the environment), <code>asks</code> (retrieve a function of the environment), and <code>local</code> (run a subcomputation under a different environment).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Writer-Lazy.html <code>Control.Monad.Writer</code>] module provides the <code>Writer</code> monad, which allows information to be collected as a computation progresses. <code>Writer w a</code> is isomorphic to <code>(a,w)</code>, where the output value <code>a</code> is carried along with an annotation or “log” of type <code>w</code>, which must be an instance of <code>Monoid</code> (see [[#Monoid|section Monoid]]); the special function <code>tell</code> performs logging.
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-State-Lazy.html <code>Control.Monad.State</code>] module provides the <code>State s a</code> type, a <code>newtype</code> wrapper around <code>s -> (a,s)</code>. Something of type <code>State s a</code> represents a stateful computation which produces an <code>a</code> but can access and modify the state of type <code>s</code> along the way. The module also provides <code>State</code>-specific utility functions such as <code>get</code> (read the current state), <code>gets</code> (read a function of the current state), <code>put</code> (overwrite the state), and <code>modify</code> (apply a function to the state).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Cont.html <code>Control.Monad.Cont</code>] module provides the <code>Cont</code> monad, which represents computations in continuation-passing style. It can be used to suspend and resume computations, and to implement non-local transfers of control, co-routines, other complex control structures—all in a functionally pure way. <code>Cont</code> has been called the [http://blog.sigfpe.com/2008/12/mother-of-all-monads.html “mother of all monads”] because of its universal properties.
</li>
</ul>

{{Exercises|
<ol>
<li>Implement a <code>Monad</code> instance for the list constructor, <code>[]</code>. Follow the types!</li>
<li>Implement a <code>Monad</code> instance for <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> and <code>Monad</code> instances for <code>Free f</code>, defined as
<haskell>
data Free f a = Var a
| Node (f (Free f a))
</haskell>
You may assume that <code>f</code> has a <code>Functor</code> instance. This is known as the ''free monad'' built from the functor <code>f</code>.
</li>
</ol>
}}

==Intuition==

Let’s look more closely at the type of <code>(>>=)</code>. The basic intuition is that it combines two computations into one larger computation. The first argument, <code>m a</code>, is the first computation. However, it would be boring if the second argument were just an <code>m b</code>; then there would be no way for the computations to interact with one another (actually, this is exactly the situation with <code>Applicative</code>). So, the second argument to <code>(>>=)</code> has type <code>a -> m b</code>: a function of this type, given a ''result'' of the first computation, can produce a second computation to be run. In other words, <code>x >>= k</code> is a computation which runs <code>x</code>, and then uses the result(s) of <code>x</code> to ''decide'' what computation to run second, using the output of the second computation as the result of the entire computation.

{{note|Actually, because Haskell allows general recursion, this is a lie: using a Haskell parsing library one can recursively construct ''infinite'' grammars, and hence <code>Applicative</code> (together with <code>Alternative</code>) is enough to parse any context-sensitive language with a finite alphabet. See [http://byorgey.wordpress.com/2012/01/05/parsing-context-sensitive-languages-with-applicative/ Parsing context-sensitive languages with Applicative].}}
Intuitively, it is this ability to use the output from previous computations to decide what computations to run next that makes <code>Monad</code> more powerful than <code>Applicative</code>. The structure of an <code>Applicative</code> computation is fixed, whereas the structure of a <code>Monad</code> computation can change based on intermediate results. This also means that parsers built using an <code>Applicative</code> interface can only parse context-free languages; in order to parse context-sensitive languages a <code>Monad</code> interface is needed.{{noteref}}

To see the increased power of <code>Monad</code> from a different point of view, let’s see what happens if we try to implement <code>(>>=)</code> in terms of <code>fmap</code>, <code>pure</code>, and <code>(<*>)</code>. We are given a value <code>x</code> of type <code>m a</code>, and a function <code>k</code> of type <code>a -> m b</code>, so the only thing we can do is apply <code>k</code> to <code>x</code>. We can’t apply it directly, of course; we have to use <code>fmap</code> to lift it over the <code>m</code>. But what is the type of <code>fmap k</code>? Well, it’s <code>m a -> m (m b)</code>. So after we apply it to <code>x</code>, we are left with something of type <code>m (m b)</code>—but now we are stuck; what we really want is an <code>m b</code>, but there’s no way to get there from here. We can ''add'' <code>m</code>’s using <code>pure</code>, but we have no way to ''collapse'' multiple <code>m</code>’s into one.

{{note|1=You might hear some people claim that that the definition in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> is the “math definition” and the definition in terms of <code>return</code> and <code>(>>=)</code> is something specific to Haskell. In fact, both definitions were known in the mathematics community long before Haskell picked up monads.}}

This ability to collapse multiple <code>m</code>’s is exactly the ability provided by the function <code>join :: m (m a) -> m a</code>, and it should come as no surprise that an alternative definition of <code>Monad</code> can be given in terms of <code>join</code>:

<haskell>
class Applicative m => Monad'' m where
join :: m (m a) -> m a
</haskell>

In fact, the canonical definition of monads in category theory is in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> (often called <math>\eta</math>, <math>T</math>, and <math>\mu</math> in the mathematical literature). Haskell uses an alternative formulation with <code>(>>=)</code> instead of <code>join</code> since it is more convenient to use {{noteref}}. However, sometimes it can be easier to think about <code>Monad</code> instances in terms of <code>join</code>, since it is a more “atomic” operation. (For example, <code>join</code> for the list monad is just <code>concat</code>.)

{{Exercises|
# Implement <code>(>>{{=}})</code> in terms of <code>fmap</code> (or <code>liftM</code>) and <code>join</code>.
# Now implement <code>join</code> and <code>fmap</code> (<code>liftM</code>) in terms of <code>(>>{{=}})</code> and <code>return</code>.
}}

==Utility functions==

The [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>] module provides a large number of convenient utility functions, all of which can be implemented in terms of the basic <code>Monad</code> operations (<code>return</code> and <code>(>>=)</code> in particular). We have already seen one of them, namely, <code>join</code>. We also mention some other noteworthy ones here; implementing these utility functions oneself is a good exercise. For a more detailed guide to these functions, with commentary and example code, see Henk-Jan van Tuyl’s [http://members.chello.nl/hjgtuyl/tourdemonad.html tour].

{{note|This will most likely change in Haskell 2014 with the implementation of the [[Functor-Applicative-Monad_Proposal|Haskell 2014 Applicative => Monad proposal]].}}

* <code>liftM :: Monad m => (a -> b) -> m a -> m b</code>. This should be familiar; of course, it is just <code>fmap</code>. The fact that we have both <code>fmap</code> and <code>liftM</code> is an unfortunate consequence of the fact that the <code>Monad</code> type class does not require a <code>Functor</code> instance, even though mathematically speaking, every monad is a functor. However, <code>fmap</code> and <code>liftM</code> are essentially interchangeable, since it is a bug (in a social rather than technical sense) for any type to be an instance of <code>Monad</code> without also being an instance of <code>Functor</code> {{noteref}}.

* <code>ap :: Monad m => m (a -> b) -> m a -> m b</code> should also be familiar: it is equivalent to <code>(<*>)</code>, justifying the claim that the <code>Monad</code> interface is strictly more powerful than <code>Applicative</code>. We can make any <code>Monad</code> into an instance of <code>Applicative</code> by setting <code>pure = return</code> and <code>(<*>) = ap</code>.

* <code>sequence :: Monad m => [m a] -> m [a]</code> takes a list of computations and combines them into one computation which collects a list of their results. It is again something of a historical accident that <code>sequence</code> has a <code>Monad</code> constraint, since it can actually be implemented only in terms of <code>Applicative</code>. There is an additional generalization of <code>sequence</code> to structures other than lists, which will be discussed in the [[#Traversable|section on <code>Traversable</code>]].

* <code>replicateM :: Monad m => Int -> m a -> m [a]</code> is simply a combination of [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#v:replicate <code>replicate</code>] and <code>sequence</code>.

* <code>when :: Monad m => Bool -> m () -> m ()</code> conditionally executes a computation, evaluating to its second argument if the test is <code>True</code>, and to <code>return ()</code> if the test is <code>False</code>. A collection of other sorts of monadic conditionals can be found in the [http://hackage.haskell.org/package/IfElse <code>IfElse</code> package].

* <code>mapM :: Monad m => (a -> m b) -> [a] -> m [b]</code> maps its first argument over the second, and <code>sequence</code>s the results. The <code>forM</code> function is just <code>mapM</code> with its arguments reversed; it is called <code>forM</code> since it models generalized <code>for</code> loops: the list <code>[a]</code> provides the loop indices, and the function <code>a -> m b</code> specifies the “body” of the loop for each index.

* <code>(=<<) :: Monad m => (a -> m b) -> m a -> m b</code> is just <code>(>>=)</code> with its arguments reversed; sometimes this direction is more convenient since it corresponds more closely to function application.

* <code>(>=>) :: Monad m => (a -> m b) -> (b -> m c) -> a -> m c</code> is sort of like function composition, but with an extra <code>m</code> on the result type of each function, and the arguments swapped. We’ll have more to say about this operation later. There is also a flipped variant, <code>(<=<)</code>.

* The <code>guard</code> function is for use with instances of <code>MonadPlus</code>, which is discussed at the end of the [[#Monoid|<code>Monoid</code> section]].

Many of these functions also have “underscored” variants, such as <code>sequence_</code> and <code>mapM_</code>; these variants throw away the results of the computations passed to them as arguments, using them only for their side effects.

Other monadic functions which are occasionally useful include <code>filterM</code>, <code>zipWithM</code>, <code>foldM</code>, and <code>forever</code>.

==Laws==

There are several laws that instances of <code>Monad</code> should satisfy (see also the [[Monad laws]] wiki page). The standard presentation is:

<haskell>
return a >>= k = k a
m >>= return = m
m >>= (\x -> k x >>= h) = (m >>= k) >>= h

fmap f xs = xs >>= return . f = liftM f xs
</haskell>

The first and second laws express the fact that <code>return</code> behaves nicely: if we inject a value <code>a</code> into a monadic context with <code>return</code>, and then bind to <code>k</code>, it is the same as just applying <code>k</code> to <code>a</code> in the first place; if we bind a computation <code>m</code> to <code>return</code>, nothing changes. The third law essentially says that <code>(>>=)</code> is associative, sort of. The last law ensures that <code>fmap</code> and <code>liftM</code> are the same for types which are instances of both <code>Functor</code> and <code>Monad</code>—which, as already noted, should be every instance of <code>Monad</code>.

{{note|I like to pronounce this operator “fish”.}}

However, the presentation of the above laws, especially the third, is marred by the asymmetry of <code>(>>=)</code>. It’s hard to look at the laws and see what they’re really saying. I prefer a much more elegant version of the laws, which is formulated in terms of <code>(>=>)</code> {{noteref}}. Recall that <code>(>=>)</code> “composes” two functions of type <code>a -> m b</code> and <code>b -> m c</code>. You can think of something of type <code>a -> m b</code> (roughly) as a function from <code>a</code> to <code>b</code> which may also have some sort of effect in the context corresponding to <code>m</code>. <code>(>=>)</code> lets us compose these “effectful functions”, and we would like to know what properties <code>(>=>)</code> has. The monad laws reformulated in terms of <code>(>=>)</code> are:

<haskell>
return >=> g = g
g >=> return = g
(g >=> h) >=> k = g >=> (h >=> k)
</haskell>

{{note|As fans of category theory will note, these laws say precisely that functions of type <code>a -> m b</code> are the arrows of a category with <code>(>{{=}}>)</code> as composition! Indeed, this is known as the ''Kleisli category'' of the monad <code>m</code>. It will come up again when we discuss <code>Arrow</code>s.}}

Ah, much better! The laws simply state that <code>return</code> is the identity of <code>(>=>)</code>, and that <code>(>=>)</code> is associative {{noteref}}.

There is also a formulation of the monad laws in terms of <code>fmap</code>, <code>return</code>, and <code>join</code>; for a discussion of this formulation, see the Haskell [http://en.wikibooks.org/wiki/Haskell/Category_theory wikibook page on category theory].

{{Exercises|
# Given the definition <code>g >{{=}}> h {{=}} \x -> g x >>{{=}} h</code>, prove the equivalence of the above laws and the usual monad laws.
}}

==<code>do</code> notation==

Haskell’s special <code>do</code> notation supports an “imperative style” of programming by providing syntactic sugar for chains of monadic expressions. The genesis of the notation lies in realizing that something like <code>a >>= \x -> b >> c >>= \y -> d </code> can be more readably written by putting successive computations on separate lines:

<haskell>
a >>= \x ->
b >>
c >>= \y ->
d
</haskell>

This emphasizes that the overall computation consists of four computations <code>a</code>, <code>b</code>, <code>c</code>, and <code>d</code>, and that <code>x</code> is bound to the result of <code>a</code>, and <code>y</code> is bound to the result of <code>c</code> (<code>b</code>, <code>c</code>, and <code>d</code> are allowed to refer to <code>x</code>, and <code>d</code> is allowed to refer to <code>y</code> as well). From here it is not hard to imagine a nicer notation:

<haskell>
do { x <- a
; b
; y <- c
; d
}
</haskell>

(The curly braces and semicolons may optionally be omitted; the Haskell parser uses layout to determine where they should be inserted.) This discussion should make clear that <code>do</code> notation is just syntactic sugar. In fact, <code>do</code> blocks are recursively translated into monad operations (almost) like this:

<pre>
do e → e
do { e; stmts } → e >> do { stmts }
do { v <- e; stmts } → e >>= \v -> do { stmts }
do { let decls; stmts} → let decls in do { stmts }
</pre>

This is not quite the whole story, since <code>v</code> might be a pattern instead of a variable. For example, one can write

<haskell>
do (x:xs) <- foo
bar x
</haskell>

but what happens if <code>foo</code> produces an empty list? Well, remember that ugly <code>fail</code> function in the <code>Monad</code> type class declaration? That’s what happens. See [http://www.haskell.org/onlinereport/exps.html#sect3.14 section 3.14 of the Haskell Report] for the full details. See also the discussion of <code>MonadPlus</code> and <code>MonadZero</code> in the [[#Other monoidal classes: Alternative, MonadPlus, ArrowPlus|section on other monoidal classes]].

A final note on intuition: <code>do</code> notation plays very strongly to the “computational context” point of view rather than the “container” point of view, since the binding notation <code>x <- m</code> is suggestive of “extracting” a single <code>x</code> from <code>m</code> and doing something with it. But <code>m</code> may represent some sort of a container, such as a list or a tree; the meaning of <code>x <- m</code> is entirely dependent on the implementation of <code>(>>=)</code>. For example, if <code>m</code> is a list, <code>x <- m</code> actually means that <code>x</code> will take on each value from the list in turn.

==Further reading==

Philip Wadler was the first to propose using monads to structure functional programs. [http://homepages.inf.ed.ac.uk/wadler/topics/monads.html His paper] is still a readable introduction to the subject.

{{note|1=
[[All About Monads]],
[http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers],
[http://en.wikibooks.org/w/index.php?title=Haskell/Understanding_monads Understanding monads],
[[The Monadic Way]],
[http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads! (And Maybe You Already Have.)],
[http://www.haskell.org/pipermail/haskell-cafe/2006-November/019190.html there’s a monster in my Haskell!],
[http://kawagner.blogspot.com/2007/02/understanding-monads-for-real.html Understanding Monads. For real.],
[http://www.randomhacks.net/articles/2007/03/12/monads-in-15-minutes Monads in 15 minutes: Backtracking and Maybe],
[http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation],
[http://metafoo.co.uk/practical-monads.txt Practical Monads]}}

There are, of course, numerous monad tutorials of varying quality {{noteref}}.

A few of the best include Cale Gibbard’s [http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers] and [http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation]; Jeff Newbern’s [[All About Monads]], a comprehensive guide with lots of examples; and Dan Piponi’s [http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads!], which features great exercises. If you just want to know how to use <code>IO</code>, you could consult the [[Introduction to IO]]. Even this is just a sampling; the [[monad tutorials timeline]] is a more complete list. (All these monad tutorials have prompted parodies like [http://koweycode.blogspot.com/2007/01/think-of-monad.html think of a monad ...] as well as other kinds of backlash like [http://ahamsandwich.wordpress.com/2007/07/26/monads-and-why-monad-tutorials-are-all-awful/ Monads! (and Why Monad Tutorials Are All Awful)] or [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ Abstraction, intuition, and the “monad tutorial fallacy”].)

Other good monad references which are not necessarily tutorials include [http://members.chello.nl/hjgtuyl/tourdemonad.html Henk-Jan van Tuyl’s tour] of the functions in <code>Control.Monad</code>, Dan Piponi’s [http://blog.sigfpe.com/2006/10/monads-field-guide.html field guide], Tim Newsham’s [http://www.thenewsh.com/~newsham/haskell/monad.html What’s a Monad?], and Chris Smith's excellent article [http://cdsmith.wordpress.com/2012/04/18/why-do-monads-matter/ Why Do Monads Matter?]. There are also many blog posts which have been written on various aspects of monads; a collection of links can be found under [[Blog articles/Monads]].

For help constructing monads from scratch, and for obtaining a "deep embedding" of monad operations suitable for use in, say, compiling a domain-specific language, see [http://projects.haskell.org/operational Apfelmus's operational package].

One of the quirks of the <code>Monad</code> class and the Haskell type system is that it is not possible to straightforwardly declare <code>Monad</code> instances for types which require a class constraint on their data, even if they are monads from a mathematical point of view. For example, <code>Data.Set</code> requires an <code>Ord</code> constraint on its data, so it cannot be easily made an instance of <code>Monad</code>. A solution to this problem was [http://www.randomhacks.net/articles/2007/03/15/data-set-monad-haskell-macros first described by Eric Kidd], and later made into a [http://hackage.haskell.org/cgi-bin/hackage-scripts/package/rmonad library named rmonad] by Ganesh Sittampalam and Peter Gavin.

There are many good reasons for eschewing <code>do</code> notation; some have gone so far as to [[Do_notation_considered_harmful|consider it harmful]].

Monads can be generalized in various ways; for an exposition of one possibility, see Robert Atkey’s paper on [http://homepages.inf.ed.ac.uk/ratkey/paramnotions-jfp.pdf parameterized monads], or Dan Piponi’s [http://blog.sigfpe.com/2009/02/beyond-monads.html Beyond Monads].

For the categorically inclined, monads can be viewed as monoids ([http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html From Monoids to Monads]) and also as closure operators [http://blog.plover.com/math/monad-closure.html Triples and Closure]. Derek Elkins’ article in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13 of the Monad.Reader] contains an exposition of the category-theoretic underpinnings of some of the standard <code>Monad</code> instances, such as <code>State</code> and <code>Cont</code>. Jonathan Hill and Keith Clarke have [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.53.6497 an early paper explaining the connection between monads as they arise in category theory and as used in functional programming]. There is also a [http://okmij.org/ftp/Computation/IO-monad-history.html web page by Oleg Kiselyov] explaining the history of the IO monad.

Links to many more research papers related to monads can be found under [[Research papers/Monads and arrows]].

=Monad transformers=

One would often like to be able to combine two monads into one: for example, to have stateful, nondeterministic computations (<code>State</code> + <code>[]</code>), or computations which may fail and can consult a read-only environment (<code>Maybe</code> + <code>Reader</code>), and so on. Unfortunately, monads do not compose as nicely as applicative functors (yet another reason to use <code>Applicative</code> if you don’t need the full power that <code>Monad</code> provides), but some monads can be combined in certain ways.

==Standard monad transformers==

The [http://hackage.haskell.org/package/transformers transformers] library provides a number of standard ''monad transformers''. Each monad transformer adds a particular capability/feature/effect to any existing monad.

* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Identity.html <code>IdentityT</code>] is the identity transformer, which maps a monad to (something isomorphic to) itself. This may seem useless at first glance, but it is useful for the same reason that the <code>id</code> function is useful -- it can be passed as an argument to things which are parameterized over an arbitrary monad transformer, when you do not actually want any extra capabilities.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-State.html <code>StateT</code>] adds a read-write state.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Reader.html <code>ReaderT</code>] adds a read-only environment.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Writer.html <code>WriterT</code>] adds a write-only log.
* [http://hackage.haskell.org/packages/archive/transformers/0.2.2.0/doc/html/Control-Monad-Trans-RWS.html <code>RWST</code>] conveniently combines <code>ReaderT</code>, <code>WriterT</code>, and <code>StateT</code> into one.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Maybe.html <code>MaybeT</code>] adds the possibility of failure.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Error.html <code>ErrorT</code>] adds the possibility of failure with an arbitrary type to represent errors.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-List.html <code>ListT</code>] adds non-determinism (however, see the discussion of <code>ListT</code> below).
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Cont.html <code>ContT</code>] adds continuation handling.

For example, <code>StateT s Maybe</code> is an instance of <code>Monad</code>; computations of type <code>StateT s Maybe a</code> may fail, and have access to a mutable state of type <code>s</code>. Monad transformers can be multiply stacked. One thing to keep in mind while using monad transformers is that the order of composition matters. For example, when a <code>StateT s Maybe a</code> computation fails, the state ceases being updated (indeed, it simply disappears); on the other hand, the state of a <code>MaybeT (State s) a</code> computation may continue to be modified even after the computation has "failed". This may seem backwards, but it is correct. Monad transformers build composite monads “inside out”; <code>MaybeT (State s) a</code> is isomorphic to <code>s -> (Maybe a, s)</code>. (Lambdabot has an indispensable <code>@unmtl</code> command which you can use to “unpack” a monad transformer stack in this way.)
Intuitively, the monads become "more fundamental" the further inside the stack you get, and the effects of inner monads "have precedence" over the effects of outer ones. Of course, this is just handwaving, and if you are unsure of the proper order for some monads you wish to combine, there is no substitute for using <code>@unmtl</code> or simply trying out the various options.

==Definition and laws==

All monad transformers should implement the <code>MonadTrans</code> type class, defined in <code>Control.Monad.Trans.Class</code>:

<haskell>
class MonadTrans t where
lift :: Monad m => m a -> t m a
</haskell>

It allows arbitrary computations in the base monad <code>m</code> to be “lifted” into computations in the transformed monad <code>t m</code>. (Note that type application associates to the left, just like function application, so <code>t m a = (t m) a</code>.)

<code>lift</code> must satisfy the laws
<haskell>
lift . return = return
lift (m >>= f) = lift m >>= (lift . f)
</haskell>
which intuitively state that <code>lift</code> transforms <code>m a</code> computations into <code>t m a</code> computations in a "sensible" way, which sends the <code>return</code> and <code>(>>=)</code> of <code>m</code> to the <code>return</code> and <code>(>>=)</code> of <code>t m</code>.

{{Exercises|
# What is the kind of <code>t</code> in the declaration of <code>MonadTrans</code>?
}}

==Transformer type classes and "capability" style==

{{note|The only problem with this scheme is the quadratic number of instances required as the number of standard monad transformers grows—but as the current set of standard monad transformers seems adequate for most common use cases, this may not be that big of a deal.}}

There are also type classes (provided by the [http://hackage.haskell.org/package/mtl <code>mtl</code> package]) for the operations of each transformer. For example, the <code>MonadState</code> type class provides the state-specific methods <code>get</code> and <code>put</code>, allowing you to conveniently use these methods not only with <code>State</code>, but with any monad which is an instance of <code>MonadState</code>—including <code>MaybeT (State s)</code>, <code>StateT s (ReaderT r IO)</code>, and so on. Similar type classes exist for <code>Reader</code>, <code>Writer</code>, <code>Cont</code>, <code>IO</code>, and others {{noteref}}.

These type classes serve two purposes. First, they get rid of (most of) the need for explicitly using <code>lift</code>, giving a type-directed way to automatically determine the right number of calls to <code>lift</code>. Simply writing <code>put</code> will be automatically translated into <code>lift . put</code>, <code>lift . lift . put</code>, or something similar depending on what concrete monad stack you are using.

Second, they give you more flexibility to switch between different concrete monad stacks. For example, if you are writing a state-based algorithm, don't write
<haskell>
foo :: State Int Char
foo = modify (*2) >> return 'x'
</haskell>
but rather
<haskell>
foo :: MonadState Int m => m Char
foo = modify (*2) >> return 'x'
</haskell>
Now, if somewhere down the line you realize you need to introduce the possibility of failure, you might switch from <code>State Int</code> to <code>MaybeT (State Int)</code>. The type of the first version of <code>foo</code> would need to be modified to reflect this change, but the second version of <code>foo</code> can still be used as-is.

However, this sort of "capability-based" style (e.g. specifying that <code>foo</code> works for any monad with the "state capability") quickly runs into problems when you try to naively scale it up: for example, what if you need to maintain two independent states? A framework for solving this and related problems is described by Schrijvers and Olivera ([http://users.ugent.be/~tschrijv/Research/papers/icfp2011.pdf Monads, zippers and views: virtualizing the monad stack, ICFP 2011]) and is implemented in the [http://hackage.haskell.org/package/Monatron <code>Monatron</code> package].

==Composing monads==

Is the composition of two monads always a monad? As hinted previously, the answer is no.

Since <code>Applicative</code> functors are closed under composition, the problem must lie with <code>join</code>. Indeed, suppose <code>m</code> and <code>n</code> are arbitrary monads; to make a monad out of their composition we would need to be able to implement
<haskell>
join :: m (n (m (n a))) -> m (n a)
</haskell>
but it is not clear how this could be done in general. The <code>join</code> method for <code>m</code> is no help, because the two occurrences of <code>m</code> are not next to each other (and likewise for <code>n</code>).

However, one situation in which it can be done is if <code>n</code> ''distributes'' over <code>m</code>, that is, if there is a function
<haskell>
distrib :: n (m a) -> m (n a)
</haskell>
satisfying certain laws. See Jones and Duponcheel ([http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.2605 Composing Monads]); see also the [[#Traversable|section on Traversable]].

For a much more in-depth discussion and analysis of the failure of monads to be closed under composition, see [http://stackoverflow.com/questions/13034229/concrete-example-showing-that-monads-are-not-closed-under-composition-with-proo?lq=1 this question on StackOverflow].

{{Exercises|
* Implement <code>join :: M (N (M (N a))) -> M (N a)</code>, given <code>distrib :: N (M a) -> M (N a)</code> and assuming <code>M</code> and <code>N</code> are instances of <code>Monad</code>.
}}

==Further reading==

Much of the monad transformer library (originally [http://hackage.haskell.org/package/mtl <code>mtl</code>], now split between <code>mtl</code> and [http://hackage.haskell.org/package/transformers <code>transformers</code>]), including the <code>Reader</code>, <code>Writer</code>, <code>State</code>, and other monads, as well as the monad transformer framework itself, was inspired by Mark Jones’ classic paper [http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Functional Programming with Overloading and Higher-Order Polymorphism]. It’s still very much worth a read—and highly readable—after almost fifteen years.

See [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17139 Edward Kmett's mailing list message] for a description of the history and relationships among monad transformer packages (<code>mtl</code>, <code>transformers</code>, <code>monads-fd</code>, <code>monads-tf</code>).

There are two excellent references on monad transformers. Martin Grabmüller’s [http://www.grabmueller.de/martin/www/pub/Transformers.en.html Monad Transformers Step by Step] is a thorough description, with running examples, of how to use monad transformers to elegantly build up computations with various effects. [http://cale.yi.org/index.php/How_To_Use_Monad_Transformers Cale Gibbard’s article] on how to use monad transformers is more practical, describing how to structure code using monad transformers to make writing it as painless as possible. Another good starting place for learning about monad transformers is a [http://blog.sigfpe.com/2006/05/grok-haskell-monad-transformers.html blog post by Dan Piponi].

The <code>ListT</code> transformer from the <code>transformers</code> package comes with the caveat that <code>ListT m</code> is only a monad when <code>m</code> is ''commutative'', that is, when <code>ma >>= \a -> mb >>= \b -> foo</code> is equivalent to <code>mb >>= \b -> ma >>= \a -> foo</code> (i.e. the order of <code>m</code>'s effects does not matter). For one explanation why, see Dan Piponi's blog post [http://blog.sigfpe.com/2006/11/why-isnt-listt-monad.html "Why isn't <code><nowiki>ListT []</nowiki></code> a monad"]. For more examples, as well as a design for a version of <code>ListT</code> which does not have this problem, see [http://www.haskell.org/haskellwiki/ListT_done_right <code>ListT</code> done right].

There is an alternative way to compose monads, using coproducts, as described by [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.8.3581 Lüth and Ghani]. This method is interesting but has not (yet?) seen widespread use. For a more recent alternative, see Kiselyov et al's [http://okmij.org/ftp/Haskell/extensible/exteff.pdf Extensible Effects: An Alternative to Monad Transformers].

=MonadFix=

''Note: <code>MonadFix</code> is included here for completeness (and because it is interesting) but seems not to be used much. Skipping this section on a first read-through is perfectly OK (and perhaps even recommended).''

==<code>mdo</code>/<code>do rec</code> notation==

{{note|In GHC 7.6, the flag has been changed to <code>-XRecursiveDo</code>.}}
The <code>MonadFix</code> class describes monads which support the special fixpoint operation <code>mfix :: (a -> m a) -> m a</code>, which allows the output of monadic computations to be defined via (effectful) recursion. This is [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation supported in GHC] by a special “recursive do” notation, enabled by the <code>-XDoRec</code> flag{{noteref}}. Within a <code>do</code> block, one may have a nested <code>rec</code> block, like so:
<haskell>
do { x <- foo
; rec { y <- baz
; z <- bar
; bob
}
; w <- frob
}
</haskell>
Normally (if we had <code>do</code> in place of <code>rec</code> in the above example), <code>y</code> would be in scope in <code>bar</code> and <code>bob</code> but not in <code>baz</code>, and <code>z</code> would be in scope only in <code>bob</code>. With the <code>rec</code>, however, <code>y</code> and <code>z</code> are both in scope in all three of <code>baz</code>, <code>bar</code>, and <code>bob</code>. A <code>rec</code> block is analogous to a <code>let</code> block such as
<haskell>
let { y = baz
; z = bar
}
in bob
</haskell>
because, in Haskell, every variable bound in a <code>let</code>-block is in scope throughout the entire block. (From this point of view, Haskell's normal <code>do</code> blocks are analogous to Scheme's <code>let*</code> construct.)

What could such a feature be used for? One of the motivating examples given in the original paper describing <code>MonadFix</code> (see below) is encoding circuit descriptions. A line in a <code>do</code>-block such as
<haskell>
x <- gate y z
</haskell>
describes a gate whose input wires are labeled <code>y</code> and <code>z</code> and whose output wire is labeled <code>x</code>. Many (most?) useful circuits, however, involve some sort of feedback loop, making them impossible to write in a normal <code>do</code>-block (since some wire would have to be mentioned as an input ''before'' being listed as an output). Using a <code>rec</code> block solves this problem.

==Examples and intuition==

Of course, not every monad supports such recursive binding. However, as mentioned above, it suffices to have an implementation of <code>mfix :: (a -> m a) -> m a</code>, satisfying a few laws. Let's try implementing <code>mfix</code> for the <code>Maybe</code> monad. That is, we want to implement a function
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
</haskell>
{{note|Actually, <code>fix</code> is implemented slightly differently for efficiency reasons; but the given definition is equivalent and simpler for the present purpose.}}
Let's think for a moment about the implementation {{noteref}} of the non-monadic <code>fix :: (a -> a) -> a</code>:
<haskell>
fix f = f (fix f)
</haskell>
Inspired by <code>fix</code>, our first attempt at implementing <code>maybeFix</code> might be something like
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = maybeFix f >>= f
</haskell>
This has the right type. However, something seems wrong: there is nothing in particular here about <code>Maybe</code>; <code>maybeFix</code> actually has the more general type <code>Monad m => (a -> m a) -> m a</code>. But didn't we just say that not all monads support <code>mfix</code>?

The answer is that although this implementation of <code>maybeFix</code> has the right type, it does ''not'' have the intended semantics. If we think about how <code>(>>=)</code> works for the <code>Maybe</code> monad (by pattern-matching on its first argument to see whether it is <code>Nothing</code> or <code>Just</code>) we can see that this definition of <code>maybeFix</code> is completely useless: it will just recurse infinitely, trying to decide whether it is going to return <code>Nothing</code> or <code>Just</code>, without ever even so much as a glance in the direction of <code>f</code>.

The trick is to simply ''assume'' that <code>maybeFix</code> will return <code>Just</code>, and get on with life!
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = ma
where ma = f (fromJust ma)
</haskell>
This says that the result of <code>maybeFix</code> is <code>ma</code>, and assuming that <code>ma = Just x</code>, it is defined (recursively) to be equal to <code>f x</code>.

Why is this OK? Isn't <code>fromJust</code> almost as bad as <code>unsafePerformIO</code>? Well, usually, yes. This is just about the only situation in which it is justified! The interesting thing to note is that <code>maybeFix</code> ''will never crash'' -- although it may, of course, fail to terminate. The only way we could get a crash is if we try to evaluate <code>fromJust ma</code> when we know that <code>ma = Nothing</code>. But how could we know <code>ma = Nothing</code>? Since <code>ma</code> is defined as <code>f (fromJust ma)</code>, it must be that this expression has already been evaluated to <code>Nothing</code> -- in which case there is no reason for us to be evaluating <code>fromJust ma</code> in the first place!

To see this from another point of view, we can consider three possibilities. First, if <code>f</code> outputs <code>Nothing</code> without looking at its argument, then <code>maybeFix f</code> clearly returns <code>Nothing</code>. Second, if <code>f</code> always outputs <code>Just x</code>, where <code>x</code> depends on its argument, then the recursion can proceed usefully: <code>fromJust ma</code> will be able to evaluate to <code>x</code>, thus feeding <code>f</code>'s output back to it as input. Third, if <code>f</code> tries to use its argument to decide whether to output <code>Just</code> or <code>Nothing</code>, then <code>maybeFix f</code> will not terminate: evaluating <code>f</code>'s argument requires evaluating <code>ma</code> to see whether it is <code>Just</code>, which requires evaluating <code>f (fromJust ma)</code>, which requires evaluating <code>ma</code>, ... and so on.

There are also instances of <code>MonadFix</code> for lists (which works analogously to the instance for <code>Maybe</code>), for <code>ST</code>, and for <code>IO</code>. The [http://hackage.haskell.org/packages/archive/base/latest/doc/html/src/System-IO.html#fixIO instance for <code>IO</code>] is particularly amusing: it creates a new (empty) <code>MVar</code>, immediately reads its contents using <code>unsafeInterleaveIO</code> (which delays the actual reading lazily until the value is needed), uses the contents of the <code>MVar</code> to compute a new value, which it then writes back into the <code>MVar</code>. It almost seems, spookily, that <code>mfix</code> is sending a value back in time to itself through the <code>MVar</code> -- though of course what is really going on is that the reading is delayed just long enough (via <code>unsafeInterleaveIO</code>) to get the process bootstrapped.

{{Exercises|
* Implement a <code>MonadFix</code> instance for <code>[]</code>.
}}

==GHC 7.6 changes==

GHC 7.6 reinstated the old <code>mdo</code> syntax, so the example at the start of this section can be written

<haskell>
mdo { x <- foo
; y <- baz
; z <- bar
; bob
; w <- frob
}
</haskell>

which will be translated into the original example (assuming that, say, <code>bar</code> and <code>bob</code> refer to <code>y</code>. The difference is that <code>mdo</code> will analyze the code in order to find minimal recursive blocks, which will be placed in <code>rec</code> blocks, whereas <code>rec</code> blocks desugar directly into calls to <code>mfix</code> without any further analysis.
==Further reading==

For more information (such as the precise desugaring rules for <code>rec</code> blocks), see Levent Erkök and John Launchbury's 2002 Haskell workshop paper, [http://sites.google.com/site/leventerkok/recdo.pdf?attredirects=0 A Recursive do for Haskell], or for full details, Levent Erkök’s thesis, [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.15.1543&rep=rep1&type=pdf Value Recursion in Monadic Computations]. (Note, while reading, that <code>MonadFix</code> used to be called <code>MonadRec</code>.) You can also read the [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation GHC user manual section on recursive do-notation].

=Semigroup=

A semigroup is a set <math>S\ </math> together with a binary operation <math>\oplus\ </math> which
combines elements from <math>S\ </math>. The <math>\oplus\ </math> operator is required to be associative
(that is, <math>(a \oplus b) \oplus c = a \oplus (b \oplus c)\ </math>, for any
<math>a,b,c\ </math> which are elements of <math>S\ </math>).

For example, the natural numbers under addition form a semigroup: the sum of any two natural numbers is a natural number, and <math>(a+b)+c = a+(b+c)\ </math> for any natural numbers <math>a\ </math>, <math>b\ </math>, and <math>c\,\ </math>. The integers under multiplication also form a semigroup, as do the integers (or rationals, or reals) under <math>\max\ </math> or <math>\min\ </math>, Boolean values under conjunction and disjunction, lists under concatenation, functions from a set to itself under composition ... Semigroups show up all over the place, once you know to look for them.

==Definition==

Semigroups are not (yet?) defined in the base package, but the {{HackagePackage|id=semigroups}} package provides a standard definition.

The definition of the <code>Semigroup</code> type class ([http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock]) is as follows:

<haskell>
class Semigroup a where
(<>) :: a -> a -> a

sconcat :: NonEmpty a -> a
sconcat = sconcat (a :| as) = go a as where
go b (c:cs) = b <> go c cs
go b [] = b

times1p :: Whole n => n -> a -> a
times1p = ...
</haskell>

The really important method is <code>(<>)</code>, representing the associative binary operation. The other two methods have default implementations in terms of <code>(<>)</code>, and are included in the type class in case some instances can give more efficient implementations than the default. <code>sconcat</code> reduces a nonempty list using <code>(<>)</code>; <code>times1p n</code> is equivalent to (but more efficient than) <code>sconcat . replicate n</code>. See the [http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock documentation] for more information on <code>sconcat</code> and <code>times1p</code>.

==Laws==

The only law is that <code>(<>)</code> must be associative:

<haskell>
(x <> y) <> z = x <> (y <> z)
</haskell>

=Monoid=

Many semigroups have a special element <math>e</math> for which the binary operation <math>\oplus</math> is the identity, that is, <math>e \oplus x = x \oplus e = x</math> for every element <math>x</math>. Such a semigroup-with-identity-element is called a ''monoid''.

==Definition==

The definition of the <code>Monoid</code> type class (defined in
<code>Data.Monoid</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Monoid.html haddock]) is:

<haskell>
class Monoid a where
mempty :: a
mappend :: a -> a -> a

mconcat :: [a] -> a
mconcat = foldr mappend mempty
</haskell>

The <code>mempty</code> value specifies the identity element of the monoid, and <code>mappend</code>
is the binary operation. The default definition for <code>mconcat</code>
“reduces” a list of elements by combining them all with <code>mappend</code>,
using a right fold. It is only in the <code>Monoid</code> class so that specific
instances have the option of providing an alternative, more efficient
implementation; usually, you can safely ignore <code>mconcat</code> when creating
a <code>Monoid</code> instance, since its default definition will work just fine.

The <code>Monoid</code> methods are rather unfortunately named; they are inspired
by the list instance of <code>Monoid</code>, where indeed <code>mempty = []</code> and <code>mappend = (++)</code>, but this is misleading since many
monoids have little to do with appending (see these [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 Comments from OCaml Hacker Brian Hurt] on the Haskell-cafe mailing list). This was improved in GHC 7.4, where <code>(<>)</code> was added as an alias to <code>mappend</code>.

==Laws==

Of course, every <code>Monoid</code> instance should actually be a monoid in the
mathematical sense, which implies these laws:

<haskell>
mempty `mappend` x = x
x `mappend` mempty = x
(x `mappend` y) `mappend` z = x `mappend` (y `mappend` z)
</haskell>

==Instances==

There are quite a few interesting <code>Monoid</code> instances defined in <code>Data.Monoid</code>.

<ul>
<li><code>[a]</code> is a <code>Monoid</code>, with <code>mempty = []</code> and <code>mappend = (++)</code>. It is not hard to check that <code>(x ++ y) ++ z = x ++ (y ++ z)</code> for any lists <code>x</code>, <code>y</code>, and <code>z</code>, and that the empty list is the identity: <code>[] ++ x = x ++ [] = x</code>.</li>

<li>As noted previously, we can make a monoid out of any numeric type under either addition or multiplication. However, since we can’t have two instances for the same type, <code>Data.Monoid</code> provides two <code>newtype</code> wrappers, <code>Sum</code> and <code>Product</code>, with appropriate <code>Monoid</code> instances.

<haskell>
> getSum (mconcat . map Sum $ [1..5])
15
> getProduct (mconcat . map Product $ [1..5])
120
</haskell>

This example code is silly, of course; we could just write
<code>sum [1..5]</code> and <code>product [1..5]</code>. Nevertheless, these instances are useful in more generalized settings, as we will see in the [[Foldable|section on <code>Foldable</code>]].</li>

<li><code>Any</code> and <code>All</code> are <code>newtype</code> wrappers providing <code>Monoid</code> instances for <code>Bool</code> (under disjunction and conjunction, respectively).</li>

<li> There are three instances for <code>Maybe</code>: a basic instance which lifts a <code>Monoid</code> instance for <code>a</code> to an instance for <code>Maybe a</code>, and two <code>newtype</code> wrappers <code>First</code> and <code>Last</code> for which <code>mappend</code> selects the first (respectively last) non-<code>Nothing</code> item.</li>

<li><code>Endo a</code> is a newtype wrapper for functions <code>a -> a</code>, which form a monoid under composition.</li>

<li>There are several ways to “lift” <code>Monoid</code> instances to instances with additional structure. We have already seen that an instance for <code>a</code> can be lifted to an instance for <code>Maybe a</code>. There are also tuple instances: if <code>a</code> and <code>b</code> are instances of <code>Monoid</code>, then so is <code>(a,b)</code>, using the monoid operations for <code>a</code> and <code>b</code> in the obvious pairwise manner. Finally, if <code>a</code> is a <code>Monoid</code>, then so is the function type <code>e -> a</code> for any <code>e</code>; in particular, <code>g `mappend` h</code> is the function which applies both <code>g</code> and <code>h</code> to its argument and then combines the results using the underlying <code>Monoid</code> instance for <code>a</code>. This can be quite useful and elegant (see [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/52416 example]).</li>

<li>The type <code>Ordering = LT | EQ | GT</code> is a <code>Monoid</code>, defined in such a way that <code>mconcat (zipWith compare xs ys)</code> computes the lexicographic ordering of <code>xs</code> and <code>ys</code> (if <code>xs</code> and <code>ys</code> have the same length). In particular, <code>mempty = EQ</code>, and <code>mappend</code> evaluates to its leftmost non-<code>EQ</code> argument (or <code>EQ</code> if both arguments are <code>EQ</code>). This can be used together with the function instance of <code>Monoid</code> to do some clever things ([http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx example]).</li>

<li>There are also <code>Monoid</code> instances for several standard data structures in the containers library ([http://hackage.haskell.org/packages/archive/containers/0.2.0.0/doc/html/index.html haddock]), including <code>Map</code>, <code>Set</code>, and <code>Sequence</code>.</li>
</ul>

<code>Monoid</code> is also used to enable several other type class instances.
As noted previously, we can use <code>Monoid</code> to make <code>((,) e)</code> an instance of <code>Applicative</code>:

<haskell>
instance Monoid e => Applicative ((,) e) where
pure x = (mempty, x)
(u, f) <*> (v, x) = (u `mappend` v, f x)
</haskell>

<code>Monoid</code> can be similarly used to make <code>((,) e)</code> an instance of <code>Monad</code> as well; this is known as the ''writer monad''. As we’ve already seen, <code>Writer</code> and <code>WriterT</code> are a newtype wrapper and transformer for this monad, respectively.

<code>Monoid</code> also plays a key role in the <code>Foldable</code> type class (see section [[#Foldable|Foldable]]).

==Other monoidal classes: Alternative, MonadPlus, ArrowPlus==

The <code>Alternative</code> type class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html#g:2 haddock])
is for <code>Applicative</code> functors which also have
a monoid structure:

<haskell>
class Applicative f => Alternative f where
empty :: f a
(<|>) :: f a -> f a -> f a
</haskell>

Of course, instances of <code>Alternative</code> should satisfy the monoid laws

<haskell>
empty <|> x = x
x <|> empty = x
(x <|> y) <|> z = x <|> (y <|> z)
</haskell>

Likewise, <code>MonadPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html#t:MonadPlus haddock])
is for <code>Monad</code>s with a monoid structure:

<haskell>
class Monad m => MonadPlus m where
mzero :: m a
mplus :: m a -> m a -> m a
</haskell>

The <code>MonadPlus</code> documentation states that it is intended to model
monads which also support “choice and failure”; in addition to the
monoid laws, instances of <code>MonadPlus</code> are expected to satisfy

<haskell>
mzero >>= f = mzero
v >> mzero = mzero
</haskell>

which explains the sense in which <code>mzero</code> denotes failure. Since
<code>mzero</code> should be the identity for <code>mplus</code>, the computation <code>m1 `mplus` m2</code> succeeds (evaluates to something other than <code>mzero</code>) if
either <code>m1</code> or <code>m2</code> does; so <code>mplus</code> represents choice. The <code>guard</code>
function can also be used with instances of <code>MonadPlus</code>; it requires a
condition to be satisfied and fails (using <code>mzero</code>) if it is not. A
simple example of a <code>MonadPlus</code> instance is <code>[]</code>, which is exactly the
same as the <code>Monoid</code> instance for <code>[]</code>: the empty list represents
failure, and list concatenation represents choice. In general,
however, a <code>MonadPlus</code> instance for a type need not be the same as its
<code>Monoid</code> instance; <code>Maybe</code> is an example of such a type. A great
introduction to the <code>MonadPlus</code> type class, with interesting examples
of its use, is Doug Auclair’s ''MonadPlus: What a Super Monad!'' in [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad.Reader issue 11].

There used to be a type class called <code>MonadZero</code> containing only
<code>mzero</code>, representing monads with failure. The <code>do</code>-notation requires
some notion of failure to deal with failing pattern matches.
Unfortunately, <code>MonadZero</code> was scrapped in favor of adding the <code>fail</code>
method to the <code>Monad</code> class. If we are lucky, someday <code>MonadZero</code> will
be restored, and <code>fail</code> will be banished to the bit bucket where it
belongs (see [[MonadPlus reform proposal]]). The idea is that any
<code>do</code>-block which uses pattern matching (and hence may fail) would require
a <code>MonadZero</code> constraint; otherwise, only a <code>Monad</code> constraint would be
required.

Finally, <code>ArrowZero</code> and <code>ArrowPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html#t:ArrowZero haddock])
represent <code>Arrow</code>s ([[#Arrow|see below]]) with a
monoid structure:

<haskell>
class Arrow arr => ArrowZero arr where
zeroArrow :: b `arr` c

class ArrowZero arr => ArrowPlus arr where
(<+>) :: (b `arr` c) -> (b `arr` c) -> (b `arr` c)
</haskell>

==Further reading==

Monoids have gotten a fair bit of attention recently, ultimately due
to
[http://enfranchisedmind.com/blog/posts/random-thoughts-on-haskell/ a blog post by Brian Hurt], in which he
complained about the fact that the names of many Haskell type classes
(<code>Monoid</code> in particular) are taken from abstract mathematics. This
resulted in [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 a long Haskell-cafe thread]
arguing the point and discussing monoids in general.

{{note|May its name live forever.}}

However, this was quickly followed by several blog posts about
<code>Monoid</code> {{noteref}}. First, Dan Piponi
wrote a great introductory post, [http://blog.sigfpe.com/2009/01/haskell-monoids-and-their-uses.html Haskell Monoids and their Uses]. This was quickly followed by
Heinrich Apfelmus’ [http://apfelmus.nfshost.com/monoid-fingertree.html Monoids and Finger Trees], an accessible exposition of
Hinze and Paterson’s [http://www.soi.city.ac.uk/%7Eross/papers/FingerTree.html classic paper on 2-3 finger trees], which makes very clever
use of <code>Monoid</code> to implement an elegant and generic data structure.
Dan Piponi then wrote two fascinating articles about using <code>Monoids</code>
(and finger trees): [http://blog.sigfpe.com/2009/01/fast-incremental-regular-expression.html Fast Incremental Regular Expressions] and [http://blog.sigfpe.com/2009/01/beyond-regular-expressions-more.html Beyond Regular Expressions]

In a similar vein, David Place’s article on improving <code>Data.Map</code> in
order to compute incremental folds (see [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad Reader issue 11])
is also a
good example of using <code>Monoid</code> to generalize a data structure.

Some other interesting examples of <code>Monoid</code> use include [http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx building elegant list sorting combinators], [http://byorgey.wordpress.com/2008/04/17/collecting-unstructured-information-with-the-monoid-of-partial-knowledge/ collecting unstructured information], [http://izbicki.me/blog/gausian-distributions-are-monoids combining probability distributions], and a brilliant series of posts by Chung-Chieh Shan and Dylan Thurston using <code>Monoid</code>s to [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers1/ elegantly solve a difficult combinatorial puzzle] (followed by [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers2/ part 2], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers3/ part 3], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers4/ part 4]).

As unlikely as it sounds, monads can actually be viewed as a sort of
monoid, with <code>join</code> playing the role of the binary operation and
<code>return</code> the role of the identity; see [http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html Dan Piponi’s blog post].

=Foldable=

The <code>Foldable</code> class, defined in the <code>Data.Foldable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html haddock]), abstracts over containers which can be
“folded” into a summary value. This allows such folding operations
to be written in a container-agnostic way.

==Definition==

The definition of the <code>Foldable</code> type class is:

<haskell>
class Foldable t where
fold :: Monoid m => t m -> m
foldMap :: Monoid m => (a -> m) -> t a -> m

foldr :: (a -> b -> b) -> b -> t a -> b
foldl :: (a -> b -> a) -> a -> t b -> a
foldr1 :: (a -> a -> a) -> t a -> a
foldl1 :: (a -> a -> a) -> t a -> a
</haskell>

This may look complicated, but in fact, to make a <code>Foldable</code> instance
you only need to implement one method: your choice of <code>foldMap</code> or
<code>foldr</code>. All the other methods have default implementations in terms
of these, and are presumably included in the class in case more
efficient implementations can be provided.

==Instances and examples==

The type of <code>foldMap</code> should make it clear what it is supposed to do:
given a way to convert the data in a container into a <code>Monoid</code> (a
function <code>a -> m</code>) and a container of <code>a</code>’s (<code>t a</code>), <code>foldMap</code>
provides a way to iterate over the entire contents of the container,
converting all the <code>a</code>’s to <code>m</code>’s and combining all the <code>m</code>’s with
<code>mappend</code>. The following code shows two examples: a simple
implementation of <code>foldMap</code> for lists, and a binary tree example
provided by the <code>Foldable</code> documentation.

<haskell>
instance Foldable [] where
foldMap g = mconcat . map g

data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Foldable Tree where
foldMap f Empty = mempty
foldMap f (Leaf x) = f x
foldMap f (Node l k r) = foldMap f l `mappend` f k `mappend` foldMap f r
</haskell>

The <code>foldr</code> function has a type similar to the <code>foldr</code> found in the <code>Prelude</code>, but
more general, since the <code>foldr</code> in the <code>Prelude</code> works only on lists.

The <code>Foldable</code> module also provides instances for <code>Maybe</code> and <code>Array</code>;
additionally, many of the data structures found in the standard [http://hackage.haskell.org/package/containers containers library] (for example, <code>Map</code>, <code>Set</code>, <code>Tree</code>,
and <code>Sequence</code>) provide their own <code>Foldable</code> instances.

{{Exercises|
# What is the type of <code>foldMap . foldMap</code>? Or <code>foldMap . foldMap . foldMap</code>, etc.? What do they do?
}}

==Derived folds==

Given an instance of <code>Foldable</code>, we can write generic,
container-agnostic functions such as:

<haskell>
-- Compute the size of any container.
containerSize :: Foldable f => f a -> Int
containerSize = getSum . foldMap (const (Sum 1))

-- Compute a list of elements of a container satisfying a predicate.
filterF :: Foldable f => (a -> Bool) -> f a -> [a]
filterF p = foldMap (\a -> if p a then [a] else [])

-- Get a list of all the Strings in a container which include the
-- letter a.
aStrings :: Foldable f => f String -> [String]
aStrings = filterF (elem 'a')
</haskell>

The <code>Foldable</code> module also provides a large number of predefined
folds, many of which are generalized versions of <code>Prelude</code> functions of the
same name that only work on lists: <code>concat</code>, <code>concatMap</code>, <code>and</code>,
<code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>),
<code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>.

The important function <code>toList</code> is also provided, which turns any <code>Foldable</code> structure into a list of its elements in left-right order; it works by folding with the list monoid.

There are also generic functions that work with <code>Applicative</code> or
<code>Monad</code> instances to generate some sort of computation from each
element in a container, and then perform all the side effects from
those computations, discarding the results: <code>traverse_</code>, <code>sequenceA_</code>,
and others. The results must be discarded because the <code>Foldable</code>
class is too weak to specify what to do with them: we cannot, in
general, make an arbitrary <code>Applicative</code> or <code>Monad</code> instance into a <code>Monoid</code>, but we can make <code>m ()</code> into a <code>Monoid</code> for any such <code>m</code>. If we do have an <code>Applicative</code> or <code>Monad</code> with a monoid
structure—that is, an <code>Alternative</code> or a <code>MonadPlus</code>—then we can
use the <code>asum</code> or <code>msum</code> functions, which can combine the results as
well. Consult the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html <code>Foldable</code> documentation] for
more details on any of these functions.

Note that the <code>Foldable</code> operations always forget the structure of
the container being folded. If we start with a container of type <code>t a</code> for some <code>Foldable t</code>, then <code>t</code> will never appear in the output
type of any operations defined in the <code>Foldable</code> module. Many times
this is exactly what we want, but sometimes we would like to be able
to generically traverse a container while preserving its
structure—and this is exactly what the <code>Traversable</code> class provides,
which will be discussed in the next section.

{{Exercises|
# Implement <code>toList :: Foldable f {{=}}> f a -> [a]</code>.
# Pick some of the following functions to implement: <code>concat</code>, <code>concatMap</code>, <code>and</code>, <code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>), <code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>. Figure out how they generalize to <code>Foldable</code> and come up with elegant implementations using <code>fold</code> or <code>foldMap</code> along with appropriate <code>Monoid</code> instances.
}}

==Foldable actually isn't==

The generic term "fold" is often used to refer to the more technical concept of [[Catamorphisms|catamorphism]]. Intuitively, given a way to summarize "one level of structure" (where recursive subterms have already been replaced with their summaries), a catamorphism can summarize an entire recursive structure. It is important to realize that <code>Foldable</code> does not correspond to catamorphisms, but to something weaker. In particular, <code>Foldable</code> allows observing only the left-right order of elements within a structure, not the actual structure itself. Put another way, every use of <code>Foldable</code> can be expressed in terms of <code>toList</code>. For example, <code>fold</code> itself is equivalent to <code>mconcat . toList</code>.

This is sufficient for many tasks, but not all. For example, consider trying to compute the depth of a <code>Tree</code>: try as we might, there is no way to implement it using <code>Foldable</code>. However, it can be implemented as a catamorphism.

==Further reading==

The <code>Foldable</code> class had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s paper]
introducing <code>Applicative</code>, although it has
been fleshed out quite a bit from the form in the paper.

An interesting use of <code>Foldable</code> (as well as <code>Traversable</code>) can be
found in Janis Voigtländer’s paper [http://doi.acm.org/10.1145/1480881.1480904 Bidirectionalization for free!].

=Traversable=

==Definition==

The <code>Traversable</code> type class, defined in the <code>Data.Traversable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Traversable.html haddock]), is:

<haskell>
class (Functor t, Foldable t) => Traversable t where
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
sequenceA :: Applicative f => t (f a) -> f (t a)
mapM :: Monad m => (a -> m b) -> t a -> m (t b)
sequence :: Monad m => t (m a) -> m (t a)
</haskell>

As you can see, every <code>Traversable</code> is also a foldable functor. Like
<code>Foldable</code>, there is a lot in this type class, but making instances is
actually rather easy: one need only implement <code>traverse</code> or
<code>sequenceA</code>; the other methods all have default implementations in
terms of these functions. A good exercise is to figure out what the default
implementations should be: given either <code>traverse</code> or <code>sequenceA</code>, how
would you define the other three methods? (Hint for <code>mapM</code>:
<code>Control.Applicative</code> exports the <code>WrapMonad</code> newtype, which makes any
<code>Monad</code> into an <code>Applicative</code>. The <code>sequence</code> function can be implemented in terms
of <code>mapM</code>.)

==Intuition==

The key method of the <code>Traversable</code> class, and the source of its
unique power, is <code>sequenceA</code>. Consider its type:
<haskell>
sequenceA :: Applicative f => t (f a) -> f (t a)
</haskell>
This answers the fundamental question: when can we commute two
functors? For example, can we turn a tree of lists into a list of
trees?

The ability to compose two monads depends crucially on this ability to
commute functors. Intuitively, if we want to build a composed monad
<code>M a = m (n a)</code> out of monads <code>m</code> and <code>n</code>, then to be able to
implement <code>join :: M (M a) -> M a</code>, that is,
<code>join :: m (n (m (n a))) -> m (n a)</code>, we have to be able to commute
the <code>n</code> past the <code>m</code> to get <code>m (m (n (n a)))</code>, and then we can use the
<code>join</code>s for <code>m</code> and <code>n</code> to produce something of type <code>m (n a)</code>. See
[http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Mark Jones’ paper] for more details.

Alternatively, looking at the type of <code>traverse</code>,
<haskell>
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
</haskell>
leads us to view <code>Traversable</code> as a generalization of <code>Functor</code>. <code>traverse</code> is an "effectful <code>fmap</code>": it allows us to map over a structure of type <code>t a</code>, applying a function to every element of type <code>a</code> and in order to produce a new structure of type <code>t b</code>; but along the way the function may have some effects (captured by the applicative functor <code>f</code>).

{{Exercises|
# There are at least two natural ways to turn a tree of lists into a list of trees. What are they, and why?
# Give a natural way to turn a list of trees into a tree of lists.
# What is the type of <code>traverse . traverse</code>? What does it do?
}}

==Instances and examples==

What’s an example of a <code>Traversable</code> instance?
The following code shows an example instance for the same
<code>Tree</code> type used as an example in the previous <code>Foldable</code> section. It
is instructive to compare this instance with a <code>Functor</code> instance for
<code>Tree</code>, which is also shown.

<haskell>
data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Traversable Tree where
traverse g Empty = pure Empty
traverse g (Leaf x) = Leaf <$> g x
traverse g (Node l x r) = Node <$> traverse g l
<*> g x
<*> traverse g r

instance Functor Tree where
fmap g Empty = Empty
fmap g (Leaf x) = Leaf $ g x
fmap g (Node l x r) = Node (fmap g l)
(g x)
(fmap g r)
</haskell>

It should be clear that the <code>Traversable</code> and <code>Functor</code> instances for
<code>Tree</code> are almost identical; the only difference is that the <code>Functor</code>
instance involves normal function application, whereas the
applications in the <code>Traversable</code> instance take place within an
<code>Applicative</code> context, using <code>(<$>)</code> and <code>(<*>)</code>. In fact, this will
be
true for any type.

Any <code>Traversable</code> functor is also <code>Foldable</code>, and a <code>Functor</code>. We can see
this not only from the class declaration, but by the fact that we can
implement the methods of both classes given only the <code>Traversable</code>
methods.

The standard libraries provide a number of <code>Traversable</code> instances,
including instances for <code>[]</code>, <code>Maybe</code>, <code>Map</code>, <code>Tree</code>, and <code>Sequence</code>.
Notably, <code>Set</code> is not <code>Traversable</code>, although it is <code>Foldable</code>.

{{Exercises|
# Implement <code>fmap</code> and <code>foldMap</code> using only the <code>Traversable</code> methods. (Note that the <code>Traversable</code> module provides these implementations as <code>fmapDefault</code> and <code>foldMapDefault</code>.)
}}

==Laws==

Any instance of <code>Traversable</code> must satisfy the following two laws, where <code>Identity</code> is the identity functor (as defined in the [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Data-Functor-Identity.html <code>Data.Functor.Identity</code> module] from the <code>transformers</code> package), and <code>Compose</code> wraps the composition of two functors (as defined in [http://hackage.haskell.org/packages/archive/transformers/0.3.0.0/doc/html/Data-Functor-Compose.html <code>Data.Functor.Compose</code>]):

# <code>traverse Identity = Identity</code>
# <code>traverse (Compose . fmap g . f) = Compose . fmap (traverse g) . traverse f</code>

The first law essentially says that traversals cannot make up arbitrary effects. The second law explains how doing two traversals in sequence can be collapsed to a single traversal.

Additionally, suppose <code>eta</code> is an "<code>Applicative</code> morphism", that is,
<haskell>
eta :: forall a f g. (Applicative f, Applicative g) => f a -> g a
</haskell>
and <code>eta</code> preserves the <code>Applicative</code> operations: <code>eta (pure x) = pure x</code> and <code>eta (x <*> y) = eta x <*> eta y</code>. Then, by parametricity, any instance of <code>Traversable</code> satisfying the above two laws will also satisfy <code>eta . traverse f = traverse (eta . f)</code>.

==Further reading==

The <code>Traversable</code> class also had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s <code>Applicative</code> paper],
and is described in more detail in Gibbons and Oliveira, [http://www.comlab.ox.ac.uk/jeremy.gibbons/publications/iterator.pdf The Essence of the Iterator Pattern],
which also contains a wealth of references to related work.

<code>Traversable</code> forms a core component of Edward Kmett's [http://hackage.haskell.org/package/lens lens library]. Watching [https://vimeo.com/56063074 Edward's talk on the subject] is a highly recommended way to gain better insight into <code>Traversable</code>, <code>Foldable</code>, <code>Applicative</code>, and many other things besides.

For references on the <code>Traversable</code> laws, see Russell O'Connor's [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17778 mailing list post] (and subsequent thread).

=Category=

<code>Category</code> is a relatively recent addition to the Haskell standard libraries. It generalizes the notion of function composition to general “morphisms”.

{{note|GHC 7.6.1 changed its rules regarding types and type variables. Now, any operator at the type level is treated as a type ''constructor'' rather than a type ''variable''; prior to GHC 7.6.1 it was possible to use <code>(~>)</code> instead of <code>`arr`</code>. For more information, see [http://thread.gmane.org/gmane.comp.lang.haskell.glasgow.user/21350 the discussion on the GHC-users mailing list]. For a new approach to nice arrow notation that works with GHC 7.6.1, see [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22615 this message] and also [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22616 this message] from Edward Kmett, though for simplicity I haven't adopted it here.}}
The definition of the <code>Category</code> type class (from
<code>Control.Category</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Category.html haddock]) is shown below. For ease of reading, note that I have used an infix type variable <code>`arr`</code>, in parallel with the infix function type constructor <code>(->)</code>. {{noteref}} This syntax is not part of Haskell 2010. The second definition shown is the one used in the standard libraries. For the remainder of this document, I will use the infix type constructor <code>`arr`</code> for <code>Category</code> as well as <code>Arrow</code>.

<haskell>
class Category arr where
id :: a `arr` a
(.) :: (b `arr` c) -> (a `arr` b) -> (a `arr` c)

-- The same thing, with a normal (prefix) type constructor
class Category cat where
id :: cat a a
(.) :: cat b c -> cat a b -> cat a c
</haskell>

Note that an instance of <code>Category</code> should be a type constructor which takes two type arguments, that is, something of kind <code>* -> * -> *</code>. It is instructive to imagine the type constructor variable <code>cat</code> replaced by the function constructor <code>(->)</code>: indeed, in this case we recover precisely the familiar identity function <code>id</code> and function composition operator <code>(.)</code> defined in the standard <code>Prelude</code>.

Of course, the <code>Category</code> module provides exactly such an instance of
<code>Category</code> for <code>(->)</code>. But it also provides one other instance, shown below, which should be familiar from the previous discussion of the <code>Monad</code> laws. <code>Kleisli m a b</code>, as defined in the <code>Control.Arrow</code> module, is just a <code>newtype</code> wrapper around <code>a -> m b</code>.

<haskell>
newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Category (Kleisli m) where
id = Kleisli return
Kleisli g . Kleisli h = Kleisli (h >=> g)
</haskell>

The only law that <code>Category</code> instances should satisfy is that <code>id</code> and <code>(.)</code> should form a monoid—that is, <code>id</code> should be the identity of <code>(.)</code>, and <code>(.)</code> should be associative.

Finally, the <code>Category</code> module exports two additional operators:
<code>(<<<)</code>, which is just a synonym for <code>(.)</code>, and <code>(>>>)</code>, which is <code>(.)</code> with its arguments reversed. (In previous versions of the libraries, these operators were defined as part of the <code>Arrow</code> class.)

==Further reading==

The name <code>Category</code> is a bit misleading, since the <code>Category</code> class cannot represent arbitrary categories, but only categories whose objects are objects of <code>Hask</code>, the category of Haskell types. For a more general treatment of categories within Haskell, see the [http://hackage.haskell.org/package/category-extras category-extras package]. For more about category theory in general, see the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page],
[http://books.google.com/books/about/Category_theory.html?id=-MCJ6x2lC7oC Steve Awodey’s new book], Benjamin Pierce’s [http://books.google.com/books/about/Basic_category_theory_for_computer_scien.html?id=ezdeaHfpYPwC Basic category theory for computer scientists], or [http://folli.loria.fr/cds/1999/esslli99/courses/barr-wells.html Barr and Wells category theory lecture notes]. [http://dekudekuplex.wordpress.com/2009/01/19/motivating-learning-category-theory-for-non-mathematicians/ Benjamin Russell’s blog post]
is another good source of motivation and category theory links. You certainly don’t need to know any category theory to be a successful and productive Haskell programmer, but it does lend itself to much deeper appreciation of Haskell’s underlying theory.

=Arrow=

The <code>Arrow</code> class represents another abstraction of computation, in a
similar vein to <code>Monad</code> and <code>Applicative</code>. However, unlike <code>Monad</code>
and <code>Applicative</code>, whose types only reflect their output, the type of
an <code>Arrow</code> computation reflects both its input and output. Arrows
generalize functions: if <code>arr</code> is an instance of <code>Arrow</code>, a value of
type <code>b `arr` c</code> can be thought of as a computation which takes values of
type <code>b</code> as input, and produces values of type <code>c</code> as output. In the
<code>(->)</code> instance of <code>Arrow</code> this is just a pure function; in general, however,
an arrow may represent some sort of “effectful” computation.

==Definition==

The definition of the <code>Arrow</code> type class, from
<code>Control.Arrow</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html haddock]), is:

<haskell>
class Category arr => Arrow arr where
arr :: (b -> c) -> (b `arr` c)
first :: (b `arr` c) -> ((b, d) `arr` (c, d))
second :: (b `arr` c) -> ((d, b) `arr` (d, c))
(***) :: (b `arr` c) -> (b' `arr` c') -> ((b, b') `arr` (c, c'))
(&&&) :: (b `arr` c) -> (b `arr` c') -> (b `arr` (c, c'))
</haskell>

{{note|In versions of the <code>base</code>
package prior to version 4, there is no <code>Category</code> class, and the
<code>Arrow</code> class includes the arrow composition operator <code>(>>>)</code>. It
also includes <code>pure</code> as a synonym for <code>arr</code>, but this was removed
since it conflicts with the <code>pure</code> from <code>Applicative</code>.}}

The first thing to note is the <code>Category</code> class constraint, which
means that we get identity arrows and arrow composition for free:
given two arrows <code>g :: b `arr` c</code> and <code>h :: c `arr` d</code>, we can form their
composition <code>g >>> h :: b `arr` d</code> {{noteref}}.

As should be a familiar pattern by now, the only methods which must be
defined when writing a new instance of <code>Arrow</code> are <code>arr</code> and <code>first</code>;
the other methods have default definitions in terms of these, but are
included in the <code>Arrow</code> class so that they can be overridden with more
efficient implementations if desired.

==Intuition==

Let’s look at each of the arrow methods in turn. [http://www.haskell.org/arrows/ Ross Paterson’s web page on arrows] has nice diagrams which can help
build intuition.

* The <code>arr</code> function takes any function <code>b -> c</code> and turns it into a generalized arrow <code>b `arr` c</code>. The <code>arr</code> method justifies the claim that arrows generalize functions, since it says that we can treat any function as an arrow. It is intended that the arrow <code>arr g</code> is “pure” in the sense that it only computes <code>g</code> and has no “effects” (whatever that might mean for any particular arrow type).

* The <code>first</code> method turns any arrow from <code>b</code> to <code>c</code> into an arrow from <code>(b,d)</code> to <code>(c,d)</code>. The idea is that <code>first g</code> uses <code>g</code> to process the first element of a tuple, and lets the second element pass through unchanged. For the function instance of <code>Arrow</code>, of course, <code>first g (x,y) = (g x, y)</code>.

* The <code>second</code> function is similar to <code>first</code>, but with the elements of the tuples swapped. Indeed, it can be defined in terms of <code>first</code> using an auxiliary function <code>swap</code>, defined by <code>swap (x,y) = (y,x)</code>.

* The <code>(***)</code> operator is “parallel composition” of arrows: it takes two arrows and makes them into one arrow on tuples, which has the behavior of the first arrow on the first element of a tuple, and the behavior of the second arrow on the second element. The mnemonic is that <code>g *** h</code> is the ''product'' (hence <code>*</code>) of <code>g</code> and <code>h</code>. For the function instance of <code>Arrow</code>, we define <code>(g *** h) (x,y) = (g x, h y)</code>. The default implementation of <code>(***)</code> is in terms of <code>first</code>, <code>second</code>, and sequential arrow composition <code>(>>>)</code>. The reader may also wish to think about how to implement <code>first</code> and <code>second</code> in terms of <code>(***)</code>.

* The <code>(&&&)</code> operator is “fanout composition” of arrows: it takes two arrows <code>g</code> and <code>h</code> and makes them into a new arrow <code>g &&& h</code> which supplies its input as the input to both <code>g</code> and <code>h</code>, returning their results as a tuple. The mnemonic is that <code>g &&& h</code> performs both <code>g</code> ''and'' <code>h</code> (hence <code>&</code>) on its input. For functions, we define <code>(g &&& h) x = (g x, h x)</code>.

==Instances==

The <code>Arrow</code> library itself only provides two <code>Arrow</code> instances, both
of which we have already seen: <code>(->)</code>, the normal function
constructor, and <code>Kleisli m</code>, which makes functions of
type <code>a -> m b</code> into <code>Arrow</code>s for any <code>Monad m</code>. These instances are:

<haskell>
instance Arrow (->) where
arr g = g
first g (x,y) = (g x, y)

newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Arrow (Kleisli m) where
arr f = Kleisli (return . f)
first (Kleisli f) = Kleisli (\ ~(b,d) -> do c <- f b
return (c,d) )
</haskell>

==Laws==

{{note|See [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 John Hughes: Generalising monads to arrows]; [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf Sam Lindley, Philip Wadler, Jeremy Yallop: The arrow calculus]; [http://www.soi.city.ac.uk/~ross/papers/fop.html Ross Paterson: Programming with Arrows].}}

There are quite a few laws that instances of <code>Arrow</code> should
satisfy {{noteref}}:

<haskell>
arr id = id
arr (h . g) = arr g >>> arr h
first (arr g) = arr (g *** id)
first (g >>> h) = first g >>> first h
first g >>> arr (id *** h) = arr (id *** h) >>> first g
first g >>> arr fst = arr fst >>> g
first (first g) >>> arr assoc = arr assoc >>> first g

assoc ((x,y),z) = (x,(y,z))
</haskell>

Note that this version of the laws is slightly different than the laws given in the
first two above references, since several of the laws have now been
subsumed by the <code>Category</code> laws (in particular, the requirements that
<code>id</code> is the identity arrow and that <code>(>>>)</code> is associative). The laws
shown here follow those in Paterson’s Programming with Arrows, which uses the
<code>Category</code> class.

{{note|Unless category-theory-induced insomnolence is your cup of tea.}}

The reader is advised not to lose too much sleep over the <code>Arrow</code>
laws {{noteref}}, since it is not essential to understand them in order to
program with arrows. There are also laws that <code>ArrowChoice</code>,
<code>ArrowApply</code>, and <code>ArrowLoop</code> instances should satisfy; the interested
reader should consult [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows].

==ArrowChoice==

Computations built using the <code>Arrow</code> class, like those built using
the <code>Applicative</code> class, are rather inflexible: the structure of the computation
is fixed at the outset, and there is no ability to choose between
alternate execution paths based on intermediate results.
The <code>ArrowChoice</code> class provides exactly such an ability:

<haskell>
class Arrow arr => ArrowChoice arr where
left :: (b `arr` c) -> (Either b d `arr` Either c d)
right :: (b `arr` c) -> (Either d b `arr` Either d c)
(+++) :: (b `arr` c) -> (b' `arr` c') -> (Either b b' `arr` Either c c')
(|||) :: (b `arr` d) -> (c `arr` d) -> (Either b c `arr` d)
</haskell>

A comparison of <code>ArrowChoice</code> to <code>Arrow</code> will reveal a striking
parallel between <code>left</code>, <code>right</code>, <code>(+++)</code>, <code>(|||)</code> and <code>first</code>,
<code>second</code>, <code>(***)</code>, <code>(&&&)</code>, respectively. Indeed, they are dual:
<code>first</code>, <code>second</code>, <code>(***)</code>, and <code>(&&&)</code> all operate on product types
(tuples), and <code>left</code>, <code>right</code>, <code>(+++)</code>, and <code>(|||)</code> are the
corresponding operations on sum types. In general, these operations
create arrows whose inputs are tagged with <code>Left</code> or <code>Right</code>, and can
choose how to act based on these tags.

* If <code>g</code> is an arrow from <code>b</code> to <code>c</code>, then <code>left g</code> is an arrow from <code>Either b d</code> to <code>Either c d</code>. On inputs tagged with <code>Left</code>, the <code>left g</code> arrow has the behavior of <code>g</code>; on inputs tagged with <code>Right</code>, it behaves as the identity.

* The <code>right</code> function, of course, is the mirror image of <code>left</code>. The arrow <code>right g</code> has the behavior of <code>g</code> on inputs tagged with <code>Right</code>.

* The <code>(+++)</code> operator performs “multiplexing”: <code>g +++ h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and as <code>h</code> on inputs tagged with <code>Right</code>. The tags are preserved. The <code>(+++)</code> operator is the ''sum'' (hence <code>+</code>) of two arrows, just as <code>(***)</code> is the product.

* The <code>(|||)</code> operator is “merge” or “fanin”: the arrow <code>g ||| h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and <code>h</code> on inputs tagged with <code>Right</code>, but the tags are discarded (hence, <code>g</code> and <code>h</code> must have the same output type). The mnemonic is that <code>g ||| h</code> performs either <code>g</code> ''or'' <code>h</code> on its input.

The <code>ArrowChoice</code> class allows computations to choose among a finite number of execution paths, based on intermediate results. The possible
execution paths must be known in advance, and explicitly assembled with <code>(+++)</code> or <code>(|||)</code>. However, sometimes more flexibility is
needed: we would like to be able to ''compute'' an arrow from intermediate results, and use this computed arrow to continue the computation. This is the power given to us by <code>ArrowApply</code>.

==ArrowApply==

The <code>ArrowApply</code> type class is:

<haskell>
class Arrow arr => ArrowApply arr where
app :: (b `arr` c, b) `arr` c
</haskell>

If we have computed an arrow as the output of some previous
computation, then <code>app</code> allows us to apply that arrow to an input,
producing its output as the output of <code>app</code>. As an exercise, the
reader may wish to use <code>app</code> to implement an alternative “curried”
version, <code>app2 :: b `arr` ((b `arr` c) `arr` c)</code>.

This notion of being able to ''compute'' a new computation
may sound familiar:
this is exactly what the monadic bind operator <code>(>>=)</code> does. It
should not particularly come as a surprise that <code>ArrowApply</code> and
<code>Monad</code> are exactly equivalent in expressive power. In particular,
<code>Kleisli m</code> can be made an instance of <code>ArrowApply</code>, and any instance
of <code>ArrowApply</code> can be made a <code>Monad</code> (via the <code>newtype</code> wrapper
<code>ArrowMonad</code>). As an exercise, the reader may wish to try
implementing these instances:

<haskell>
instance Monad m => ArrowApply (Kleisli m) where
app = -- exercise

newtype ArrowApply a => ArrowMonad a b = ArrowMonad (a () b)

instance ArrowApply a => Monad (ArrowMonad a) where
return = -- exercise
(ArrowMonad a) >>= k = -- exercise
</haskell>

==ArrowLoop==

The <code>ArrowLoop</code> type class is:

<haskell>
class Arrow a => ArrowLoop a where
loop :: a (b, d) (c, d) -> a b c

trace :: ((b,d) -> (c,d)) -> b -> c
trace f b = let (c,d) = f (b,d) in c
</haskell>

It describes arrows that can use recursion to compute results, and is
used to desugar the <code>rec</code> construct in arrow notation (described
below).

Taken by itself, the type of the <code>loop</code> method does not seem to tell
us much. Its intention, however, is a generalization of the <code>trace</code>
function which is also shown. The <code>d</code> component of the first arrow’s
output is fed back in as its own input. In other words, the arrow
<code>loop g</code> is obtained by recursively “fixing” the second component of
the input to <code>g</code>.

It can be a bit difficult to grok what the <code>trace</code> function is doing.
How can <code>d</code> appear on the left and right sides of the <code>let</code>? Well,
this is Haskell’s laziness at work. There is not space here for a
full explanation; the interested reader is encouraged to study the
standard <code>fix</code> function, and to read [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson’s arrow tutorial].

==Arrow notation==

Programming directly with the arrow combinators can be painful,
especially when writing complex computations which need to retain
simultaneous reference to a number of intermediate results. With
nothing but the arrow combinators, such intermediate results must be
kept in nested tuples, and it is up to the programmer to remember
which intermediate results are in which components, and to swap,
reassociate, and generally mangle tuples as necessary. This problem
is solved by the special arrow notation supported by GHC, similar to
<code>do</code> notation for monads, that allows names to be assigned to
intermediate results while building up arrow computations. An example
arrow implemented using arrow notation, taken from
Paterson, is:

<haskell>
class ArrowLoop arr => ArrowCircuit arr where
delay :: b -> (b `arr` b)

counter :: ArrowCircuit arr => Bool `arr` Int
counter = proc reset -> do
rec output <- idA -< if reset then 0 else next
next <- delay 0 -< output + 1
idA -< output
</haskell>

This arrow is intended to
represent a recursively defined counter circuit with a reset line.

There is not space here for a full explanation of arrow notation; the
interested reader should consult
[http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper introducing the notation], or his later [http://www.soi.city.ac.uk/~ross/papers/fop.html tutorial which presents a simplified version].

==Further reading==

An excellent starting place for the student of arrows is the [http://www.haskell.org/arrows/ arrows web page], which contains an
introduction and many references. Some key papers on arrows include
Hughes’ original paper introducing arrows, [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 Generalising monads to arrows], and [http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper on arrow notation].

Both Hughes and Paterson later wrote accessible tutorials intended for a broader
audience: [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows] and [http://www.cse.chalmers.se/~rjmh/afp-arrows.pdf Hughes: Programming with Arrows].

Although Hughes’ goal in defining the <code>Arrow</code> class was to
generalize <code>Monad</code>s, and it has been said that <code>Arrow</code> lies “between
<code>Applicative</code> and <code>Monad</code>” in power, they are not directly
comparable. The precise relationship remained in some confusion until
[http://homepages.inf.ed.ac.uk/wadler/papers/arrows-and-idioms/arrows-and-idioms.pdf analyzed by Lindley, Wadler, and Yallop], who
also invented a new calculus of arrows, based on the lambda calculus,
which considerably simplifies the presentation of the arrow laws
(see [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf The arrow calculus]). There is also a precise technical sense in which [http://just-bottom.blogspot.de/2010/04/programming-with-effects-story-so-far.html <code>Arrow</code> can be seen as the intersection of <code>Applicative</code> and <code>Category</code>].

Some examples of <code>Arrow</code>s include [http://www.haskell.org/yampa/ Yampa], the
[http://www.fh-wedel.de/~si/HXmlToolbox/ Haskell XML Toolkit], and the functional GUI library [[Grapefruit]].

Some extensions to arrows have been explored; for example, the
<code>BiArrow</code>s of Alimarine et al. ([http://wiki.clean.cs.ru.nl/download/papers/2005/alia2005-biarrowsHaskellWorkshop.pdf "There and Back Again: Arrows for Invertible Programming"]), for two-way instead of one-way
computation.

The Haskell wiki has [[Research papers/Monads and Arrows|links to many additional research papers relating to <code>Arrow</code>s]].

=Comonad=

The final type class we will examine is <code>Comonad</code>. The <code>Comonad</code> class
is the categorical dual of <code>Monad</code>; that is, <code>Comonad</code> is like <code>Monad</code>
but with all the function arrows flipped. It is not actually in the
standard Haskell libraries, but it has seen some interesting uses
recently, so we include it here for completeness.

==Definition==

The <code>Comonad</code> type class, defined in the <code>Control.Comonad</code> module of
the [http://hackage.haskell.org/package/comonad comonad library], is:

<haskell>
class Functor w => Comonad w where
extract :: w a -> a

duplicate :: w a -> w (w a)
duplicate = extend id

extend :: (w a -> b) -> w a -> w b
extend f = fmap f . duplicate
</haskell>

As you can see, <code>extract</code> is the dual of <code>return</code>, <code>duplicate</code> is the dual of <code>join</code>, and <code>extend</code> is the dual of <code>(=<<)</code>. The definition of <code>Comonad</code> is a bit redundant, giving the programmer the choice on whether extend or duplicate are implemented; the other operation then has a default implementation.

A prototypical example of a <code>Comonad</code> instance is:

<haskell>
-- Infinite lazy streams
data Stream a = Cons a (Stream a)

-- 'duplicate' is like the list function 'tails'
-- 'extend' computes a new Stream from an old, where the element
-- at position n is computed as a function of everything from
-- position n onwards in the old Stream
instance Comonad Stream where
extract (Cons x _) = x
duplicate s@(Cons x xs) = Cons s (duplicate xs)
extend g s@(Cons x xs) = Cons (g s) (extend g xs)
-- = fmap g (duplicate s)
</haskell>

==Further reading==

Dan Piponi explains in a blog post what [http://blog.sigfpe.com/2006/12/evaluating-cellular-automata-is.html cellular automata have to do with comonads]. In another blog post, Conal Elliott has examined [http://conal.net/blog/posts/functional-interactive-behavior/ a comonadic formulation of functional reactive programming]. Sterling Clover’s blog post [http://fmapfixreturn.wordpress.com/2008/07/09/comonads-in-everyday-life/ Comonads in everyday life] explains the relationship between comonads and zippers, and how comonads can be used to design a menu system for a web site.

Uustalu and Vene have a number of papers exploring ideas related to comonads and functional programming:
* [http://dx.doi.org/10.1016/j.entcs.2008.05.029 Comonadic Notions of Computation]
* [http://www.ioc.ee/~tarmo/papers/sfp01-book.pdf The dual of substitution is redecoration] (Also available as [http://www.cs.ut.ee/~varmo/papers/sfp01-book.ps.gz ps.gz].)
* [http://dx.doi.org/10.1016/j.ic.2005.08.005 Recursive coalgebras from comonads]
* [http://www.fing.edu.uy/~pardo/papers/njc01.ps.gz Recursion schemes from comonads]
* [http://cs.ioc.ee/~tarmo/papers/essence.pdf The Essence of Dataflow Programming].

Gabriel Gonzalez's [http://www.haskellforall.com/2013/02/you-could-have-invented-comonads.html Comonads are objects] points out similarities between comonads and object-oriented programming.

The [http://hackage.haskell.org/package/comonad-transformers comonad-transformers] package contains comonad transformers.

=Acknowledgements=

A special thanks to all of those who taught me about standard Haskell
type classes and helped me develop good intuition for them,
particularly Jules Bean (quicksilver), Derek Elkins (ddarius), Conal
Elliott (conal), Cale Gibbard (Cale), David House, Dan Piponi
(sigfpe), and Kevin Reid (kpreid).

I also thank the many people who provided a mountain of helpful
feedback and suggestions on a first draft of the Typeclassopedia: David Amos,
Kevin Ballard, Reid Barton, Doug Beardsley, Joachim Breitner, Andrew
Cave, David Christiansen, Gregory Collins, Mark Jason Dominus, Conal
Elliott, Yitz Gale, George Giorgidze, Steven Grady, Travis Hartwell,
Steve Hicks, Philip Hölzenspies, Edward Kmett, Eric Kow, Serge Le
Huitouze, Felipe Lessa, Stefan Ljungstrand, Eric Macaulay, Rob MacAulay, Simon Meier,
Eric Mertens, Tim Newsham, Russell O’Connor, Conrad Parker, Walt
Rorie-Baety, Colin Ross, Tom Schrijvers, Aditya Siram, C. Smith,
Martijn van Steenbergen, Joe Thornber, Jared Updike, Rob Vollmert,
Andrew Wagner, Louis Wasserman, and Ashley Yakeley, as well as a few
only known to me by their IRC nicks: b_jonas, maltem, tehgeekmeister,
and ziman. I have undoubtedly omitted a few inadvertently, which in
no way diminishes my gratitude.

Finally, I would like to thank Wouter Swierstra for his fantastic work
editing the Monad.Reader, and my wife Joyia for her patience during
the process of writing the Typeclassopedia.

=About the author=

Brent Yorgey ([http://byorgey.wordpress.com/ blog], [http://www.cis.upenn.edu/~byorgey/ homepage]) is (as of November 2011) a fourth-year Ph.D. student in the [http://www.cis.upenn.edu/~plclub/ programming languages group] at the [http://www.upenn.edu University of Pennsylvania]. He enjoys teaching, creating EDSLs, playing Bach fugues, musing upon category theory, and cooking tasty lambda-treats for the denizens of #haskell.

=Colophon=

The Typeclassopedia was written by Brent Yorgey and initially published in March 2009. Painstakingly converted to wiki syntax by [[User:Geheimdienst]] in November 2011, after asking Brent’s permission.

If something like this TeX to wiki syntax conversion ever needs to be done again, here are some vim commands that helped:

* <nowiki>%s/\\section{$[^}]*$}/=\1=/gc</nowiki>
* <nowiki>%s/\\subsection{$[^}]*$}/==\1==/gc</nowiki>
* <nowiki>%s/^ *\\item /\r* /gc</nowiki>
* <nowiki>%s/---/—/gc</nowiki>
* <nowiki>%s/\$$[^$]*$\$/<math>\1\\ <\/math>/gc</nowiki> ''Appending “\ ” forces images to be rendered. Otherwise, Mediawiki would go back and forth between one font for short <nowiki><math></nowiki> tags, and another more Tex-like font for longer tags (containing more than a few characters)""
* <nowiki>%s/|$[^|]*$|/<code>\1<\/code>/gc</nowiki>
* <nowiki>%s/\\dots/.../gc</nowiki>
* <nowiki>%s/^\\label{.*$//gc</nowiki>
* <nowiki>%s/\\emph{$[^}]*$}/''\1''/gc</nowiki>
* <nowiki>%s/\\term{$[^}]*$}/''\1''/gc</nowiki>

The biggest issue was taking the academic-paper-style citations and turning them into hyperlinks with an appropriate title and an appropriate target. In most cases there was an obvious thing to do (e.g. online PDFs of the cited papers or CiteSeer entries). Sometimes, however, it’s less clear and you might want to check the
[[Media:Typeclassopedia.pdf|original Typeclassopedia PDF]]
with the
[http://code.haskell.org/~byorgey/TMR/Issue13/typeclassopedia.bib original bibliography file].

To get all the citations into the main text, I first tried processing the source with TeX or Lyx. This didn’t work due to missing unfindable packages, syntax errors, and my general ineptitude with Tex.

I then went for the next best solution, which seemed to be extracting all instances of “\cite{something}” from the source and ''in that order'' pulling the referenced entries from the .bib file. This way you can go through the source file and sorted-references file in parallel, copying over what you need, without searching back and forth in the .bib file. I used:

* <nowiki>egrep -o "\cite\{[^\}]*\}" ~/typeclassopedia.lhs | cut -c 6- | tr "," "\n" | tr -d "}" > /tmp/citations</nowiki>
* <nowiki>for i in $(cat /tmp/citations); do grep -A99 "$i" ~/typeclassopedia.bib|egrep -B99 '^\}$' -m1 ; done > ~/typeclasso-refs-sorted</nowiki>

[[Category:Applicative Functor]]
[[Category:Arrow]]
[[Category:Functor]]
[[Category:Monad]]
[[Category:Standard classes]]
[[Category:Standard libraries]]
[[Category:Standard packages]]
[[Category:Standard types]]

Web/Servers

2015-03-26T09:57:07Z

Imz: /* mighttpd / mighttpd2 */ Its performance is comparable to that of nginx

[[Category:Web|*]]
{{Web infobox}}

== happstack-server ==

happstack-server contains a low-level HTTP backend, and high-level functions for routing requests, examining request data, and generating responses. happstack-server is part of the Happstack framework, but can be used as an independent entity. The low and high level portions of the server are not cleanly separated into different packages, so it is not the best choice if you only need a low-level backend.

{| class="wikitable"
! License
| BSD3
|-
! Author:
| Happstack team, HAppS LLC
|-
! Maintainer:
| Happstack team <happs@googlegroups.com>
|-
! Home page:
| http://happstack.com
|-
! Documentation:
| http://happstack.com/docs
|-
! Package & repositories
| [http://hackage.haskell.org/package/happstack Hackage] - [http://patch-tag.com/r/mae/happstack Darcs]
|}

== Hyena ==

Hyena is a simple web application container that can be used to run Haskell web applications behind more robust web servers like Apache.

{| class="wikitable"
! License
| BSD3
|-
! Author
| Johan Tibell <johan.tibell@gmail.com>
|-
! Maintainer
| Johan Tibell <johan.tibell@gmail.com>
|-
! Announcement
| [http://www.haskell.org/pipermail/haskell-cafe/2009-June/063058.html Haskell Cafe]
|-
! Package & repositories
| [http://hackage.haskell.org/package/hyena Hackage] - [http://github.com/tibbe/hyena Github]
|}

== Snap Server ==

Part of the Snap framework, the Snap server is similar to Hyena in that it provides a very fast low level web server. From the Hackage package:

This is the first developer prerelease of the Snap framework. Snap is a simple and fast web development framework and server written in Haskell. For more information or to download the latest version, you can visit the Snap project website at http://snapframework.com/.

The Snap HTTP server is a high performance, epoll-enabled, iteratee-based web server library written in Haskell. Together with the snap-core library upon which it depends, it provides a clean and efficient Haskell programming interface to the HTTP protocol.

Higher-level facilities for building web applications (like user/session management, component interfaces, data modeling, etc.) are planned but not yet implemented, so this release will mostly be of interest for those who:

* need a fast and minimal HTTP API at roughly the same level of abstraction as Java servlets, or

* are interested in contributing to the Snap Framework project.

{| class="wikitable"
! License
| BSD3
|-
! Author
| James Sanders, Gregory Collins, Doug Beardsley
|-
! Maintainer
| snap@snapframework.com
|-
! Package & repositories
| [http://hackage.haskell.org/package/snap-server Hackage] - [http://github.com/snapframework/snap-server Github]
|}

== Warp ==

The fastest Haskell web server, targeting the WAI [[Web/Framework_Interfaces]]

{| class="wikitable"
! License:
| BSD3
|-
! Author:
| Michael Snoyman <michael@snoyman.com>
|-
! Maintainer:
| Michael Snoyman <michael@snoyman.com>
|-
! Announcement:
| http://docs.yesodweb.com/blog/announcing-warp
|-
! Package & repositories
| [http://hackage.haskell.org/package/warp Hackage] - [http://github.com/softmechanics/warp Github]
|}

Example:

<haskell>
{-# LANGUAGE OverloadedStrings #-}

import Network.Wai
import Network.Wai.Handler.Warp
import Network.HTTP.Types (status200)
import Blaze.ByteString.Builder (copyByteString)
import qualified Data.ByteString.UTF8 as BU
import Data.Monoid

main = do
let port = 3000
putStrLn $ "Listening on port " ++ show port
run port app

app req respond = respond $
case pathInfo req of
["yay"] -> yay
x -> index x

yay = responseBuilder status200 [ ("Content-Type", "text/plain") ] $ mconcat $ map copyByteString
[ "yay" ]

index x = responseBuilder status200 [("Content-Type", "text/html")] $ mconcat $ map copyByteString
[ "Hello from ", BU.fromString $ show x, "!"
, "<a href='/yay'>yay</a>\n" ]

</haskell>

== mighttpd / mighttpd2 ==

High performance web server to handle static files and CGI on WAI/warp. Reverse proxy functionality is also provided to connect web applications behind.

Its performance is comparable to that of nginx written in C (according to the Monad.Reader article linked below).

(Initial version of mighttpd didn't use WAI/warp.)

{| class="wikitable"
! License:
| BSD3
|-
! Author:
| Kazu Yamamoto <kazu@iij.ad.jp>
|-
! Maintainer:
| Kazu Yamamoto <kazu@iij.ad.jp>
|-
! Announcement:
| https://themonadreader.wordpress.com/2011/10/26/issue-19/
|-
! Package & repositories
| [https://hackage.haskell.org/package/mighttpd2 Hackage] - [http://www.mew.org/~kazu/proj/mighttpd/ Homepage]
|}

Web/Libraries/XML and HTML

2015-03-26T09:02:37Z

Imz: /* HTML Templating */ +Lucid; +examples from Chris Done's post for xhtml and blaze-html

[[Category:Web|*]]
{{Web infobox}}
{{Formal under construction}}

'''The libraries on this page need checking and sorting into maintained/not maintained.'''

== HTML Templating ==

;[http://hackage.haskell.org/package/xhtml XHtml library]
:This is a version of [http://haskell.org/ghc/docs/latest/html/libraries/base/Text-Html.html Text.Html], modified to produce XHTML 1.0 Transitional.

Example:

<haskell>
header << thetitle << "Page title"

thediv noHtml ! [theclass "logo"] << "…"
thediv noHtml ! [identifier "login"]
</haskell>

;[http://hackage.haskell.org/package/blaze-html blaze-html]
Later, for [http://chrisdone.com/posts/lucid some people], blaze-html became the new goto HTML writing library. It improved upon the XHTML package by being faster and having a convenient monad instance. It looks like this:

<haskell>
page1 = html $ do
head $ do
title "Introduction page."
link ! rel "stylesheet" ! type_ "text/css" ! href "screen.css"
body $ do
div ! id "header" $ "Syntax"
p "This is an example of BlazeMarkup syntax."
ul $ mapM_ (li . toMarkup . show) [1, 2, 3]
</haskell>

;[http://www.dtek.chalmers.se/~tumm/vieux/ Vieux: A Nevow implementation]
:Vieux is a html-template system for Haskell. The basic idea is to define a xhtml template which is used to generate a xhtml document by Vieux.

;[http://www.wellquite.org/chunks/ Text.HTML.Chunks]
:Text.HTML.Chunks is a templating system inspired by the Perl HTML::Chunks module. The major change for the Haskell version is that the use of the templates is statically verified.

;[http://hackage.haskell.org/package/lucid lucid]
Wanting to improve on Xhtml and Blaze-html, Chris Done [http://chrisdone.com/posts/lucid wrote Lucid] ([http://www.reddit.com/r/haskell/comments/2my5bc/lucid_templating_dsl_for_html/ reddit discussion]); a bit later, he [http://chrisdone.com/posts/lucid2 updated] Lucid to major version 2.0 in a way that removes the need for the with combinator.

Example:

<haskell>
page :: Html ()
page =
html_
(do head_
(do title_ "Introduction page."
link_ [rel_ "stylesheet",type_ "text/css",href_ "screen.css"]
style_ "body{background:red}")
body_
(do div_ [id_ "header",style_ "color:white"] "Syntax"
p_ (span_ (strong_ "This is an example of Lucid syntax."))
hr_ []
ul_ (mapM_ (li_ . toHtml . show)
[1,2,3])
table_ (tr_ (do td_ "Hello!"
td_ [class_ "alt"] "World!"
td_ "Sup?"))))
</haskell>

== HTML Parsing ==

;[http://community.haskell.org/~ndm/tagsoup/ TagSoup]
: TagSoup is a library for extracting information out of unstructured HTML code, sometimes known as tag-soup. The HTML does not have to be well formed, or render properly within any particular framework. This library is for situations where the author of the HTML is not cooperating with the person trying to extract the information, but is also not trying to hide the information. The library provides a basic data type for a list of unstructured tags, a parser to convert HTML into this tag type, and useful functions and combinators for finding and extracting information.

== XML ==

;[http://www.cs.york.ac.uk/fp/HaXml/ HaXml: utilities for using XML with Haskell]
:Includes an XML parser, an HTML parser, a pretty-printer, a combinator library for generic XML transformations, and two Haskell>-<XML converters using type-based translation.

;[http://www.fh-wedel.de/~si/HXmlToolbox/ HXT: Haskell XML Toolbox]
:The Haskell XML Toolbox (HXT) bases on the ideas of HaXml and HXML, but introduces a more general approach based on arrows for processing XML with Haskell. The Haskell XML Toolbox uses a generic data model for representing XML documents, including the DTD subset and the document subset, in Haskell. It contains an XML parser, an HTML parser, namespaces are supported, XPath expressions can be used for selecting and transforming parts of a document. Validation can be performed with respect to DTDs and RelaxNG schema. A [[HXT|Getting started page]] describes the programming model behind HXT and gives some simple examples.

;[http://darcs.haskell.org/wraxml WraXML]
:A little wrapper to HaXML and HXT: It provides a more natural data structure for representing XML trees, and converts between HaXML or HXT and its custom tree structure. The operations on the tree need not to be of type (a -> [a]), thus using these functions is a bit more type safe. It has a custom lazy HTML parser using TagSoup and a custom lazy formatter. The library is currently much oriented to HTML rather than XML.

;[http://wiki.di.uminho.pt/wiki/bin/view/PURe/2LT 2LT: Two-Level Transformation]
:A two-level data transformation consists of a type-level transformation of a data format coupled with value-level transformations of data instances corresponding to that format. Examples of two-level data transformations include XML schema evolution coupled with document migration, and data mappings used for interoperability and persistence. A library of two-level transformation combinators. These combinators are used to compose transformation systems which, when applied to an input type, produce an output type, together with the conversion functions that mediate between input and out types. Front-ends for XML and SQL. These front-ends support (i) reading a schema, (ii) applying a two-level transformation system to produce a new schema, (iii) convert a document/database corresponding to the input schema to a document/database corresponding to the output schema, and vice versa. Referential constraints and primary key information are propagated through the schema transformation.

;[http://www.mail-archive.com/haskell@haskell.org/msg18396.html HSXML]
:A direct Haskell embedding of SXML

;[http://m13s07.vlinux.de/darcs/StaticDTD/v2/ StaticDTD]
:StaticDTD: complete static validness against a DTD.

Web/Servers

2015-03-26T08:38:12Z

Imz: + mighttpd2

[[Category:Web|*]]
{{Web infobox}}

== happstack-server ==

happstack-server contains a low-level HTTP backend, and high-level functions for routing requests, examining request data, and generating responses. happstack-server is part of the Happstack framework, but can be used as an independent entity. The low and high level portions of the server are not cleanly separated into different packages, so it is not the best choice if you only need a low-level backend.

{| class="wikitable"
! License
| BSD3
|-
! Author:
| Happstack team, HAppS LLC
|-
! Maintainer:
| Happstack team <happs@googlegroups.com>
|-
! Home page:
| http://happstack.com
|-
! Documentation:
| http://happstack.com/docs
|-
! Package & repositories
| [http://hackage.haskell.org/package/happstack Hackage] - [http://patch-tag.com/r/mae/happstack Darcs]
|}

== Hyena ==

Hyena is a simple web application container that can be used to run Haskell web applications behind more robust web servers like Apache.

{| class="wikitable"
! License
| BSD3
|-
! Author
| Johan Tibell <johan.tibell@gmail.com>
|-
! Maintainer
| Johan Tibell <johan.tibell@gmail.com>
|-
! Announcement
| [http://www.haskell.org/pipermail/haskell-cafe/2009-June/063058.html Haskell Cafe]
|-
! Package & repositories
| [http://hackage.haskell.org/package/hyena Hackage] - [http://github.com/tibbe/hyena Github]
|}

== Snap Server ==

Part of the Snap framework, the Snap server is similar to Hyena in that it provides a very fast low level web server. From the Hackage package:

This is the first developer prerelease of the Snap framework. Snap is a simple and fast web development framework and server written in Haskell. For more information or to download the latest version, you can visit the Snap project website at http://snapframework.com/.

The Snap HTTP server is a high performance, epoll-enabled, iteratee-based web server library written in Haskell. Together with the snap-core library upon which it depends, it provides a clean and efficient Haskell programming interface to the HTTP protocol.

Higher-level facilities for building web applications (like user/session management, component interfaces, data modeling, etc.) are planned but not yet implemented, so this release will mostly be of interest for those who:

* need a fast and minimal HTTP API at roughly the same level of abstraction as Java servlets, or

* are interested in contributing to the Snap Framework project.

{| class="wikitable"
! License
| BSD3
|-
! Author
| James Sanders, Gregory Collins, Doug Beardsley
|-
! Maintainer
| snap@snapframework.com
|-
! Package & repositories
| [http://hackage.haskell.org/package/snap-server Hackage] - [http://github.com/snapframework/snap-server Github]
|}

== Warp ==

The fastest Haskell web server, targeting the WAI [[Web/Framework_Interfaces]]

{| class="wikitable"
! License:
| BSD3
|-
! Author:
| Michael Snoyman <michael@snoyman.com>
|-
! Maintainer:
| Michael Snoyman <michael@snoyman.com>
|-
! Announcement:
| http://docs.yesodweb.com/blog/announcing-warp
|-
! Package & repositories
| [http://hackage.haskell.org/package/warp Hackage] - [http://github.com/softmechanics/warp Github]
|}

Example:

<haskell>
{-# LANGUAGE OverloadedStrings #-}

import Network.Wai
import Network.Wai.Handler.Warp
import Network.HTTP.Types (status200)
import Blaze.ByteString.Builder (copyByteString)
import qualified Data.ByteString.UTF8 as BU
import Data.Monoid

main = do
let port = 3000
putStrLn $ "Listening on port " ++ show port
run port app

app req respond = respond $
case pathInfo req of
["yay"] -> yay
x -> index x

yay = responseBuilder status200 [ ("Content-Type", "text/plain") ] $ mconcat $ map copyByteString
[ "yay" ]

index x = responseBuilder status200 [("Content-Type", "text/html")] $ mconcat $ map copyByteString
[ "Hello from ", BU.fromString $ show x, "!"
, "<a href='/yay'>yay</a>\n" ]

</haskell>

== mighttpd / mighttpd2 ==

High performance web server to handle static files and CGI on WAI/warp. Reverse proxy functionality is also provided to connect web applications behind.

(Initial version of mighttpd didn't use WAI/warp.)

{| class="wikitable"
! License:
| BSD3
|-
! Author:
| Kazu Yamamoto <kazu@iij.ad.jp>
|-
! Maintainer:
| Kazu Yamamoto <kazu@iij.ad.jp>
|-
! Announcement:
| https://themonadreader.wordpress.com/2011/10/26/issue-19/
|-
! Package & repositories
| [https://hackage.haskell.org/package/mighttpd2 Hackage] - [http://www.mew.org/~kazu/proj/mighttpd/ Homepage]
|}

Talk:All About Monads

2015-02-27T14:59:12Z

Imz: /* ST monad not covered */ new section

== ST monad not covered ==

I was looking for a Haskell abstraction that would suit my needs (an FFI library), and I suspect that ST would be the one (need to elaborate), but since this article doesn't describe it, I had the impression that although the idea (of ST) is simple, there is no such standard thing in Haskell.--[[User:Imz|Imz]] ([[User talk:Imz|talk]]) 14:59, 27 February 2015 (UTC)

Applications and libraries/Linguistics

2015-02-22T08:38:17Z

Imz: /* Other functional or Haskell-related approaches to linguistics */ take a look at a section in "A History of Haskell: Being Lazy With Class"

__TOC__

== Portals and other huge resources ==

Peter Ljunglöf's many [http://www.cse.chalmers.se/~peb/bibliography.html publications] on natural language processing, parsing, formal semantics. Many of them use Haskell, and there are [http://www.ling.gu.se/~peb/software.html downloadable] Haskell sources too.

[http://homepages.cwi.nl/~jve/index.html Jan van Eijck's page] contains a huge amount of materials on logic and language:
* computational linguistics
* logics (e.g. dynamic epistemic modelling)

The [http://projects.haskell.org/nlp Haskell NLP projects] provides a mailing list for Haskellers doing NLP work, as well as a community wiki and darcs repository. Come join us!

[http://nlpwp.org/ Natural Language Processing for The Working Programmer] is a book that provides an introduction to Haskell and NLP.

There are many Haskell resources, too.

== Tools and libraries ==

* [http://www.w3.org/wiki/Cypher Cypher] is one of the first software program available which generates the metadata representation of natural language input. Cypher produces RDF graph and SeRQL query representations of sentences, clauses, phrases and questions. The Cypher framework provides a set of robust definition languages, which can be used to extend and create grammars and lexicons. Cypher programming is fun to learn and easy to use, and the specifications are designed to allow a novice to quickly and easily build transcoders for processing highly complex sentences and phrases of any natural language, and to cover any vocabulary
* [http://trac.loria.fr/~geni GenI] is a surface realiser for Tree Adjoining Grammars. Surface realisation can be seen as the last stage in a natural language generation pipeline. GenI in particular takes an FB-LTAG grammar and an input semantics (a conjunction of first order terms), and produces the set of sentences associated to the input semantics by the grammar. See also [http://www.loria.fr/~kow/ Eric Kow]'s recent publications on it.
* [http://grammaticalframework.org/ Grammatical Framework] (GF) is a compiler and grammatical programming environment written entirely in Haskell, with an interactive interpreter and two GUI interfaces, one written in Fudgets and another written in Java. GF grammars are written in a subset of Haskell and compile into an internal GF format that may be used as embedded parsers in Haskell, parsers in Java (with an embedded Java interpreter gfc2java.jar) and subsequently converted to applets ([http://www.cs.chalmers.se/~markus/gramlets/ Gramlets]). (GF-Haskell to Java translation is performed through an Open Agent Architecture--the original .NET, see [http://www.cs.chalmers.se/~bringert/gf/gf-oaa.html GF OAA].) The GF grammatical formalism handles linguistic entities (morphemes, etc.) using type theory: an approach especially suited to machine translation of controlled natural languages. The [http://www.cs.chalmers.se/~aarne/GF/lib/resource-1.0/doc/index.html Grammar Resource Library], a set of basic grammars for Danish, English, Finnish, French, German, Italian, Norwegian, Russian, Spanish and Swedish, is available as a separate download. GF has been used to translate a fragment of C code to JVM (see [http://www.cs.chalmers.se/~aarne/GF/doc/gfcc.pdf GFCC (PDF document)]).
* [http://www.cs.chalmers.se/~markus/FM/index.html Functional Morphology] - a toolkit for morphology development. Has been used for Swedish, Spanish, Urdu and more.
* [http://www.umiacs.umd.edu/~hal/HWordNet/ HWordNet] - A Haskell interface to WordNet by Hal Daumé III.
* '''Saxophone''' is a fun translator from German to the Saxon dialect. It is part of the [https://sourceforge.net/projects/parallelweb ParallelWeb] project which aims at translating Web pages including all of their links.

== Natural language processing and combinatory logic ==

[[Combinatory logic]] contributed to develop powerful theories in linguistics..

=== Applicative universal grammar ===

Now it has got [[/Applicative universal grammar|its own HaskellWiki page]].

=== Categorial grammar ===

A general summary of modern semantic theories developed in the century
is provided by [http://citeseer.ist.psu.edu/blackburn97logical.html Logical Aspects of Computational Linguistics: an introduction].

[http://www-unix.oit.umass.edu/~gmhwww/ Gary Hardegree]'s portal-rich page provides a lot of materials on logic and linguistics, among them
* [http://www-unix.oit.umass.edu/~gmhwww/scholar.htm The Axiomatic Theory of Truth] grasping concepts like truth, quotations, paradoxes, liar's paradox
* [http://www-unix.oit.umass.edu/~gmhwww/scholar.htm Courses] ranging from the introductory level to developed topics, e.g. [http://www-unix.oit.umass.edu/~gmhwww/511/pdf/a3.pdf Basic Categorial Grammar].

[http://groups.inf.ed.ac.uk/ccg/ The Combinatory Categorial Grammar Site] contains links, papers (both introductory and developed) and software ([http://opennlp.sourceforge.net/ OpenNLP] open source projects, related to natural language processing, and [http://openccg.sourceforge.net/ OpenCCG])

On natural languages relating to combinatory logic, see also
* Mark Steedman's [http://citeseer.ist.psu.edu/steedman97does.html Does Grammar Make Use of Bound Variables?]
* Mark Hepple: [http://citeseer.ist.psu.edu/hepple90grammar.html The Grammar and Processing of Order and Dependency: a Categorial Approach]

=== Type-Logical Grammar ===

Matteo Capelletti's [http://www.let.uu.nl/users/Matteo.Capelletti/personal/Home.html home page] contains a parser based on the Non-associative Lambek calculus. It supports hypothetical reasoning and Montague style semantics.

=== Tree Adjoining Grammar ===

* See [http://trac.loria.fr/~geni GenI], mentioned above.

== Game theoretic semantics ==

Game theoretic semantics presents an interesting concept of ''truth'' -- in another way than that of Tarski.
Its connections to computer science and computer languages is described in Wikipedia's [http://en.wikipedia.org/wiki/Game_semantics Game semantics] article. Merlijn Sevenster's [http://staff.science.uva.nl/~peter/teaching/merlijns20041129.pdf Game theoretical semantics and -logic] is a good introductory material too.

Chiaki Ohkura's [http://acl.ldc.upenn.edu/W/W03/W03-1408.pdf The Semantics of Metaphor in the Game Theoretic Semantics with at Least Two Coordination Equilibria] article tries to catch the concept of ''metaphor''.

=== Relatedness to linear logic ===

The Wikipedia article mentions also the relatedness of game theoretic semantics to ''linear logic''.
[http://homepages.inf.ed.ac.uk/wadler/ Philip Wadler]'s page on [http://homepages.inf.ed.ac.uk/wadler/topics/linear-logic.html linear logic] describes the topic and its relatedness to many concepts concerning Haskell. [http://homepages.inf.ed.ac.uk/wadler/topics/linear-logic.html#lineartaste A taste of linear logic] can serve as an introductory article.

== Parsing natural languages ==
===Parsing Natural Language with X-SAIGA parser===

The goal of the [http://www.cs.uwindsor.ca/~hafiz/proHome.html X-SAIGA] project is to create algorithms and implementations which enable the construction of language processors (recognizers, parsers, interpreters, translators, etc.) to be constructed as modular and efficient embedded executable specifications of grammars. The syntax analysis is done with a set of parser combinators by overcoming some long standing limitations -
# the simple implementations of parser combinators require [[exponential]] time and space when parsing an ambiguous context free grammar.
# like any top-down recursive descent parsing, the conventional parser combinators won't terminate while processing a left-recursive grammar (i.e. <code>s ::= s *> s *> term 'x'|empty</code>).

As a part of the X-SAIGA project's syntax analysis, a [http://cs.uwindsor.ca/~hafiz/p46-frost.pdf recognition algorithm] that accommodates ambiguous grammars with direct [[left recursion|left-recursive]] rules is described by Frost and Hafiz in 2006. The algorithm curtails the otherwise ever-growing left-recursive parse by imposing depth restrictions. That algorithm was extended to a complete [http://cs.uwindsor.ca/~hafiz/iwpt-07.pdf parsing algorithm] to accommodate indirect as well as direct left-recursion in [[polynomial]] time, and to generate compact polynomial-size representations of the potentially-exponential number of parse trees for highly-ambiguous grammars by Frost, Hafiz and Callaghan in 2007. This extended algorithm accommodates indirect left-recursion by comparing its 'computed-context' with 'current-context'. The same authors also [http://cs.uwindsor.ca/~hafiz/PADL_PAPER_FINAL.pdf described their implementation of a set of parser combinators] written in the [[Haskell]] programming language based on the same algorithm in [http://www.ist.unomaha.edu/padl2008/ PADL 08]. The [http://www.cs.uwindsor.ca/~hafiz/proHome.html X-SAIGA] site has more about the algorithms, implementation details experimental results.

===Monadic Compositional Parsing===
Gordon J. Pace: [http://www.cs.um.edu.mt/~csaw/CSAW04/Proceedings/08.pdf Monadic Compositional Parsing with Context Using Maltese as a Case Study], see its [http://www.cs.um.edu.mt/~csaw/CSAW04/ context] too.

== Other functional or Haskell-related approaches to linguistics ==

* [http://cs.uwindsor.ca/~richard/PUBLICATIONS/NLI_LFP_SURVEY_DRAFT.pdf A Survey on the Use of Haskell in Natural-Language Processing] (Report by Richard A. Frost). It is also a part of Haskell Communities and Activities Report, [http://www.haskell.org/communities/11-2006/html/report.html Eleventh edition – November 30, 2006].
* [http://research.microsoft.com/en-us/um/people/simonpj/papers/history-of-haskell/history.pdf A History of Haskell: Being Lazy With Class] (2007) has a section (11.5 on page 40) with material contributed by Paul Callaghan on applications of Haskell to natural-language processing. Perhaps, there are some projects mentioned there which are not (yet) listed here on this page; so take a look.
* From [http://www.cs.chalmers.se/~aarne/ Aarne Ranta's homepage]
** [http://www.cs.chalmers.se/~aarne/course-langtech/ Natural Language Technology], with (among others) [http://www.cs.chalmers.se/~aarne/course-langtech/lectures/lectures.html online course slides]. They give huge insights, for example, see the slide example which discusses [[Dependent type#Type theory|the concept of dependent type and Curry Howard isomorphism]] in lingustical context.
* The [http://sanskrit.inria.fr/ZEN/ Zen Computational Linguistics Toolkit] has tools for efficiently processing linguistic data structures, like trees and automata. It's written in Literate O'Caml, though a Haskell port shouldn't be very hard to do.
* The [http://nlpers.blogspot.com/ natural language processing blog] written by [http://www.isi.edu/~hdaume/ Hal Daume III].

== Other linguistics-related resources ==

Dr. [http://www.haskell.org/haskellwiki/Libraries_and_tools/Linguistics Günter Neumann]'s homepage.

== Specific topics ==

=== Lojban ===

Lojban, an artificial language ([[Lojban|see a separate HaskellWiki page on it with references]].) “Lojban was not designed primarily to be an international language, however, but rather as a linguistic tool for studying and understanding language. Its linguistic and computer applications make Lojban unique among international languages...” (NC:WhLoj, page 15 par 1)

=== Continuations in natural languages ===

Some phenomena in natural languages can be grasped with the notion of [[continuation]]. For details, see
Chris Barker's paper [http://www.cs.bham.ac.uk/~hxt/cw04/barker.pdf Continuations in Natural Language]. It is quite accessible to non-linguists.
== References ==

;barker2004cnl
:Barker, Chris: [http://www.cs.bham.ac.uk/~hxt/cw04/barker.pdf Continuations in Natural Language] (pdf), 2004
;nicholas2003wl
:Nicholas, Nick and Cowan, John (ed.): What is Lojban? [http://www.lojban.org/ Logical Language Group], 2003. Available also [http://www.lojban.org/tiki/tiki-index.php?page=What+Is+Lojban%3F%2C+The+Book&bl online].
;frost2006rnl
:Frost, Richard: [http://cs.uwindsor.ca/~richard/PUBLICATIONS/NLI_LFP_SURVEY_DRAFT.pdf Realization of natural language interfaces using lazy functional programming] (pdf), 2006
;frost2008pal
:Frost, Richard; Hafiz, Rahmatullah and Callaghan, Paul: [http://cs.uwindsor.ca/~hafiz/PADL_PAPER_FINAL.pdf Parser Combinators for Ambiguous Left-Recursive Grammars.] Proceedings of the 10th International Symposium on Practical Aspects of Declarative Languages (PADL), ACM-SIGPLAN. January 2008, San Francisco, USA.
;xsaiga2008exg
:Frost, Richard; Hafiz, Rahmatullah and Callaghan, Paul: [http://cs.uwindsor.ca/~hafiz/proHome.html X-SAIGA] website - eXecutable SpecificAtIons of GrAmmars.

[[Category:Theoretical foundations]]

Typeclassopedia

2014-11-27T17:50:59Z

Imz: /* Further reading */ fixed wikisyntax for external links

''By [[User:Byorgey|Brent Yorgey]], byorgey@cis.upenn.edu''

''Originally published 12 March 2009 in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13] of [http://themonadreader.wordpress.com/ the Monad.Reader]. Ported to the Haskell wiki in November 2011 by [[User:Geheimdienst|Geheimdienst]].''

''This is now the official version of the Typeclassopedia and supersedes the version published in the Monad.Reader. Please help update and extend it by editing it yourself or by leaving comments, suggestions, and questions on the [[Talk:Typeclassopedia|talk page]].''

=Abstract=

The standard Haskell libraries feature a number of type classes with algebraic or category-theoretic underpinnings. Becoming a fluent Haskell hacker requires intimate familiarity with them all, yet acquiring this familiarity often involves combing through a mountain of tutorials, blog posts, mailing list archives, and IRC logs.

The goal of this document is to serve as a starting point for the student of Haskell wishing to gain a firm grasp of its standard type classes. The essentials of each type class are introduced, with examples, commentary, and extensive references for further reading.

=Introduction=

Have you ever had any of the following thoughts?
* What the heck is a monoid, and how is it different from a monad?

* I finally figured out how to use [[Parsec]] with do-notation, and someone told me I should use something called <code>Applicative</code> instead. Um, what?

* Someone in the [[IRC channel|#haskell]] IRC channel used <code>(***)</code>, and when I asked Lambdabot to tell me its type, it printed out scary gobbledygook that didn’t even fit on one line! Then someone used <code>fmap fmap fmap</code> and my brain exploded.

* When I asked how to do something I thought was really complicated, people started typing things like <code>zip.ap fmap.(id &&& wtf)</code> and the scary thing is that they worked! Anyway, I think those people must actually be robots because there’s no way anyone could come up with that in two seconds off the top of their head.

If you have, look no further! You, too, can write and understand concise, elegant, idiomatic Haskell code with the best of them.

There are two keys to an expert Haskell hacker’s wisdom:
# Understand the types.
# Gain a deep intuition for each type class and its relationship to other type classes, backed up by familiarity with many examples.

It’s impossible to overstate the importance of the first; the patient student of type signatures will uncover many profound secrets. Conversely, anyone ignorant of the types in their code is doomed to eternal uncertainty. “Hmm, it doesn’t compile ... maybe I’ll stick in an
<code>fmap</code> here ... nope, let’s see ... maybe I need another <code>(.)</code> somewhere? ... um ...”

The second key—gaining deep intuition, backed by examples—is also important, but much more difficult to attain. A primary goal of this document is to set you on the road to gaining such intuition. However—

:''There is no royal road to Haskell. {{h:title|Well, he probably would have said it if he knew Haskell.|—Euclid}}''

This document can only be a starting point, since good intuition comes from hard work, [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ not from learning the right metaphor]. Anyone who reads and understands all of it will still have an arduous journey ahead—but sometimes a good starting point makes a big difference.

It should be noted that this is not a Haskell tutorial; it is assumed that the reader is already familiar with the basics of Haskell, including the standard <code>[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html Prelude]</code>, the type system, data types, and type classes.

The type classes we will be discussing and their interrelationships:

[[Image:Typeclassopedia-diagram.png]]

{{note|<code>Semigroup</code> can be found in the [http://hackage.haskell.org/package/semigroups <code>semigroups</code> package], <code>Apply</code> in the [http://hackage.haskell.org/package/semigroupoids <code>semigroupoids</code> package], and <code>Comonad</code> in the [http://hackage.haskell.org/package/comonad <code>comonad</code> package].}}

* Solid arrows point from the general to the specific; that is, if there is an arrow from <code>Foo</code> to <code>Bar</code> it means that every <code>Bar</code> is (or should be, or can be made into) a <code>Foo</code>.
* Dotted arrows indicate some other sort of relationship.
* <code>Monad</code> and <code>ArrowApply</code> are equivalent.
* <code>Semigroup</code>, <code>Apply</code> and <code>Comonad</code> are greyed out since they are not actually (yet?) in the standard Haskell libraries {{noteref}}.

One more note before we begin. The original spelling of “type class” is with two words, as evidenced by, for example, the [http://www.haskell.org/onlinereport/haskell2010/ Haskell 2010 Language Report], early papers on type classes like [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.103.5639 Type classes in Haskell] and [http://research.microsoft.com/en-us/um/people/simonpj/papers/type-class-design-space/ Type classes: exploring the design space], and [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.168.4008 Hudak et al.’s history of Haskell]. However, as often happens with two-word phrases that see a lot of use, it has started to show up as one word (“typeclass”) or, rarely, hyphenated (“type-class”). When wearing my prescriptivist hat, I prefer “type class”, but realize (after changing into my descriptivist hat) that there's probably not much I can do about it.

We now begin with the simplest type class of all: <code>Functor</code>.

=Functor=

The <code>Functor</code> class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Functor haddock]) is the most basic and ubiquitous type class in the Haskell libraries. A simple intuition is that a <code>Functor</code> represents a “container” of some sort, along with the ability to apply a function uniformly to every element in the container. For example, a list is a container of elements, and we can apply a function to every element of a list, using <code>map</code>. As another example, a binary tree is also a container of elements, and it’s not hard to come up with a way to recursively apply a function to every element in a tree.

Another intuition is that a <code>Functor</code> represents some sort of “computational context”. This intuition is generally more useful, but is more difficult to explain, precisely because it is so general. Some examples later should help to clarify the <code>Functor</code>-as-context point of view.

In the end, however, a <code>Functor</code> is simply what it is defined to be; doubtless there are many examples of <code>Functor</code> instances that don’t exactly fit either of the above intuitions. The wise student will focus their attention on definitions and examples, without leaning too heavily on any particular metaphor. Intuition will come, in time, on its own.

==Definition==

Here is the type class declaration for <code>Functor</code>:

<haskell>
class Functor f where
fmap :: (a -> b) -> f a -> f b
</haskell>

<code>Functor</code> is exported by the <code>Prelude</code>, so no special imports are needed to use it.

First, the <code>f a</code> and <code>f b</code> in the type signature for <code>fmap</code> tell us that <code>f</code> isn’t just a type; it is a ''type constructor'' which takes another type as a parameter. (A more precise way to say this is that the ''kind'' of <code>f</code> must be <code>* -> *</code>.) For example, <code>Maybe</code> is such a type constructor: <code>Maybe</code> is not a type in and of itself, but requires another type as a parameter, like <code>Maybe Integer</code>. So it would not make sense to say <code>instance Functor Integer</code>, but it could make sense to say <code>instance Functor Maybe</code>.

Now look at the type of <code>fmap</code>: it takes any function from <code>a</code> to <code>b</code>, and a value of type <code>f a</code>, and outputs a value of type <code>f b</code>. From the container point of view, the intention is that <code>fmap</code> applies a function to each element of a container, without altering the structure of the container. From the context point of view, the intention is that <code>fmap</code> applies a function to a value without altering its context. Let’s look at a few specific examples.

==Instances==

{{note|Recall that <code>[]</code> has two meanings in Haskell: it can either stand for the empty list, or, as here, it can represent the list type constructor (pronounced “list-of”). In other words, the type <code>[a]</code> (list-of-<code>a</code>) can also be written <code>[] a</code>.}}

{{note|You might ask why we need a separate <code>map</code> function. Why not just do away with the current list-only <code>map</code> function, and rename <code>fmap</code> to <code>map</code> instead? Well, that’s a good question. The usual argument is that someone just learning Haskell, when using <code>map</code> incorrectly, would much rather see an error about lists than about <code>Functor</code>s.}}

As noted before, the list constructor <code>[]</code> is a functor {{noteref}}; we can use the standard list function <code>map</code> to apply a function to each element of a list {{noteref}}. The <code>Maybe</code> type constructor is also a functor, representing a container which might hold a single element. The function <code>fmap g</code> has no effect on <code>Nothing</code> (there are no elements to which <code>g</code> can be applied), and simply applies <code>g</code> to the single element inside a <code>Just</code>. Alternatively, under the context interpretation, the list functor represents a context of nondeterministic choice; that is, a list can be thought of as representing a single value which is nondeterministically chosen from among several possibilities (the elements of the list). Likewise, the <code>Maybe</code> functor represents a context with possible failure. These instances are:

<haskell>
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : fmap g xs
-- or we could just say fmap = map

instance Functor Maybe where
fmap _ Nothing = Nothing
fmap g (Just a) = Just (g a)
</haskell>

As an aside, in idiomatic Haskell code you will often see the letter <code>f</code> used to stand for both an arbitrary <code>Functor</code> and an arbitrary function. In this document, <code>f</code> represents only <code>Functor</code>s, and <code>g</code> or <code>h</code> always represent functions, but you should be aware of the potential confusion. In practice, what <code>f</code> stands for should always be clear from the context, by noting whether it is part of a type or part of the code.

There are other <code>Functor</code> instances in the standard libraries; below are a few. Note that some of these instances are not exported by the <code>Prelude</code>; to access them, you can import <code>Control.Monad.Instances</code>.

* <code>Either e</code> is an instance of <code>Functor</code>; <code>Either e a</code> represents a container which can contain either a value of type <code>a</code>, or a value of type <code>e</code> (often representing some sort of error condition). It is similar to <code>Maybe</code> in that it represents possible failure, but it can carry some extra information about the failure as well.

* <code>((,) e)</code> represents a container which holds an “annotation” of type <code>e</code> along with the actual value it holds. It might be clearer to write it as <code>(e,)</code>, by analogy with an operator section like <code>(1+)</code>, but that syntax is not allowed in types (although it is allowed in expressions with the <code>TupleSections</code> extension enabled). However, you can certainly ''think'' of it as <code>(e,)</code>.

* <code>((->) e)</code> (which can be thought of as <code>(e ->)</code>; see above), the type of functions which take a value of type <code>e</code> as a parameter, is a <code>Functor</code>. As a container, <code>(e -> a)</code> represents a (possibly infinite) set of values of <code>a</code>, indexed by values of <code>e</code>. Alternatively, and more usefully, <code>((->) e)</code> can be thought of as a context in which a value of type <code>e</code> is available to be consulted in a read-only fashion. This is also why <code>((->) e)</code> is sometimes referred to as the ''reader monad''; more on this later.

* <code>IO</code> is a <code>Functor</code>; a value of type <code>IO a</code> represents a computation producing a value of type <code>a</code> which may have I/O effects. If <code>m</code> computes the value <code>x</code> while producing some I/O effects, then <code>fmap g m</code> will compute the value <code>g x</code> while producing the same I/O effects.

* Many standard types from the [http://hackage.haskell.org/package/containers/ containers library] (such as <code>Tree</code>, <code>Map</code>, and <code>Sequence</code>) are instances of <code>Functor</code>. A notable exception is <code>Set</code>, which cannot be made a <code>Functor</code> in Haskell (although it is certainly a mathematical functor) since it requires an <code>Ord</code> constraint on its elements; <code>fmap</code> must be applicable to ''any'' types <code>a</code> and <code>b</code>. However, <code>Set</code> (and other similarly restricted data types) can be made an instance of a suitable generalization of <code>Functor</code>, either by [http://article.gmane.org/gmane.comp.lang.haskell.cafe/78052/ making <code>a</code> and <code>b</code> arguments to the <code>Functor</code> type class themselves], or by adding an [http://blog.omega-prime.co.uk/?p=127 associated constraint].

{{Exercises|
<ol>
<li>Implement <code>Functor</code> instances for <code>Either e</code> and <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> instances for <code>((,) e)</code> and for <code>Pair</code>, defined as

<haskell>data Pair a = Pair a a</haskell>

Explain their similarities and differences.
</li>
<li>Implement a <code>Functor</code> instance for the type <code>ITree</code>, defined as

<haskell>
data ITree a = Leaf (Int -> a)
| Node [ITree a]
</haskell>
</li>
<li>Give an example of a type of kind <code>* -> *</code> which cannot be made an instance of <code>Functor</code> (without using <code>undefined</code>).
</li>
<li>Is this statement true or false?

:''The composition of two <code>Functor</code>s is also a <code>Functor</code>.''

If false, give a counterexample; if true, prove it by exhibiting some appropriate Haskell code.
</li>
</ol>
}}

==Laws==

As far as the Haskell language itself is concerned, the only requirement to be a <code>Functor</code> is an implementation of <code>fmap</code> with the proper type. Any sensible <code>Functor</code> instance, however, will also satisfy the ''functor laws'', which are part of the definition of a mathematical functor. There are two:

<haskell>
fmap id = id
fmap (g . h) = (fmap g) . (fmap h)
</haskell>

{{note|Technically, these laws make <code>f</code> and <code>fmap</code> together an endofunctor on ''Hask'', the category of Haskell types (ignoring [[Bottom|&perp;]], which is a party pooper). See [http://en.wikibooks.org/wiki/Haskell/Category_theory Wikibook: Category theory].}}

Together, these laws ensure that <code>fmap g</code> does not change the ''structure'' of a container, only the elements. Equivalently, and more simply, they ensure that <code>fmap g</code> changes a value without altering its context {{noteref}}.

The first law says that mapping the identity function over every item in a container has no effect. The second says that mapping a composition of two functions over every item in a container is the same as first mapping one function, and then mapping the other.

As an example, the following code is a “valid” instance of <code>Functor</code> (it typechecks), but it violates the functor laws. Do you see why?

<haskell>
-- Evil Functor instance
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : g x : fmap g xs
</haskell>

Any Haskeller worth their salt would reject this code as a gruesome abomination.

Unlike some other type classes we will encounter, a given type has at most one valid instance of <code>Functor</code>. This [http://article.gmane.org/gmane.comp.lang.haskell.libraries/15384 can be proven] via the [http://homepages.inf.ed.ac.uk/wadler/topics/parametricity.html#free ''free theorem''] for the type of <code>fmap</code>. In fact, [http://byorgey.wordpress.com/2010/03/03/deriving-pleasure-from-ghc-6-12-1/ GHC can automatically derive] <code>Functor</code> instances for many data types.

{{note|Actually, if <code>seq</code>/<code>undefined</code> are considered, it [http://stackoverflow.com/a/8323243/305559 is possible] to have an implementation which satisfies the first law but not the second. The rest of the comments in this section should considered in a context where <code>seq</code> and <code>undefined</code> are excluded.}}

A [https://github.com/quchen/articles/blob/master/second_functor_law.md similar argument also shows] that any <code>Functor</code> instance satisfying the first law (<code>fmap id = id</code>) will automatically satisfy the second law as well. Practically, this means that only the first law needs to be checked (usually by a very straightforward induction) to ensure that a <code>Functor</code> instance is valid.{{noteref}}

{{Exercises|
# Although it is not possible for a <code>Functor</code> instance to satisfy the first <code>Functor</code> law but not the second (excluding <code>undefined</code>), the reverse is possible. Give an example of a (bogus) <code>Functor</code> instance which satisfies the second law but not the first.
# Which laws are violated by the evil <code>Functor</code> instance for list shown above: both laws, or the first law alone? Give specific counterexamples.
}}

==Intuition==

There are two fundamental ways to think about <code>fmap</code>. The first has already been mentioned: it takes two parameters, a function and a container, and applies the function “inside” the container, producing a new container. Alternately, we can think of <code>fmap</code> as applying a function to a value in a context (without altering the context).

Just like all other Haskell functions of “more than one parameter”, however, <code>fmap</code> is actually ''curried'': it does not really take two parameters, but takes a single parameter and returns a function. For emphasis, we can write <code>fmap</code>’s type with extra parentheses: <code>fmap :: (a -> b) -> (f a -> f b)</code>. Written in this form, it is apparent that <code>fmap</code> transforms a “normal” function (<code>g :: a -> b</code>) into one which operates over containers/contexts (<code>fmap g :: f a -> f b</code>). This transformation is often referred to as a ''lift''; <code>fmap</code> “lifts” a function from the “normal world” into the “<code>f</code> world”.

==Further reading==

A good starting point for reading about the category theory behind the concept of a functor is the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page on category theory].

=Applicative=

A somewhat newer addition to the pantheon of standard Haskell type classes, ''applicative functors'' represent an abstraction lying in between <code>Functor</code> and <code>Monad</code> in expressivity, first described by McBride and Paterson. The title of their classic paper, [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative Programming with Effects], gives a hint at the intended intuition behind the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html <code>Applicative</code>] type class. It encapsulates certain sorts of “effectful” computations in a functionally pure way, and encourages an “applicative” programming style. Exactly what these things mean will be seen later.

==Definition==

Recall that <code>Functor</code> allows us to lift a “normal” function to a function on computational contexts. But <code>fmap</code> doesn’t allow us to apply a function which is itself in a context to a value in a context. <code>Applicative</code> gives us just such a tool, <code>(<*>)</code>. It also provides a method, <code>pure</code>, for embedding values in a default, “effect free” context. Here is the type class declaration for <code>Applicative</code>, as defined in <code>Control.Applicative</code>:

<haskell>
class Functor f => Applicative f where
pure :: a -> f a
(<*>) :: f (a -> b) -> f a -> f b
</haskell>

Note that every <code>Applicative</code> must also be a <code>Functor</code>. In fact, as we will see, <code>fmap</code> can be implemented using the <code>Applicative</code> methods, so every <code>Applicative</code> is a functor whether we like it or not; the <code>Functor</code> constraint forces us to be honest.

{{note|Recall that <code>($)</code> is just function application: <code>f $ x {{=}} f x</code>.}}

As always, it’s crucial to understand the type signatures. First, consider <code>(<*>)</code>: the best way of thinking about it comes from noting that the type of <code>(<*>)</code> is similar to the type of <code>($)</code> {{noteref}}, but with everything enclosed in an <code>f</code>. In other words, <code>(<*>)</code> is just function application within a computational context. The type of <code>(<*>)</code> is also very similar to the type of <code>fmap</code>; the only difference is that the first parameter is <code>f (a -> b)</code>, a function in a context, instead of a “normal” function <code>(a -> b)</code>.

<code>pure</code> takes a value of any type <code>a</code>, and returns a context/container of type <code>f a</code>. The intention is that <code>pure</code> creates some sort of “default” container or “effect free” context. In fact, the behavior of <code>pure</code> is quite constrained by the laws it should satisfy in conjunction with <code>(<*>)</code>. Usually, for a given implementation of <code>(<*>)</code> there is only one possible implementation of <code>pure</code>.

(Note that previous versions of the Typeclassopedia explained <code>pure</code> in terms of a type class <code>Pointed</code>, which can still be found in the [http://hackage.haskell.org/package/pointed <code>pointed</code> package]. However, the current consensus is that <code>Pointed</code> is not very useful after all. For a more detailed explanation, see [[Why not Pointed?]])

==Laws==

{{note|See
[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html haddock for Applicative] and [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative programming with effects]}}

Traditionally, there are four laws that <code>Applicative</code> instances should satisfy {{noteref}}. In some sense, they are all concerned with making sure that <code>pure</code> deserves its name:

* The identity law: <haskell>pure id <*> v = v</haskell>
* Homomorphism: <haskell>pure f <*> pure x = pure (f x)</haskell>Intuitively, applying a non-effectful function to a non-effectful argument in an effectful context is the same as just applying the function to the argument and then injecting the result into the context with <code>pure</code>.
* Interchange: <haskell>u <*> pure y = pure ($ y) <*> u</haskell>Intuitively, this says that when evaluating the application of an effectful function to a pure argument, the order in which we evaluate the function and its argument doesn't matter.
* Composition: <haskell>u <*> (v <*> w) = pure (.) <*> u <*> v <*> w </haskell>This one is the trickiest law to gain intuition for. In some sense it is expressing a sort of associativity property of <code>(<*>)</code>. The reader may wish to simply convince themselves that this law is type-correct.

Considered as left-to-right rewrite rules, the homomorphism, interchange, and composition laws actually constitute an algorithm for transforming any expression using <code>pure</code> and <code>(<*>)</code> into a canonical form with only a single use of <code>pure</code> at the very beginning and only left-nested occurrences of <code>(<*>)</code>. Composition allows reassociating <code>(<*>)</code>; interchange allows moving occurrences of <code>pure</code> leftwards; and homomorphism allows collapsing multiple adjacent occurrences of <code>pure</code> into one.

There is also a law specifying how <code>Applicative</code> should relate to <code>Functor</code>:

<haskell>
fmap g x = pure g <*> x
</haskell>

It says that mapping a pure function <code>g</code> over a context <code>x</code> is the same as first injecting <code>g</code> into a context with <code>pure</code>, and then applying it to <code>x</code> with <code>(<*>)</code>. In other words, we can decompose <code>fmap</code> into two more atomic operations: injection into a context, and application within a context. The <code>Control.Applicative</code> module also defines <code>(<$>)</code> as a synonym for <code>fmap</code>, so the above law can also be expressed as:

<code>g <$> x = pure g <*> x</code>.

{{Exercises|
# (Tricky) One might imagine a variant of the interchange law that says something about applying a pure function to an effectful argument. Using the above laws, prove that<haskell>pure f <*> x = pure (flip ($)) <*> x <*> pure f</haskell>
}}

==Instances==

Most of the standard types which are instances of <code>Functor</code> are also instances of <code>Applicative</code>.

<code>Maybe</code> can easily be made an instance of <code>Applicative</code>; writing such an instance is left as an exercise for the reader.

The list type constructor <code>[]</code> can actually be made an instance of <code>Applicative</code> in two ways; essentially, it comes down to whether we want to think of lists as ordered collections of elements, or as contexts representing multiple results of a nondeterministic computation (see Wadler’s [http://www.springerlink.com/content/y7450255v2670167/ How to replace failure by a list of successes]).

Let’s first consider the collection point of view. Since there can only be one instance of a given type class for any particular type, one or both of the list instances of <code>Applicative</code> need to be defined for a <code>newtype</code> wrapper; as it happens, the nondeterministic computation instance is the default, and the collection instance is defined in terms of a <code>newtype</code> called <code>ZipList</code>. This instance is:

<haskell>
newtype ZipList a = ZipList { getZipList :: [a] }

instance Applicative ZipList where
pure = undefined -- exercise
(ZipList gs) <*> (ZipList xs) = ZipList (zipWith ($) gs xs)
</haskell>

To apply a list of functions to a list of inputs with <code>(<*>)</code>, we just match up the functions and inputs elementwise, and produce a list of the resulting outputs. In other words, we “zip” the lists together with function application, <code>($)</code>; hence the name <code>ZipList</code>.

The other <code>Applicative</code> instance for lists, based on the nondeterministic computation point of view, is:

<haskell>
instance Applicative [] where
pure x = [x]
gs <*> xs = [ g x | g <- gs, x <- xs ]
</haskell>

Instead of applying functions to inputs pairwise, we apply each function to all the inputs in turn, and collect all the results in a list.

Now we can write nondeterministic computations in a natural style. To add the numbers <code>3</code> and <code>4</code> deterministically, we can of course write <code>(+) 3 4</code>. But suppose instead of <code>3</code> we have a nondeterministic computation that might result in <code>2</code>, <code>3</code>, or <code>4</code>; then we can write

<haskell>
pure (+) <*> [2,3,4] <*> pure 4
</haskell>

or, more idiomatically,

<haskell>
(+) <$> [2,3,4] <*> pure 4.
</haskell>

There are several other <code>Applicative</code> instances as well:

* <code>IO</code> is an instance of <code>Applicative</code>, and behaves exactly as you would think: to execute <code>m1 <*> m2</code>, first <code>m1</code> is executed, resulting in a function <code>f</code>, then <code>m2</code> is executed, resulting in a value <code>x</code>, and finally the value <code>f x</code> is returned as the result of executing <code>m1 <*> m2</code>.

* <code>((,) a)</code> is an <code>Applicative</code>, as long as <code>a</code> is an instance of <code>Monoid</code> ([[#Monoid|section Monoid]]). The <code>a</code> values are accumulated in parallel with the computation.

* The <code>Applicative</code> module defines the <code>Const</code> type constructor; a value of type <code>Const a b</code> simply contains an <code>a</code>. This is an instance of <code>Applicative</code> for any <code>Monoid a</code>; this instance becomes especially useful in conjunction with things like <code>Foldable</code> ([[#Foldable|section Foldable]]).

* The <code>WrappedMonad</code> and <code>WrappedArrow</code> newtypes make any instances of <code>Monad</code> ([[#Monad|section Monad]]) or <code>Arrow</code> ([[#Arrow|section Arrow]]) respectively into instances of <code>Applicative</code>; as we will see when we study those type classes, both are strictly more expressive than <code>Applicative</code>, in the sense that the <code>Applicative</code> methods can be implemented in terms of their methods.

{{Exercises|
# Implement an instance of <code>Applicative</code> for <code>Maybe</code>.
# Determine the correct definition of <code>pure</code> for the <code>ZipList</code> instance of <code>Applicative</code>—there is only one implementation that satisfies the law relating <code>pure</code> and <code>(<*>)</code>.
}}

==Intuition==

McBride and Paterson’s paper introduces the notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> to denote function application in a computational context. If each <math>x_i\ </math> has type <math>f \; t_i\ </math> for some applicative functor <math>f\ </math>, and <math>g\ </math> has type <math>t_1 \to t_2 \to \dots \to t_n \to t\ </math>, then the entire expression <math>[[g \; x_1 \; \cdots \; x_n]]\ </math> has type <math>f \; t\ </math>. You can think of this as applying a function to multiple “effectful” arguments. In this sense, the double bracket notation is a generalization of <code>fmap</code>, which allows us to apply a function to a single argument in a context.

Why do we need <code>Applicative</code> to implement this generalization of <code>fmap</code>? Suppose we use <code>fmap</code> to apply <code>g</code> to the first parameter <code>x1</code>. Then we get something of type <code>f (t2 -> ... t)</code>, but now we are stuck: we can’t apply this function-in-a-context to the next argument with <code>fmap</code>. However, this is precisely what <code>(<*>)</code> allows us to do.

This suggests the proper translation of the idealized notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> into Haskell, namely
<haskell>
g <$> x1 <*> x2 <*> ... <*> xn,
</haskell>

recalling that <code>Control.Applicative</code> defines <code>(<$>)</code> as convenient infix shorthand for <code>fmap</code>. This is what is meant by an “applicative style”—effectful computations can still be described in terms of function application; the only difference is that we have to use the special operator <code>(<*>)</code> for application instead of simple juxtaposition.

Note that <code>pure</code> allows embedding “non-effectful” arguments in the middle of an idiomatic application, like
<haskell>
g <$> x1 <*> pure x2 <*> x3
</haskell>
which has type <code>f d</code>, given
<haskell>
g :: a -> b -> c -> d
x1 :: f a
x2 :: b
x3 :: f c
</haskell>

The double brackets are commonly known as “idiom brackets”, because they allow writing “idiomatic” function application, that is, function application that looks normal but has some special, non-standard meaning (determined by the particular instance of <code>Applicative</code> being used). Idiom brackets are not supported by GHC, but they are supported by the [http://personal.cis.strath.ac.uk/~conor/pub/she/ Strathclyde Haskell Enhancement], a preprocessor which (among many other things) translates idiom brackets into standard uses of <code>(<$>)</code> and <code>(<*>)</code>. This can result in much more readable code when making heavy use of <code>Applicative</code>.

==Alternative formulation==

An alternative, equivalent formulation of <code>Applicative</code> is given by

<haskell>
class Functor f => Monoidal f where
unit :: f ()
(**) :: f a -> f b -> f (a,b)
</haskell>

{{note|In category-theory speak, we say <code>f</code> is a ''lax'' monoidal functor because there aren't necessarily functions in the other direction, like <code>f (a, b) -> (f a, f b)</code>.}}
Intuitively, this states that a monoidal functor{{noteref}} is one which has some sort of "default shape" and which supports some sort of "combining" operation. <code>pure</code> and <code>(<*>)</code> are equivalent in power to <code>unit</code> and <code>(**)</code> (see the Exercises below). More technically, the idea is that <code>f</code> preserves the "monoidal structure" given by the pairing constructor <code>(,)</code> and unit type <code>()</code>. This can be seen even more clearly if we rewrite the types of <code>unit</code> and <code>(**)</code> as
<haskell>
unit' :: () -> f ()
(**') :: (f a, f b) -> f (a, b)
</haskell>

Furthermore, to deserve the name "monoidal" (see the [[#Monoid|section on Monoids]]), instances of <code>Monoidal</code> ought to satisfy the following laws, which seem much more straightforward than the traditional <code>Applicative</code> laws:

{{note|In this and the following laws, <code>≅</code> refers to isomorphism rather than equality. In particular we consider <code>(x,()) ≅ x ≅ ((),x)</code> and <code>((x,y),z) ≅ (x,(y,z))</code>.}}
* Left identity{{noteref}}: <haskell>unit ** v ≅ v</haskell>
* Right identity: <haskell>u ** unit ≅ u</haskell>
* Associativity: <haskell>u ** (v ** w) ≅ (u ** v) ** w</haskell>

These turn out to be equivalent to the usual <code>Applicative</code> laws. In a category theory setting, one would also require a naturality law:

{{note|Here <code>g *** h {{=}} \(x,y) -> (g x, h y)</code>. See [[#Arrow|Arrows]].}}
* Naturality: <haskell>fmap (g *** h) (u ** v) = fmap g u ** fmap h v</haskell>

but in the context of Haskell, this is a free theorem.

Much of this section was taken from [http://blog.ezyang.com/2012/08/applicative-functors/ a blog post by Edward Z. Yang]; see his actual post for a bit more information.

{{Exercises|
# Implement <code>pure</code> and <code>(<*>)</code> in terms of <code>unit</code> and <code>(**)</code>, and vice versa.
# Are there any <code>Applicative</code> instances for which there are also functions <code>f () -> ()</code> and <code>f (a,b) -> (f a, f b)</code>, satisfying some "reasonable" laws?
# (Tricky) Prove that given your implementations from the previous exercise, the usual <code>Applicative</code> laws and the <code>Monoidal</code> laws stated above are equivalent.
}}

==Further reading==

There are many other useful combinators in the standard libraries implemented in terms of <code>pure</code> and <code>(<*>)</code>: for example, <code>(*>)</code>, <code>(<*)</code>, <code>(<**>)</code>, <code>(<$)</code>, and so on (see [http://www.haskell.org/ghc/docs/latest/html/libraries/base-4.7.0.0/Control-Applicative.html haddock for Applicative]). Judicious use of such secondary combinators can often make code using <code>Applicative</code>s much easier to read.

[http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s original paper] is a treasure-trove of information and examples, as well as some perspectives on the connection between <code>Applicative</code> and category theory. Beginners will find it difficult to make it through the entire paper, but it is extremely well-motivated—even beginners will be able to glean something from reading as far as they are able.

{{note|Introduced by [http://conal.net/papers/simply-reactive/ an earlier paper] that was since superseded by [http://conal.net/papers/push-pull-frp/ Push-pull functional reactive programming].}}

Conal Elliott has been one of the biggest proponents of <code>Applicative</code>. For example, the [http://conal.net/papers/functional-images/ Pan library for functional images] and the reactive library for functional reactive programming (FRP) {{noteref}} make key use of it; his blog also contains [http://conal.net/blog/tag/applicative-functor many examples of <code>Applicative</code> in action]. Building on the work of McBride and Paterson, Elliott also built the [[TypeCompose]] library, which embodies the observation (among others) that <code>Applicative</code> types are closed under composition; therefore, <code>Applicative</code> instances can often be automatically derived for complex types built out of simpler ones.

Although the [http://hackage.haskell.org/package/parsec Parsec parsing library] ([http://legacy.cs.uu.nl/daan/download/papers/parsec-paper.pdf paper]) was originally designed for use as a monad, in its most common use cases an <code>Applicative</code> instance can be used to great effect; [http://www.serpentine.com/blog/2008/02/06/the-basics-of-applicative-functors-put-to-practical-work/ Bryan O’Sullivan’s blog post] is a good starting point. If the extra power provided by <code>Monad</code> isn’t needed, it’s usually a good idea to use <code>Applicative</code> instead.

A couple other nice examples of <code>Applicative</code> in action include the [http://web.archive.org/web/20090416111947/chrisdone.com/blog/html/2009-02-10-applicative-configfile-hsql.html ConfigFile and HSQL libraries] and the [http://groups.inf.ed.ac.uk/links/formlets/ formlets library].

Gershom Bazerman's [http://comonad.com/reader/2012/abstracting-with-applicatives/ post] contains many insights into applicatives.

=Monad=

It’s a safe bet that if you’re reading this, you’ve heard of monads—although it’s quite possible you’ve never heard of <code>Applicative</code> before, or <code>Arrow</code>, or even <code>Monoid</code>. Why are monads such a big deal in Haskell? There are several reasons.

* Haskell does, in fact, single out monads for special attention by making them the framework in which to construct I/O operations.
* Haskell also singles out monads for special attention by providing a special syntactic sugar for monadic expressions: the <code>do</code>-notation.
* <code>Monad</code> has been around longer than other abstract models of computation such as <code>Applicative</code> or <code>Arrow</code>.
* The more monad tutorials there are, the harder people think monads must be, and the more new monad tutorials are written by people who think they finally “get” monads (the [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ monad tutorial fallacy]).

I will let you judge for yourself whether these are good reasons.

In the end, despite all the hoopla, <code>Monad</code> is just another type class. Let’s take a look at its definition.

==Definition==

The type class declaration for [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Monad <code>Monad</code>] is:

<haskell>
class Monad m where
return :: a -> m a
(>>=) :: m a -> (a -> m b) -> m b
(>>) :: m a -> m b -> m b
m >> n = m >>= \_ -> n

fail :: String -> m a
</haskell>

The <code>Monad</code> type class is exported by the <code>Prelude</code>, along with a few standard instances. However, many utility functions are found in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>], and there are also several instances (such as <code>((->) e)</code>) defined in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad-Instances.html <code>Control.Monad.Instances</code>].

{{note|However, as of GHC 7.10 this will be fixed!}}
Let’s examine the methods in the <code>Monad</code> class one by one. The type of <code>return</code> should look familiar; it’s the same as <code>pure</code>. Indeed, <code>return</code> ''is'' <code>pure</code>, but with an unfortunate name. (Unfortunate, since someone coming from an imperative programming background might think that <code>return</code> is like the C or Java keyword of the same name, when in fact the similarities are minimal.) From a mathematical point of view, every monad is an applicative functor, but for historical reasons, the <code>Monad</code> type class declaration unfortunately does not require this.{{noteref}}

We can see that <code>(>>)</code> is a specialized version of <code>(>>=)</code>, with a default implementation given. It is only included in the type class declaration so that specific instances of <code>Monad</code> can override the default implementation of <code>(>>)</code> with a more efficient one, if desired. Also, note that although <code>_ >> n = n</code> would be a type-correct implementation of <code>(>>)</code>, it would not correspond to the intended semantics: the intention is that <code>m >> n</code> ignores the ''result'' of <code>m</code>, but not its ''effects''.

The <code>fail</code> function is an awful hack that has no place in the <code>Monad</code> class; more on this later.

The only really interesting thing to look at—and what makes <code>Monad</code> strictly more powerful than <code>Applicative</code>—is <code>(>>=)</code>, which is often called ''bind''. An alternative definition of <code>Monad</code> could look like:

<haskell>
class Applicative m => Monad' m where
(>>=) :: m a -> (a -> m b) -> m b
</haskell>

We could spend a while talking about the intuition behind <code>(>>=)</code>—and we will. But first, let’s look at some examples.

==Instances==

Even if you don’t understand the intuition behind the <code>Monad</code> class, you can still create instances of it by just seeing where the types lead you. You may be surprised to find that this actually gets you a long way towards understanding the intuition; at the very least, it will give you some concrete examples to play with as you read more about the <code>Monad</code> class in general. The first few examples are from the standard <code>Prelude</code>; the remaining examples are from the [http://hackage.haskell.org/package/transformers <code>transformers</code> package].

<ul>
<li>The simplest possible instance of <code>Monad</code> is [http://hackage.haskell.org/packages/archive/mtl/1.1.0.2/doc/html/Control-Monad-Identity.html <code>Identity</code>], which is described in Dan Piponi’s highly recommended blog post on [http://blog.sigfpe.com/2007/04/trivial-monad.html The Trivial Monad]. Despite being “trivial”, it is a great introduction to the <code>Monad</code> type class, and contains some good exercises to get your brain working.
</li>
<li>The next simplest instance of <code>Monad</code> is <code>Maybe</code>. We already know how to write <code>return</code>/<code>pure</code> for <code>Maybe</code>. So how do we write <code>(>>=)</code>? Well, let’s think about its type. Specializing for <code>Maybe</code>, we have

<haskell>
(>>=) :: Maybe a -> (a -> Maybe b) -> Maybe b.
</haskell>

If the first argument to <code>(>>=)</code> is <code>Just x</code>, then we have something of type <code>a</code> (namely, <code>x</code>), to which we can apply the second argument—resulting in a <code>Maybe b</code>, which is exactly what we wanted. What if the first argument to <code>(>>=)</code> is <code>Nothing</code>? In that case, we don’t have anything to which we can apply the <code>a -> Maybe b</code> function, so there’s only one thing we can do: yield <code>Nothing</code>. This instance is:

<haskell>
instance Monad Maybe where
return = Just
(Just x) >>= g = g x
Nothing >>= _ = Nothing
</haskell>

We can already get a bit of intuition as to what is going on here: if we build up a computation by chaining together a bunch of functions with <code>(>>=)</code>, as soon as any one of them fails, the entire computation will fail (because <code>Nothing >>= f</code> is <code>Nothing</code>, no matter what <code>f</code> is). The entire computation succeeds only if all the constituent functions individually succeed. So the <code>Maybe</code> monad models computations which may fail.
</li>

<li>The <code>Monad</code> instance for the list constructor <code>[]</code> is similar to its <code>Applicative</code> instance; see the exercise below.
</li>

<li>Of course, the <code>IO</code> constructor is famously a <code>Monad</code>, but its implementation is somewhat magical, and may in fact differ from compiler to compiler. It is worth emphasizing that the <code>IO</code> monad is the ''only'' monad which is magical. It allows us to build up, in an entirely pure way, values representing possibly effectful computations. The special value <code>main</code>, of type <code>IO ()</code>, is taken by the runtime and actually executed, producing actual effects. Every other monad is functionally pure, and requires no special compiler support. We often speak of monadic values as “effectful computations”, but this is because some monads allow us to write code ''as if'' it has side effects, when in fact the monad is hiding the plumbing which allows these apparent side effects to be implemented in a functionally pure way.
</li>

<li>As mentioned earlier, <code>((->) e)</code> is known as the ''reader monad'', since it describes computations in which a value of type <code>e</code> is available as a read-only environment.

The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Reader.html <code>Control.Monad.Reader</code>] module provides the <code>Reader e a</code> type, which is just a convenient <code>newtype</code> wrapper around <code>(e -> a)</code>, along with an appropriate <code>Monad</code> instance and some <code>Reader</code>-specific utility functions such as <code>ask</code> (retrieve the environment), <code>asks</code> (retrieve a function of the environment), and <code>local</code> (run a subcomputation under a different environment).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Writer-Lazy.html <code>Control.Monad.Writer</code>] module provides the <code>Writer</code> monad, which allows information to be collected as a computation progresses. <code>Writer w a</code> is isomorphic to <code>(a,w)</code>, where the output value <code>a</code> is carried along with an annotation or “log” of type <code>w</code>, which must be an instance of <code>Monoid</code> (see [[#Monoid|section Monoid]]); the special function <code>tell</code> performs logging.
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-State-Lazy.html <code>Control.Monad.State</code>] module provides the <code>State s a</code> type, a <code>newtype</code> wrapper around <code>s -> (a,s)</code>. Something of type <code>State s a</code> represents a stateful computation which produces an <code>a</code> but can access and modify the state of type <code>s</code> along the way. The module also provides <code>State</code>-specific utility functions such as <code>get</code> (read the current state), <code>gets</code> (read a function of the current state), <code>put</code> (overwrite the state), and <code>modify</code> (apply a function to the state).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Cont.html <code>Control.Monad.Cont</code>] module provides the <code>Cont</code> monad, which represents computations in continuation-passing style. It can be used to suspend and resume computations, and to implement non-local transfers of control, co-routines, other complex control structures—all in a functionally pure way. <code>Cont</code> has been called the [http://blog.sigfpe.com/2008/12/mother-of-all-monads.html “mother of all monads”] because of its universal properties.
</li>
</ul>

{{Exercises|
<ol>
<li>Implement a <code>Monad</code> instance for the list constructor, <code>[]</code>. Follow the types!</li>
<li>Implement a <code>Monad</code> instance for <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> and <code>Monad</code> instances for <code>Free f</code>, defined as
<haskell>
data Free f a = Var a
| Node (f (Free f a))
</haskell>
You may assume that <code>f</code> has a <code>Functor</code> instance. This is known as the ''free monad'' built from the functor <code>f</code>.
</li>
</ol>
}}

==Intuition==

Let’s look more closely at the type of <code>(>>=)</code>. The basic intuition is that it combines two computations into one larger computation. The first argument, <code>m a</code>, is the first computation. However, it would be boring if the second argument were just an <code>m b</code>; then there would be no way for the computations to interact with one another (actually, this is exactly the situation with <code>Applicative</code>). So, the second argument to <code>(>>=)</code> has type <code>a -> m b</code>: a function of this type, given a ''result'' of the first computation, can produce a second computation to be run. In other words, <code>x >>= k</code> is a computation which runs <code>x</code>, and then uses the result(s) of <code>x</code> to ''decide'' what computation to run second, using the output of the second computation as the result of the entire computation.

{{note|Actually, because Haskell allows general recursion, this is a lie: using a Haskell parsing library one can recursively construct ''infinite'' grammars, and hence <code>Applicative</code> (together with <code>Alternative</code>) is enough to parse any context-sensitive language with a finite alphabet. See [http://byorgey.wordpress.com/2012/01/05/parsing-context-sensitive-languages-with-applicative/ Parsing context-sensitive languages with Applicative].}}
Intuitively, it is this ability to use the output from previous computations to decide what computations to run next that makes <code>Monad</code> more powerful than <code>Applicative</code>. The structure of an <code>Applicative</code> computation is fixed, whereas the structure of a <code>Monad</code> computation can change based on intermediate results. This also means that parsers built using an <code>Applicative</code> interface can only parse context-free languages; in order to parse context-sensitive languages a <code>Monad</code> interface is needed.{{noteref}}

To see the increased power of <code>Monad</code> from a different point of view, let’s see what happens if we try to implement <code>(>>=)</code> in terms of <code>fmap</code>, <code>pure</code>, and <code>(<*>)</code>. We are given a value <code>x</code> of type <code>m a</code>, and a function <code>k</code> of type <code>a -> m b</code>, so the only thing we can do is apply <code>k</code> to <code>x</code>. We can’t apply it directly, of course; we have to use <code>fmap</code> to lift it over the <code>m</code>. But what is the type of <code>fmap k</code>? Well, it’s <code>m a -> m (m b)</code>. So after we apply it to <code>x</code>, we are left with something of type <code>m (m b)</code>—but now we are stuck; what we really want is an <code>m b</code>, but there’s no way to get there from here. We can ''add'' <code>m</code>’s using <code>pure</code>, but we have no way to ''collapse'' multiple <code>m</code>’s into one.

{{note|1=You might hear some people claim that that the definition in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> is the “math definition” and the definition in terms of <code>return</code> and <code>(>>=)</code> is something specific to Haskell. In fact, both definitions were known in the mathematics community long before Haskell picked up monads.}}

This ability to collapse multiple <code>m</code>’s is exactly the ability provided by the function <code>join :: m (m a) -> m a</code>, and it should come as no surprise that an alternative definition of <code>Monad</code> can be given in terms of <code>join</code>:

<haskell>
class Applicative m => Monad'' m where
join :: m (m a) -> m a
</haskell>

In fact, the canonical definition of monads in category theory is in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> (often called <math>\eta</math>, <math>T</math>, and <math>\mu</math> in the mathematical literature). Haskell uses an alternative formulation with <code>(>>=)</code> instead of <code>join</code> since it is more convenient to use {{noteref}}. However, sometimes it can be easier to think about <code>Monad</code> instances in terms of <code>join</code>, since it is a more “atomic” operation. (For example, <code>join</code> for the list monad is just <code>concat</code>.)

{{Exercises|
# Implement <code>(>>{{=}})</code> in terms of <code>fmap</code> (or <code>liftM</code>) and <code>join</code>.
# Now implement <code>join</code> and <code>fmap</code> (<code>liftM</code>) in terms of <code>(>>{{=}})</code> and <code>return</code>.
}}

==Utility functions==

The [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>] module provides a large number of convenient utility functions, all of which can be implemented in terms of the basic <code>Monad</code> operations (<code>return</code> and <code>(>>=)</code> in particular). We have already seen one of them, namely, <code>join</code>. We also mention some other noteworthy ones here; implementing these utility functions oneself is a good exercise. For a more detailed guide to these functions, with commentary and example code, see Henk-Jan van Tuyl’s [http://members.chello.nl/hjgtuyl/tourdemonad.html tour].

{{note|This will most likely change in Haskell 2014 with the implementation of the [[Functor-Applicative-Monad_Proposal|Haskell 2014 Applicative => Monad proposal]].}}

* <code>liftM :: Monad m => (a -> b) -> m a -> m b</code>. This should be familiar; of course, it is just <code>fmap</code>. The fact that we have both <code>fmap</code> and <code>liftM</code> is an unfortunate consequence of the fact that the <code>Monad</code> type class does not require a <code>Functor</code> instance, even though mathematically speaking, every monad is a functor. However, <code>fmap</code> and <code>liftM</code> are essentially interchangeable, since it is a bug (in a social rather than technical sense) for any type to be an instance of <code>Monad</code> without also being an instance of <code>Functor</code> {{noteref}}.

* <code>ap :: Monad m => m (a -> b) -> m a -> m b</code> should also be familiar: it is equivalent to <code>(<*>)</code>, justifying the claim that the <code>Monad</code> interface is strictly more powerful than <code>Applicative</code>. We can make any <code>Monad</code> into an instance of <code>Applicative</code> by setting <code>pure = return</code> and <code>(<*>) = ap</code>.

* <code>sequence :: Monad m => [m a] -> m [a]</code> takes a list of computations and combines them into one computation which collects a list of their results. It is again something of a historical accident that <code>sequence</code> has a <code>Monad</code> constraint, since it can actually be implemented only in terms of <code>Applicative</code>. There is an additional generalization of <code>sequence</code> to structures other than lists, which will be discussed in the [[#Traversable|section on <code>Traversable</code>]].

* <code>replicateM :: Monad m => Int -> m a -> m [a]</code> is simply a combination of [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#v:replicate <code>replicate</code>] and <code>sequence</code>.

* <code>when :: Monad m => Bool -> m () -> m ()</code> conditionally executes a computation, evaluating to its second argument if the test is <code>True</code>, and to <code>return ()</code> if the test is <code>False</code>. A collection of other sorts of monadic conditionals can be found in the [http://hackage.haskell.org/package/IfElse <code>IfElse</code> package].

* <code>mapM :: Monad m => (a -> m b) -> [a] -> m [b]</code> maps its first argument over the second, and <code>sequence</code>s the results. The <code>forM</code> function is just <code>mapM</code> with its arguments reversed; it is called <code>forM</code> since it models generalized <code>for</code> loops: the list <code>[a]</code> provides the loop indices, and the function <code>a -> m b</code> specifies the “body” of the loop for each index.

* <code>(=<<) :: Monad m => (a -> m b) -> m a -> m b</code> is just <code>(>>=)</code> with its arguments reversed; sometimes this direction is more convenient since it corresponds more closely to function application.

* <code>(>=>) :: Monad m => (a -> m b) -> (b -> m c) -> a -> m c</code> is sort of like function composition, but with an extra <code>m</code> on the result type of each function, and the arguments swapped. We’ll have more to say about this operation later. There is also a flipped variant, <code>(<=<)</code>.

* The <code>guard</code> function is for use with instances of <code>MonadPlus</code>, which is discussed at the end of the [[#Monoid|<code>Monoid</code> section]].

Many of these functions also have “underscored” variants, such as <code>sequence_</code> and <code>mapM_</code>; these variants throw away the results of the computations passed to them as arguments, using them only for their side effects.

Other monadic functions which are occasionally useful include <code>filterM</code>, <code>zipWithM</code>, <code>foldM</code>, and <code>forever</code>.

==Laws==

There are several laws that instances of <code>Monad</code> should satisfy (see also the [[Monad laws]] wiki page). The standard presentation is:

<haskell>
return a >>= k = k a
m >>= return = m
m >>= (\x -> k x >>= h) = (m >>= k) >>= h

fmap f xs = xs >>= return . f = liftM f xs
</haskell>

The first and second laws express the fact that <code>return</code> behaves nicely: if we inject a value <code>a</code> into a monadic context with <code>return</code>, and then bind to <code>k</code>, it is the same as just applying <code>k</code> to <code>a</code> in the first place; if we bind a computation <code>m</code> to <code>return</code>, nothing changes. The third law essentially says that <code>(>>=)</code> is associative, sort of. The last law ensures that <code>fmap</code> and <code>liftM</code> are the same for types which are instances of both <code>Functor</code> and <code>Monad</code>—which, as already noted, should be every instance of <code>Monad</code>.

{{note|I like to pronounce this operator “fish”.}}

However, the presentation of the above laws, especially the third, is marred by the asymmetry of <code>(>>=)</code>. It’s hard to look at the laws and see what they’re really saying. I prefer a much more elegant version of the laws, which is formulated in terms of <code>(>=>)</code> {{noteref}}. Recall that <code>(>=>)</code> “composes” two functions of type <code>a -> m b</code> and <code>b -> m c</code>. You can think of something of type <code>a -> m b</code> (roughly) as a function from <code>a</code> to <code>b</code> which may also have some sort of effect in the context corresponding to <code>m</code>. <code>(>=>)</code> lets us compose these “effectful functions”, and we would like to know what properties <code>(>=>)</code> has. The monad laws reformulated in terms of <code>(>=>)</code> are:

<haskell>
return >=> g = g
g >=> return = g
(g >=> h) >=> k = g >=> (h >=> k)
</haskell>

{{note|As fans of category theory will note, these laws say precisely that functions of type <code>a -> m b</code> are the arrows of a category with <code>(>{{=}}>)</code> as composition! Indeed, this is known as the ''Kleisli category'' of the monad <code>m</code>. It will come up again when we discuss <code>Arrow</code>s.}}

Ah, much better! The laws simply state that <code>return</code> is the identity of <code>(>=>)</code>, and that <code>(>=>)</code> is associative {{noteref}}.

There is also a formulation of the monad laws in terms of <code>fmap</code>, <code>return</code>, and <code>join</code>; for a discussion of this formulation, see the Haskell [http://en.wikibooks.org/wiki/Haskell/Category_theory wikibook page on category theory].

{{Exercises|
# Given the definition <code>g >{{=}}> h {{=}} \x -> g x >>{{=}} h</code>, prove the equivalence of the above laws and the usual monad laws.
}}

==<code>do</code> notation==

Haskell’s special <code>do</code> notation supports an “imperative style” of programming by providing syntactic sugar for chains of monadic expressions. The genesis of the notation lies in realizing that something like <code>a >>= \x -> b >> c >>= \y -> d </code> can be more readably written by putting successive computations on separate lines:

<haskell>
a >>= \x ->
b >>
c >>= \y ->
d
</haskell>

This emphasizes that the overall computation consists of four computations <code>a</code>, <code>b</code>, <code>c</code>, and <code>d</code>, and that <code>x</code> is bound to the result of <code>a</code>, and <code>y</code> is bound to the result of <code>c</code> (<code>b</code>, <code>c</code>, and <code>d</code> are allowed to refer to <code>x</code>, and <code>d</code> is allowed to refer to <code>y</code> as well). From here it is not hard to imagine a nicer notation:

<haskell>
do { x <- a
; b
; y <- c
; d
}
</haskell>

(The curly braces and semicolons may optionally be omitted; the Haskell parser uses layout to determine where they should be inserted.) This discussion should make clear that <code>do</code> notation is just syntactic sugar. In fact, <code>do</code> blocks are recursively translated into monad operations (almost) like this:

<pre>
do e → e
do { e; stmts } → e >> do { stmts }
do { v <- e; stmts } → e >>= \v -> do { stmts }
do { let decls; stmts} → let decls in do { stmts }
</pre>

This is not quite the whole story, since <code>v</code> might be a pattern instead of a variable. For example, one can write

<haskell>
do (x:xs) <- foo
bar x
</haskell>

but what happens if <code>foo</code> produces an empty list? Well, remember that ugly <code>fail</code> function in the <code>Monad</code> type class declaration? That’s what happens. See [http://www.haskell.org/onlinereport/exps.html#sect3.14 section 3.14 of the Haskell Report] for the full details. See also the discussion of <code>MonadPlus</code> and <code>MonadZero</code> in the [[#Other monoidal classes: Alternative, MonadPlus, ArrowPlus|section on other monoidal classes]].

A final note on intuition: <code>do</code> notation plays very strongly to the “computational context” point of view rather than the “container” point of view, since the binding notation <code>x <- m</code> is suggestive of “extracting” a single <code>x</code> from <code>m</code> and doing something with it. But <code>m</code> may represent some sort of a container, such as a list or a tree; the meaning of <code>x <- m</code> is entirely dependent on the implementation of <code>(>>=)</code>. For example, if <code>m</code> is a list, <code>x <- m</code> actually means that <code>x</code> will take on each value from the list in turn.

==Further reading==

Philip Wadler was the first to propose using monads to structure functional programs. [http://homepages.inf.ed.ac.uk/wadler/topics/monads.html His paper] is still a readable introduction to the subject.

{{note|1=
[[All About Monads]],
[http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers],
[http://en.wikibooks.org/w/index.php?title=Haskell/Understanding_monads Understanding monads],
[[The Monadic Way]],
[http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads! (And Maybe You Already Have.)],
[http://www.haskell.org/pipermail/haskell-cafe/2006-November/019190.html there’s a monster in my Haskell!],
[http://kawagner.blogspot.com/2007/02/understanding-monads-for-real.html Understanding Monads. For real.],
[http://www.randomhacks.net/articles/2007/03/12/monads-in-15-minutes Monads in 15 minutes: Backtracking and Maybe],
[http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation],
[http://metafoo.co.uk/practical-monads.txt Practical Monads]}}

There are, of course, numerous monad tutorials of varying quality {{noteref}}.

A few of the best include Cale Gibbard’s [http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers] and [http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation]; Jeff Newbern’s [[All About Monads]], a comprehensive guide with lots of examples; and Dan Piponi’s [http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads!], which features great exercises. If you just want to know how to use <code>IO</code>, you could consult the [[Introduction to IO]]. Even this is just a sampling; the [[monad tutorials timeline]] is a more complete list. (All these monad tutorials have prompted parodies like [http://koweycode.blogspot.com/2007/01/think-of-monad.html think of a monad ...] as well as other kinds of backlash like [http://ahamsandwich.wordpress.com/2007/07/26/monads-and-why-monad-tutorials-are-all-awful/ Monads! (and Why Monad Tutorials Are All Awful)] or [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ Abstraction, intuition, and the “monad tutorial fallacy”].)

Other good monad references which are not necessarily tutorials include [http://members.chello.nl/hjgtuyl/tourdemonad.html Henk-Jan van Tuyl’s tour] of the functions in <code>Control.Monad</code>, Dan Piponi’s [http://blog.sigfpe.com/2006/10/monads-field-guide.html field guide], Tim Newsham’s [http://www.thenewsh.com/~newsham/haskell/monad.html What’s a Monad?], and Chris Smith's excellent article [http://cdsmith.wordpress.com/2012/04/18/why-do-monads-matter/ Why Do Monads Matter?]. There are also many blog posts which have been written on various aspects of monads; a collection of links can be found under [[Blog articles/Monads]].

For help constructing monads from scratch, and for obtaining a "deep embedding" of monad operations suitable for use in, say, compiling a domain-specific language, see [http://projects.haskell.org/operational Apfelmus's operational package].

One of the quirks of the <code>Monad</code> class and the Haskell type system is that it is not possible to straightforwardly declare <code>Monad</code> instances for types which require a class constraint on their data, even if they are monads from a mathematical point of view. For example, <code>Data.Set</code> requires an <code>Ord</code> constraint on its data, so it cannot be easily made an instance of <code>Monad</code>. A solution to this problem was [http://www.randomhacks.net/articles/2007/03/15/data-set-monad-haskell-macros first described by Eric Kidd], and later made into a [http://hackage.haskell.org/cgi-bin/hackage-scripts/package/rmonad library named rmonad] by Ganesh Sittampalam and Peter Gavin.

There are many good reasons for eschewing <code>do</code> notation; some have gone so far as to [[Do_notation_considered_harmful|consider it harmful]].

Monads can be generalized in various ways; for an exposition of one possibility, see Robert Atkey’s paper on [http://homepages.inf.ed.ac.uk/ratkey/paramnotions-jfp.pdf parameterized monads], or Dan Piponi’s [http://blog.sigfpe.com/2009/02/beyond-monads.html Beyond Monads].

For the categorically inclined, monads can be viewed as monoids ([http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html From Monoids to Monads]) and also as closure operators [http://blog.plover.com/math/monad-closure.html Triples and Closure]. Derek Elkins’s article in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13 of the Monad.Reader] contains an exposition of the category-theoretic underpinnings of some of the standard <code>Monad</code> instances, such as <code>State</code> and <code>Cont</code>. Jonathan Hill and Keith Clarke have [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.53.6497 an early paper explaining the connection between monads as they arise in category theory and as used in functional programming]. There is also a [http://okmij.org/ftp/Computation/IO-monad-history.html web page by Oleg Kiselyov] explaining the history of the IO monad.

Links to many more research papers related to monads can be found under [[Research papers/Monads and arrows]].

=Monad transformers=

One would often like to be able to combine two monads into one: for example, to have stateful, nondeterministic computations (<code>State</code> + <code>[]</code>), or computations which may fail and can consult a read-only environment (<code>Maybe</code> + <code>Reader</code>), and so on. Unfortunately, monads do not compose as nicely as applicative functors (yet another reason to use <code>Applicative</code> if you don’t need the full power that <code>Monad</code> provides), but some monads can be combined in certain ways.

==Standard monad transformers==

The [http://hackage.haskell.org/package/transformers transformers] library provides a number of standard ''monad transformers''. Each monad transformer adds a particular capability/feature/effect to any existing monad.

* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Identity.html <code>IdentityT</code>] is the identity transformer, which maps a monad to (something isomorphic to) itself. This may seem useless at first glance, but it is useful for the same reason that the <code>id</code> function is useful -- it can be passed as an argument to things which are parameterized over an arbitrary monad transformer, when you do not actually want any extra capabilities.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-State.html <code>StateT</code>] adds a read-write state.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Reader.html <code>ReaderT</code>] adds a read-only environment.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Writer.html <code>WriterT</code>] adds a write-only log.
* [http://hackage.haskell.org/packages/archive/transformers/0.2.2.0/doc/html/Control-Monad-Trans-RWS.html <code>RWST</code>] conveniently combines <code>ReaderT</code>, <code>WriterT</code>, and <code>StateT</code> into one.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Maybe.html <code>MaybeT</code>] adds the possibility of failure.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Error.html <code>ErrorT</code>] adds the possibility of failure with an arbitrary type to represent errors.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-List.html <code>ListT</code>] adds non-determinism (however, see the discussion of <code>ListT</code> below).
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Cont.html <code>ContT</code>] adds continuation handling.

For example, <code>StateT s Maybe</code> is an instance of <code>Monad</code>; computations of type <code>StateT s Maybe a</code> may fail, and have access to a mutable state of type <code>s</code>. Monad transformers can be multiply stacked. One thing to keep in mind while using monad transformers is that the order of composition matters. For example, when a <code>StateT s Maybe a</code> computation fails, the state ceases being updated (indeed, it simply disappears); on the other hand, the state of a <code>MaybeT (State s) a</code> computation may continue to be modified even after the computation has "failed". This may seem backwards, but it is correct. Monad transformers build composite monads “inside out”; <code>MaybeT (State s) a</code> is isomorphic to <code>s -> (Maybe a, s)</code>. (Lambdabot has an indispensable <code>@unmtl</code> command which you can use to “unpack” a monad transformer stack in this way.)
Intuitively, the monads become "more fundamental" the further inside the stack you get, and the effects of inner monads "have precedence" over the effects of outer ones. Of course, this is just handwaving, and if you are unsure of the proper order for some monads you wish to combine, there is no substitute for using <code>@unmtl</code> or simply trying out the various options.

==Definition and laws==

All monad transformers should implement the <code>MonadTrans</code> type class, defined in <code>Control.Monad.Trans.Class</code>:

<haskell>
class MonadTrans t where
lift :: Monad m => m a -> t m a
</haskell>

It allows arbitrary computations in the base monad <code>m</code> to be “lifted” into computations in the transformed monad <code>t m</code>. (Note that type application associates to the left, just like function application, so <code>t m a = (t m) a</code>.)

<code>lift</code> must satisfy the laws
<haskell>
lift . return = return
lift (m >>= f) = lift m >>= (lift . f)
</haskell>
which intuitively state that <code>lift</code> transforms <code>m a</code> computations into <code>t m a</code> computations in a "sensible" way, which sends the <code>return</code> and <code>(>>=)</code> of <code>m</code> to the <code>return</code> and <code>(>>=)</code> of <code>t m</code>.

{{Exercises|
# What is the kind of <code>t</code> in the declaration of <code>MonadTrans</code>?
}}

==Transformer type classes and "capability" style==

{{note|The only problem with this scheme is the quadratic number of instances required as the number of standard monad transformers grows—but as the current set of standard monad transformers seems adequate for most common use cases, this may not be that big of a deal.}}

There are also type classes (provided by the [http://hackage.haskell.org/package/mtl <code>mtl</code> package]) for the operations of each transformer. For example, the <code>MonadState</code> type class provides the state-specific methods <code>get</code> and <code>put</code>, allowing you to conveniently use these methods not only with <code>State</code>, but with any monad which is an instance of <code>MonadState</code>—including <code>MaybeT (State s)</code>, <code>StateT s (ReaderT r IO)</code>, and so on. Similar type classes exist for <code>Reader</code>, <code>Writer</code>, <code>Cont</code>, <code>IO</code>, and others {{noteref}}.

These type classes serve two purposes. First, they get rid of (most of) the need for explicitly using <code>lift</code>, giving a type-directed way to automatically determine the right number of calls to <code>lift</code>. Simply writing <code>put</code> will be automatically translated into <code>lift . put</code>, <code>lift . lift . put</code>, or something similar depending on what concrete monad stack you are using.

Second, they give you more flexibility to switch between different concrete monad stacks. For example, if you are writing a state-based algorithm, don't write
<haskell>
foo :: State Int Char
foo = modify (*2) >> return 'x'
</haskell>
but rather
<haskell>
foo :: MonadState Int m => m Char
foo = modify (*2) >> return 'x'
</haskell>
Now, if somewhere down the line you realize you need to introduce the possibility of failure, you might switch from <code>State Int</code> to <code>MaybeT (State Int)</code>. The type of the first version of <code>foo</code> would need to be modified to reflect this change, but the second version of <code>foo</code> can still be used as-is.

However, this sort of "capability-based" style (e.g. specifying that <code>foo</code> works for any monad with the "state capability") quickly runs into problems when you try to naively scale it up: for example, what if you need to maintain two independent states? A framework for solving this and related problems is described by Schrijvers and Olivera ([http://users.ugent.be/~tschrijv/Research/papers/icfp2011.pdf Monads, zippers and views: virtualizing the monad stack, ICFP 2011]) and is implemented in the [http://hackage.haskell.org/package/Monatron <code>Monatron</code> package].

==Composing monads==

Is the composition of two monads always a monad? As hinted previously, the answer is no.

Since <code>Applicative</code> functors are closed under composition, the problem must lie with <code>join</code>. Indeed, suppose <code>m</code> and <code>n</code> are arbitrary monads; to make a monad out of their composition we would need to be able to implement
<haskell>
join :: m (n (m (n a))) -> m (n a)
</haskell>
but it is not clear how this could be done in general. The <code>join</code> method for <code>m</code> is no help, because the two occurrences of <code>m</code> are not next to each other (and likewise for <code>n</code>).

However, one situation in which it can be done is if <code>n</code> ''distributes'' over <code>m</code>, that is, if there is a function
<haskell>
distrib :: n (m a) -> m (n a)
</haskell>
satisfying certain laws. See Jones and Duponcheel ([http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.2605 Composing Monads]); see also the [[#Traversable|section on Traversable]].

For a much more in-depth discussion and analysis of the failure of monads to be closed under composition, see [http://stackoverflow.com/questions/13034229/concrete-example-showing-that-monads-are-not-closed-under-composition-with-proo?lq=1 this question on StackOverflow].

{{Exercises|
* Implement <code>join :: M (N (M (N a))) -> M (N a)</code>, given <code>distrib :: N (M a) -> M (N a)</code> and assuming <code>M</code> and <code>N</code> are instances of <code>Monad</code>.
}}

==Further reading==

Much of the monad transformer library (originally [http://hackage.haskell.org/package/mtl <code>mtl</code>], now split between <code>mtl</code> and [http://hackage.haskell.org/package/transformers <code>transformers</code>]), including the <code>Reader</code>, <code>Writer</code>, <code>State</code>, and other monads, as well as the monad transformer framework itself, was inspired by Mark Jones’s classic paper [http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Functional Programming with Overloading and Higher-Order Polymorphism]. It’s still very much worth a read—and highly readable—after almost fifteen years.

See [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17139 Edward Kmett's mailing list message] for a description of the history and relationships among monad transformer packages (<code>mtl</code>, <code>transformers</code>, <code>monads-fd</code>, <code>monads-tf</code>).

There are two excellent references on monad transformers. Martin Grabmüller’s [http://www.grabmueller.de/martin/www/pub/Transformers.en.html Monad Transformers Step by Step] is a thorough description, with running examples, of how to use monad transformers to elegantly build up computations with various effects. [http://cale.yi.org/index.php/How_To_Use_Monad_Transformers Cale Gibbard’s article] on how to use monad transformers is more practical, describing how to structure code using monad transformers to make writing it as painless as possible. Another good starting place for learning about monad transformers is a [http://blog.sigfpe.com/2006/05/grok-haskell-monad-transformers.html blog post by Dan Piponi].

The <code>ListT</code> transformer from the <code>transformers</code> package comes with the caveat that <code>ListT m</code> is only a monad when <code>m</code> is ''commutative'', that is, when <code>ma >>= \a -> mb >>= \b -> foo</code> is equivalent to <code>mb >>= \b -> ma >>= \a -> foo</code> (i.e. the order of <code>m</code>'s effects does not matter). For one explanation why, see Dan Piponi's blog post [http://blog.sigfpe.com/2006/11/why-isnt-listt-monad.html "Why isn't <code><nowiki>ListT []</nowiki></code> a monad"]. For more examples, as well as a design for a version of <code>ListT</code> which does not have this problem, see [http://www.haskell.org/haskellwiki/ListT_done_right <code>ListT</code> done right].

There is an alternative way to compose monads, using coproducts, as described by [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.8.3581 Lüth and Ghani]. This method is interesting but has not (yet?) seen widespread use. For a more recent alternative, see Kiselyov et al's [http://okmij.org/ftp/Haskell/extensible/exteff.pdf Extensible Effects: An Alternative to Monad Transformers].

=MonadFix=

''Note: <code>MonadFix</code> is included here for completeness (and because it is interesting) but seems not to be used much. Skipping this section on a first read-through is perfectly OK (and perhaps even recommended).''

==<code>mdo</code>/<code>do rec</code> notation==

{{note|In GHC 7.6, the flag has been changed to <code>-XRecursiveDo</code>.}}
The <code>MonadFix</code> class describes monads which support the special fixpoint operation <code>mfix :: (a -> m a) -> m a</code>, which allows the output of monadic computations to be defined via (effectful) recursion. This is [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation supported in GHC] by a special “recursive do” notation, enabled by the <code>-XDoRec</code> flag{{noteref}}. Within a <code>do</code> block, one may have a nested <code>rec</code> block, like so:
<haskell>
do { x <- foo
; rec { y <- baz
; z <- bar
; bob
}
; w <- frob
}
</haskell>
Normally (if we had <code>do</code> in place of <code>rec</code> in the above example), <code>y</code> would be in scope in <code>bar</code> and <code>bob</code> but not in <code>baz</code>, and <code>z</code> would be in scope only in <code>bob</code>. With the <code>rec</code>, however, <code>y</code> and <code>z</code> are both in scope in all three of <code>baz</code>, <code>bar</code>, and <code>bob</code>. A <code>rec</code> block is analogous to a <code>let</code> block such as
<haskell>
let { y = baz
; z = bar
}
in bob
</haskell>
because, in Haskell, every variable bound in a <code>let</code>-block is in scope throughout the entire block. (From this point of view, Haskell's normal <code>do</code> blocks are analogous to Scheme's <code>let*</code> construct.)

What could such a feature be used for? One of the motivating examples given in the original paper describing <code>MonadFix</code> (see below) is encoding circuit descriptions. A line in a <code>do</code>-block such as
<haskell>
x <- gate y z
</haskell>
describes a gate whose input wires are labeled <code>y</code> and <code>z</code> and whose output wire is labeled <code>x</code>. Many (most?) useful circuits, however, involve some sort of feedback loop, making them impossible to write in a normal <code>do</code>-block (since some wire would have to be mentioned as an input ''before'' being listed as an output). Using a <code>rec</code> block solves this problem.

==Examples and intuition==

Of course, not every monad supports such recursive binding. However, as mentioned above, it suffices to have an implementation of <code>mfix :: (a -> m a) -> m a</code>, satisfying a few laws. Let's try implementing <code>mfix</code> for the <code>Maybe</code> monad. That is, we want to implement a function
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
</haskell>
{{note|Actually, <code>fix</code> is implemented slightly differently for efficiency reasons; but the given definition is equivalent and simpler for the present purpose.}}
Let's think for a moment about the implementation {{noteref}} of the non-monadic <code>fix :: (a -> a) -> a</code>:
<haskell>
fix f = f (fix f)
</haskell>
Inspired by <code>fix</code>, our first attempt at implementing <code>maybeFix</code> might be something like
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = maybeFix f >>= f
</haskell>
This has the right type. However, something seems wrong: there is nothing in particular here about <code>Maybe</code>; <code>maybeFix</code> actually has the more general type <code>Monad m => (a -> m a) -> m a</code>. But didn't we just say that not all monads support <code>mfix</code>?

The answer is that although this implementation of <code>maybeFix</code> has the right type, it does ''not'' have the intended semantics. If we think about how <code>(>>=)</code> works for the <code>Maybe</code> monad (by pattern-matching on its first argument to see whether it is <code>Nothing</code> or <code>Just</code>) we can see that this definition of <code>maybeFix</code> is completely useless: it will just recurse infinitely, trying to decide whether it is going to return <code>Nothing</code> or <code>Just</code>, without ever even so much as a glance in the direction of <code>f</code>.

The trick is to simply ''assume'' that <code>maybeFix</code> will return <code>Just</code>, and get on with life!
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = ma
where ma = f (fromJust ma)
</haskell>
This says that the result of <code>maybeFix</code> is <code>ma</code>, and assuming that <code>ma = Just x</code>, it is defined (recursively) to be equal to <code>f x</code>.

Why is this OK? Isn't <code>fromJust</code> almost as bad as <code>unsafePerformIO</code>? Well, usually, yes. This is just about the only situation in which it is justified! The interesting thing to note is that <code>maybeFix</code> ''will never crash'' -- although it may, of course, fail to terminate. The only way we could get a crash is if we try to evaluate <code>fromJust ma</code> when we know that <code>ma = Nothing</code>. But how could we know <code>ma = Nothing</code>? Since <code>ma</code> is defined as <code>f (fromJust ma)</code>, it must be that this expression has already been evaluated to <code>Nothing</code> -- in which case there is no reason for us to be evaluating <code>fromJust ma</code> in the first place!

To see this from another point of view, we can consider three possibilities. First, if <code>f</code> outputs <code>Nothing</code> without looking at its argument, then <code>maybeFix f</code> clearly returns <code>Nothing</code>. Second, if <code>f</code> always outputs <code>Just x</code>, where <code>x</code> depends on its argument, then the recursion can proceed usefully: <code>fromJust ma</code> will be able to evaluate to <code>x</code>, thus feeding <code>f</code>'s output back to it as input. Third, if <code>f</code> tries to use its argument to decide whether to output <code>Just</code> or <code>Nothing</code>, then <code>maybeFix f</code> will not terminate: evaluating <code>f</code>'s argument requires evaluating <code>ma</code> to see whether it is <code>Just</code>, which requires evaluating <code>f (fromJust ma)</code>, which requires evaluating <code>ma</code>, ... and so on.

There are also instances of <code>MonadFix</code> for lists (which works analogously to the instance for <code>Maybe</code>), for <code>ST</code>, and for <code>IO</code>. The [http://hackage.haskell.org/packages/archive/base/latest/doc/html/src/System-IO.html#fixIO instance for <code>IO</code>] is particularly amusing: it creates a new (empty) <code>MVar</code>, immediately reads its contents using <code>unsafeInterleaveIO</code> (which delays the actual reading lazily until the value is needed), uses the contents of the <code>MVar</code> to compute a new value, which it then writes back into the <code>MVar</code>. It almost seems, spookily, that <code>mfix</code> is sending a value back in time to itself through the <code>MVar</code> -- though of course what is really going on is that the reading is delayed just long enough (via <code>unsafeInterleaveIO</code>) to get the process bootstrapped.

{{Exercises|
* Implement a <code>MonadFix</code> instance for <code>[]</code>.
}}

==GHC 7.6 changes==

GHC 7.6 reinstated the old <code>mdo</code> syntax, so the example at the start of this section can be written

<haskell>
mdo { x <- foo
; y <- baz
; z <- bar
; bob
; w <- frob
}
</haskell>

which will be translated into the original example (assuming that, say, <code>bar</code> and <code>bob</code> refer to <code>y</code>. The difference is that <code>mdo</code> will analyze the code in order to find minimal recursive blocks, which will be placed in <code>rec</code> blocks, whereas <code>rec</code> blocks desugar directly into calls to <code>mfix</code> without any further analysis.
==Further reading==

For more information (such as the precise desugaring rules for <code>rec</code> blocks), see Levent Erkök and John Launchbury's 2002 Haskell workshop paper, [http://sites.google.com/site/leventerkok/recdo.pdf?attredirects=0 A Recursive do for Haskell], or for full details, Levent Erkök’s thesis, [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.15.1543&rep=rep1&type=pdf Value Recursion in Monadic Computations]. (Note, while reading, that <code>MonadFix</code> used to be called <code>MonadRec</code>.) You can also read the [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation GHC user manual section on recursive do-notation].

=Semigroup=

A semigroup is a set <math>S\ </math> together with a binary operation <math>\oplus\ </math> which
combines elements from <math>S\ </math>. The <math>\oplus\ </math> operator is required to be associative
(that is, <math>(a \oplus b) \oplus c = a \oplus (b \oplus c)\ </math>, for any
<math>a,b,c\ </math> which are elements of <math>S\ </math>).

For example, the natural numbers under addition form a semigroup: the sum of any two natural numbers is a natural number, and <math>(a+b)+c = a+(b+c)\ </math> for any natural numbers <math>a\ </math>, <math>b\ </math>, and <math>c\,\ </math>. The integers under multiplication also form a semigroup, as do the integers (or rationals, or reals) under <math>\max\ </math> or <math>\min\ </math>, Boolean values under conjunction and disjunction, lists under concatenation, functions from a set to itself under composition ... Semigroups show up all over the place, once you know to look for them.

==Definition==

Semigroups are not (yet?) defined in the base package, but the {{HackagePackage|id=semigroups}} package provides a standard definition.

The definition of the <code>Semigroup</code> type class ([http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock]) is as follows:

<haskell>
class Semigroup a where
(<>) :: a -> a -> a

sconcat :: NonEmpty a -> a
sconcat = sconcat (a :| as) = go a as where
go b (c:cs) = b <> go c cs
go b [] = b

times1p :: Whole n => n -> a -> a
times1p = ...
</haskell>

The really important method is <code>(<>)</code>, representing the associative binary operation. The other two methods have default implementations in terms of <code>(<>)</code>, and are included in the type class in case some instances can give more efficient implementations than the default. <code>sconcat</code> reduces a nonempty list using <code>(<>)</code>; <code>times1p n</code> is equivalent to (but more efficient than) <code>sconcat . replicate n</code>. See the [http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock documentation] for more information on <code>sconcat</code> and <code>times1p</code>.

==Laws==

The only law is that <code>(<>)</code> must be associative:

<haskell>
(x <> y) <> z = x <> (y <> z)
</haskell>

=Monoid=

Many semigroups have a special element <math>e</math> for which the binary operation <math>\oplus</math> is the identity, that is, <math>e \oplus x = x \oplus e = x</math> for every element <math>x</math>. Such a semigroup-with-identity-element is called a ''monoid''.

==Definition==

The definition of the <code>Monoid</code> type class (defined in
<code>Data.Monoid</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Monoid.html haddock]) is:

<haskell>
class Monoid a where
mempty :: a
mappend :: a -> a -> a

mconcat :: [a] -> a
mconcat = foldr mappend mempty
</haskell>

The <code>mempty</code> value specifies the identity element of the monoid, and <code>mappend</code>
is the binary operation. The default definition for <code>mconcat</code>
“reduces” a list of elements by combining them all with <code>mappend</code>,
using a right fold. It is only in the <code>Monoid</code> class so that specific
instances have the option of providing an alternative, more efficient
implementation; usually, you can safely ignore <code>mconcat</code> when creating
a <code>Monoid</code> instance, since its default definition will work just fine.

The <code>Monoid</code> methods are rather unfortunately named; they are inspired
by the list instance of <code>Monoid</code>, where indeed <code>mempty = []</code> and <code>mappend = (++)</code>, but this is misleading since many
monoids have little to do with appending (see these [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 Comments from OCaml Hacker Brian Hurt] on the Haskell-cafe mailing list). This was improved in GHC 7.4, where <code>(<>)</code> was added as an alias to <code>mappend</code>.

==Laws==

Of course, every <code>Monoid</code> instance should actually be a monoid in the
mathematical sense, which implies these laws:

<haskell>
mempty `mappend` x = x
x `mappend` mempty = x
(x `mappend` y) `mappend` z = x `mappend` (y `mappend` z)
</haskell>

==Instances==

There are quite a few interesting <code>Monoid</code> instances defined in <code>Data.Monoid</code>.

<ul>
<li><code>[a]</code> is a <code>Monoid</code>, with <code>mempty = []</code> and <code>mappend = (++)</code>. It is not hard to check that <code>(x ++ y) ++ z = x ++ (y ++ z)</code> for any lists <code>x</code>, <code>y</code>, and <code>z</code>, and that the empty list is the identity: <code>[] ++ x = x ++ [] = x</code>.</li>

<li>As noted previously, we can make a monoid out of any numeric type under either addition or multiplication. However, since we can’t have two instances for the same type, <code>Data.Monoid</code> provides two <code>newtype</code> wrappers, <code>Sum</code> and <code>Product</code>, with appropriate <code>Monoid</code> instances.

<haskell>
> getSum (mconcat . map Sum $ [1..5])
15
> getProduct (mconcat . map Product $ [1..5])
120
</haskell>

This example code is silly, of course; we could just write
<code>sum [1..5]</code> and <code>product [1..5]</code>. Nevertheless, these instances are useful in more generalized settings, as we will see in the [[Foldable|section on <code>Foldable</code>]].</li>

<li><code>Any</code> and <code>All</code> are <code>newtype</code> wrappers providing <code>Monoid</code> instances for <code>Bool</code> (under disjunction and conjunction, respectively).</li>

<li> There are three instances for <code>Maybe</code>: a basic instance which lifts a <code>Monoid</code> instance for <code>a</code> to an instance for <code>Maybe a</code>, and two <code>newtype</code> wrappers <code>First</code> and <code>Last</code> for which <code>mappend</code> selects the first (respectively last) non-<code>Nothing</code> item.</li>

<li><code>Endo a</code> is a newtype wrapper for functions <code>a -> a</code>, which form a monoid under composition.</li>

<li>There are several ways to “lift” <code>Monoid</code> instances to instances with additional structure. We have already seen that an instance for <code>a</code> can be lifted to an instance for <code>Maybe a</code>. There are also tuple instances: if <code>a</code> and <code>b</code> are instances of <code>Monoid</code>, then so is <code>(a,b)</code>, using the monoid operations for <code>a</code> and <code>b</code> in the obvious pairwise manner. Finally, if <code>a</code> is a <code>Monoid</code>, then so is the function type <code>e -> a</code> for any <code>e</code>; in particular, <code>g `mappend` h</code> is the function which applies both <code>g</code> and <code>h</code> to its argument and then combines the results using the underlying <code>Monoid</code> instance for <code>a</code>. This can be quite useful and elegant (see [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/52416 example]).</li>

<li>The type <code>Ordering = LT | EQ | GT</code> is a <code>Monoid</code>, defined in such a way that <code>mconcat (zipWith compare xs ys)</code> computes the lexicographic ordering of <code>xs</code> and <code>ys</code> (if <code>xs</code> and <code>ys</code> have the same length). In particular, <code>mempty = EQ</code>, and <code>mappend</code> evaluates to its leftmost non-<code>EQ</code> argument (or <code>EQ</code> if both arguments are <code>EQ</code>). This can be used together with the function instance of <code>Monoid</code> to do some clever things ([http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx example]).</li>

<li>There are also <code>Monoid</code> instances for several standard data structures in the containers library ([http://hackage.haskell.org/packages/archive/containers/0.2.0.0/doc/html/index.html haddock]), including <code>Map</code>, <code>Set</code>, and <code>Sequence</code>.</li>
</ul>

<code>Monoid</code> is also used to enable several other type class instances.
As noted previously, we can use <code>Monoid</code> to make <code>((,) e)</code> an instance of <code>Applicative</code>:

<haskell>
instance Monoid e => Applicative ((,) e) where
pure x = (mempty, x)
(u, f) <*> (v, x) = (u `mappend` v, f x)
</haskell>

<code>Monoid</code> can be similarly used to make <code>((,) e)</code> an instance of <code>Monad</code> as well; this is known as the ''writer monad''. As we’ve already seen, <code>Writer</code> and <code>WriterT</code> are a newtype wrapper and transformer for this monad, respectively.

<code>Monoid</code> also plays a key role in the <code>Foldable</code> type class (see section [[#Foldable|Foldable]]).

==Other monoidal classes: Alternative, MonadPlus, ArrowPlus==

The <code>Alternative</code> type class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html#g:2 haddock])
is for <code>Applicative</code> functors which also have
a monoid structure:

<haskell>
class Applicative f => Alternative f where
empty :: f a
(<|>) :: f a -> f a -> f a
</haskell>

Of course, instances of <code>Alternative</code> should satisfy the monoid laws

<haskell>
empty <|> x = x
x <|> empty = x
(x <|> y) <|> z = x <|> (y <|> z)
</haskell>

Likewise, <code>MonadPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html#t:MonadPlus haddock])
is for <code>Monad</code>s with a monoid structure:

<haskell>
class Monad m => MonadPlus m where
mzero :: m a
mplus :: m a -> m a -> m a
</haskell>

The <code>MonadPlus</code> documentation states that it is intended to model
monads which also support “choice and failure”; in addition to the
monoid laws, instances of <code>MonadPlus</code> are expected to satisfy

<haskell>
mzero >>= f = mzero
v >> mzero = mzero
</haskell>

which explains the sense in which <code>mzero</code> denotes failure. Since
<code>mzero</code> should be the identity for <code>mplus</code>, the computation <code>m1 `mplus` m2</code> succeeds (evaluates to something other than <code>mzero</code>) if
either <code>m1</code> or <code>m2</code> does; so <code>mplus</code> represents choice. The <code>guard</code>
function can also be used with instances of <code>MonadPlus</code>; it requires a
condition to be satisfied and fails (using <code>mzero</code>) if it is not. A
simple example of a <code>MonadPlus</code> instance is <code>[]</code>, which is exactly the
same as the <code>Monoid</code> instance for <code>[]</code>: the empty list represents
failure, and list concatenation represents choice. In general,
however, a <code>MonadPlus</code> instance for a type need not be the same as its
<code>Monoid</code> instance; <code>Maybe</code> is an example of such a type. A great
introduction to the <code>MonadPlus</code> type class, with interesting examples
of its use, is Doug Auclair’s ''MonadPlus: What a Super Monad!'' in [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad.Reader issue 11].

There used to be a type class called <code>MonadZero</code> containing only
<code>mzero</code>, representing monads with failure. The <code>do</code>-notation requires
some notion of failure to deal with failing pattern matches.
Unfortunately, <code>MonadZero</code> was scrapped in favor of adding the <code>fail</code>
method to the <code>Monad</code> class. If we are lucky, someday <code>MonadZero</code> will
be restored, and <code>fail</code> will be banished to the bit bucket where it
belongs (see [[MonadPlus reform proposal]]). The idea is that any
<code>do</code>-block which uses pattern matching (and hence may fail) would require
a <code>MonadZero</code> constraint; otherwise, only a <code>Monad</code> constraint would be
required.

Finally, <code>ArrowZero</code> and <code>ArrowPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html#t:ArrowZero haddock])
represent <code>Arrow</code>s ([[#Arrow|see below]]) with a
monoid structure:

<haskell>
class Arrow arr => ArrowZero arr where
zeroArrow :: b `arr` c

class ArrowZero arr => ArrowPlus arr where
(<+>) :: (b `arr` c) -> (b `arr` c) -> (b `arr` c)
</haskell>

==Further reading==

Monoids have gotten a fair bit of attention recently, ultimately due
to
[http://enfranchisedmind.com/blog/posts/random-thoughts-on-haskell/ a blog post by Brian Hurt], in which he
complained about the fact that the names of many Haskell type classes
(<code>Monoid</code> in particular) are taken from abstract mathematics. This
resulted in [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 a long Haskell-cafe thread]
arguing the point and discussing monoids in general.

{{note|May its name live forever.}}

However, this was quickly followed by several blog posts about
<code>Monoid</code> {{noteref}}. First, Dan Piponi
wrote a great introductory post, [http://blog.sigfpe.com/2009/01/haskell-monoids-and-their-uses.html Haskell Monoids and their Uses]. This was quickly followed by
Heinrich Apfelmus’s [http://apfelmus.nfshost.com/monoid-fingertree.html Monoids and Finger Trees], an accessible exposition of
Hinze and Paterson’s [http://www.soi.city.ac.uk/%7Eross/papers/FingerTree.html classic paper on 2-3 finger trees], which makes very clever
use of <code>Monoid</code> to implement an elegant and generic data structure.
Dan Piponi then wrote two fascinating articles about using <code>Monoids</code>
(and finger trees): [http://blog.sigfpe.com/2009/01/fast-incremental-regular-expression.html Fast Incremental Regular Expressions] and [http://blog.sigfpe.com/2009/01/beyond-regular-expressions-more.html Beyond Regular Expressions]

In a similar vein, David Place’s article on improving <code>Data.Map</code> in
order to compute incremental folds (see [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad Reader issue 11])
is also a
good example of using <code>Monoid</code> to generalize a data structure.

Some other interesting examples of <code>Monoid</code> use include [http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx building elegant list sorting combinators], [http://byorgey.wordpress.com/2008/04/17/collecting-unstructured-information-with-the-monoid-of-partial-knowledge/ collecting unstructured information], [http://izbicki.me/blog/gausian-distributions-are-monoids combining probability distributions], and a brilliant series of posts by Chung-Chieh Shan and Dylan Thurston using <code>Monoid</code>s to [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers1/ elegantly solve a difficult combinatorial puzzle] (followed by [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers2/ part 2], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers3/ part 3], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers4/ part 4]).

As unlikely as it sounds, monads can actually be viewed as a sort of
monoid, with <code>join</code> playing the role of the binary operation and
<code>return</code> the role of the identity; see [http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html Dan Piponi’s blog post].

=Foldable=

The <code>Foldable</code> class, defined in the <code>Data.Foldable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html haddock]), abstracts over containers which can be
“folded” into a summary value. This allows such folding operations
to be written in a container-agnostic way.

==Definition==

The definition of the <code>Foldable</code> type class is:

<haskell>
class Foldable t where
fold :: Monoid m => t m -> m
foldMap :: Monoid m => (a -> m) -> t a -> m

foldr :: (a -> b -> b) -> b -> t a -> b
foldl :: (a -> b -> a) -> a -> t b -> a
foldr1 :: (a -> a -> a) -> t a -> a
foldl1 :: (a -> a -> a) -> t a -> a
</haskell>

This may look complicated, but in fact, to make a <code>Foldable</code> instance
you only need to implement one method: your choice of <code>foldMap</code> or
<code>foldr</code>. All the other methods have default implementations in terms
of these, and are presumably included in the class in case more
efficient implementations can be provided.

==Instances and examples==

The type of <code>foldMap</code> should make it clear what it is supposed to do:
given a way to convert the data in a container into a <code>Monoid</code> (a
function <code>a -> m</code>) and a container of <code>a</code>’s (<code>t a</code>), <code>foldMap</code>
provides a way to iterate over the entire contents of the container,
converting all the <code>a</code>’s to <code>m</code>’s and combining all the <code>m</code>’s with
<code>mappend</code>. The following code shows two examples: a simple
implementation of <code>foldMap</code> for lists, and a binary tree example
provided by the <code>Foldable</code> documentation.

<haskell>
instance Foldable [] where
foldMap g = mconcat . map g

data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Foldable Tree where
foldMap f Empty = mempty
foldMap f (Leaf x) = f x
foldMap f (Node l k r) = foldMap f l `mappend` f k `mappend` foldMap f r
</haskell>

The <code>foldr</code> function has a type similar to the <code>foldr</code> found in the <code>Prelude</code>, but
more general, since the <code>foldr</code> in the <code>Prelude</code> works only on lists.

The <code>Foldable</code> module also provides instances for <code>Maybe</code> and <code>Array</code>;
additionally, many of the data structures found in the standard [http://hackage.haskell.org/package/containers containers library] (for example, <code>Map</code>, <code>Set</code>, <code>Tree</code>,
and <code>Sequence</code>) provide their own <code>Foldable</code> instances.

{{Exercises|
# What is the type of <code>foldMap . foldMap</code>? Or <code>foldMap . foldMap . foldMap</code>, etc.? What do they do?
}}

==Derived folds==

Given an instance of <code>Foldable</code>, we can write generic,
container-agnostic functions such as:

<haskell>
-- Compute the size of any container.
containerSize :: Foldable f => f a -> Int
containerSize = getSum . foldMap (const (Sum 1))

-- Compute a list of elements of a container satisfying a predicate.
filterF :: Foldable f => (a -> Bool) -> f a -> [a]
filterF p = foldMap (\a -> if p a then [a] else [])

-- Get a list of all the Strings in a container which include the
-- letter a.
aStrings :: Foldable f => f String -> [String]
aStrings = filterF (elem 'a')
</haskell>

The <code>Foldable</code> module also provides a large number of predefined
folds, many of which are generalized versions of <code>Prelude</code> functions of the
same name that only work on lists: <code>concat</code>, <code>concatMap</code>, <code>and</code>,
<code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>),
<code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>.

The important function <code>toList</code> is also provided, which turns any <code>Foldable</code> structure into a list of its elements in left-right order; it works by folding with the list monoid.

There are also generic functions that work with <code>Applicative</code> or
<code>Monad</code> instances to generate some sort of computation from each
element in a container, and then perform all the side effects from
those computations, discarding the results: <code>traverse_</code>, <code>sequenceA_</code>,
and others. The results must be discarded because the <code>Foldable</code>
class is too weak to specify what to do with them: we cannot, in
general, make an arbitrary <code>Applicative</code> or <code>Monad</code> instance into a <code>Monoid</code>, but we can make <code>m ()</code> into a <code>Monoid</code> for any such <code>m</code>. If we do have an <code>Applicative</code> or <code>Monad</code> with a monoid
structure—that is, an <code>Alternative</code> or a <code>MonadPlus</code>—then we can
use the <code>asum</code> or <code>msum</code> functions, which can combine the results as
well. Consult the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html <code>Foldable</code> documentation] for
more details on any of these functions.

Note that the <code>Foldable</code> operations always forget the structure of
the container being folded. If we start with a container of type <code>t a</code> for some <code>Foldable t</code>, then <code>t</code> will never appear in the output
type of any operations defined in the <code>Foldable</code> module. Many times
this is exactly what we want, but sometimes we would like to be able
to generically traverse a container while preserving its
structure—and this is exactly what the <code>Traversable</code> class provides,
which will be discussed in the next section.

{{Exercises|
# Implement <code>toList :: Foldable f {{=}}> f a -> [a]</code>.
# Pick some of the following functions to implement: <code>concat</code>, <code>concatMap</code>, <code>and</code>, <code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>), <code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>. Figure out how they generalize to <code>Foldable</code> and come up with elegant implementations using <code>fold</code> or <code>foldMap</code> along with appropriate <code>Monoid</code> instances.
}}

==Foldable actually isn't==

The generic term "fold" is often used to refer to the more technical concept of [[Catamorphisms|catamorphism]]. Intuitively, given a way to summarize "one level of structure" (where recursive subterms have already been replaced with their summaries), a catamorphism can summarize an entire recursive structure. It is important to realize that <code>Foldable</code> does not correspond to catamorphisms, but to something weaker. In particular, <code>Foldable</code> allows observing only the left-right order of elements within a structure, not the actual structure itself. Put another way, every use of <code>Foldable</code> can be expressed in terms of <code>toList</code>. For example, <code>fold</code> itself is equivalent to <code>mconcat . toList</code>.

This is sufficient for many tasks, but not all. For example, consider trying to compute the depth of a <code>Tree</code>: try as we might, there is no way to implement it using <code>Foldable</code>. However, it can be implemented as a catamorphism.

==Further reading==

The <code>Foldable</code> class had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s paper]
introducing <code>Applicative</code>, although it has
been fleshed out quite a bit from the form in the paper.

An interesting use of <code>Foldable</code> (as well as <code>Traversable</code>) can be
found in Janis Voigtländer’s paper [http://doi.acm.org/10.1145/1480881.1480904 Bidirectionalization for free!].

=Traversable=

==Definition==

The <code>Traversable</code> type class, defined in the <code>Data.Traversable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Traversable.html haddock]), is:

<haskell>
class (Functor t, Foldable t) => Traversable t where
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
sequenceA :: Applicative f => t (f a) -> f (t a)
mapM :: Monad m => (a -> m b) -> t a -> m (t b)
sequence :: Monad m => t (m a) -> m (t a)
</haskell>

As you can see, every <code>Traversable</code> is also a foldable functor. Like
<code>Foldable</code>, there is a lot in this type class, but making instances is
actually rather easy: one need only implement <code>traverse</code> or
<code>sequenceA</code>; the other methods all have default implementations in
terms of these functions. A good exercise is to figure out what the default
implementations should be: given either <code>traverse</code> or <code>sequenceA</code>, how
would you define the other three methods? (Hint for <code>mapM</code>:
<code>Control.Applicative</code> exports the <code>WrapMonad</code> newtype, which makes any
<code>Monad</code> into an <code>Applicative</code>. The <code>sequence</code> function can be implemented in terms
of <code>mapM</code>.)

==Intuition==

The key method of the <code>Traversable</code> class, and the source of its
unique power, is <code>sequenceA</code>. Consider its type:
<haskell>
sequenceA :: Applicative f => t (f a) -> f (t a)
</haskell>
This answers the fundamental question: when can we commute two
functors? For example, can we turn a tree of lists into a list of
trees?

The ability to compose two monads depends crucially on this ability to
commute functors. Intuitively, if we want to build a composed monad
<code>M a = m (n a)</code> out of monads <code>m</code> and <code>n</code>, then to be able to
implement <code>join :: M (M a) -> M a</code>, that is,
<code>join :: m (n (m (n a))) -> m (n a)</code>, we have to be able to commute
the <code>n</code> past the <code>m</code> to get <code>m (m (n (n a)))</code>, and then we can use the
<code>join</code>s for <code>m</code> and <code>n</code> to produce something of type <code>m (n a)</code>. See
[http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Mark Jones’s paper] for more details.

Alternatively, looking at the type of <code>traverse</code>,
<haskell>
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
</haskell>
leads us to view <code>Traversable</code> as a generalization of <code>Functor</code>. <code>traverse</code> is an "effectful <code>fmap</code>": it allows us to map over a structure of type <code>t a</code>, applying a function to every element of type <code>a</code> and in order to produce a new structure of type <code>t b</code>; but along the way the function may have some effects (captured by the applicative functor <code>f</code>).

{{Exercises|
# There are at least two natural ways to turn a tree of lists into a list of trees. What are they, and why?
# Give a natural way to turn a list of trees into a tree of lists.
# What is the type of <code>traverse . traverse</code>? What does it do?
}}

==Instances and examples==

What’s an example of a <code>Traversable</code> instance?
The following code shows an example instance for the same
<code>Tree</code> type used as an example in the previous <code>Foldable</code> section. It
is instructive to compare this instance with a <code>Functor</code> instance for
<code>Tree</code>, which is also shown.

<haskell>
data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Traversable Tree where
traverse g Empty = pure Empty
traverse g (Leaf x) = Leaf <$> g x
traverse g (Node l x r) = Node <$> traverse g l
<*> g x
<*> traverse g r

instance Functor Tree where
fmap g Empty = Empty
fmap g (Leaf x) = Leaf $ g x
fmap g (Node l x r) = Node (fmap g l)
(g x)
(fmap g r)
</haskell>

It should be clear that the <code>Traversable</code> and <code>Functor</code> instances for
<code>Tree</code> are almost identical; the only difference is that the <code>Functor</code>
instance involves normal function application, whereas the
applications in the <code>Traversable</code> instance take place within an
<code>Applicative</code> context, using <code>(<$>)</code> and <code>(<*>)</code>. In fact, this will
be
true for any type.

Any <code>Traversable</code> functor is also <code>Foldable</code>, and a <code>Functor</code>. We can see
this not only from the class declaration, but by the fact that we can
implement the methods of both classes given only the <code>Traversable</code>
methods.

The standard libraries provide a number of <code>Traversable</code> instances,
including instances for <code>[]</code>, <code>Maybe</code>, <code>Map</code>, <code>Tree</code>, and <code>Sequence</code>.
Notably, <code>Set</code> is not <code>Traversable</code>, although it is <code>Foldable</code>.

{{Exercises|
# Implement <code>fmap</code> and <code>foldMap</code> using only the <code>Traversable</code> methods. (Note that the <code>Traversable</code> module provides these implementations as <code>fmapDefault</code> and <code>foldMapDefault</code>.)
}}

==Laws==

Any instance of <code>Traversable</code> must satisfy the following two laws, where <code>Identity</code> is the identity functor (as defined in the [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Data-Functor-Identity.html <code>Data.Functor.Identity</code> module] from the <code>transformers</code> package), and <code>Compose</code> wraps the composition of two functors (as defined in [http://hackage.haskell.org/packages/archive/transformers/0.3.0.0/doc/html/Data-Functor-Compose.html <code>Data.Functor.Compose</code>]):

# <code>traverse Identity = Identity</code>
# <code>traverse (Compose . fmap g . f) = Compose . fmap (traverse g) . traverse f</code>

The first law essentially says that traversals cannot make up arbitrary effects. The second law explains how doing two traversals in sequence can be collapsed to a single traversal.

Additionally, suppose <code>eta</code> is an "<code>Applicative</code> morphism", that is,
<haskell>
eta :: forall a f g. (Applicative f, Applicative g) => f a -> g a
</haskell>
and <code>eta</code> preserves the <code>Applicative</code> operations: <code>eta (pure x) = pure x</code> and <code>eta (x <*> y) = eta x <*> eta y</code>. Then, by parametricity, any instance of <code>Traversable</code> satisfying the above two laws will also satisfy <code>eta . traverse f = traverse (eta . f)</code>.

==Further reading==

The <code>Traversable</code> class also had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s <code>Applicative</code> paper],
and is described in more detail in Gibbons and Oliveira, [http://www.comlab.ox.ac.uk/jeremy.gibbons/publications/iterator.pdf The Essence of the Iterator Pattern],
which also contains a wealth of references to related work.

<code>Traversable</code> forms a core component of Edward Kmett's [http://hackage.haskell.org/package/lens lens library]. Watching [https://vimeo.com/56063074 Edward's talk on the subject] is a highly recommended way to gain better insight into <code>Traversable</code>, <code>Foldable</code>, <code>Applicative</code>, and many other things besides.

For references on the <code>Traversable</code> laws, see Russell O'Connor's [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17778 mailing list post] (and subsequent thread).

=Category=

<code>Category</code> is a relatively recent addition to the Haskell standard libraries. It generalizes the notion of function composition to general “morphisms”.

{{note|GHC 7.6.1 changed its rules regarding types and type variables. Now, any operator at the type level is treated as a type ''constructor'' rather than a type ''variable''; prior to GHC 7.6.1 it was possible to use <code>(~>)</code> instead of <code>`arr`</code>. For more information, see [http://thread.gmane.org/gmane.comp.lang.haskell.glasgow.user/21350 the discussion on the GHC-users mailing list]. For a new approach to nice arrow notation that works with GHC 7.6.1, see [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22615 this message] and also [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22616 this message] from Edward Kmett, though for simplicity I haven't adopted it here.}}
The definition of the <code>Category</code> type class (from
<code>Control.Category</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Category.html haddock]) is shown below. For ease of reading, note that I have used an infix type variable <code>`arr`</code>, in parallel with the infix function type constructor <code>(->)</code>. {{noteref}} This syntax is not part of Haskell 2010. The second definition shown is the one used in the standard libraries. For the remainder of this document, I will use the infix type constructor <code>`arr`</code> for <code>Category</code> as well as <code>Arrow</code>.

<haskell>
class Category arr where
id :: a `arr` a
(.) :: (b `arr` c) -> (a `arr` b) -> (a `arr` c)

-- The same thing, with a normal (prefix) type constructor
class Category cat where
id :: cat a a
(.) :: cat b c -> cat a b -> cat a c
</haskell>

Note that an instance of <code>Category</code> should be a type constructor which takes two type arguments, that is, something of kind <code>* -> * -> *</code>. It is instructive to imagine the type constructor variable <code>cat</code> replaced by the function constructor <code>(->)</code>: indeed, in this case we recover precisely the familiar identity function <code>id</code> and function composition operator <code>(.)</code> defined in the standard <code>Prelude</code>.

Of course, the <code>Category</code> module provides exactly such an instance of
<code>Category</code> for <code>(->)</code>. But it also provides one other instance, shown below, which should be familiar from the previous discussion of the <code>Monad</code> laws. <code>Kleisli m a b</code>, as defined in the <code>Control.Arrow</code> module, is just a <code>newtype</code> wrapper around <code>a -> m b</code>.

<haskell>
newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Category (Kleisli m) where
id = Kleisli return
Kleisli g . Kleisli h = Kleisli (h >=> g)
</haskell>

The only law that <code>Category</code> instances should satisfy is that <code>id</code> and <code>(.)</code> should form a monoid—that is, <code>id</code> should be the identity of <code>(.)</code>, and <code>(.)</code> should be associative.

Finally, the <code>Category</code> module exports two additional operators:
<code>(<<<)</code>, which is just a synonym for <code>(.)</code>, and <code>(>>>)</code>, which is <code>(.)</code> with its arguments reversed. (In previous versions of the libraries, these operators were defined as part of the <code>Arrow</code> class.)

==Further reading==

The name <code>Category</code> is a bit misleading, since the <code>Category</code> class cannot represent arbitrary categories, but only categories whose objects are objects of <code>Hask</code>, the category of Haskell types. For a more general treatment of categories within Haskell, see the [http://hackage.haskell.org/package/category-extras category-extras package]. For more about category theory in general, see the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page],
[http://books.google.com/books/about/Category_theory.html?id=-MCJ6x2lC7oC Steve Awodey’s new book], Benjamin Pierce’s [http://books.google.com/books/about/Basic_category_theory_for_computer_scien.html?id=ezdeaHfpYPwC Basic category theory for computer scientists], or [http://folli.loria.fr/cds/1999/esslli99/courses/barr-wells.html Barr and Wells’s category theory lecture notes]. [http://dekudekuplex.wordpress.com/2009/01/19/motivating-learning-category-theory-for-non-mathematicians/ Benjamin Russell’s blog post]
is another good source of motivation and category theory links. You certainly don’t need to know any category theory to be a successful and productive Haskell programmer, but it does lend itself to much deeper appreciation of Haskell’s underlying theory.

=Arrow=

The <code>Arrow</code> class represents another abstraction of computation, in a
similar vein to <code>Monad</code> and <code>Applicative</code>. However, unlike <code>Monad</code>
and <code>Applicative</code>, whose types only reflect their output, the type of
an <code>Arrow</code> computation reflects both its input and output. Arrows
generalize functions: if <code>arr</code> is an instance of <code>Arrow</code>, a value of
type <code>b `arr` c</code> can be thought of as a computation which takes values of
type <code>b</code> as input, and produces values of type <code>c</code> as output. In the
<code>(->)</code> instance of <code>Arrow</code> this is just a pure function; in general, however,
an arrow may represent some sort of “effectful” computation.

==Definition==

The definition of the <code>Arrow</code> type class, from
<code>Control.Arrow</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html haddock]), is:

<haskell>
class Category arr => Arrow arr where
arr :: (b -> c) -> (b `arr` c)
first :: (b `arr` c) -> ((b, d) `arr` (c, d))
second :: (b `arr` c) -> ((d, b) `arr` (d, c))
(***) :: (b `arr` c) -> (b' `arr` c') -> ((b, b') `arr` (c, c'))
(&&&) :: (b `arr` c) -> (b `arr` c') -> (b `arr` (c, c'))
</haskell>

{{note|In versions of the <code>base</code>
package prior to version 4, there is no <code>Category</code> class, and the
<code>Arrow</code> class includes the arrow composition operator <code>(>>>)</code>. It
also includes <code>pure</code> as a synonym for <code>arr</code>, but this was removed
since it conflicts with the <code>pure</code> from <code>Applicative</code>.}}

The first thing to note is the <code>Category</code> class constraint, which
means that we get identity arrows and arrow composition for free:
given two arrows <code>g :: b `arr` c</code> and <code>h :: c `arr` d</code>, we can form their
composition <code>g >>> h :: b `arr` d</code> {{noteref}}.

As should be a familiar pattern by now, the only methods which must be
defined when writing a new instance of <code>Arrow</code> are <code>arr</code> and <code>first</code>;
the other methods have default definitions in terms of these, but are
included in the <code>Arrow</code> class so that they can be overridden with more
efficient implementations if desired.

==Intuition==

Let’s look at each of the arrow methods in turn. [http://www.haskell.org/arrows/ Ross Paterson’s web page on arrows] has nice diagrams which can help
build intuition.

* The <code>arr</code> function takes any function <code>b -> c</code> and turns it into a generalized arrow <code>b `arr` c</code>. The <code>arr</code> method justifies the claim that arrows generalize functions, since it says that we can treat any function as an arrow. It is intended that the arrow <code>arr g</code> is “pure” in the sense that it only computes <code>g</code> and has no “effects” (whatever that might mean for any particular arrow type).

* The <code>first</code> method turns any arrow from <code>b</code> to <code>c</code> into an arrow from <code>(b,d)</code> to <code>(c,d)</code>. The idea is that <code>first g</code> uses <code>g</code> to process the first element of a tuple, and lets the second element pass through unchanged. For the function instance of <code>Arrow</code>, of course, <code>first g (x,y) = (g x, y)</code>.

* The <code>second</code> function is similar to <code>first</code>, but with the elements of the tuples swapped. Indeed, it can be defined in terms of <code>first</code> using an auxiliary function <code>swap</code>, defined by <code>swap (x,y) = (y,x)</code>.

* The <code>(***)</code> operator is “parallel composition” of arrows: it takes two arrows and makes them into one arrow on tuples, which has the behavior of the first arrow on the first element of a tuple, and the behavior of the second arrow on the second element. The mnemonic is that <code>g *** h</code> is the ''product'' (hence <code>*</code>) of <code>g</code> and <code>h</code>. For the function instance of <code>Arrow</code>, we define <code>(g *** h) (x,y) = (g x, h y)</code>. The default implementation of <code>(***)</code> is in terms of <code>first</code>, <code>second</code>, and sequential arrow composition <code>(>>>)</code>. The reader may also wish to think about how to implement <code>first</code> and <code>second</code> in terms of <code>(***)</code>.

* The <code>(&&&)</code> operator is “fanout composition” of arrows: it takes two arrows <code>g</code> and <code>h</code> and makes them into a new arrow <code>g &&& h</code> which supplies its input as the input to both <code>g</code> and <code>h</code>, returning their results as a tuple. The mnemonic is that <code>g &&& h</code> performs both <code>g</code> ''and'' <code>h</code> (hence <code>&</code>) on its input. For functions, we define <code>(g &&& h) x = (g x, h x)</code>.

==Instances==

The <code>Arrow</code> library itself only provides two <code>Arrow</code> instances, both
of which we have already seen: <code>(->)</code>, the normal function
constructor, and <code>Kleisli m</code>, which makes functions of
type <code>a -> m b</code> into <code>Arrow</code>s for any <code>Monad m</code>. These instances are:

<haskell>
instance Arrow (->) where
arr g = g
first g (x,y) = (g x, y)

newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Arrow (Kleisli m) where
arr f = Kleisli (return . f)
first (Kleisli f) = Kleisli (\ ~(b,d) -> do c <- f b
return (c,d) )
</haskell>

==Laws==

{{note|See [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 John Hughes: Generalising monads to arrows]; [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf Sam Lindley, Philip Wadler, Jeremy Yallop: The arrow calculus]; [http://www.soi.city.ac.uk/~ross/papers/fop.html Ross Paterson: Programming with Arrows].}}

There are quite a few laws that instances of <code>Arrow</code> should
satisfy {{noteref}}:

<haskell>
arr id = id
arr (h . g) = arr g >>> arr h
first (arr g) = arr (g *** id)
first (g >>> h) = first g >>> first h
first g >>> arr (id *** h) = arr (id *** h) >>> first g
first g >>> arr fst = arr fst >>> g
first (first g) >>> arr assoc = arr assoc >>> first g

assoc ((x,y),z) = (x,(y,z))
</haskell>

Note that this version of the laws is slightly different than the laws given in the
first two above references, since several of the laws have now been
subsumed by the <code>Category</code> laws (in particular, the requirements that
<code>id</code> is the identity arrow and that <code>(>>>)</code> is associative). The laws
shown here follow those in Paterson’s Programming with Arrows, which uses the
<code>Category</code> class.

{{note|Unless category-theory-induced insomnolence is your cup of tea.}}

The reader is advised not to lose too much sleep over the <code>Arrow</code>
laws {{noteref}}, since it is not essential to understand them in order to
program with arrows. There are also laws that <code>ArrowChoice</code>,
<code>ArrowApply</code>, and <code>ArrowLoop</code> instances should satisfy; the interested
reader should consult [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows].

==ArrowChoice==

Computations built using the <code>Arrow</code> class, like those built using
the <code>Applicative</code> class, are rather inflexible: the structure of the computation
is fixed at the outset, and there is no ability to choose between
alternate execution paths based on intermediate results.
The <code>ArrowChoice</code> class provides exactly such an ability:

<haskell>
class Arrow arr => ArrowChoice arr where
left :: (b `arr` c) -> (Either b d `arr` Either c d)
right :: (b `arr` c) -> (Either d b `arr` Either d c)
(+++) :: (b `arr` c) -> (b' `arr` c') -> (Either b b' `arr` Either c c')
(|||) :: (b `arr` d) -> (c `arr` d) -> (Either b c `arr` d)
</haskell>

A comparison of <code>ArrowChoice</code> to <code>Arrow</code> will reveal a striking
parallel between <code>left</code>, <code>right</code>, <code>(+++)</code>, <code>(|||)</code> and <code>first</code>,
<code>second</code>, <code>(***)</code>, <code>(&&&)</code>, respectively. Indeed, they are dual:
<code>first</code>, <code>second</code>, <code>(***)</code>, and <code>(&&&)</code> all operate on product types
(tuples), and <code>left</code>, <code>right</code>, <code>(+++)</code>, and <code>(|||)</code> are the
corresponding operations on sum types. In general, these operations
create arrows whose inputs are tagged with <code>Left</code> or <code>Right</code>, and can
choose how to act based on these tags.

* If <code>g</code> is an arrow from <code>b</code> to <code>c</code>, then <code>left g</code> is an arrow from <code>Either b d</code> to <code>Either c d</code>. On inputs tagged with <code>Left</code>, the <code>left g</code> arrow has the behavior of <code>g</code>; on inputs tagged with <code>Right</code>, it behaves as the identity.

* The <code>right</code> function, of course, is the mirror image of <code>left</code>. The arrow <code>right g</code> has the behavior of <code>g</code> on inputs tagged with <code>Right</code>.

* The <code>(+++)</code> operator performs “multiplexing”: <code>g +++ h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and as <code>h</code> on inputs tagged with <code>Right</code>. The tags are preserved. The <code>(+++)</code> operator is the ''sum'' (hence <code>+</code>) of two arrows, just as <code>(***)</code> is the product.

* The <code>(|||)</code> operator is “merge” or “fanin”: the arrow <code>g ||| h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and <code>h</code> on inputs tagged with <code>Right</code>, but the tags are discarded (hence, <code>g</code> and <code>h</code> must have the same output type). The mnemonic is that <code>g ||| h</code> performs either <code>g</code> ''or'' <code>h</code> on its input.

The <code>ArrowChoice</code> class allows computations to choose among a finite number of execution paths, based on intermediate results. The possible
execution paths must be known in advance, and explicitly assembled with <code>(+++)</code> or <code>(|||)</code>. However, sometimes more flexibility is
needed: we would like to be able to ''compute'' an arrow from intermediate results, and use this computed arrow to continue the computation. This is the power given to us by <code>ArrowApply</code>.

==ArrowApply==

The <code>ArrowApply</code> type class is:

<haskell>
class Arrow arr => ArrowApply arr where
app :: (b `arr` c, b) `arr` c
</haskell>

If we have computed an arrow as the output of some previous
computation, then <code>app</code> allows us to apply that arrow to an input,
producing its output as the output of <code>app</code>. As an exercise, the
reader may wish to use <code>app</code> to implement an alternative “curried”
version, <code>app2 :: b `arr` ((b `arr` c) `arr` c)</code>.

This notion of being able to ''compute'' a new computation
may sound familiar:
this is exactly what the monadic bind operator <code>(>>=)</code> does. It
should not particularly come as a surprise that <code>ArrowApply</code> and
<code>Monad</code> are exactly equivalent in expressive power. In particular,
<code>Kleisli m</code> can be made an instance of <code>ArrowApply</code>, and any instance
of <code>ArrowApply</code> can be made a <code>Monad</code> (via the <code>newtype</code> wrapper
<code>ArrowMonad</code>). As an exercise, the reader may wish to try
implementing these instances:

<haskell>
instance Monad m => ArrowApply (Kleisli m) where
app = -- exercise

newtype ArrowApply a => ArrowMonad a b = ArrowMonad (a () b)

instance ArrowApply a => Monad (ArrowMonad a) where
return = -- exercise
(ArrowMonad a) >>= k = -- exercise
</haskell>

==ArrowLoop==

The <code>ArrowLoop</code> type class is:

<haskell>
class Arrow a => ArrowLoop a where
loop :: a (b, d) (c, d) -> a b c

trace :: ((b,d) -> (c,d)) -> b -> c
trace f b = let (c,d) = f (b,d) in c
</haskell>

It describes arrows that can use recursion to compute results, and is
used to desugar the <code>rec</code> construct in arrow notation (described
below).

Taken by itself, the type of the <code>loop</code> method does not seem to tell
us much. Its intention, however, is a generalization of the <code>trace</code>
function which is also shown. The <code>d</code> component of the first arrow’s
output is fed back in as its own input. In other words, the arrow
<code>loop g</code> is obtained by recursively “fixing” the second component of
the input to <code>g</code>.

It can be a bit difficult to grok what the <code>trace</code> function is doing.
How can <code>d</code> appear on the left and right sides of the <code>let</code>? Well,
this is Haskell’s laziness at work. There is not space here for a
full explanation; the interested reader is encouraged to study the
standard <code>fix</code> function, and to read [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson’s arrow tutorial].

==Arrow notation==

Programming directly with the arrow combinators can be painful,
especially when writing complex computations which need to retain
simultaneous reference to a number of intermediate results. With
nothing but the arrow combinators, such intermediate results must be
kept in nested tuples, and it is up to the programmer to remember
which intermediate results are in which components, and to swap,
reassociate, and generally mangle tuples as necessary. This problem
is solved by the special arrow notation supported by GHC, similar to
<code>do</code> notation for monads, that allows names to be assigned to
intermediate results while building up arrow computations. An example
arrow implemented using arrow notation, taken from
Paterson, is:

<haskell>
class ArrowLoop arr => ArrowCircuit arr where
delay :: b -> (b `arr` b)

counter :: ArrowCircuit arr => Bool `arr` Int
counter = proc reset -> do
rec output <- idA -< if reset then 0 else next
next <- delay 0 -< output + 1
idA -< output
</haskell>

This arrow is intended to
represent a recursively defined counter circuit with a reset line.

There is not space here for a full explanation of arrow notation; the
interested reader should consult
[http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper introducing the notation], or his later [http://www.soi.city.ac.uk/~ross/papers/fop.html tutorial which presents a simplified version].

==Further reading==

An excellent starting place for the student of arrows is the [http://www.haskell.org/arrows/ arrows web page], which contains an
introduction and many references. Some key papers on arrows include
Hughes’s original paper introducing arrows, [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 Generalising monads to arrows], and [http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper on arrow notation].

Both Hughes and Paterson later wrote accessible tutorials intended for a broader
audience: [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows] and [http://www.cse.chalmers.se/~rjmh/afp-arrows.pdf Hughes: Programming with Arrows].

Although Hughes’s goal in defining the <code>Arrow</code> class was to
generalize <code>Monad</code>s, and it has been said that <code>Arrow</code> lies “between
<code>Applicative</code> and <code>Monad</code>” in power, they are not directly
comparable. The precise relationship remained in some confusion until
[http://homepages.inf.ed.ac.uk/wadler/papers/arrows-and-idioms/arrows-and-idioms.pdf analyzed by Lindley, Wadler, and Yallop], who
also invented a new calculus of arrows, based on the lambda calculus,
which considerably simplifies the presentation of the arrow laws
(see [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf The arrow calculus]). There is also a precise technical sense in which [http://just-bottom.blogspot.de/2010/04/programming-with-effects-story-so-far.html <code>Arrow</code> can be seen as the intersection of <code>Applicative</code> and <code>Category</code>].

Some examples of <code>Arrow</code>s include [http://www.haskell.org/yampa/ Yampa], the
[http://www.fh-wedel.de/~si/HXmlToolbox/ Haskell XML Toolkit], and the functional GUI library [[Grapefruit]].

Some extensions to arrows have been explored; for example, the
<code>BiArrow</code>s of Alimarine et al. ([http://wiki.clean.cs.ru.nl/download/papers/2005/alia2005-biarrowsHaskellWorkshop.pdf "There and Back Again: Arrows for Invertible Programming"]), for two-way instead of one-way
computation.

The Haskell wiki has [[Research papers/Monads and Arrows|links to many additional research papers relating to <code>Arrow</code>s]].

=Comonad=

The final type class we will examine is <code>Comonad</code>. The <code>Comonad</code> class
is the categorical dual of <code>Monad</code>; that is, <code>Comonad</code> is like <code>Monad</code>
but with all the function arrows flipped. It is not actually in the
standard Haskell libraries, but it has seen some interesting uses
recently, so we include it here for completeness.

==Definition==

The <code>Comonad</code> type class, defined in the <code>Control.Comonad</code> module of
the [http://hackage.haskell.org/package/comonad comonad library], is:

<haskell>
class Functor w => Comonad w where
extract :: w a -> a

duplicate :: w a -> w (w a)
duplicate = extend id

extend :: (w a -> b) -> w a -> w b
extend f = fmap f . duplicate
</haskell>

As you can see, <code>extract</code> is the dual of <code>return</code>, <code>duplicate</code> is the dual of <code>join</code>, and <code>extend</code> is the dual of <code>(=<<)</code>. The definition of <code>Comonad</code> is a bit redundant, giving the programmer the choice on whether extend or duplicate are implemented; the other operation then has a default implementation.

A prototypical example of a <code>Comonad</code> instance is:

<haskell>
-- Infinite lazy streams
data Stream a = Cons a (Stream a)

-- 'duplicate' is like the list function 'tails'
-- 'extend' computes a new Stream from an old, where the element
-- at position n is computed as a function of everything from
-- position n onwards in the old Stream
instance Comonad Stream where
extract (Cons x _) = x
duplicate s@(Cons x xs) = Cons s (duplicate xs)
extend g s@(Cons x xs) = Cons (g s) (extend g xs)
-- = fmap g (duplicate s)
</haskell>

==Further reading==

Dan Piponi explains in a blog post what [http://blog.sigfpe.com/2006/12/evaluating-cellular-automata-is.html cellular automata have to do with comonads]. In another blog post, Conal Elliott has examined [http://conal.net/blog/posts/functional-interactive-behavior/ a comonadic formulation of functional reactive programming]. Sterling Clover’s blog post [http://fmapfixreturn.wordpress.com/2008/07/09/comonads-in-everyday-life/ Comonads in everyday life] explains the relationship between comonads and zippers, and how comonads can be used to design a menu system for a web site.

Uustalu and Vene have a number of papers exploring ideas related to comonads and functional programming:
* [http://dx.doi.org/10.1016/j.entcs.2008.05.029 Comonadic Notions of Computation]
* [http://www.ioc.ee/~tarmo/papers/sfp01-book.pdf The dual of substitution is redecoration] (Also available as [http://www.cs.ut.ee/~varmo/papers/sfp01-book.ps.gz ps.gz].)
* [http://dx.doi.org/10.1016/j.ic.2005.08.005 Recursive coalgebras from comonads]
* [http://www.fing.edu.uy/~pardo/papers/njc01.ps.gz Recursion schemes from comonads]
* [http://cs.ioc.ee/~tarmo/papers/essence.pdf The Essence of Dataflow Programming].

Gabriel Gonzalez's [http://www.haskellforall.com/2013/02/you-could-have-invented-comonads.html Comonads are objects] points out similarities between comonads and object-oriented programming.

The [http://hackage.haskell.org/package/comonad-transformers comonad-transformers] package contains comonad transformers.

=Acknowledgements=

A special thanks to all of those who taught me about standard Haskell
type classes and helped me develop good intuition for them,
particularly Jules Bean (quicksilver), Derek Elkins (ddarius), Conal
Elliott (conal), Cale Gibbard (Cale), David House, Dan Piponi
(sigfpe), and Kevin Reid (kpreid).

I also thank the many people who provided a mountain of helpful
feedback and suggestions on a first draft of the Typeclassopedia: David Amos,
Kevin Ballard, Reid Barton, Doug Beardsley, Joachim Breitner, Andrew
Cave, David Christiansen, Gregory Collins, Mark Jason Dominus, Conal
Elliott, Yitz Gale, George Giorgidze, Steven Grady, Travis Hartwell,
Steve Hicks, Philip Hölzenspies, Edward Kmett, Eric Kow, Serge Le
Huitouze, Felipe Lessa, Stefan Ljungstrand, Eric Macaulay, Rob MacAulay, Simon Meier,
Eric Mertens, Tim Newsham, Russell O’Connor, Conrad Parker, Walt
Rorie-Baety, Colin Ross, Tom Schrijvers, Aditya Siram, C. Smith,
Martijn van Steenbergen, Joe Thornber, Jared Updike, Rob Vollmert,
Andrew Wagner, Louis Wasserman, and Ashley Yakeley, as well as a few
only known to me by their IRC nicks: b_jonas, maltem, tehgeekmeister,
and ziman. I have undoubtedly omitted a few inadvertently, which in
no way diminishes my gratitude.

Finally, I would like to thank Wouter Swierstra for his fantastic work
editing the Monad.Reader, and my wife Joyia for her patience during
the process of writing the Typeclassopedia.

=About the author=

Brent Yorgey ([http://byorgey.wordpress.com/ blog], [http://www.cis.upenn.edu/~byorgey/ homepage]) is (as of November 2011) a fourth-year Ph.D. student in the [http://www.cis.upenn.edu/~plclub/ programming languages group] at the [http://www.upenn.edu University of Pennsylvania]. He enjoys teaching, creating EDSLs, playing Bach fugues, musing upon category theory, and cooking tasty lambda-treats for the denizens of #haskell.

=Colophon=

The Typeclassopedia was written by Brent Yorgey and initially published in March 2009. Painstakingly converted to wiki syntax by [[User:Geheimdienst]] in November 2011, after asking Brent’s permission.

If something like this TeX to wiki syntax conversion ever needs to be done again, here are some vim commands that helped:

* <nowiki>%s/\\section{$[^}]*$}/=\1=/gc</nowiki>
* <nowiki>%s/\\subsection{$[^}]*$}/==\1==/gc</nowiki>
* <nowiki>%s/^ *\\item /\r* /gc</nowiki>
* <nowiki>%s/---/—/gc</nowiki>
* <nowiki>%s/\$$[^$]*$\$/<math>\1\\ <\/math>/gc</nowiki> ''Appending “\ ” forces images to be rendered. Otherwise, Mediawiki would go back and forth between one font for short <nowiki><math></nowiki> tags, and another more Tex-like font for longer tags (containing more than a few characters)""
* <nowiki>%s/|$[^|]*$|/<code>\1<\/code>/gc</nowiki>
* <nowiki>%s/\\dots/.../gc</nowiki>
* <nowiki>%s/^\\label{.*$//gc</nowiki>
* <nowiki>%s/\\emph{$[^}]*$}/''\1''/gc</nowiki>
* <nowiki>%s/\\term{$[^}]*$}/''\1''/gc</nowiki>

The biggest issue was taking the academic-paper-style citations and turning them into hyperlinks with an appropriate title and an appropriate target. In most cases there was an obvious thing to do (e.g. online PDFs of the cited papers or CiteSeer entries). Sometimes, however, it’s less clear and you might want to check the
[[Media:Typeclassopedia.pdf|original Typeclassopedia PDF]]
with the
[http://code.haskell.org/~byorgey/TMR/Issue13/typeclassopedia.bib original bibliography file].

To get all the citations into the main text, I first tried processing the source with TeX or Lyx. This didn’t work due to missing unfindable packages, syntax errors, and my general ineptitude with Tex.

I then went for the next best solution, which seemed to be extracting all instances of “\cite{something}” from the source and ''in that order'' pulling the referenced entries from the .bib file. This way you can go through the source file and sorted-references file in parallel, copying over what you need, without searching back and forth in the .bib file. I used:

* <nowiki>egrep -o "\cite\{[^\}]*\}" ~/typeclassopedia.lhs | cut -c 6- | tr "," "\n" | tr -d "}" > /tmp/citations</nowiki>
* <nowiki>for i in $(cat /tmp/citations); do grep -A99 "$i" ~/typeclassopedia.bib|egrep -B99 '^\}$' -m1 ; done > ~/typeclasso-refs-sorted</nowiki>

[[Category:Applicative Functor]]
[[Category:Arrow]]
[[Category:Functor]]
[[Category:Monad]]
[[Category:Standard classes]]
[[Category:Standard libraries]]
[[Category:Standard packages]]
[[Category:Standard types]]

Typeclassopedia

2014-11-27T17:49:11Z

Imz: /* Further reading */ fixed a rotten link

''By [[User:Byorgey|Brent Yorgey]], byorgey@cis.upenn.edu''

''Originally published 12 March 2009 in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13] of [http://themonadreader.wordpress.com/ the Monad.Reader]. Ported to the Haskell wiki in November 2011 by [[User:Geheimdienst|Geheimdienst]].''

''This is now the official version of the Typeclassopedia and supersedes the version published in the Monad.Reader. Please help update and extend it by editing it yourself or by leaving comments, suggestions, and questions on the [[Talk:Typeclassopedia|talk page]].''

=Abstract=

The standard Haskell libraries feature a number of type classes with algebraic or category-theoretic underpinnings. Becoming a fluent Haskell hacker requires intimate familiarity with them all, yet acquiring this familiarity often involves combing through a mountain of tutorials, blog posts, mailing list archives, and IRC logs.

The goal of this document is to serve as a starting point for the student of Haskell wishing to gain a firm grasp of its standard type classes. The essentials of each type class are introduced, with examples, commentary, and extensive references for further reading.

=Introduction=

Have you ever had any of the following thoughts?
* What the heck is a monoid, and how is it different from a monad?

* I finally figured out how to use [[Parsec]] with do-notation, and someone told me I should use something called <code>Applicative</code> instead. Um, what?

* Someone in the [[IRC channel|#haskell]] IRC channel used <code>(***)</code>, and when I asked Lambdabot to tell me its type, it printed out scary gobbledygook that didn’t even fit on one line! Then someone used <code>fmap fmap fmap</code> and my brain exploded.

* When I asked how to do something I thought was really complicated, people started typing things like <code>zip.ap fmap.(id &&& wtf)</code> and the scary thing is that they worked! Anyway, I think those people must actually be robots because there’s no way anyone could come up with that in two seconds off the top of their head.

If you have, look no further! You, too, can write and understand concise, elegant, idiomatic Haskell code with the best of them.

There are two keys to an expert Haskell hacker’s wisdom:
# Understand the types.
# Gain a deep intuition for each type class and its relationship to other type classes, backed up by familiarity with many examples.

It’s impossible to overstate the importance of the first; the patient student of type signatures will uncover many profound secrets. Conversely, anyone ignorant of the types in their code is doomed to eternal uncertainty. “Hmm, it doesn’t compile ... maybe I’ll stick in an
<code>fmap</code> here ... nope, let’s see ... maybe I need another <code>(.)</code> somewhere? ... um ...”

The second key—gaining deep intuition, backed by examples—is also important, but much more difficult to attain. A primary goal of this document is to set you on the road to gaining such intuition. However—

:''There is no royal road to Haskell. {{h:title|Well, he probably would have said it if he knew Haskell.|—Euclid}}''

This document can only be a starting point, since good intuition comes from hard work, [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ not from learning the right metaphor]. Anyone who reads and understands all of it will still have an arduous journey ahead—but sometimes a good starting point makes a big difference.

It should be noted that this is not a Haskell tutorial; it is assumed that the reader is already familiar with the basics of Haskell, including the standard <code>[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html Prelude]</code>, the type system, data types, and type classes.

The type classes we will be discussing and their interrelationships:

[[Image:Typeclassopedia-diagram.png]]

{{note|<code>Semigroup</code> can be found in the [http://hackage.haskell.org/package/semigroups <code>semigroups</code> package], <code>Apply</code> in the [http://hackage.haskell.org/package/semigroupoids <code>semigroupoids</code> package], and <code>Comonad</code> in the [http://hackage.haskell.org/package/comonad <code>comonad</code> package].}}

* Solid arrows point from the general to the specific; that is, if there is an arrow from <code>Foo</code> to <code>Bar</code> it means that every <code>Bar</code> is (or should be, or can be made into) a <code>Foo</code>.
* Dotted arrows indicate some other sort of relationship.
* <code>Monad</code> and <code>ArrowApply</code> are equivalent.
* <code>Semigroup</code>, <code>Apply</code> and <code>Comonad</code> are greyed out since they are not actually (yet?) in the standard Haskell libraries {{noteref}}.

One more note before we begin. The original spelling of “type class” is with two words, as evidenced by, for example, the [http://www.haskell.org/onlinereport/haskell2010/ Haskell 2010 Language Report], early papers on type classes like [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.103.5639 Type classes in Haskell] and [http://research.microsoft.com/en-us/um/people/simonpj/papers/type-class-design-space/ Type classes: exploring the design space], and [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.168.4008 Hudak et al.’s history of Haskell]. However, as often happens with two-word phrases that see a lot of use, it has started to show up as one word (“typeclass”) or, rarely, hyphenated (“type-class”). When wearing my prescriptivist hat, I prefer “type class”, but realize (after changing into my descriptivist hat) that there's probably not much I can do about it.

We now begin with the simplest type class of all: <code>Functor</code>.

=Functor=

The <code>Functor</code> class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Functor haddock]) is the most basic and ubiquitous type class in the Haskell libraries. A simple intuition is that a <code>Functor</code> represents a “container” of some sort, along with the ability to apply a function uniformly to every element in the container. For example, a list is a container of elements, and we can apply a function to every element of a list, using <code>map</code>. As another example, a binary tree is also a container of elements, and it’s not hard to come up with a way to recursively apply a function to every element in a tree.

Another intuition is that a <code>Functor</code> represents some sort of “computational context”. This intuition is generally more useful, but is more difficult to explain, precisely because it is so general. Some examples later should help to clarify the <code>Functor</code>-as-context point of view.

In the end, however, a <code>Functor</code> is simply what it is defined to be; doubtless there are many examples of <code>Functor</code> instances that don’t exactly fit either of the above intuitions. The wise student will focus their attention on definitions and examples, without leaning too heavily on any particular metaphor. Intuition will come, in time, on its own.

==Definition==

Here is the type class declaration for <code>Functor</code>:

<haskell>
class Functor f where
fmap :: (a -> b) -> f a -> f b
</haskell>

<code>Functor</code> is exported by the <code>Prelude</code>, so no special imports are needed to use it.

First, the <code>f a</code> and <code>f b</code> in the type signature for <code>fmap</code> tell us that <code>f</code> isn’t just a type; it is a ''type constructor'' which takes another type as a parameter. (A more precise way to say this is that the ''kind'' of <code>f</code> must be <code>* -> *</code>.) For example, <code>Maybe</code> is such a type constructor: <code>Maybe</code> is not a type in and of itself, but requires another type as a parameter, like <code>Maybe Integer</code>. So it would not make sense to say <code>instance Functor Integer</code>, but it could make sense to say <code>instance Functor Maybe</code>.

Now look at the type of <code>fmap</code>: it takes any function from <code>a</code> to <code>b</code>, and a value of type <code>f a</code>, and outputs a value of type <code>f b</code>. From the container point of view, the intention is that <code>fmap</code> applies a function to each element of a container, without altering the structure of the container. From the context point of view, the intention is that <code>fmap</code> applies a function to a value without altering its context. Let’s look at a few specific examples.

==Instances==

{{note|Recall that <code>[]</code> has two meanings in Haskell: it can either stand for the empty list, or, as here, it can represent the list type constructor (pronounced “list-of”). In other words, the type <code>[a]</code> (list-of-<code>a</code>) can also be written <code>[] a</code>.}}

{{note|You might ask why we need a separate <code>map</code> function. Why not just do away with the current list-only <code>map</code> function, and rename <code>fmap</code> to <code>map</code> instead? Well, that’s a good question. The usual argument is that someone just learning Haskell, when using <code>map</code> incorrectly, would much rather see an error about lists than about <code>Functor</code>s.}}

As noted before, the list constructor <code>[]</code> is a functor {{noteref}}; we can use the standard list function <code>map</code> to apply a function to each element of a list {{noteref}}. The <code>Maybe</code> type constructor is also a functor, representing a container which might hold a single element. The function <code>fmap g</code> has no effect on <code>Nothing</code> (there are no elements to which <code>g</code> can be applied), and simply applies <code>g</code> to the single element inside a <code>Just</code>. Alternatively, under the context interpretation, the list functor represents a context of nondeterministic choice; that is, a list can be thought of as representing a single value which is nondeterministically chosen from among several possibilities (the elements of the list). Likewise, the <code>Maybe</code> functor represents a context with possible failure. These instances are:

<haskell>
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : fmap g xs
-- or we could just say fmap = map

instance Functor Maybe where
fmap _ Nothing = Nothing
fmap g (Just a) = Just (g a)
</haskell>

As an aside, in idiomatic Haskell code you will often see the letter <code>f</code> used to stand for both an arbitrary <code>Functor</code> and an arbitrary function. In this document, <code>f</code> represents only <code>Functor</code>s, and <code>g</code> or <code>h</code> always represent functions, but you should be aware of the potential confusion. In practice, what <code>f</code> stands for should always be clear from the context, by noting whether it is part of a type or part of the code.

There are other <code>Functor</code> instances in the standard libraries; below are a few. Note that some of these instances are not exported by the <code>Prelude</code>; to access them, you can import <code>Control.Monad.Instances</code>.

* <code>Either e</code> is an instance of <code>Functor</code>; <code>Either e a</code> represents a container which can contain either a value of type <code>a</code>, or a value of type <code>e</code> (often representing some sort of error condition). It is similar to <code>Maybe</code> in that it represents possible failure, but it can carry some extra information about the failure as well.

* <code>((,) e)</code> represents a container which holds an “annotation” of type <code>e</code> along with the actual value it holds. It might be clearer to write it as <code>(e,)</code>, by analogy with an operator section like <code>(1+)</code>, but that syntax is not allowed in types (although it is allowed in expressions with the <code>TupleSections</code> extension enabled). However, you can certainly ''think'' of it as <code>(e,)</code>.

* <code>((->) e)</code> (which can be thought of as <code>(e ->)</code>; see above), the type of functions which take a value of type <code>e</code> as a parameter, is a <code>Functor</code>. As a container, <code>(e -> a)</code> represents a (possibly infinite) set of values of <code>a</code>, indexed by values of <code>e</code>. Alternatively, and more usefully, <code>((->) e)</code> can be thought of as a context in which a value of type <code>e</code> is available to be consulted in a read-only fashion. This is also why <code>((->) e)</code> is sometimes referred to as the ''reader monad''; more on this later.

* <code>IO</code> is a <code>Functor</code>; a value of type <code>IO a</code> represents a computation producing a value of type <code>a</code> which may have I/O effects. If <code>m</code> computes the value <code>x</code> while producing some I/O effects, then <code>fmap g m</code> will compute the value <code>g x</code> while producing the same I/O effects.

* Many standard types from the [http://hackage.haskell.org/package/containers/ containers library] (such as <code>Tree</code>, <code>Map</code>, and <code>Sequence</code>) are instances of <code>Functor</code>. A notable exception is <code>Set</code>, which cannot be made a <code>Functor</code> in Haskell (although it is certainly a mathematical functor) since it requires an <code>Ord</code> constraint on its elements; <code>fmap</code> must be applicable to ''any'' types <code>a</code> and <code>b</code>. However, <code>Set</code> (and other similarly restricted data types) can be made an instance of a suitable generalization of <code>Functor</code>, either by [http://article.gmane.org/gmane.comp.lang.haskell.cafe/78052/ making <code>a</code> and <code>b</code> arguments to the <code>Functor</code> type class themselves], or by adding an [http://blog.omega-prime.co.uk/?p=127 associated constraint].

{{Exercises|
<ol>
<li>Implement <code>Functor</code> instances for <code>Either e</code> and <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> instances for <code>((,) e)</code> and for <code>Pair</code>, defined as

<haskell>data Pair a = Pair a a</haskell>

Explain their similarities and differences.
</li>
<li>Implement a <code>Functor</code> instance for the type <code>ITree</code>, defined as

<haskell>
data ITree a = Leaf (Int -> a)
| Node [ITree a]
</haskell>
</li>
<li>Give an example of a type of kind <code>* -> *</code> which cannot be made an instance of <code>Functor</code> (without using <code>undefined</code>).
</li>
<li>Is this statement true or false?

:''The composition of two <code>Functor</code>s is also a <code>Functor</code>.''

If false, give a counterexample; if true, prove it by exhibiting some appropriate Haskell code.
</li>
</ol>
}}

==Laws==

As far as the Haskell language itself is concerned, the only requirement to be a <code>Functor</code> is an implementation of <code>fmap</code> with the proper type. Any sensible <code>Functor</code> instance, however, will also satisfy the ''functor laws'', which are part of the definition of a mathematical functor. There are two:

<haskell>
fmap id = id
fmap (g . h) = (fmap g) . (fmap h)
</haskell>

{{note|Technically, these laws make <code>f</code> and <code>fmap</code> together an endofunctor on ''Hask'', the category of Haskell types (ignoring [[Bottom|&perp;]], which is a party pooper). See [http://en.wikibooks.org/wiki/Haskell/Category_theory Wikibook: Category theory].}}

Together, these laws ensure that <code>fmap g</code> does not change the ''structure'' of a container, only the elements. Equivalently, and more simply, they ensure that <code>fmap g</code> changes a value without altering its context {{noteref}}.

The first law says that mapping the identity function over every item in a container has no effect. The second says that mapping a composition of two functions over every item in a container is the same as first mapping one function, and then mapping the other.

As an example, the following code is a “valid” instance of <code>Functor</code> (it typechecks), but it violates the functor laws. Do you see why?

<haskell>
-- Evil Functor instance
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : g x : fmap g xs
</haskell>

Any Haskeller worth their salt would reject this code as a gruesome abomination.

Unlike some other type classes we will encounter, a given type has at most one valid instance of <code>Functor</code>. This [http://article.gmane.org/gmane.comp.lang.haskell.libraries/15384 can be proven] via the [http://homepages.inf.ed.ac.uk/wadler/topics/parametricity.html#free ''free theorem''] for the type of <code>fmap</code>. In fact, [http://byorgey.wordpress.com/2010/03/03/deriving-pleasure-from-ghc-6-12-1/ GHC can automatically derive] <code>Functor</code> instances for many data types.

{{note|Actually, if <code>seq</code>/<code>undefined</code> are considered, it [http://stackoverflow.com/a/8323243/305559 is possible] to have an implementation which satisfies the first law but not the second. The rest of the comments in this section should considered in a context where <code>seq</code> and <code>undefined</code> are excluded.}}

A [https://github.com/quchen/articles/blob/master/second_functor_law.md similar argument also shows] that any <code>Functor</code> instance satisfying the first law (<code>fmap id = id</code>) will automatically satisfy the second law as well. Practically, this means that only the first law needs to be checked (usually by a very straightforward induction) to ensure that a <code>Functor</code> instance is valid.{{noteref}}

{{Exercises|
# Although it is not possible for a <code>Functor</code> instance to satisfy the first <code>Functor</code> law but not the second (excluding <code>undefined</code>), the reverse is possible. Give an example of a (bogus) <code>Functor</code> instance which satisfies the second law but not the first.
# Which laws are violated by the evil <code>Functor</code> instance for list shown above: both laws, or the first law alone? Give specific counterexamples.
}}

==Intuition==

There are two fundamental ways to think about <code>fmap</code>. The first has already been mentioned: it takes two parameters, a function and a container, and applies the function “inside” the container, producing a new container. Alternately, we can think of <code>fmap</code> as applying a function to a value in a context (without altering the context).

Just like all other Haskell functions of “more than one parameter”, however, <code>fmap</code> is actually ''curried'': it does not really take two parameters, but takes a single parameter and returns a function. For emphasis, we can write <code>fmap</code>’s type with extra parentheses: <code>fmap :: (a -> b) -> (f a -> f b)</code>. Written in this form, it is apparent that <code>fmap</code> transforms a “normal” function (<code>g :: a -> b</code>) into one which operates over containers/contexts (<code>fmap g :: f a -> f b</code>). This transformation is often referred to as a ''lift''; <code>fmap</code> “lifts” a function from the “normal world” into the “<code>f</code> world”.

==Further reading==

A good starting point for reading about the category theory behind the concept of a functor is the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page on category theory].

=Applicative=

A somewhat newer addition to the pantheon of standard Haskell type classes, ''applicative functors'' represent an abstraction lying in between <code>Functor</code> and <code>Monad</code> in expressivity, first described by McBride and Paterson. The title of their classic paper, [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative Programming with Effects], gives a hint at the intended intuition behind the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html <code>Applicative</code>] type class. It encapsulates certain sorts of “effectful” computations in a functionally pure way, and encourages an “applicative” programming style. Exactly what these things mean will be seen later.

==Definition==

Recall that <code>Functor</code> allows us to lift a “normal” function to a function on computational contexts. But <code>fmap</code> doesn’t allow us to apply a function which is itself in a context to a value in a context. <code>Applicative</code> gives us just such a tool, <code>(<*>)</code>. It also provides a method, <code>pure</code>, for embedding values in a default, “effect free” context. Here is the type class declaration for <code>Applicative</code>, as defined in <code>Control.Applicative</code>:

<haskell>
class Functor f => Applicative f where
pure :: a -> f a
(<*>) :: f (a -> b) -> f a -> f b
</haskell>

Note that every <code>Applicative</code> must also be a <code>Functor</code>. In fact, as we will see, <code>fmap</code> can be implemented using the <code>Applicative</code> methods, so every <code>Applicative</code> is a functor whether we like it or not; the <code>Functor</code> constraint forces us to be honest.

{{note|Recall that <code>($)</code> is just function application: <code>f $ x {{=}} f x</code>.}}

As always, it’s crucial to understand the type signatures. First, consider <code>(<*>)</code>: the best way of thinking about it comes from noting that the type of <code>(<*>)</code> is similar to the type of <code>($)</code> {{noteref}}, but with everything enclosed in an <code>f</code>. In other words, <code>(<*>)</code> is just function application within a computational context. The type of <code>(<*>)</code> is also very similar to the type of <code>fmap</code>; the only difference is that the first parameter is <code>f (a -> b)</code>, a function in a context, instead of a “normal” function <code>(a -> b)</code>.

<code>pure</code> takes a value of any type <code>a</code>, and returns a context/container of type <code>f a</code>. The intention is that <code>pure</code> creates some sort of “default” container or “effect free” context. In fact, the behavior of <code>pure</code> is quite constrained by the laws it should satisfy in conjunction with <code>(<*>)</code>. Usually, for a given implementation of <code>(<*>)</code> there is only one possible implementation of <code>pure</code>.

(Note that previous versions of the Typeclassopedia explained <code>pure</code> in terms of a type class <code>Pointed</code>, which can still be found in the [http://hackage.haskell.org/package/pointed <code>pointed</code> package]. However, the current consensus is that <code>Pointed</code> is not very useful after all. For a more detailed explanation, see [[Why not Pointed?]])

==Laws==

{{note|See
[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html haddock for Applicative] and [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative programming with effects]}}

Traditionally, there are four laws that <code>Applicative</code> instances should satisfy {{noteref}}. In some sense, they are all concerned with making sure that <code>pure</code> deserves its name:

* The identity law: <haskell>pure id <*> v = v</haskell>
* Homomorphism: <haskell>pure f <*> pure x = pure (f x)</haskell>Intuitively, applying a non-effectful function to a non-effectful argument in an effectful context is the same as just applying the function to the argument and then injecting the result into the context with <code>pure</code>.
* Interchange: <haskell>u <*> pure y = pure ($ y) <*> u</haskell>Intuitively, this says that when evaluating the application of an effectful function to a pure argument, the order in which we evaluate the function and its argument doesn't matter.
* Composition: <haskell>u <*> (v <*> w) = pure (.) <*> u <*> v <*> w </haskell>This one is the trickiest law to gain intuition for. In some sense it is expressing a sort of associativity property of <code>(<*>)</code>. The reader may wish to simply convince themselves that this law is type-correct.

Considered as left-to-right rewrite rules, the homomorphism, interchange, and composition laws actually constitute an algorithm for transforming any expression using <code>pure</code> and <code>(<*>)</code> into a canonical form with only a single use of <code>pure</code> at the very beginning and only left-nested occurrences of <code>(<*>)</code>. Composition allows reassociating <code>(<*>)</code>; interchange allows moving occurrences of <code>pure</code> leftwards; and homomorphism allows collapsing multiple adjacent occurrences of <code>pure</code> into one.

There is also a law specifying how <code>Applicative</code> should relate to <code>Functor</code>:

<haskell>
fmap g x = pure g <*> x
</haskell>

It says that mapping a pure function <code>g</code> over a context <code>x</code> is the same as first injecting <code>g</code> into a context with <code>pure</code>, and then applying it to <code>x</code> with <code>(<*>)</code>. In other words, we can decompose <code>fmap</code> into two more atomic operations: injection into a context, and application within a context. The <code>Control.Applicative</code> module also defines <code>(<$>)</code> as a synonym for <code>fmap</code>, so the above law can also be expressed as:

<code>g <$> x = pure g <*> x</code>.

{{Exercises|
# (Tricky) One might imagine a variant of the interchange law that says something about applying a pure function to an effectful argument. Using the above laws, prove that<haskell>pure f <*> x = pure (flip ($)) <*> x <*> pure f</haskell>
}}

==Instances==

Most of the standard types which are instances of <code>Functor</code> are also instances of <code>Applicative</code>.

<code>Maybe</code> can easily be made an instance of <code>Applicative</code>; writing such an instance is left as an exercise for the reader.

The list type constructor <code>[]</code> can actually be made an instance of <code>Applicative</code> in two ways; essentially, it comes down to whether we want to think of lists as ordered collections of elements, or as contexts representing multiple results of a nondeterministic computation (see Wadler’s [http://www.springerlink.com/content/y7450255v2670167/ How to replace failure by a list of successes]).

Let’s first consider the collection point of view. Since there can only be one instance of a given type class for any particular type, one or both of the list instances of <code>Applicative</code> need to be defined for a <code>newtype</code> wrapper; as it happens, the nondeterministic computation instance is the default, and the collection instance is defined in terms of a <code>newtype</code> called <code>ZipList</code>. This instance is:

<haskell>
newtype ZipList a = ZipList { getZipList :: [a] }

instance Applicative ZipList where
pure = undefined -- exercise
(ZipList gs) <*> (ZipList xs) = ZipList (zipWith ($) gs xs)
</haskell>

To apply a list of functions to a list of inputs with <code>(<*>)</code>, we just match up the functions and inputs elementwise, and produce a list of the resulting outputs. In other words, we “zip” the lists together with function application, <code>($)</code>; hence the name <code>ZipList</code>.

The other <code>Applicative</code> instance for lists, based on the nondeterministic computation point of view, is:

<haskell>
instance Applicative [] where
pure x = [x]
gs <*> xs = [ g x | g <- gs, x <- xs ]
</haskell>

Instead of applying functions to inputs pairwise, we apply each function to all the inputs in turn, and collect all the results in a list.

Now we can write nondeterministic computations in a natural style. To add the numbers <code>3</code> and <code>4</code> deterministically, we can of course write <code>(+) 3 4</code>. But suppose instead of <code>3</code> we have a nondeterministic computation that might result in <code>2</code>, <code>3</code>, or <code>4</code>; then we can write

<haskell>
pure (+) <*> [2,3,4] <*> pure 4
</haskell>

or, more idiomatically,

<haskell>
(+) <$> [2,3,4] <*> pure 4.
</haskell>

There are several other <code>Applicative</code> instances as well:

* <code>IO</code> is an instance of <code>Applicative</code>, and behaves exactly as you would think: to execute <code>m1 <*> m2</code>, first <code>m1</code> is executed, resulting in a function <code>f</code>, then <code>m2</code> is executed, resulting in a value <code>x</code>, and finally the value <code>f x</code> is returned as the result of executing <code>m1 <*> m2</code>.

* <code>((,) a)</code> is an <code>Applicative</code>, as long as <code>a</code> is an instance of <code>Monoid</code> ([[#Monoid|section Monoid]]). The <code>a</code> values are accumulated in parallel with the computation.

* The <code>Applicative</code> module defines the <code>Const</code> type constructor; a value of type <code>Const a b</code> simply contains an <code>a</code>. This is an instance of <code>Applicative</code> for any <code>Monoid a</code>; this instance becomes especially useful in conjunction with things like <code>Foldable</code> ([[#Foldable|section Foldable]]).

* The <code>WrappedMonad</code> and <code>WrappedArrow</code> newtypes make any instances of <code>Monad</code> ([[#Monad|section Monad]]) or <code>Arrow</code> ([[#Arrow|section Arrow]]) respectively into instances of <code>Applicative</code>; as we will see when we study those type classes, both are strictly more expressive than <code>Applicative</code>, in the sense that the <code>Applicative</code> methods can be implemented in terms of their methods.

{{Exercises|
# Implement an instance of <code>Applicative</code> for <code>Maybe</code>.
# Determine the correct definition of <code>pure</code> for the <code>ZipList</code> instance of <code>Applicative</code>—there is only one implementation that satisfies the law relating <code>pure</code> and <code>(<*>)</code>.
}}

==Intuition==

McBride and Paterson’s paper introduces the notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> to denote function application in a computational context. If each <math>x_i\ </math> has type <math>f \; t_i\ </math> for some applicative functor <math>f\ </math>, and <math>g\ </math> has type <math>t_1 \to t_2 \to \dots \to t_n \to t\ </math>, then the entire expression <math>[[g \; x_1 \; \cdots \; x_n]]\ </math> has type <math>f \; t\ </math>. You can think of this as applying a function to multiple “effectful” arguments. In this sense, the double bracket notation is a generalization of <code>fmap</code>, which allows us to apply a function to a single argument in a context.

Why do we need <code>Applicative</code> to implement this generalization of <code>fmap</code>? Suppose we use <code>fmap</code> to apply <code>g</code> to the first parameter <code>x1</code>. Then we get something of type <code>f (t2 -> ... t)</code>, but now we are stuck: we can’t apply this function-in-a-context to the next argument with <code>fmap</code>. However, this is precisely what <code>(<*>)</code> allows us to do.

This suggests the proper translation of the idealized notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> into Haskell, namely
<haskell>
g <$> x1 <*> x2 <*> ... <*> xn,
</haskell>

recalling that <code>Control.Applicative</code> defines <code>(<$>)</code> as convenient infix shorthand for <code>fmap</code>. This is what is meant by an “applicative style”—effectful computations can still be described in terms of function application; the only difference is that we have to use the special operator <code>(<*>)</code> for application instead of simple juxtaposition.

Note that <code>pure</code> allows embedding “non-effectful” arguments in the middle of an idiomatic application, like
<haskell>
g <$> x1 <*> pure x2 <*> x3
</haskell>
which has type <code>f d</code>, given
<haskell>
g :: a -> b -> c -> d
x1 :: f a
x2 :: b
x3 :: f c
</haskell>

The double brackets are commonly known as “idiom brackets”, because they allow writing “idiomatic” function application, that is, function application that looks normal but has some special, non-standard meaning (determined by the particular instance of <code>Applicative</code> being used). Idiom brackets are not supported by GHC, but they are supported by the [http://personal.cis.strath.ac.uk/~conor/pub/she/ Strathclyde Haskell Enhancement], a preprocessor which (among many other things) translates idiom brackets into standard uses of <code>(<$>)</code> and <code>(<*>)</code>. This can result in much more readable code when making heavy use of <code>Applicative</code>.

==Alternative formulation==

An alternative, equivalent formulation of <code>Applicative</code> is given by

<haskell>
class Functor f => Monoidal f where
unit :: f ()
(**) :: f a -> f b -> f (a,b)
</haskell>

{{note|In category-theory speak, we say <code>f</code> is a ''lax'' monoidal functor because there aren't necessarily functions in the other direction, like <code>f (a, b) -> (f a, f b)</code>.}}
Intuitively, this states that a monoidal functor{{noteref}} is one which has some sort of "default shape" and which supports some sort of "combining" operation. <code>pure</code> and <code>(<*>)</code> are equivalent in power to <code>unit</code> and <code>(**)</code> (see the Exercises below). More technically, the idea is that <code>f</code> preserves the "monoidal structure" given by the pairing constructor <code>(,)</code> and unit type <code>()</code>. This can be seen even more clearly if we rewrite the types of <code>unit</code> and <code>(**)</code> as
<haskell>
unit' :: () -> f ()
(**') :: (f a, f b) -> f (a, b)
</haskell>

Furthermore, to deserve the name "monoidal" (see the [[#Monoid|section on Monoids]]), instances of <code>Monoidal</code> ought to satisfy the following laws, which seem much more straightforward than the traditional <code>Applicative</code> laws:

{{note|In this and the following laws, <code>≅</code> refers to isomorphism rather than equality. In particular we consider <code>(x,()) ≅ x ≅ ((),x)</code> and <code>((x,y),z) ≅ (x,(y,z))</code>.}}
* Left identity{{noteref}}: <haskell>unit ** v ≅ v</haskell>
* Right identity: <haskell>u ** unit ≅ u</haskell>
* Associativity: <haskell>u ** (v ** w) ≅ (u ** v) ** w</haskell>

These turn out to be equivalent to the usual <code>Applicative</code> laws. In a category theory setting, one would also require a naturality law:

{{note|Here <code>g *** h {{=}} \(x,y) -> (g x, h y)</code>. See [[#Arrow|Arrows]].}}
* Naturality: <haskell>fmap (g *** h) (u ** v) = fmap g u ** fmap h v</haskell>

but in the context of Haskell, this is a free theorem.

Much of this section was taken from [http://blog.ezyang.com/2012/08/applicative-functors/ a blog post by Edward Z. Yang]; see his actual post for a bit more information.

{{Exercises|
# Implement <code>pure</code> and <code>(<*>)</code> in terms of <code>unit</code> and <code>(**)</code>, and vice versa.
# Are there any <code>Applicative</code> instances for which there are also functions <code>f () -> ()</code> and <code>f (a,b) -> (f a, f b)</code>, satisfying some "reasonable" laws?
# (Tricky) Prove that given your implementations from the previous exercise, the usual <code>Applicative</code> laws and the <code>Monoidal</code> laws stated above are equivalent.
}}

==Further reading==

There are many other useful combinators in the standard libraries implemented in terms of <code>pure</code> and <code>(<*>)</code>: for example, <code>(*>)</code>, <code>(<*)</code>, <code>(<**>)</code>, <code>(<$)</code>, and so on (see [http://www.haskell.org/ghc/docs/latest/html/libraries/base-4.7.0.0/Control-Applicative.html haddock for Applicative]). Judicious use of such secondary combinators can often make code using <code>Applicative</code>s much easier to read.

[http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s original paper] is a treasure-trove of information and examples, as well as some perspectives on the connection between <code>Applicative</code> and category theory. Beginners will find it difficult to make it through the entire paper, but it is extremely well-motivated—even beginners will be able to glean something from reading as far as they are able.

{{note|Introduced by [http://conal.net/papers/simply-reactive/ an earlier paper] that was since superseded by [http://conal.net/papers/push-pull-frp/ Push-pull functional reactive programming].}}

Conal Elliott has been one of the biggest proponents of <code>Applicative</code>. For example, the [http://conal.net/papers/functional-images/ Pan library for functional images] and the reactive library for functional reactive programming (FRP) {{noteref}} make key use of it; his blog also contains [http://conal.net/blog/tag/applicative-functor many examples of <code>Applicative</code> in action]. Building on the work of McBride and Paterson, Elliott also built the [[TypeCompose]] library, which embodies the observation (among others) that <code>Applicative</code> types are closed under composition; therefore, <code>Applicative</code> instances can often be automatically derived for complex types built out of simpler ones.

Although the [http://hackage.haskell.org/package/parsec Parsec parsing library] ([http://legacy.cs.uu.nl/daan/download/papers/parsec-paper.pdf paper]) was originally designed for use as a monad, in its most common use cases an <code>Applicative</code> instance can be used to great effect; [http://www.serpentine.com/blog/2008/02/06/the-basics-of-applicative-functors-put-to-practical-work/ Bryan O’Sullivan’s blog post] is a good starting point. If the extra power provided by <code>Monad</code> isn’t needed, it’s usually a good idea to use <code>Applicative</code> instead.

A couple other nice examples of <code>Applicative</code> in action include the [http://web.archive.org/web/20090416111947/chrisdone.com/blog/html/2009-02-10-applicative-configfile-hsql.html ConfigFile and HSQL libraries] and the [http://groups.inf.ed.ac.uk/links/formlets/ formlets library].

Gershom Bazerman's [http://comonad.com/reader/2012/abstracting-with-applicatives/ post] contains many insights into applicatives.

=Monad=

It’s a safe bet that if you’re reading this, you’ve heard of monads—although it’s quite possible you’ve never heard of <code>Applicative</code> before, or <code>Arrow</code>, or even <code>Monoid</code>. Why are monads such a big deal in Haskell? There are several reasons.

* Haskell does, in fact, single out monads for special attention by making them the framework in which to construct I/O operations.
* Haskell also singles out monads for special attention by providing a special syntactic sugar for monadic expressions: the <code>do</code>-notation.
* <code>Monad</code> has been around longer than other abstract models of computation such as <code>Applicative</code> or <code>Arrow</code>.
* The more monad tutorials there are, the harder people think monads must be, and the more new monad tutorials are written by people who think they finally “get” monads (the [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ monad tutorial fallacy]).

I will let you judge for yourself whether these are good reasons.

In the end, despite all the hoopla, <code>Monad</code> is just another type class. Let’s take a look at its definition.

==Definition==

The type class declaration for [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Monad <code>Monad</code>] is:

<haskell>
class Monad m where
return :: a -> m a
(>>=) :: m a -> (a -> m b) -> m b
(>>) :: m a -> m b -> m b
m >> n = m >>= \_ -> n

fail :: String -> m a
</haskell>

The <code>Monad</code> type class is exported by the <code>Prelude</code>, along with a few standard instances. However, many utility functions are found in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>], and there are also several instances (such as <code>((->) e)</code>) defined in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad-Instances.html <code>Control.Monad.Instances</code>].

{{note|However, as of GHC 7.10 this will be fixed!}}
Let’s examine the methods in the <code>Monad</code> class one by one. The type of <code>return</code> should look familiar; it’s the same as <code>pure</code>. Indeed, <code>return</code> ''is'' <code>pure</code>, but with an unfortunate name. (Unfortunate, since someone coming from an imperative programming background might think that <code>return</code> is like the C or Java keyword of the same name, when in fact the similarities are minimal.) From a mathematical point of view, every monad is an applicative functor, but for historical reasons, the <code>Monad</code> type class declaration unfortunately does not require this.{{noteref}}

We can see that <code>(>>)</code> is a specialized version of <code>(>>=)</code>, with a default implementation given. It is only included in the type class declaration so that specific instances of <code>Monad</code> can override the default implementation of <code>(>>)</code> with a more efficient one, if desired. Also, note that although <code>_ >> n = n</code> would be a type-correct implementation of <code>(>>)</code>, it would not correspond to the intended semantics: the intention is that <code>m >> n</code> ignores the ''result'' of <code>m</code>, but not its ''effects''.

The <code>fail</code> function is an awful hack that has no place in the <code>Monad</code> class; more on this later.

The only really interesting thing to look at—and what makes <code>Monad</code> strictly more powerful than <code>Applicative</code>—is <code>(>>=)</code>, which is often called ''bind''. An alternative definition of <code>Monad</code> could look like:

<haskell>
class Applicative m => Monad' m where
(>>=) :: m a -> (a -> m b) -> m b
</haskell>

We could spend a while talking about the intuition behind <code>(>>=)</code>—and we will. But first, let’s look at some examples.

==Instances==

Even if you don’t understand the intuition behind the <code>Monad</code> class, you can still create instances of it by just seeing where the types lead you. You may be surprised to find that this actually gets you a long way towards understanding the intuition; at the very least, it will give you some concrete examples to play with as you read more about the <code>Monad</code> class in general. The first few examples are from the standard <code>Prelude</code>; the remaining examples are from the [http://hackage.haskell.org/package/transformers <code>transformers</code> package].

<ul>
<li>The simplest possible instance of <code>Monad</code> is [http://hackage.haskell.org/packages/archive/mtl/1.1.0.2/doc/html/Control-Monad-Identity.html <code>Identity</code>], which is described in Dan Piponi’s highly recommended blog post on [http://blog.sigfpe.com/2007/04/trivial-monad.html The Trivial Monad]. Despite being “trivial”, it is a great introduction to the <code>Monad</code> type class, and contains some good exercises to get your brain working.
</li>
<li>The next simplest instance of <code>Monad</code> is <code>Maybe</code>. We already know how to write <code>return</code>/<code>pure</code> for <code>Maybe</code>. So how do we write <code>(>>=)</code>? Well, let’s think about its type. Specializing for <code>Maybe</code>, we have

<haskell>
(>>=) :: Maybe a -> (a -> Maybe b) -> Maybe b.
</haskell>

If the first argument to <code>(>>=)</code> is <code>Just x</code>, then we have something of type <code>a</code> (namely, <code>x</code>), to which we can apply the second argument—resulting in a <code>Maybe b</code>, which is exactly what we wanted. What if the first argument to <code>(>>=)</code> is <code>Nothing</code>? In that case, we don’t have anything to which we can apply the <code>a -> Maybe b</code> function, so there’s only one thing we can do: yield <code>Nothing</code>. This instance is:

<haskell>
instance Monad Maybe where
return = Just
(Just x) >>= g = g x
Nothing >>= _ = Nothing
</haskell>

We can already get a bit of intuition as to what is going on here: if we build up a computation by chaining together a bunch of functions with <code>(>>=)</code>, as soon as any one of them fails, the entire computation will fail (because <code>Nothing >>= f</code> is <code>Nothing</code>, no matter what <code>f</code> is). The entire computation succeeds only if all the constituent functions individually succeed. So the <code>Maybe</code> monad models computations which may fail.
</li>

<li>The <code>Monad</code> instance for the list constructor <code>[]</code> is similar to its <code>Applicative</code> instance; see the exercise below.
</li>

<li>Of course, the <code>IO</code> constructor is famously a <code>Monad</code>, but its implementation is somewhat magical, and may in fact differ from compiler to compiler. It is worth emphasizing that the <code>IO</code> monad is the ''only'' monad which is magical. It allows us to build up, in an entirely pure way, values representing possibly effectful computations. The special value <code>main</code>, of type <code>IO ()</code>, is taken by the runtime and actually executed, producing actual effects. Every other monad is functionally pure, and requires no special compiler support. We often speak of monadic values as “effectful computations”, but this is because some monads allow us to write code ''as if'' it has side effects, when in fact the monad is hiding the plumbing which allows these apparent side effects to be implemented in a functionally pure way.
</li>

<li>As mentioned earlier, <code>((->) e)</code> is known as the ''reader monad'', since it describes computations in which a value of type <code>e</code> is available as a read-only environment.

The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Reader.html <code>Control.Monad.Reader</code>] module provides the <code>Reader e a</code> type, which is just a convenient <code>newtype</code> wrapper around <code>(e -> a)</code>, along with an appropriate <code>Monad</code> instance and some <code>Reader</code>-specific utility functions such as <code>ask</code> (retrieve the environment), <code>asks</code> (retrieve a function of the environment), and <code>local</code> (run a subcomputation under a different environment).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Writer-Lazy.html <code>Control.Monad.Writer</code>] module provides the <code>Writer</code> monad, which allows information to be collected as a computation progresses. <code>Writer w a</code> is isomorphic to <code>(a,w)</code>, where the output value <code>a</code> is carried along with an annotation or “log” of type <code>w</code>, which must be an instance of <code>Monoid</code> (see [[#Monoid|section Monoid]]); the special function <code>tell</code> performs logging.
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-State-Lazy.html <code>Control.Monad.State</code>] module provides the <code>State s a</code> type, a <code>newtype</code> wrapper around <code>s -> (a,s)</code>. Something of type <code>State s a</code> represents a stateful computation which produces an <code>a</code> but can access and modify the state of type <code>s</code> along the way. The module also provides <code>State</code>-specific utility functions such as <code>get</code> (read the current state), <code>gets</code> (read a function of the current state), <code>put</code> (overwrite the state), and <code>modify</code> (apply a function to the state).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Cont.html <code>Control.Monad.Cont</code>] module provides the <code>Cont</code> monad, which represents computations in continuation-passing style. It can be used to suspend and resume computations, and to implement non-local transfers of control, co-routines, other complex control structures—all in a functionally pure way. <code>Cont</code> has been called the [http://blog.sigfpe.com/2008/12/mother-of-all-monads.html “mother of all monads”] because of its universal properties.
</li>
</ul>

{{Exercises|
<ol>
<li>Implement a <code>Monad</code> instance for the list constructor, <code>[]</code>. Follow the types!</li>
<li>Implement a <code>Monad</code> instance for <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> and <code>Monad</code> instances for <code>Free f</code>, defined as
<haskell>
data Free f a = Var a
| Node (f (Free f a))
</haskell>
You may assume that <code>f</code> has a <code>Functor</code> instance. This is known as the ''free monad'' built from the functor <code>f</code>.
</li>
</ol>
}}

==Intuition==

Let’s look more closely at the type of <code>(>>=)</code>. The basic intuition is that it combines two computations into one larger computation. The first argument, <code>m a</code>, is the first computation. However, it would be boring if the second argument were just an <code>m b</code>; then there would be no way for the computations to interact with one another (actually, this is exactly the situation with <code>Applicative</code>). So, the second argument to <code>(>>=)</code> has type <code>a -> m b</code>: a function of this type, given a ''result'' of the first computation, can produce a second computation to be run. In other words, <code>x >>= k</code> is a computation which runs <code>x</code>, and then uses the result(s) of <code>x</code> to ''decide'' what computation to run second, using the output of the second computation as the result of the entire computation.

{{note|Actually, because Haskell allows general recursion, this is a lie: using a Haskell parsing library one can recursively construct ''infinite'' grammars, and hence <code>Applicative</code> (together with <code>Alternative</code>) is enough to parse any context-sensitive language with a finite alphabet. See [http://byorgey.wordpress.com/2012/01/05/parsing-context-sensitive-languages-with-applicative/ Parsing context-sensitive languages with Applicative].}}
Intuitively, it is this ability to use the output from previous computations to decide what computations to run next that makes <code>Monad</code> more powerful than <code>Applicative</code>. The structure of an <code>Applicative</code> computation is fixed, whereas the structure of a <code>Monad</code> computation can change based on intermediate results. This also means that parsers built using an <code>Applicative</code> interface can only parse context-free languages; in order to parse context-sensitive languages a <code>Monad</code> interface is needed.{{noteref}}

To see the increased power of <code>Monad</code> from a different point of view, let’s see what happens if we try to implement <code>(>>=)</code> in terms of <code>fmap</code>, <code>pure</code>, and <code>(<*>)</code>. We are given a value <code>x</code> of type <code>m a</code>, and a function <code>k</code> of type <code>a -> m b</code>, so the only thing we can do is apply <code>k</code> to <code>x</code>. We can’t apply it directly, of course; we have to use <code>fmap</code> to lift it over the <code>m</code>. But what is the type of <code>fmap k</code>? Well, it’s <code>m a -> m (m b)</code>. So after we apply it to <code>x</code>, we are left with something of type <code>m (m b)</code>—but now we are stuck; what we really want is an <code>m b</code>, but there’s no way to get there from here. We can ''add'' <code>m</code>’s using <code>pure</code>, but we have no way to ''collapse'' multiple <code>m</code>’s into one.

{{note|1=You might hear some people claim that that the definition in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> is the “math definition” and the definition in terms of <code>return</code> and <code>(>>=)</code> is something specific to Haskell. In fact, both definitions were known in the mathematics community long before Haskell picked up monads.}}

This ability to collapse multiple <code>m</code>’s is exactly the ability provided by the function <code>join :: m (m a) -> m a</code>, and it should come as no surprise that an alternative definition of <code>Monad</code> can be given in terms of <code>join</code>:

<haskell>
class Applicative m => Monad'' m where
join :: m (m a) -> m a
</haskell>

In fact, the canonical definition of monads in category theory is in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> (often called <math>\eta</math>, <math>T</math>, and <math>\mu</math> in the mathematical literature). Haskell uses an alternative formulation with <code>(>>=)</code> instead of <code>join</code> since it is more convenient to use {{noteref}}. However, sometimes it can be easier to think about <code>Monad</code> instances in terms of <code>join</code>, since it is a more “atomic” operation. (For example, <code>join</code> for the list monad is just <code>concat</code>.)

{{Exercises|
# Implement <code>(>>{{=}})</code> in terms of <code>fmap</code> (or <code>liftM</code>) and <code>join</code>.
# Now implement <code>join</code> and <code>fmap</code> (<code>liftM</code>) in terms of <code>(>>{{=}})</code> and <code>return</code>.
}}

==Utility functions==

The [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>] module provides a large number of convenient utility functions, all of which can be implemented in terms of the basic <code>Monad</code> operations (<code>return</code> and <code>(>>=)</code> in particular). We have already seen one of them, namely, <code>join</code>. We also mention some other noteworthy ones here; implementing these utility functions oneself is a good exercise. For a more detailed guide to these functions, with commentary and example code, see Henk-Jan van Tuyl’s [http://members.chello.nl/hjgtuyl/tourdemonad.html tour].

{{note|This will most likely change in Haskell 2014 with the implementation of the [[Functor-Applicative-Monad_Proposal|Haskell 2014 Applicative => Monad proposal]].}}

* <code>liftM :: Monad m => (a -> b) -> m a -> m b</code>. This should be familiar; of course, it is just <code>fmap</code>. The fact that we have both <code>fmap</code> and <code>liftM</code> is an unfortunate consequence of the fact that the <code>Monad</code> type class does not require a <code>Functor</code> instance, even though mathematically speaking, every monad is a functor. However, <code>fmap</code> and <code>liftM</code> are essentially interchangeable, since it is a bug (in a social rather than technical sense) for any type to be an instance of <code>Monad</code> without also being an instance of <code>Functor</code> {{noteref}}.

* <code>ap :: Monad m => m (a -> b) -> m a -> m b</code> should also be familiar: it is equivalent to <code>(<*>)</code>, justifying the claim that the <code>Monad</code> interface is strictly more powerful than <code>Applicative</code>. We can make any <code>Monad</code> into an instance of <code>Applicative</code> by setting <code>pure = return</code> and <code>(<*>) = ap</code>.

* <code>sequence :: Monad m => [m a] -> m [a]</code> takes a list of computations and combines them into one computation which collects a list of their results. It is again something of a historical accident that <code>sequence</code> has a <code>Monad</code> constraint, since it can actually be implemented only in terms of <code>Applicative</code>. There is an additional generalization of <code>sequence</code> to structures other than lists, which will be discussed in the [[#Traversable|section on <code>Traversable</code>]].

* <code>replicateM :: Monad m => Int -> m a -> m [a]</code> is simply a combination of [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#v:replicate <code>replicate</code>] and <code>sequence</code>.

* <code>when :: Monad m => Bool -> m () -> m ()</code> conditionally executes a computation, evaluating to its second argument if the test is <code>True</code>, and to <code>return ()</code> if the test is <code>False</code>. A collection of other sorts of monadic conditionals can be found in the [http://hackage.haskell.org/package/IfElse <code>IfElse</code> package].

* <code>mapM :: Monad m => (a -> m b) -> [a] -> m [b]</code> maps its first argument over the second, and <code>sequence</code>s the results. The <code>forM</code> function is just <code>mapM</code> with its arguments reversed; it is called <code>forM</code> since it models generalized <code>for</code> loops: the list <code>[a]</code> provides the loop indices, and the function <code>a -> m b</code> specifies the “body” of the loop for each index.

* <code>(=<<) :: Monad m => (a -> m b) -> m a -> m b</code> is just <code>(>>=)</code> with its arguments reversed; sometimes this direction is more convenient since it corresponds more closely to function application.

* <code>(>=>) :: Monad m => (a -> m b) -> (b -> m c) -> a -> m c</code> is sort of like function composition, but with an extra <code>m</code> on the result type of each function, and the arguments swapped. We’ll have more to say about this operation later. There is also a flipped variant, <code>(<=<)</code>.

* The <code>guard</code> function is for use with instances of <code>MonadPlus</code>, which is discussed at the end of the [[#Monoid|<code>Monoid</code> section]].

Many of these functions also have “underscored” variants, such as <code>sequence_</code> and <code>mapM_</code>; these variants throw away the results of the computations passed to them as arguments, using them only for their side effects.

Other monadic functions which are occasionally useful include <code>filterM</code>, <code>zipWithM</code>, <code>foldM</code>, and <code>forever</code>.

==Laws==

There are several laws that instances of <code>Monad</code> should satisfy (see also the [[Monad laws]] wiki page). The standard presentation is:

<haskell>
return a >>= k = k a
m >>= return = m
m >>= (\x -> k x >>= h) = (m >>= k) >>= h

fmap f xs = xs >>= return . f = liftM f xs
</haskell>

The first and second laws express the fact that <code>return</code> behaves nicely: if we inject a value <code>a</code> into a monadic context with <code>return</code>, and then bind to <code>k</code>, it is the same as just applying <code>k</code> to <code>a</code> in the first place; if we bind a computation <code>m</code> to <code>return</code>, nothing changes. The third law essentially says that <code>(>>=)</code> is associative, sort of. The last law ensures that <code>fmap</code> and <code>liftM</code> are the same for types which are instances of both <code>Functor</code> and <code>Monad</code>—which, as already noted, should be every instance of <code>Monad</code>.

{{note|I like to pronounce this operator “fish”.}}

However, the presentation of the above laws, especially the third, is marred by the asymmetry of <code>(>>=)</code>. It’s hard to look at the laws and see what they’re really saying. I prefer a much more elegant version of the laws, which is formulated in terms of <code>(>=>)</code> {{noteref}}. Recall that <code>(>=>)</code> “composes” two functions of type <code>a -> m b</code> and <code>b -> m c</code>. You can think of something of type <code>a -> m b</code> (roughly) as a function from <code>a</code> to <code>b</code> which may also have some sort of effect in the context corresponding to <code>m</code>. <code>(>=>)</code> lets us compose these “effectful functions”, and we would like to know what properties <code>(>=>)</code> has. The monad laws reformulated in terms of <code>(>=>)</code> are:

<haskell>
return >=> g = g
g >=> return = g
(g >=> h) >=> k = g >=> (h >=> k)
</haskell>

{{note|As fans of category theory will note, these laws say precisely that functions of type <code>a -> m b</code> are the arrows of a category with <code>(>{{=}}>)</code> as composition! Indeed, this is known as the ''Kleisli category'' of the monad <code>m</code>. It will come up again when we discuss <code>Arrow</code>s.}}

Ah, much better! The laws simply state that <code>return</code> is the identity of <code>(>=>)</code>, and that <code>(>=>)</code> is associative {{noteref}}.

There is also a formulation of the monad laws in terms of <code>fmap</code>, <code>return</code>, and <code>join</code>; for a discussion of this formulation, see the Haskell [http://en.wikibooks.org/wiki/Haskell/Category_theory wikibook page on category theory].

{{Exercises|
# Given the definition <code>g >{{=}}> h {{=}} \x -> g x >>{{=}} h</code>, prove the equivalence of the above laws and the usual monad laws.
}}

==<code>do</code> notation==

Haskell’s special <code>do</code> notation supports an “imperative style” of programming by providing syntactic sugar for chains of monadic expressions. The genesis of the notation lies in realizing that something like <code>a >>= \x -> b >> c >>= \y -> d </code> can be more readably written by putting successive computations on separate lines:

<haskell>
a >>= \x ->
b >>
c >>= \y ->
d
</haskell>

This emphasizes that the overall computation consists of four computations <code>a</code>, <code>b</code>, <code>c</code>, and <code>d</code>, and that <code>x</code> is bound to the result of <code>a</code>, and <code>y</code> is bound to the result of <code>c</code> (<code>b</code>, <code>c</code>, and <code>d</code> are allowed to refer to <code>x</code>, and <code>d</code> is allowed to refer to <code>y</code> as well). From here it is not hard to imagine a nicer notation:

<haskell>
do { x <- a
; b
; y <- c
; d
}
</haskell>

(The curly braces and semicolons may optionally be omitted; the Haskell parser uses layout to determine where they should be inserted.) This discussion should make clear that <code>do</code> notation is just syntactic sugar. In fact, <code>do</code> blocks are recursively translated into monad operations (almost) like this:

<pre>
do e → e
do { e; stmts } → e >> do { stmts }
do { v <- e; stmts } → e >>= \v -> do { stmts }
do { let decls; stmts} → let decls in do { stmts }
</pre>

This is not quite the whole story, since <code>v</code> might be a pattern instead of a variable. For example, one can write

<haskell>
do (x:xs) <- foo
bar x
</haskell>

but what happens if <code>foo</code> produces an empty list? Well, remember that ugly <code>fail</code> function in the <code>Monad</code> type class declaration? That’s what happens. See [http://www.haskell.org/onlinereport/exps.html#sect3.14 section 3.14 of the Haskell Report] for the full details. See also the discussion of <code>MonadPlus</code> and <code>MonadZero</code> in the [[#Other monoidal classes: Alternative, MonadPlus, ArrowPlus|section on other monoidal classes]].

A final note on intuition: <code>do</code> notation plays very strongly to the “computational context” point of view rather than the “container” point of view, since the binding notation <code>x <- m</code> is suggestive of “extracting” a single <code>x</code> from <code>m</code> and doing something with it. But <code>m</code> may represent some sort of a container, such as a list or a tree; the meaning of <code>x <- m</code> is entirely dependent on the implementation of <code>(>>=)</code>. For example, if <code>m</code> is a list, <code>x <- m</code> actually means that <code>x</code> will take on each value from the list in turn.

==Further reading==

Philip Wadler was the first to propose using monads to structure functional programs. [http://homepages.inf.ed.ac.uk/wadler/topics/monads.html His paper] is still a readable introduction to the subject.

{{note|1=
[[All About Monads]],
[http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers],
[http://en.wikibooks.org/w/index.php?title=Haskell/Understanding_monads Understanding monads],
[[The Monadic Way]],
[http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads! (And Maybe You Already Have.)],
[http://www.haskell.org/pipermail/haskell-cafe/2006-November/019190.html there’s a monster in my Haskell!],
[http://kawagner.blogspot.com/2007/02/understanding-monads-for-real.html Understanding Monads. For real.],
[http://www.randomhacks.net/articles/2007/03/12/monads-in-15-minutes Monads in 15 minutes: Backtracking and Maybe],
[http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation],
[http://metafoo.co.uk/practical-monads.txt Practical Monads]}}

There are, of course, numerous monad tutorials of varying quality {{noteref}}.

A few of the best include Cale Gibbard’s [http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers] and [http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation]; Jeff Newbern’s [[All About Monads]], a comprehensive guide with lots of examples; and Dan Piponi’s [http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads!], which features great exercises. If you just want to know how to use <code>IO</code>, you could consult the [[Introduction to IO]]. Even this is just a sampling; the [[monad tutorials timeline]] is a more complete list. (All these monad tutorials have prompted parodies like [http://koweycode.blogspot.com/2007/01/think-of-monad.html think of a monad ...] as well as other kinds of backlash like [http://ahamsandwich.wordpress.com/2007/07/26/monads-and-why-monad-tutorials-are-all-awful/ Monads! (and Why Monad Tutorials Are All Awful)] or [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ Abstraction, intuition, and the “monad tutorial fallacy”].)

Other good monad references which are not necessarily tutorials include [http://members.chello.nl/hjgtuyl/tourdemonad.html Henk-Jan van Tuyl’s tour] of the functions in <code>Control.Monad</code>, Dan Piponi’s [http://blog.sigfpe.com/2006/10/monads-field-guide.html field guide], Tim Newsham’s [http://www.thenewsh.com/~newsham/haskell/monad.html What’s a Monad?], and Chris Smith's excellent article [http://cdsmith.wordpress.com/2012/04/18/why-do-monads-matter/ Why Do Monads Matter?]. There are also many blog posts which have been written on various aspects of monads; a collection of links can be found under [[Blog articles/Monads]].

For help constructing monads from scratch, and for obtaining a "deep embedding" of monad operations suitable for use in, say, compiling a domain-specific language, see [http://projects.haskell.org/operational Apfelmus's operational package].

One of the quirks of the <code>Monad</code> class and the Haskell type system is that it is not possible to straightforwardly declare <code>Monad</code> instances for types which require a class constraint on their data, even if they are monads from a mathematical point of view. For example, <code>Data.Set</code> requires an <code>Ord</code> constraint on its data, so it cannot be easily made an instance of <code>Monad</code>. A solution to this problem was [http://www.randomhacks.net/articles/2007/03/15/data-set-monad-haskell-macros first described by Eric Kidd], and later made into a [http://hackage.haskell.org/cgi-bin/hackage-scripts/package/rmonad library named rmonad] by Ganesh Sittampalam and Peter Gavin.

There are many good reasons for eschewing <code>do</code> notation; some have gone so far as to [[Do_notation_considered_harmful|consider it harmful]].

Monads can be generalized in various ways; for an exposition of one possibility, see Robert Atkey’s paper on [http://homepages.inf.ed.ac.uk/ratkey/paramnotions-jfp.pdf parameterized monads], or Dan Piponi’s [http://blog.sigfpe.com/2009/02/beyond-monads.html Beyond Monads].

For the categorically inclined, monads can be viewed as monoids ([http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html From Monoids to Monads]) and also as closure operators [http://blog.plover.com/math/monad-closure.html Triples and Closure]. Derek Elkins’s article in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13 of the Monad.Reader] contains an exposition of the category-theoretic underpinnings of some of the standard <code>Monad</code> instances, such as <code>State</code> and <code>Cont</code>. Jonathan Hill and Keith Clarke have [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.53.6497 an early paper explaining the connection between monads as they arise in category theory and as used in functional programming]. There is also a [http://okmij.org/ftp/Computation/IO-monad-history.html web page by Oleg Kiselyov] explaining the history of the IO monad.

Links to many more research papers related to monads can be found under [[Research papers/Monads and arrows]].

=Monad transformers=

One would often like to be able to combine two monads into one: for example, to have stateful, nondeterministic computations (<code>State</code> + <code>[]</code>), or computations which may fail and can consult a read-only environment (<code>Maybe</code> + <code>Reader</code>), and so on. Unfortunately, monads do not compose as nicely as applicative functors (yet another reason to use <code>Applicative</code> if you don’t need the full power that <code>Monad</code> provides), but some monads can be combined in certain ways.

==Standard monad transformers==

The [http://hackage.haskell.org/package/transformers transformers] library provides a number of standard ''monad transformers''. Each monad transformer adds a particular capability/feature/effect to any existing monad.

* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Identity.html <code>IdentityT</code>] is the identity transformer, which maps a monad to (something isomorphic to) itself. This may seem useless at first glance, but it is useful for the same reason that the <code>id</code> function is useful -- it can be passed as an argument to things which are parameterized over an arbitrary monad transformer, when you do not actually want any extra capabilities.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-State.html <code>StateT</code>] adds a read-write state.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Reader.html <code>ReaderT</code>] adds a read-only environment.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Writer.html <code>WriterT</code>] adds a write-only log.
* [http://hackage.haskell.org/packages/archive/transformers/0.2.2.0/doc/html/Control-Monad-Trans-RWS.html <code>RWST</code>] conveniently combines <code>ReaderT</code>, <code>WriterT</code>, and <code>StateT</code> into one.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Maybe.html <code>MaybeT</code>] adds the possibility of failure.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Error.html <code>ErrorT</code>] adds the possibility of failure with an arbitrary type to represent errors.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-List.html <code>ListT</code>] adds non-determinism (however, see the discussion of <code>ListT</code> below).
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Cont.html <code>ContT</code>] adds continuation handling.

For example, <code>StateT s Maybe</code> is an instance of <code>Monad</code>; computations of type <code>StateT s Maybe a</code> may fail, and have access to a mutable state of type <code>s</code>. Monad transformers can be multiply stacked. One thing to keep in mind while using monad transformers is that the order of composition matters. For example, when a <code>StateT s Maybe a</code> computation fails, the state ceases being updated (indeed, it simply disappears); on the other hand, the state of a <code>MaybeT (State s) a</code> computation may continue to be modified even after the computation has "failed". This may seem backwards, but it is correct. Monad transformers build composite monads “inside out”; <code>MaybeT (State s) a</code> is isomorphic to <code>s -> (Maybe a, s)</code>. (Lambdabot has an indispensable <code>@unmtl</code> command which you can use to “unpack” a monad transformer stack in this way.)
Intuitively, the monads become "more fundamental" the further inside the stack you get, and the effects of inner monads "have precedence" over the effects of outer ones. Of course, this is just handwaving, and if you are unsure of the proper order for some monads you wish to combine, there is no substitute for using <code>@unmtl</code> or simply trying out the various options.

==Definition and laws==

All monad transformers should implement the <code>MonadTrans</code> type class, defined in <code>Control.Monad.Trans.Class</code>:

<haskell>
class MonadTrans t where
lift :: Monad m => m a -> t m a
</haskell>

It allows arbitrary computations in the base monad <code>m</code> to be “lifted” into computations in the transformed monad <code>t m</code>. (Note that type application associates to the left, just like function application, so <code>t m a = (t m) a</code>.)

<code>lift</code> must satisfy the laws
<haskell>
lift . return = return
lift (m >>= f) = lift m >>= (lift . f)
</haskell>
which intuitively state that <code>lift</code> transforms <code>m a</code> computations into <code>t m a</code> computations in a "sensible" way, which sends the <code>return</code> and <code>(>>=)</code> of <code>m</code> to the <code>return</code> and <code>(>>=)</code> of <code>t m</code>.

{{Exercises|
# What is the kind of <code>t</code> in the declaration of <code>MonadTrans</code>?
}}

==Transformer type classes and "capability" style==

{{note|The only problem with this scheme is the quadratic number of instances required as the number of standard monad transformers grows—but as the current set of standard monad transformers seems adequate for most common use cases, this may not be that big of a deal.}}

There are also type classes (provided by the [http://hackage.haskell.org/package/mtl <code>mtl</code> package]) for the operations of each transformer. For example, the <code>MonadState</code> type class provides the state-specific methods <code>get</code> and <code>put</code>, allowing you to conveniently use these methods not only with <code>State</code>, but with any monad which is an instance of <code>MonadState</code>—including <code>MaybeT (State s)</code>, <code>StateT s (ReaderT r IO)</code>, and so on. Similar type classes exist for <code>Reader</code>, <code>Writer</code>, <code>Cont</code>, <code>IO</code>, and others {{noteref}}.

These type classes serve two purposes. First, they get rid of (most of) the need for explicitly using <code>lift</code>, giving a type-directed way to automatically determine the right number of calls to <code>lift</code>. Simply writing <code>put</code> will be automatically translated into <code>lift . put</code>, <code>lift . lift . put</code>, or something similar depending on what concrete monad stack you are using.

Second, they give you more flexibility to switch between different concrete monad stacks. For example, if you are writing a state-based algorithm, don't write
<haskell>
foo :: State Int Char
foo = modify (*2) >> return 'x'
</haskell>
but rather
<haskell>
foo :: MonadState Int m => m Char
foo = modify (*2) >> return 'x'
</haskell>
Now, if somewhere down the line you realize you need to introduce the possibility of failure, you might switch from <code>State Int</code> to <code>MaybeT (State Int)</code>. The type of the first version of <code>foo</code> would need to be modified to reflect this change, but the second version of <code>foo</code> can still be used as-is.

However, this sort of "capability-based" style (e.g. specifying that <code>foo</code> works for any monad with the "state capability") quickly runs into problems when you try to naively scale it up: for example, what if you need to maintain two independent states? A framework for solving this and related problems is described by Schrijvers and Olivera ([http://users.ugent.be/~tschrijv/Research/papers/icfp2011.pdf Monads, zippers and views: virtualizing the monad stack, ICFP 2011]) and is implemented in the [http://hackage.haskell.org/package/Monatron <code>Monatron</code> package].

==Composing monads==

Is the composition of two monads always a monad? As hinted previously, the answer is no.

Since <code>Applicative</code> functors are closed under composition, the problem must lie with <code>join</code>. Indeed, suppose <code>m</code> and <code>n</code> are arbitrary monads; to make a monad out of their composition we would need to be able to implement
<haskell>
join :: m (n (m (n a))) -> m (n a)
</haskell>
but it is not clear how this could be done in general. The <code>join</code> method for <code>m</code> is no help, because the two occurrences of <code>m</code> are not next to each other (and likewise for <code>n</code>).

However, one situation in which it can be done is if <code>n</code> ''distributes'' over <code>m</code>, that is, if there is a function
<haskell>
distrib :: n (m a) -> m (n a)
</haskell>
satisfying certain laws. See Jones and Duponcheel ([http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.2605 Composing Monads]); see also the [[#Traversable|section on Traversable]].

For a much more in-depth discussion and analysis of the failure of monads to be closed under composition, see [http://stackoverflow.com/questions/13034229/concrete-example-showing-that-monads-are-not-closed-under-composition-with-proo?lq=1 this question on StackOverflow].

{{Exercises|
* Implement <code>join :: M (N (M (N a))) -> M (N a)</code>, given <code>distrib :: N (M a) -> M (N a)</code> and assuming <code>M</code> and <code>N</code> are instances of <code>Monad</code>.
}}

==Further reading==

Much of the monad transformer library (originally [http://hackage.haskell.org/package/mtl <code>mtl</code>], now split between <code>mtl</code> and [http://hackage.haskell.org/package/transformers <code>transformers</code>]), including the <code>Reader</code>, <code>Writer</code>, <code>State</code>, and other monads, as well as the monad transformer framework itself, was inspired by Mark Jones’s classic paper [http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Functional Programming with Overloading and Higher-Order Polymorphism]. It’s still very much worth a read—and highly readable—after almost fifteen years.

See [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17139 Edward Kmett's mailing list message] for a description of the history and relationships among monad transformer packages (<code>mtl</code>, <code>transformers</code>, <code>monads-fd</code>, <code>monads-tf</code>).

There are two excellent references on monad transformers. Martin Grabmüller’s [http://www.grabmueller.de/martin/www/pub/Transformers.en.html Monad Transformers Step by Step] is a thorough description, with running examples, of how to use monad transformers to elegantly build up computations with various effects. [http://cale.yi.org/index.php/How_To_Use_Monad_Transformers Cale Gibbard’s article] on how to use monad transformers is more practical, describing how to structure code using monad transformers to make writing it as painless as possible. Another good starting place for learning about monad transformers is a [http://blog.sigfpe.com/2006/05/grok-haskell-monad-transformers.html blog post by Dan Piponi].

The <code>ListT</code> transformer from the <code>transformers</code> package comes with the caveat that <code>ListT m</code> is only a monad when <code>m</code> is ''commutative'', that is, when <code>ma >>= \a -> mb >>= \b -> foo</code> is equivalent to <code>mb >>= \b -> ma >>= \a -> foo</code> (i.e. the order of <code>m</code>'s effects does not matter). For one explanation why, see Dan Piponi's blog post [http://blog.sigfpe.com/2006/11/why-isnt-listt-monad.html "Why isn't <code><nowiki>ListT []</nowiki></code> a monad"]. For more examples, as well as a design for a version of <code>ListT</code> which does not have this problem, see [http://www.haskell.org/haskellwiki/ListT_done_right <code>ListT</code> done right].

There is an alternative way to compose monads, using coproducts, as described by [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.8.3581 Lüth and Ghani]. This method is interesting but has not (yet?) seen widespread use. For a more recent alternative, see Kiselyov et al's [http://okmij.org/ftp/Haskell/extensible/exteff.pdf Extensible Effects: An Alternative to Monad Transformers].

=MonadFix=

''Note: <code>MonadFix</code> is included here for completeness (and because it is interesting) but seems not to be used much. Skipping this section on a first read-through is perfectly OK (and perhaps even recommended).''

==<code>mdo</code>/<code>do rec</code> notation==

{{note|In GHC 7.6, the flag has been changed to <code>-XRecursiveDo</code>.}}
The <code>MonadFix</code> class describes monads which support the special fixpoint operation <code>mfix :: (a -> m a) -> m a</code>, which allows the output of monadic computations to be defined via (effectful) recursion. This is [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation supported in GHC] by a special “recursive do” notation, enabled by the <code>-XDoRec</code> flag{{noteref}}. Within a <code>do</code> block, one may have a nested <code>rec</code> block, like so:
<haskell>
do { x <- foo
; rec { y <- baz
; z <- bar
; bob
}
; w <- frob
}
</haskell>
Normally (if we had <code>do</code> in place of <code>rec</code> in the above example), <code>y</code> would be in scope in <code>bar</code> and <code>bob</code> but not in <code>baz</code>, and <code>z</code> would be in scope only in <code>bob</code>. With the <code>rec</code>, however, <code>y</code> and <code>z</code> are both in scope in all three of <code>baz</code>, <code>bar</code>, and <code>bob</code>. A <code>rec</code> block is analogous to a <code>let</code> block such as
<haskell>
let { y = baz
; z = bar
}
in bob
</haskell>
because, in Haskell, every variable bound in a <code>let</code>-block is in scope throughout the entire block. (From this point of view, Haskell's normal <code>do</code> blocks are analogous to Scheme's <code>let*</code> construct.)

What could such a feature be used for? One of the motivating examples given in the original paper describing <code>MonadFix</code> (see below) is encoding circuit descriptions. A line in a <code>do</code>-block such as
<haskell>
x <- gate y z
</haskell>
describes a gate whose input wires are labeled <code>y</code> and <code>z</code> and whose output wire is labeled <code>x</code>. Many (most?) useful circuits, however, involve some sort of feedback loop, making them impossible to write in a normal <code>do</code>-block (since some wire would have to be mentioned as an input ''before'' being listed as an output). Using a <code>rec</code> block solves this problem.

==Examples and intuition==

Of course, not every monad supports such recursive binding. However, as mentioned above, it suffices to have an implementation of <code>mfix :: (a -> m a) -> m a</code>, satisfying a few laws. Let's try implementing <code>mfix</code> for the <code>Maybe</code> monad. That is, we want to implement a function
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
</haskell>
{{note|Actually, <code>fix</code> is implemented slightly differently for efficiency reasons; but the given definition is equivalent and simpler for the present purpose.}}
Let's think for a moment about the implementation {{noteref}} of the non-monadic <code>fix :: (a -> a) -> a</code>:
<haskell>
fix f = f (fix f)
</haskell>
Inspired by <code>fix</code>, our first attempt at implementing <code>maybeFix</code> might be something like
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = maybeFix f >>= f
</haskell>
This has the right type. However, something seems wrong: there is nothing in particular here about <code>Maybe</code>; <code>maybeFix</code> actually has the more general type <code>Monad m => (a -> m a) -> m a</code>. But didn't we just say that not all monads support <code>mfix</code>?

The answer is that although this implementation of <code>maybeFix</code> has the right type, it does ''not'' have the intended semantics. If we think about how <code>(>>=)</code> works for the <code>Maybe</code> monad (by pattern-matching on its first argument to see whether it is <code>Nothing</code> or <code>Just</code>) we can see that this definition of <code>maybeFix</code> is completely useless: it will just recurse infinitely, trying to decide whether it is going to return <code>Nothing</code> or <code>Just</code>, without ever even so much as a glance in the direction of <code>f</code>.

The trick is to simply ''assume'' that <code>maybeFix</code> will return <code>Just</code>, and get on with life!
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = ma
where ma = f (fromJust ma)
</haskell>
This says that the result of <code>maybeFix</code> is <code>ma</code>, and assuming that <code>ma = Just x</code>, it is defined (recursively) to be equal to <code>f x</code>.

Why is this OK? Isn't <code>fromJust</code> almost as bad as <code>unsafePerformIO</code>? Well, usually, yes. This is just about the only situation in which it is justified! The interesting thing to note is that <code>maybeFix</code> ''will never crash'' -- although it may, of course, fail to terminate. The only way we could get a crash is if we try to evaluate <code>fromJust ma</code> when we know that <code>ma = Nothing</code>. But how could we know <code>ma = Nothing</code>? Since <code>ma</code> is defined as <code>f (fromJust ma)</code>, it must be that this expression has already been evaluated to <code>Nothing</code> -- in which case there is no reason for us to be evaluating <code>fromJust ma</code> in the first place!

To see this from another point of view, we can consider three possibilities. First, if <code>f</code> outputs <code>Nothing</code> without looking at its argument, then <code>maybeFix f</code> clearly returns <code>Nothing</code>. Second, if <code>f</code> always outputs <code>Just x</code>, where <code>x</code> depends on its argument, then the recursion can proceed usefully: <code>fromJust ma</code> will be able to evaluate to <code>x</code>, thus feeding <code>f</code>'s output back to it as input. Third, if <code>f</code> tries to use its argument to decide whether to output <code>Just</code> or <code>Nothing</code>, then <code>maybeFix f</code> will not terminate: evaluating <code>f</code>'s argument requires evaluating <code>ma</code> to see whether it is <code>Just</code>, which requires evaluating <code>f (fromJust ma)</code>, which requires evaluating <code>ma</code>, ... and so on.

There are also instances of <code>MonadFix</code> for lists (which works analogously to the instance for <code>Maybe</code>), for <code>ST</code>, and for <code>IO</code>. The [http://hackage.haskell.org/packages/archive/base/latest/doc/html/src/System-IO.html#fixIO instance for <code>IO</code>] is particularly amusing: it creates a new (empty) <code>MVar</code>, immediately reads its contents using <code>unsafeInterleaveIO</code> (which delays the actual reading lazily until the value is needed), uses the contents of the <code>MVar</code> to compute a new value, which it then writes back into the <code>MVar</code>. It almost seems, spookily, that <code>mfix</code> is sending a value back in time to itself through the <code>MVar</code> -- though of course what is really going on is that the reading is delayed just long enough (via <code>unsafeInterleaveIO</code>) to get the process bootstrapped.

{{Exercises|
* Implement a <code>MonadFix</code> instance for <code>[]</code>.
}}

==GHC 7.6 changes==

GHC 7.6 reinstated the old <code>mdo</code> syntax, so the example at the start of this section can be written

<haskell>
mdo { x <- foo
; y <- baz
; z <- bar
; bob
; w <- frob
}
</haskell>

which will be translated into the original example (assuming that, say, <code>bar</code> and <code>bob</code> refer to <code>y</code>. The difference is that <code>mdo</code> will analyze the code in order to find minimal recursive blocks, which will be placed in <code>rec</code> blocks, whereas <code>rec</code> blocks desugar directly into calls to <code>mfix</code> without any further analysis.
==Further reading==

For more information (such as the precise desugaring rules for <code>rec</code> blocks), see Levent Erkök and John Launchbury's 2002 Haskell workshop paper, [http://sites.google.com/site/leventerkok/recdo.pdf?attredirects=0 A Recursive do for Haskell], or for full details, Levent Erkök’s thesis, [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.15.1543&rep=rep1&type=pdf Value Recursion in Monadic Computations]. (Note, while reading, that <code>MonadFix</code> used to be called <code>MonadRec</code>.) You can also read the [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation GHC user manual section on recursive do-notation].

=Semigroup=

A semigroup is a set <math>S\ </math> together with a binary operation <math>\oplus\ </math> which
combines elements from <math>S\ </math>. The <math>\oplus\ </math> operator is required to be associative
(that is, <math>(a \oplus b) \oplus c = a \oplus (b \oplus c)\ </math>, for any
<math>a,b,c\ </math> which are elements of <math>S\ </math>).

For example, the natural numbers under addition form a semigroup: the sum of any two natural numbers is a natural number, and <math>(a+b)+c = a+(b+c)\ </math> for any natural numbers <math>a\ </math>, <math>b\ </math>, and <math>c\,\ </math>. The integers under multiplication also form a semigroup, as do the integers (or rationals, or reals) under <math>\max\ </math> or <math>\min\ </math>, Boolean values under conjunction and disjunction, lists under concatenation, functions from a set to itself under composition ... Semigroups show up all over the place, once you know to look for them.

==Definition==

Semigroups are not (yet?) defined in the base package, but the {{HackagePackage|id=semigroups}} package provides a standard definition.

The definition of the <code>Semigroup</code> type class ([http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock]) is as follows:

<haskell>
class Semigroup a where
(<>) :: a -> a -> a

sconcat :: NonEmpty a -> a
sconcat = sconcat (a :| as) = go a as where
go b (c:cs) = b <> go c cs
go b [] = b

times1p :: Whole n => n -> a -> a
times1p = ...
</haskell>

The really important method is <code>(<>)</code>, representing the associative binary operation. The other two methods have default implementations in terms of <code>(<>)</code>, and are included in the type class in case some instances can give more efficient implementations than the default. <code>sconcat</code> reduces a nonempty list using <code>(<>)</code>; <code>times1p n</code> is equivalent to (but more efficient than) <code>sconcat . replicate n</code>. See the [http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock documentation] for more information on <code>sconcat</code> and <code>times1p</code>.

==Laws==

The only law is that <code>(<>)</code> must be associative:

<haskell>
(x <> y) <> z = x <> (y <> z)
</haskell>

=Monoid=

Many semigroups have a special element <math>e</math> for which the binary operation <math>\oplus</math> is the identity, that is, <math>e \oplus x = x \oplus e = x</math> for every element <math>x</math>. Such a semigroup-with-identity-element is called a ''monoid''.

==Definition==

The definition of the <code>Monoid</code> type class (defined in
<code>Data.Monoid</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Monoid.html haddock]) is:

<haskell>
class Monoid a where
mempty :: a
mappend :: a -> a -> a

mconcat :: [a] -> a
mconcat = foldr mappend mempty
</haskell>

The <code>mempty</code> value specifies the identity element of the monoid, and <code>mappend</code>
is the binary operation. The default definition for <code>mconcat</code>
“reduces” a list of elements by combining them all with <code>mappend</code>,
using a right fold. It is only in the <code>Monoid</code> class so that specific
instances have the option of providing an alternative, more efficient
implementation; usually, you can safely ignore <code>mconcat</code> when creating
a <code>Monoid</code> instance, since its default definition will work just fine.

The <code>Monoid</code> methods are rather unfortunately named; they are inspired
by the list instance of <code>Monoid</code>, where indeed <code>mempty = []</code> and <code>mappend = (++)</code>, but this is misleading since many
monoids have little to do with appending (see these [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 Comments from OCaml Hacker Brian Hurt] on the Haskell-cafe mailing list). This was improved in GHC 7.4, where <code>(<>)</code> was added as an alias to <code>mappend</code>.

==Laws==

Of course, every <code>Monoid</code> instance should actually be a monoid in the
mathematical sense, which implies these laws:

<haskell>
mempty `mappend` x = x
x `mappend` mempty = x
(x `mappend` y) `mappend` z = x `mappend` (y `mappend` z)
</haskell>

==Instances==

There are quite a few interesting <code>Monoid</code> instances defined in <code>Data.Monoid</code>.

<ul>
<li><code>[a]</code> is a <code>Monoid</code>, with <code>mempty = []</code> and <code>mappend = (++)</code>. It is not hard to check that <code>(x ++ y) ++ z = x ++ (y ++ z)</code> for any lists <code>x</code>, <code>y</code>, and <code>z</code>, and that the empty list is the identity: <code>[] ++ x = x ++ [] = x</code>.</li>

<li>As noted previously, we can make a monoid out of any numeric type under either addition or multiplication. However, since we can’t have two instances for the same type, <code>Data.Monoid</code> provides two <code>newtype</code> wrappers, <code>Sum</code> and <code>Product</code>, with appropriate <code>Monoid</code> instances.

<haskell>
> getSum (mconcat . map Sum $ [1..5])
15
> getProduct (mconcat . map Product $ [1..5])
120
</haskell>

This example code is silly, of course; we could just write
<code>sum [1..5]</code> and <code>product [1..5]</code>. Nevertheless, these instances are useful in more generalized settings, as we will see in the [[Foldable|section on <code>Foldable</code>]].</li>

<li><code>Any</code> and <code>All</code> are <code>newtype</code> wrappers providing <code>Monoid</code> instances for <code>Bool</code> (under disjunction and conjunction, respectively).</li>

<li> There are three instances for <code>Maybe</code>: a basic instance which lifts a <code>Monoid</code> instance for <code>a</code> to an instance for <code>Maybe a</code>, and two <code>newtype</code> wrappers <code>First</code> and <code>Last</code> for which <code>mappend</code> selects the first (respectively last) non-<code>Nothing</code> item.</li>

<li><code>Endo a</code> is a newtype wrapper for functions <code>a -> a</code>, which form a monoid under composition.</li>

<li>There are several ways to “lift” <code>Monoid</code> instances to instances with additional structure. We have already seen that an instance for <code>a</code> can be lifted to an instance for <code>Maybe a</code>. There are also tuple instances: if <code>a</code> and <code>b</code> are instances of <code>Monoid</code>, then so is <code>(a,b)</code>, using the monoid operations for <code>a</code> and <code>b</code> in the obvious pairwise manner. Finally, if <code>a</code> is a <code>Monoid</code>, then so is the function type <code>e -> a</code> for any <code>e</code>; in particular, <code>g `mappend` h</code> is the function which applies both <code>g</code> and <code>h</code> to its argument and then combines the results using the underlying <code>Monoid</code> instance for <code>a</code>. This can be quite useful and elegant (see [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/52416 example]).</li>

<li>The type <code>Ordering = LT | EQ | GT</code> is a <code>Monoid</code>, defined in such a way that <code>mconcat (zipWith compare xs ys)</code> computes the lexicographic ordering of <code>xs</code> and <code>ys</code> (if <code>xs</code> and <code>ys</code> have the same length). In particular, <code>mempty = EQ</code>, and <code>mappend</code> evaluates to its leftmost non-<code>EQ</code> argument (or <code>EQ</code> if both arguments are <code>EQ</code>). This can be used together with the function instance of <code>Monoid</code> to do some clever things ([http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx example]).</li>

<li>There are also <code>Monoid</code> instances for several standard data structures in the containers library ([http://hackage.haskell.org/packages/archive/containers/0.2.0.0/doc/html/index.html haddock]), including <code>Map</code>, <code>Set</code>, and <code>Sequence</code>.</li>
</ul>

<code>Monoid</code> is also used to enable several other type class instances.
As noted previously, we can use <code>Monoid</code> to make <code>((,) e)</code> an instance of <code>Applicative</code>:

<haskell>
instance Monoid e => Applicative ((,) e) where
pure x = (mempty, x)
(u, f) <*> (v, x) = (u `mappend` v, f x)
</haskell>

<code>Monoid</code> can be similarly used to make <code>((,) e)</code> an instance of <code>Monad</code> as well; this is known as the ''writer monad''. As we’ve already seen, <code>Writer</code> and <code>WriterT</code> are a newtype wrapper and transformer for this monad, respectively.

<code>Monoid</code> also plays a key role in the <code>Foldable</code> type class (see section [[#Foldable|Foldable]]).

==Other monoidal classes: Alternative, MonadPlus, ArrowPlus==

The <code>Alternative</code> type class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html#g:2 haddock])
is for <code>Applicative</code> functors which also have
a monoid structure:

<haskell>
class Applicative f => Alternative f where
empty :: f a
(<|>) :: f a -> f a -> f a
</haskell>

Of course, instances of <code>Alternative</code> should satisfy the monoid laws

<haskell>
empty <|> x = x
x <|> empty = x
(x <|> y) <|> z = x <|> (y <|> z)
</haskell>

Likewise, <code>MonadPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html#t:MonadPlus haddock])
is for <code>Monad</code>s with a monoid structure:

<haskell>
class Monad m => MonadPlus m where
mzero :: m a
mplus :: m a -> m a -> m a
</haskell>

The <code>MonadPlus</code> documentation states that it is intended to model
monads which also support “choice and failure”; in addition to the
monoid laws, instances of <code>MonadPlus</code> are expected to satisfy

<haskell>
mzero >>= f = mzero
v >> mzero = mzero
</haskell>

which explains the sense in which <code>mzero</code> denotes failure. Since
<code>mzero</code> should be the identity for <code>mplus</code>, the computation <code>m1 `mplus` m2</code> succeeds (evaluates to something other than <code>mzero</code>) if
either <code>m1</code> or <code>m2</code> does; so <code>mplus</code> represents choice. The <code>guard</code>
function can also be used with instances of <code>MonadPlus</code>; it requires a
condition to be satisfied and fails (using <code>mzero</code>) if it is not. A
simple example of a <code>MonadPlus</code> instance is <code>[]</code>, which is exactly the
same as the <code>Monoid</code> instance for <code>[]</code>: the empty list represents
failure, and list concatenation represents choice. In general,
however, a <code>MonadPlus</code> instance for a type need not be the same as its
<code>Monoid</code> instance; <code>Maybe</code> is an example of such a type. A great
introduction to the <code>MonadPlus</code> type class, with interesting examples
of its use, is Doug Auclair’s ''MonadPlus: What a Super Monad!'' in [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad.Reader issue 11].

There used to be a type class called <code>MonadZero</code> containing only
<code>mzero</code>, representing monads with failure. The <code>do</code>-notation requires
some notion of failure to deal with failing pattern matches.
Unfortunately, <code>MonadZero</code> was scrapped in favor of adding the <code>fail</code>
method to the <code>Monad</code> class. If we are lucky, someday <code>MonadZero</code> will
be restored, and <code>fail</code> will be banished to the bit bucket where it
belongs (see [[MonadPlus reform proposal]]). The idea is that any
<code>do</code>-block which uses pattern matching (and hence may fail) would require
a <code>MonadZero</code> constraint; otherwise, only a <code>Monad</code> constraint would be
required.

Finally, <code>ArrowZero</code> and <code>ArrowPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html#t:ArrowZero haddock])
represent <code>Arrow</code>s ([[#Arrow|see below]]) with a
monoid structure:

<haskell>
class Arrow arr => ArrowZero arr where
zeroArrow :: b `arr` c

class ArrowZero arr => ArrowPlus arr where
(<+>) :: (b `arr` c) -> (b `arr` c) -> (b `arr` c)
</haskell>

==Further reading==

Monoids have gotten a fair bit of attention recently, ultimately due
to
[http://enfranchisedmind.com/blog/posts/random-thoughts-on-haskell/ a blog post by Brian Hurt], in which he
complained about the fact that the names of many Haskell type classes
(<code>Monoid</code> in particular) are taken from abstract mathematics. This
resulted in [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 a long Haskell-cafe thread]
arguing the point and discussing monoids in general.

{{note|May its name live forever.}}

However, this was quickly followed by several blog posts about
<code>Monoid</code> {{noteref}}. First, Dan Piponi
wrote a great introductory post, [http://blog.sigfpe.com/2009/01/haskell-monoids-and-their-uses.html Haskell Monoids and their Uses]. This was quickly followed by
Heinrich Apfelmus’s [http://apfelmus.nfshost.com/monoid-fingertree.html Monoids and Finger Trees], an accessible exposition of
Hinze and Paterson’s [http://www.soi.city.ac.uk/%7Eross/papers/FingerTree.html classic paper on 2-3 finger trees], which makes very clever
use of <code>Monoid</code> to implement an elegant and generic data structure.
Dan Piponi then wrote two fascinating articles about using <code>Monoids</code>
(and finger trees): [http://blog.sigfpe.com/2009/01/fast-incremental-regular-expression.html Fast Incremental Regular Expressions] and [http://blog.sigfpe.com/2009/01/beyond-regular-expressions-more.html Beyond Regular Expressions]

In a similar vein, David Place’s article on improving <code>Data.Map</code> in
order to compute incremental folds (see [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad Reader issue 11])
is also a
good example of using <code>Monoid</code> to generalize a data structure.

Some other interesting examples of <code>Monoid</code> use include [http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx building elegant list sorting combinators], [http://byorgey.wordpress.com/2008/04/17/collecting-unstructured-information-with-the-monoid-of-partial-knowledge/ collecting unstructured information], [http://izbicki.me/blog/gausian-distributions-are-monoids combining probability distributions], and a brilliant series of posts by Chung-Chieh Shan and Dylan Thurston using <code>Monoid</code>s to [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers1/ elegantly solve a difficult combinatorial puzzle] (followed by [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers2/ part 2], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers3/ part 3], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers4/ part 4]).

As unlikely as it sounds, monads can actually be viewed as a sort of
monoid, with <code>join</code> playing the role of the binary operation and
<code>return</code> the role of the identity; see [http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html Dan Piponi’s blog post].

=Foldable=

The <code>Foldable</code> class, defined in the <code>Data.Foldable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html haddock]), abstracts over containers which can be
“folded” into a summary value. This allows such folding operations
to be written in a container-agnostic way.

==Definition==

The definition of the <code>Foldable</code> type class is:

<haskell>
class Foldable t where
fold :: Monoid m => t m -> m
foldMap :: Monoid m => (a -> m) -> t a -> m

foldr :: (a -> b -> b) -> b -> t a -> b
foldl :: (a -> b -> a) -> a -> t b -> a
foldr1 :: (a -> a -> a) -> t a -> a
foldl1 :: (a -> a -> a) -> t a -> a
</haskell>

This may look complicated, but in fact, to make a <code>Foldable</code> instance
you only need to implement one method: your choice of <code>foldMap</code> or
<code>foldr</code>. All the other methods have default implementations in terms
of these, and are presumably included in the class in case more
efficient implementations can be provided.

==Instances and examples==

The type of <code>foldMap</code> should make it clear what it is supposed to do:
given a way to convert the data in a container into a <code>Monoid</code> (a
function <code>a -> m</code>) and a container of <code>a</code>’s (<code>t a</code>), <code>foldMap</code>
provides a way to iterate over the entire contents of the container,
converting all the <code>a</code>’s to <code>m</code>’s and combining all the <code>m</code>’s with
<code>mappend</code>. The following code shows two examples: a simple
implementation of <code>foldMap</code> for lists, and a binary tree example
provided by the <code>Foldable</code> documentation.

<haskell>
instance Foldable [] where
foldMap g = mconcat . map g

data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Foldable Tree where
foldMap f Empty = mempty
foldMap f (Leaf x) = f x
foldMap f (Node l k r) = foldMap f l `mappend` f k `mappend` foldMap f r
</haskell>

The <code>foldr</code> function has a type similar to the <code>foldr</code> found in the <code>Prelude</code>, but
more general, since the <code>foldr</code> in the <code>Prelude</code> works only on lists.

The <code>Foldable</code> module also provides instances for <code>Maybe</code> and <code>Array</code>;
additionally, many of the data structures found in the standard [http://hackage.haskell.org/package/containers containers library] (for example, <code>Map</code>, <code>Set</code>, <code>Tree</code>,
and <code>Sequence</code>) provide their own <code>Foldable</code> instances.

{{Exercises|
# What is the type of <code>foldMap . foldMap</code>? Or <code>foldMap . foldMap . foldMap</code>, etc.? What do they do?
}}

==Derived folds==

Given an instance of <code>Foldable</code>, we can write generic,
container-agnostic functions such as:

<haskell>
-- Compute the size of any container.
containerSize :: Foldable f => f a -> Int
containerSize = getSum . foldMap (const (Sum 1))

-- Compute a list of elements of a container satisfying a predicate.
filterF :: Foldable f => (a -> Bool) -> f a -> [a]
filterF p = foldMap (\a -> if p a then [a] else [])

-- Get a list of all the Strings in a container which include the
-- letter a.
aStrings :: Foldable f => f String -> [String]
aStrings = filterF (elem 'a')
</haskell>

The <code>Foldable</code> module also provides a large number of predefined
folds, many of which are generalized versions of <code>Prelude</code> functions of the
same name that only work on lists: <code>concat</code>, <code>concatMap</code>, <code>and</code>,
<code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>),
<code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>.

The important function <code>toList</code> is also provided, which turns any <code>Foldable</code> structure into a list of its elements in left-right order; it works by folding with the list monoid.

There are also generic functions that work with <code>Applicative</code> or
<code>Monad</code> instances to generate some sort of computation from each
element in a container, and then perform all the side effects from
those computations, discarding the results: <code>traverse_</code>, <code>sequenceA_</code>,
and others. The results must be discarded because the <code>Foldable</code>
class is too weak to specify what to do with them: we cannot, in
general, make an arbitrary <code>Applicative</code> or <code>Monad</code> instance into a <code>Monoid</code>, but we can make <code>m ()</code> into a <code>Monoid</code> for any such <code>m</code>. If we do have an <code>Applicative</code> or <code>Monad</code> with a monoid
structure—that is, an <code>Alternative</code> or a <code>MonadPlus</code>—then we can
use the <code>asum</code> or <code>msum</code> functions, which can combine the results as
well. Consult the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html <code>Foldable</code> documentation] for
more details on any of these functions.

Note that the <code>Foldable</code> operations always forget the structure of
the container being folded. If we start with a container of type <code>t a</code> for some <code>Foldable t</code>, then <code>t</code> will never appear in the output
type of any operations defined in the <code>Foldable</code> module. Many times
this is exactly what we want, but sometimes we would like to be able
to generically traverse a container while preserving its
structure—and this is exactly what the <code>Traversable</code> class provides,
which will be discussed in the next section.

{{Exercises|
# Implement <code>toList :: Foldable f {{=}}> f a -> [a]</code>.
# Pick some of the following functions to implement: <code>concat</code>, <code>concatMap</code>, <code>and</code>, <code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>), <code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>. Figure out how they generalize to <code>Foldable</code> and come up with elegant implementations using <code>fold</code> or <code>foldMap</code> along with appropriate <code>Monoid</code> instances.
}}

==Foldable actually isn't==

The generic term "fold" is often used to refer to the more technical concept of [[Catamorphisms|catamorphism]]. Intuitively, given a way to summarize "one level of structure" (where recursive subterms have already been replaced with their summaries), a catamorphism can summarize an entire recursive structure. It is important to realize that <code>Foldable</code> does not correspond to catamorphisms, but to something weaker. In particular, <code>Foldable</code> allows observing only the left-right order of elements within a structure, not the actual structure itself. Put another way, every use of <code>Foldable</code> can be expressed in terms of <code>toList</code>. For example, <code>fold</code> itself is equivalent to <code>mconcat . toList</code>.

This is sufficient for many tasks, but not all. For example, consider trying to compute the depth of a <code>Tree</code>: try as we might, there is no way to implement it using <code>Foldable</code>. However, it can be implemented as a catamorphism.

==Further reading==

The <code>Foldable</code> class had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s paper]
introducing <code>Applicative</code>, although it has
been fleshed out quite a bit from the form in the paper.

An interesting use of <code>Foldable</code> (as well as <code>Traversable</code>) can be
found in Janis Voigtländer’s paper [http://doi.acm.org/10.1145/1480881.1480904 Bidirectionalization for free!].

=Traversable=

==Definition==

The <code>Traversable</code> type class, defined in the <code>Data.Traversable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Traversable.html haddock]), is:

<haskell>
class (Functor t, Foldable t) => Traversable t where
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
sequenceA :: Applicative f => t (f a) -> f (t a)
mapM :: Monad m => (a -> m b) -> t a -> m (t b)
sequence :: Monad m => t (m a) -> m (t a)
</haskell>

As you can see, every <code>Traversable</code> is also a foldable functor. Like
<code>Foldable</code>, there is a lot in this type class, but making instances is
actually rather easy: one need only implement <code>traverse</code> or
<code>sequenceA</code>; the other methods all have default implementations in
terms of these functions. A good exercise is to figure out what the default
implementations should be: given either <code>traverse</code> or <code>sequenceA</code>, how
would you define the other three methods? (Hint for <code>mapM</code>:
<code>Control.Applicative</code> exports the <code>WrapMonad</code> newtype, which makes any
<code>Monad</code> into an <code>Applicative</code>. The <code>sequence</code> function can be implemented in terms
of <code>mapM</code>.)

==Intuition==

The key method of the <code>Traversable</code> class, and the source of its
unique power, is <code>sequenceA</code>. Consider its type:
<haskell>
sequenceA :: Applicative f => t (f a) -> f (t a)
</haskell>
This answers the fundamental question: when can we commute two
functors? For example, can we turn a tree of lists into a list of
trees?

The ability to compose two monads depends crucially on this ability to
commute functors. Intuitively, if we want to build a composed monad
<code>M a = m (n a)</code> out of monads <code>m</code> and <code>n</code>, then to be able to
implement <code>join :: M (M a) -> M a</code>, that is,
<code>join :: m (n (m (n a))) -> m (n a)</code>, we have to be able to commute
the <code>n</code> past the <code>m</code> to get <code>m (m (n (n a)))</code>, and then we can use the
<code>join</code>s for <code>m</code> and <code>n</code> to produce something of type <code>m (n a)</code>. See
[http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Mark Jones’s paper] for more details.

Alternatively, looking at the type of <code>traverse</code>,
<haskell>
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
</haskell>
leads us to view <code>Traversable</code> as a generalization of <code>Functor</code>. <code>traverse</code> is an "effectful <code>fmap</code>": it allows us to map over a structure of type <code>t a</code>, applying a function to every element of type <code>a</code> and in order to produce a new structure of type <code>t b</code>; but along the way the function may have some effects (captured by the applicative functor <code>f</code>).

{{Exercises|
# There are at least two natural ways to turn a tree of lists into a list of trees. What are they, and why?
# Give a natural way to turn a list of trees into a tree of lists.
# What is the type of <code>traverse . traverse</code>? What does it do?
}}

==Instances and examples==

What’s an example of a <code>Traversable</code> instance?
The following code shows an example instance for the same
<code>Tree</code> type used as an example in the previous <code>Foldable</code> section. It
is instructive to compare this instance with a <code>Functor</code> instance for
<code>Tree</code>, which is also shown.

<haskell>
data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Traversable Tree where
traverse g Empty = pure Empty
traverse g (Leaf x) = Leaf <$> g x
traverse g (Node l x r) = Node <$> traverse g l
<*> g x
<*> traverse g r

instance Functor Tree where
fmap g Empty = Empty
fmap g (Leaf x) = Leaf $ g x
fmap g (Node l x r) = Node (fmap g l)
(g x)
(fmap g r)
</haskell>

It should be clear that the <code>Traversable</code> and <code>Functor</code> instances for
<code>Tree</code> are almost identical; the only difference is that the <code>Functor</code>
instance involves normal function application, whereas the
applications in the <code>Traversable</code> instance take place within an
<code>Applicative</code> context, using <code>(<$>)</code> and <code>(<*>)</code>. In fact, this will
be
true for any type.

Any <code>Traversable</code> functor is also <code>Foldable</code>, and a <code>Functor</code>. We can see
this not only from the class declaration, but by the fact that we can
implement the methods of both classes given only the <code>Traversable</code>
methods.

The standard libraries provide a number of <code>Traversable</code> instances,
including instances for <code>[]</code>, <code>Maybe</code>, <code>Map</code>, <code>Tree</code>, and <code>Sequence</code>.
Notably, <code>Set</code> is not <code>Traversable</code>, although it is <code>Foldable</code>.

{{Exercises|
# Implement <code>fmap</code> and <code>foldMap</code> using only the <code>Traversable</code> methods. (Note that the <code>Traversable</code> module provides these implementations as <code>fmapDefault</code> and <code>foldMapDefault</code>.)
}}

==Laws==

Any instance of <code>Traversable</code> must satisfy the following two laws, where <code>Identity</code> is the identity functor (as defined in the [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Data-Functor-Identity.html <code>Data.Functor.Identity</code> module] from the <code>transformers</code> package), and <code>Compose</code> wraps the composition of two functors (as defined in [http://hackage.haskell.org/packages/archive/transformers/0.3.0.0/doc/html/Data-Functor-Compose.html <code>Data.Functor.Compose</code>]):

# <code>traverse Identity = Identity</code>
# <code>traverse (Compose . fmap g . f) = Compose . fmap (traverse g) . traverse f</code>

The first law essentially says that traversals cannot make up arbitrary effects. The second law explains how doing two traversals in sequence can be collapsed to a single traversal.

Additionally, suppose <code>eta</code> is an "<code>Applicative</code> morphism", that is,
<haskell>
eta :: forall a f g. (Applicative f, Applicative g) => f a -> g a
</haskell>
and <code>eta</code> preserves the <code>Applicative</code> operations: <code>eta (pure x) = pure x</code> and <code>eta (x <*> y) = eta x <*> eta y</code>. Then, by parametricity, any instance of <code>Traversable</code> satisfying the above two laws will also satisfy <code>eta . traverse f = traverse (eta . f)</code>.

==Further reading==

The <code>Traversable</code> class also had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s <code>Applicative</code> paper],
and is described in more detail in Gibbons and Oliveira, [http://www.comlab.ox.ac.uk/jeremy.gibbons/publications/iterator.pdf The Essence of the Iterator Pattern],
which also contains a wealth of references to related work.

<code>Traversable</code> forms a core component of Edward Kmett's [http://hackage.haskell.org/package/lens lens library]. Watching [https://vimeo.com/56063074 Edward's talk on the subject] is a highly recommended way to gain better insight into <code>Traversable</code>, <code>Foldable</code>, <code>Applicative</code>, and many other things besides.

For references on the <code>Traversable</code> laws, see Russell O'Connor's [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17778 mailing list post] (and subsequent thread).

=Category=

<code>Category</code> is a relatively recent addition to the Haskell standard libraries. It generalizes the notion of function composition to general “morphisms”.

{{note|GHC 7.6.1 changed its rules regarding types and type variables. Now, any operator at the type level is treated as a type ''constructor'' rather than a type ''variable''; prior to GHC 7.6.1 it was possible to use <code>(~>)</code> instead of <code>`arr`</code>. For more information, see [http://thread.gmane.org/gmane.comp.lang.haskell.glasgow.user/21350 the discussion on the GHC-users mailing list]. For a new approach to nice arrow notation that works with GHC 7.6.1, see [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22615 this message] and also [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22616 this message] from Edward Kmett, though for simplicity I haven't adopted it here.}}
The definition of the <code>Category</code> type class (from
<code>Control.Category</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Category.html haddock]) is shown below. For ease of reading, note that I have used an infix type variable <code>`arr`</code>, in parallel with the infix function type constructor <code>(->)</code>. {{noteref}} This syntax is not part of Haskell 2010. The second definition shown is the one used in the standard libraries. For the remainder of this document, I will use the infix type constructor <code>`arr`</code> for <code>Category</code> as well as <code>Arrow</code>.

<haskell>
class Category arr where
id :: a `arr` a
(.) :: (b `arr` c) -> (a `arr` b) -> (a `arr` c)

-- The same thing, with a normal (prefix) type constructor
class Category cat where
id :: cat a a
(.) :: cat b c -> cat a b -> cat a c
</haskell>

Note that an instance of <code>Category</code> should be a type constructor which takes two type arguments, that is, something of kind <code>* -> * -> *</code>. It is instructive to imagine the type constructor variable <code>cat</code> replaced by the function constructor <code>(->)</code>: indeed, in this case we recover precisely the familiar identity function <code>id</code> and function composition operator <code>(.)</code> defined in the standard <code>Prelude</code>.

Of course, the <code>Category</code> module provides exactly such an instance of
<code>Category</code> for <code>(->)</code>. But it also provides one other instance, shown below, which should be familiar from the previous discussion of the <code>Monad</code> laws. <code>Kleisli m a b</code>, as defined in the <code>Control.Arrow</code> module, is just a <code>newtype</code> wrapper around <code>a -> m b</code>.

<haskell>
newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Category (Kleisli m) where
id = Kleisli return
Kleisli g . Kleisli h = Kleisli (h >=> g)
</haskell>

The only law that <code>Category</code> instances should satisfy is that <code>id</code> and <code>(.)</code> should form a monoid—that is, <code>id</code> should be the identity of <code>(.)</code>, and <code>(.)</code> should be associative.

Finally, the <code>Category</code> module exports two additional operators:
<code>(<<<)</code>, which is just a synonym for <code>(.)</code>, and <code>(>>>)</code>, which is <code>(.)</code> with its arguments reversed. (In previous versions of the libraries, these operators were defined as part of the <code>Arrow</code> class.)

==Further reading==

The name <code>Category</code> is a bit misleading, since the <code>Category</code> class cannot represent arbitrary categories, but only categories whose objects are objects of <code>Hask</code>, the category of Haskell types. For a more general treatment of categories within Haskell, see the [http://hackage.haskell.org/package/category-extras category-extras package]. For more about category theory in general, see the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page],
[http://books.google.com/books/about/Category_theory.html?id=-MCJ6x2lC7oC Steve Awodey’s new book], Benjamin Pierce’s [http://books.google.com/books/about/Basic_category_theory_for_computer_scien.html?id=ezdeaHfpYPwC Basic category theory for computer scientists], or [http://folli.loria.fr/cds/1999/esslli99/courses/barr-wells.html Barr and Wells’s category theory lecture notes]. [http://dekudekuplex.wordpress.com/2009/01/19/motivating-learning-category-theory-for-non-mathematicians/ Benjamin Russell’s blog post]
is another good source of motivation and category theory links. You certainly don’t need to know any category theory to be a successful and productive Haskell programmer, but it does lend itself to much deeper appreciation of Haskell’s underlying theory.

=Arrow=

The <code>Arrow</code> class represents another abstraction of computation, in a
similar vein to <code>Monad</code> and <code>Applicative</code>. However, unlike <code>Monad</code>
and <code>Applicative</code>, whose types only reflect their output, the type of
an <code>Arrow</code> computation reflects both its input and output. Arrows
generalize functions: if <code>arr</code> is an instance of <code>Arrow</code>, a value of
type <code>b `arr` c</code> can be thought of as a computation which takes values of
type <code>b</code> as input, and produces values of type <code>c</code> as output. In the
<code>(->)</code> instance of <code>Arrow</code> this is just a pure function; in general, however,
an arrow may represent some sort of “effectful” computation.

==Definition==

The definition of the <code>Arrow</code> type class, from
<code>Control.Arrow</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html haddock]), is:

<haskell>
class Category arr => Arrow arr where
arr :: (b -> c) -> (b `arr` c)
first :: (b `arr` c) -> ((b, d) `arr` (c, d))
second :: (b `arr` c) -> ((d, b) `arr` (d, c))
(***) :: (b `arr` c) -> (b' `arr` c') -> ((b, b') `arr` (c, c'))
(&&&) :: (b `arr` c) -> (b `arr` c') -> (b `arr` (c, c'))
</haskell>

{{note|In versions of the <code>base</code>
package prior to version 4, there is no <code>Category</code> class, and the
<code>Arrow</code> class includes the arrow composition operator <code>(>>>)</code>. It
also includes <code>pure</code> as a synonym for <code>arr</code>, but this was removed
since it conflicts with the <code>pure</code> from <code>Applicative</code>.}}

The first thing to note is the <code>Category</code> class constraint, which
means that we get identity arrows and arrow composition for free:
given two arrows <code>g :: b `arr` c</code> and <code>h :: c `arr` d</code>, we can form their
composition <code>g >>> h :: b `arr` d</code> {{noteref}}.

As should be a familiar pattern by now, the only methods which must be
defined when writing a new instance of <code>Arrow</code> are <code>arr</code> and <code>first</code>;
the other methods have default definitions in terms of these, but are
included in the <code>Arrow</code> class so that they can be overridden with more
efficient implementations if desired.

==Intuition==

Let’s look at each of the arrow methods in turn. [http://www.haskell.org/arrows/ Ross Paterson’s web page on arrows] has nice diagrams which can help
build intuition.

* The <code>arr</code> function takes any function <code>b -> c</code> and turns it into a generalized arrow <code>b `arr` c</code>. The <code>arr</code> method justifies the claim that arrows generalize functions, since it says that we can treat any function as an arrow. It is intended that the arrow <code>arr g</code> is “pure” in the sense that it only computes <code>g</code> and has no “effects” (whatever that might mean for any particular arrow type).

* The <code>first</code> method turns any arrow from <code>b</code> to <code>c</code> into an arrow from <code>(b,d)</code> to <code>(c,d)</code>. The idea is that <code>first g</code> uses <code>g</code> to process the first element of a tuple, and lets the second element pass through unchanged. For the function instance of <code>Arrow</code>, of course, <code>first g (x,y) = (g x, y)</code>.

* The <code>second</code> function is similar to <code>first</code>, but with the elements of the tuples swapped. Indeed, it can be defined in terms of <code>first</code> using an auxiliary function <code>swap</code>, defined by <code>swap (x,y) = (y,x)</code>.

* The <code>(***)</code> operator is “parallel composition” of arrows: it takes two arrows and makes them into one arrow on tuples, which has the behavior of the first arrow on the first element of a tuple, and the behavior of the second arrow on the second element. The mnemonic is that <code>g *** h</code> is the ''product'' (hence <code>*</code>) of <code>g</code> and <code>h</code>. For the function instance of <code>Arrow</code>, we define <code>(g *** h) (x,y) = (g x, h y)</code>. The default implementation of <code>(***)</code> is in terms of <code>first</code>, <code>second</code>, and sequential arrow composition <code>(>>>)</code>. The reader may also wish to think about how to implement <code>first</code> and <code>second</code> in terms of <code>(***)</code>.

* The <code>(&&&)</code> operator is “fanout composition” of arrows: it takes two arrows <code>g</code> and <code>h</code> and makes them into a new arrow <code>g &&& h</code> which supplies its input as the input to both <code>g</code> and <code>h</code>, returning their results as a tuple. The mnemonic is that <code>g &&& h</code> performs both <code>g</code> ''and'' <code>h</code> (hence <code>&</code>) on its input. For functions, we define <code>(g &&& h) x = (g x, h x)</code>.

==Instances==

The <code>Arrow</code> library itself only provides two <code>Arrow</code> instances, both
of which we have already seen: <code>(->)</code>, the normal function
constructor, and <code>Kleisli m</code>, which makes functions of
type <code>a -> m b</code> into <code>Arrow</code>s for any <code>Monad m</code>. These instances are:

<haskell>
instance Arrow (->) where
arr g = g
first g (x,y) = (g x, y)

newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Arrow (Kleisli m) where
arr f = Kleisli (return . f)
first (Kleisli f) = Kleisli (\ ~(b,d) -> do c <- f b
return (c,d) )
</haskell>

==Laws==

{{note|See [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 John Hughes: Generalising monads to arrows]; [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf Sam Lindley, Philip Wadler, Jeremy Yallop: The arrow calculus]; [http://www.soi.city.ac.uk/~ross/papers/fop.html Ross Paterson: Programming with Arrows].}}

There are quite a few laws that instances of <code>Arrow</code> should
satisfy {{noteref}}:

<haskell>
arr id = id
arr (h . g) = arr g >>> arr h
first (arr g) = arr (g *** id)
first (g >>> h) = first g >>> first h
first g >>> arr (id *** h) = arr (id *** h) >>> first g
first g >>> arr fst = arr fst >>> g
first (first g) >>> arr assoc = arr assoc >>> first g

assoc ((x,y),z) = (x,(y,z))
</haskell>

Note that this version of the laws is slightly different than the laws given in the
first two above references, since several of the laws have now been
subsumed by the <code>Category</code> laws (in particular, the requirements that
<code>id</code> is the identity arrow and that <code>(>>>)</code> is associative). The laws
shown here follow those in Paterson’s Programming with Arrows, which uses the
<code>Category</code> class.

{{note|Unless category-theory-induced insomnolence is your cup of tea.}}

The reader is advised not to lose too much sleep over the <code>Arrow</code>
laws {{noteref}}, since it is not essential to understand them in order to
program with arrows. There are also laws that <code>ArrowChoice</code>,
<code>ArrowApply</code>, and <code>ArrowLoop</code> instances should satisfy; the interested
reader should consult [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows].

==ArrowChoice==

Computations built using the <code>Arrow</code> class, like those built using
the <code>Applicative</code> class, are rather inflexible: the structure of the computation
is fixed at the outset, and there is no ability to choose between
alternate execution paths based on intermediate results.
The <code>ArrowChoice</code> class provides exactly such an ability:

<haskell>
class Arrow arr => ArrowChoice arr where
left :: (b `arr` c) -> (Either b d `arr` Either c d)
right :: (b `arr` c) -> (Either d b `arr` Either d c)
(+++) :: (b `arr` c) -> (b' `arr` c') -> (Either b b' `arr` Either c c')
(|||) :: (b `arr` d) -> (c `arr` d) -> (Either b c `arr` d)
</haskell>

A comparison of <code>ArrowChoice</code> to <code>Arrow</code> will reveal a striking
parallel between <code>left</code>, <code>right</code>, <code>(+++)</code>, <code>(|||)</code> and <code>first</code>,
<code>second</code>, <code>(***)</code>, <code>(&&&)</code>, respectively. Indeed, they are dual:
<code>first</code>, <code>second</code>, <code>(***)</code>, and <code>(&&&)</code> all operate on product types
(tuples), and <code>left</code>, <code>right</code>, <code>(+++)</code>, and <code>(|||)</code> are the
corresponding operations on sum types. In general, these operations
create arrows whose inputs are tagged with <code>Left</code> or <code>Right</code>, and can
choose how to act based on these tags.

* If <code>g</code> is an arrow from <code>b</code> to <code>c</code>, then <code>left g</code> is an arrow from <code>Either b d</code> to <code>Either c d</code>. On inputs tagged with <code>Left</code>, the <code>left g</code> arrow has the behavior of <code>g</code>; on inputs tagged with <code>Right</code>, it behaves as the identity.

* The <code>right</code> function, of course, is the mirror image of <code>left</code>. The arrow <code>right g</code> has the behavior of <code>g</code> on inputs tagged with <code>Right</code>.

* The <code>(+++)</code> operator performs “multiplexing”: <code>g +++ h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and as <code>h</code> on inputs tagged with <code>Right</code>. The tags are preserved. The <code>(+++)</code> operator is the ''sum'' (hence <code>+</code>) of two arrows, just as <code>(***)</code> is the product.

* The <code>(|||)</code> operator is “merge” or “fanin”: the arrow <code>g ||| h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and <code>h</code> on inputs tagged with <code>Right</code>, but the tags are discarded (hence, <code>g</code> and <code>h</code> must have the same output type). The mnemonic is that <code>g ||| h</code> performs either <code>g</code> ''or'' <code>h</code> on its input.

The <code>ArrowChoice</code> class allows computations to choose among a finite number of execution paths, based on intermediate results. The possible
execution paths must be known in advance, and explicitly assembled with <code>(+++)</code> or <code>(|||)</code>. However, sometimes more flexibility is
needed: we would like to be able to ''compute'' an arrow from intermediate results, and use this computed arrow to continue the computation. This is the power given to us by <code>ArrowApply</code>.

==ArrowApply==

The <code>ArrowApply</code> type class is:

<haskell>
class Arrow arr => ArrowApply arr where
app :: (b `arr` c, b) `arr` c
</haskell>

If we have computed an arrow as the output of some previous
computation, then <code>app</code> allows us to apply that arrow to an input,
producing its output as the output of <code>app</code>. As an exercise, the
reader may wish to use <code>app</code> to implement an alternative “curried”
version, <code>app2 :: b `arr` ((b `arr` c) `arr` c)</code>.

This notion of being able to ''compute'' a new computation
may sound familiar:
this is exactly what the monadic bind operator <code>(>>=)</code> does. It
should not particularly come as a surprise that <code>ArrowApply</code> and
<code>Monad</code> are exactly equivalent in expressive power. In particular,
<code>Kleisli m</code> can be made an instance of <code>ArrowApply</code>, and any instance
of <code>ArrowApply</code> can be made a <code>Monad</code> (via the <code>newtype</code> wrapper
<code>ArrowMonad</code>). As an exercise, the reader may wish to try
implementing these instances:

<haskell>
instance Monad m => ArrowApply (Kleisli m) where
app = -- exercise

newtype ArrowApply a => ArrowMonad a b = ArrowMonad (a () b)

instance ArrowApply a => Monad (ArrowMonad a) where
return = -- exercise
(ArrowMonad a) >>= k = -- exercise
</haskell>

==ArrowLoop==

The <code>ArrowLoop</code> type class is:

<haskell>
class Arrow a => ArrowLoop a where
loop :: a (b, d) (c, d) -> a b c

trace :: ((b,d) -> (c,d)) -> b -> c
trace f b = let (c,d) = f (b,d) in c
</haskell>

It describes arrows that can use recursion to compute results, and is
used to desugar the <code>rec</code> construct in arrow notation (described
below).

Taken by itself, the type of the <code>loop</code> method does not seem to tell
us much. Its intention, however, is a generalization of the <code>trace</code>
function which is also shown. The <code>d</code> component of the first arrow’s
output is fed back in as its own input. In other words, the arrow
<code>loop g</code> is obtained by recursively “fixing” the second component of
the input to <code>g</code>.

It can be a bit difficult to grok what the <code>trace</code> function is doing.
How can <code>d</code> appear on the left and right sides of the <code>let</code>? Well,
this is Haskell’s laziness at work. There is not space here for a
full explanation; the interested reader is encouraged to study the
standard <code>fix</code> function, and to read [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson’s arrow tutorial].

==Arrow notation==

Programming directly with the arrow combinators can be painful,
especially when writing complex computations which need to retain
simultaneous reference to a number of intermediate results. With
nothing but the arrow combinators, such intermediate results must be
kept in nested tuples, and it is up to the programmer to remember
which intermediate results are in which components, and to swap,
reassociate, and generally mangle tuples as necessary. This problem
is solved by the special arrow notation supported by GHC, similar to
<code>do</code> notation for monads, that allows names to be assigned to
intermediate results while building up arrow computations. An example
arrow implemented using arrow notation, taken from
Paterson, is:

<haskell>
class ArrowLoop arr => ArrowCircuit arr where
delay :: b -> (b `arr` b)

counter :: ArrowCircuit arr => Bool `arr` Int
counter = proc reset -> do
rec output <- idA -< if reset then 0 else next
next <- delay 0 -< output + 1
idA -< output
</haskell>

This arrow is intended to
represent a recursively defined counter circuit with a reset line.

There is not space here for a full explanation of arrow notation; the
interested reader should consult
[http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper introducing the notation], or his later [http://www.soi.city.ac.uk/~ross/papers/fop.html tutorial which presents a simplified version].

==Further reading==

An excellent starting place for the student of arrows is the [http://www.haskell.org/arrows/ arrows web page], which contains an
introduction and many references. Some key papers on arrows include
Hughes’s original paper introducing arrows, [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 Generalising monads to arrows], and [http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper on arrow notation].

Both Hughes and Paterson later wrote accessible tutorials intended for a broader
audience: [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows] and [http://www.cse.chalmers.se/~rjmh/afp-arrows.pdf Hughes: Programming with Arrows].

Although Hughes’s goal in defining the <code>Arrow</code> class was to
generalize <code>Monad</code>s, and it has been said that <code>Arrow</code> lies “between
<code>Applicative</code> and <code>Monad</code>” in power, they are not directly
comparable. The precise relationship remained in some confusion until
[http://homepages.inf.ed.ac.uk/wadler/papers/arrows-and-idioms/arrows-and-idioms.pdf analyzed by Lindley, Wadler, and Yallop], who
also invented a new calculus of arrows, based on the lambda calculus,
which considerably simplifies the presentation of the arrow laws
(see [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf The arrow calculus]). There is also a precise technical sense in which [http://just-bottom.blogspot.de/2010/04/programming-with-effects-story-so-far.html <code>Arrow</code> can be seen as the intersection of <code>Applicative</code> and <code>Category</code>].

Some examples of <code>Arrow</code>s include [http://www.haskell.org/yampa/ Yampa], the
[http://www.fh-wedel.de/~si/HXmlToolbox/ Haskell XML Toolkit], and the functional GUI library [[Grapefruit]].

Some extensions to arrows have been explored; for example, the
<code>BiArrow</code>s of Alimarine et al. (["There and Back Again: Arrows for Invertible Programming"](http://wiki.clean.cs.ru.nl/download/papers/2005/alia2005-biarrowsHaskellWorkshop.pdf)], for two-way instead of one-way
computation.

The Haskell wiki has [[Research papers/Monads and Arrows|links to many additional research papers relating to <code>Arrow</code>s]].

=Comonad=

The final type class we will examine is <code>Comonad</code>. The <code>Comonad</code> class
is the categorical dual of <code>Monad</code>; that is, <code>Comonad</code> is like <code>Monad</code>
but with all the function arrows flipped. It is not actually in the
standard Haskell libraries, but it has seen some interesting uses
recently, so we include it here for completeness.

==Definition==

The <code>Comonad</code> type class, defined in the <code>Control.Comonad</code> module of
the [http://hackage.haskell.org/package/comonad comonad library], is:

<haskell>
class Functor w => Comonad w where
extract :: w a -> a

duplicate :: w a -> w (w a)
duplicate = extend id

extend :: (w a -> b) -> w a -> w b
extend f = fmap f . duplicate
</haskell>

As you can see, <code>extract</code> is the dual of <code>return</code>, <code>duplicate</code> is the dual of <code>join</code>, and <code>extend</code> is the dual of <code>(=<<)</code>. The definition of <code>Comonad</code> is a bit redundant, giving the programmer the choice on whether extend or duplicate are implemented; the other operation then has a default implementation.

A prototypical example of a <code>Comonad</code> instance is:

<haskell>
-- Infinite lazy streams
data Stream a = Cons a (Stream a)

-- 'duplicate' is like the list function 'tails'
-- 'extend' computes a new Stream from an old, where the element
-- at position n is computed as a function of everything from
-- position n onwards in the old Stream
instance Comonad Stream where
extract (Cons x _) = x
duplicate s@(Cons x xs) = Cons s (duplicate xs)
extend g s@(Cons x xs) = Cons (g s) (extend g xs)
-- = fmap g (duplicate s)
</haskell>

==Further reading==

Dan Piponi explains in a blog post what [http://blog.sigfpe.com/2006/12/evaluating-cellular-automata-is.html cellular automata have to do with comonads]. In another blog post, Conal Elliott has examined [http://conal.net/blog/posts/functional-interactive-behavior/ a comonadic formulation of functional reactive programming]. Sterling Clover’s blog post [http://fmapfixreturn.wordpress.com/2008/07/09/comonads-in-everyday-life/ Comonads in everyday life] explains the relationship between comonads and zippers, and how comonads can be used to design a menu system for a web site.

Uustalu and Vene have a number of papers exploring ideas related to comonads and functional programming:
* [http://dx.doi.org/10.1016/j.entcs.2008.05.029 Comonadic Notions of Computation]
* [http://www.ioc.ee/~tarmo/papers/sfp01-book.pdf The dual of substitution is redecoration] (Also available as [http://www.cs.ut.ee/~varmo/papers/sfp01-book.ps.gz ps.gz].)
* [http://dx.doi.org/10.1016/j.ic.2005.08.005 Recursive coalgebras from comonads]
* [http://www.fing.edu.uy/~pardo/papers/njc01.ps.gz Recursion schemes from comonads]
* [http://cs.ioc.ee/~tarmo/papers/essence.pdf The Essence of Dataflow Programming].

Gabriel Gonzalez's [http://www.haskellforall.com/2013/02/you-could-have-invented-comonads.html Comonads are objects] points out similarities between comonads and object-oriented programming.

The [http://hackage.haskell.org/package/comonad-transformers comonad-transformers] package contains comonad transformers.

=Acknowledgements=

A special thanks to all of those who taught me about standard Haskell
type classes and helped me develop good intuition for them,
particularly Jules Bean (quicksilver), Derek Elkins (ddarius), Conal
Elliott (conal), Cale Gibbard (Cale), David House, Dan Piponi
(sigfpe), and Kevin Reid (kpreid).

I also thank the many people who provided a mountain of helpful
feedback and suggestions on a first draft of the Typeclassopedia: David Amos,
Kevin Ballard, Reid Barton, Doug Beardsley, Joachim Breitner, Andrew
Cave, David Christiansen, Gregory Collins, Mark Jason Dominus, Conal
Elliott, Yitz Gale, George Giorgidze, Steven Grady, Travis Hartwell,
Steve Hicks, Philip Hölzenspies, Edward Kmett, Eric Kow, Serge Le
Huitouze, Felipe Lessa, Stefan Ljungstrand, Eric Macaulay, Rob MacAulay, Simon Meier,
Eric Mertens, Tim Newsham, Russell O’Connor, Conrad Parker, Walt
Rorie-Baety, Colin Ross, Tom Schrijvers, Aditya Siram, C. Smith,
Martijn van Steenbergen, Joe Thornber, Jared Updike, Rob Vollmert,
Andrew Wagner, Louis Wasserman, and Ashley Yakeley, as well as a few
only known to me by their IRC nicks: b_jonas, maltem, tehgeekmeister,
and ziman. I have undoubtedly omitted a few inadvertently, which in
no way diminishes my gratitude.

Finally, I would like to thank Wouter Swierstra for his fantastic work
editing the Monad.Reader, and my wife Joyia for her patience during
the process of writing the Typeclassopedia.

=About the author=

Brent Yorgey ([http://byorgey.wordpress.com/ blog], [http://www.cis.upenn.edu/~byorgey/ homepage]) is (as of November 2011) a fourth-year Ph.D. student in the [http://www.cis.upenn.edu/~plclub/ programming languages group] at the [http://www.upenn.edu University of Pennsylvania]. He enjoys teaching, creating EDSLs, playing Bach fugues, musing upon category theory, and cooking tasty lambda-treats for the denizens of #haskell.

=Colophon=

The Typeclassopedia was written by Brent Yorgey and initially published in March 2009. Painstakingly converted to wiki syntax by [[User:Geheimdienst]] in November 2011, after asking Brent’s permission.

If something like this TeX to wiki syntax conversion ever needs to be done again, here are some vim commands that helped:

* <nowiki>%s/\\section{$[^}]*$}/=\1=/gc</nowiki>
* <nowiki>%s/\\subsection{$[^}]*$}/==\1==/gc</nowiki>
* <nowiki>%s/^ *\\item /\r* /gc</nowiki>
* <nowiki>%s/---/—/gc</nowiki>
* <nowiki>%s/\$$[^$]*$\$/<math>\1\\ <\/math>/gc</nowiki> ''Appending “\ ” forces images to be rendered. Otherwise, Mediawiki would go back and forth between one font for short <nowiki><math></nowiki> tags, and another more Tex-like font for longer tags (containing more than a few characters)""
* <nowiki>%s/|$[^|]*$|/<code>\1<\/code>/gc</nowiki>
* <nowiki>%s/\\dots/.../gc</nowiki>
* <nowiki>%s/^\\label{.*$//gc</nowiki>
* <nowiki>%s/\\emph{$[^}]*$}/''\1''/gc</nowiki>
* <nowiki>%s/\\term{$[^}]*$}/''\1''/gc</nowiki>

The biggest issue was taking the academic-paper-style citations and turning them into hyperlinks with an appropriate title and an appropriate target. In most cases there was an obvious thing to do (e.g. online PDFs of the cited papers or CiteSeer entries). Sometimes, however, it’s less clear and you might want to check the
[[Media:Typeclassopedia.pdf|original Typeclassopedia PDF]]
with the
[http://code.haskell.org/~byorgey/TMR/Issue13/typeclassopedia.bib original bibliography file].

To get all the citations into the main text, I first tried processing the source with TeX or Lyx. This didn’t work due to missing unfindable packages, syntax errors, and my general ineptitude with Tex.

I then went for the next best solution, which seemed to be extracting all instances of “\cite{something}” from the source and ''in that order'' pulling the referenced entries from the .bib file. This way you can go through the source file and sorted-references file in parallel, copying over what you need, without searching back and forth in the .bib file. I used:

* <nowiki>egrep -o "\cite\{[^\}]*\}" ~/typeclassopedia.lhs | cut -c 6- | tr "," "\n" | tr -d "}" > /tmp/citations</nowiki>
* <nowiki>for i in $(cat /tmp/citations); do grep -A99 "$i" ~/typeclassopedia.bib|egrep -B99 '^\}$' -m1 ; done > ~/typeclasso-refs-sorted</nowiki>

[[Category:Applicative Functor]]
[[Category:Arrow]]
[[Category:Functor]]
[[Category:Monad]]
[[Category:Standard classes]]
[[Category:Standard libraries]]
[[Category:Standard packages]]
[[Category:Standard types]]

Typeclassopedia

2014-11-27T14:19:35Z

Imz: /* Monoid */ (minor) Fixed a link to Monad.Reader

''By [[User:Byorgey|Brent Yorgey]], byorgey@cis.upenn.edu''

''Originally published 12 March 2009 in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13] of [http://themonadreader.wordpress.com/ the Monad.Reader]. Ported to the Haskell wiki in November 2011 by [[User:Geheimdienst|Geheimdienst]].''

''This is now the official version of the Typeclassopedia and supersedes the version published in the Monad.Reader. Please help update and extend it by editing it yourself or by leaving comments, suggestions, and questions on the [[Talk:Typeclassopedia|talk page]].''

=Abstract=

The standard Haskell libraries feature a number of type classes with algebraic or category-theoretic underpinnings. Becoming a fluent Haskell hacker requires intimate familiarity with them all, yet acquiring this familiarity often involves combing through a mountain of tutorials, blog posts, mailing list archives, and IRC logs.

The goal of this document is to serve as a starting point for the student of Haskell wishing to gain a firm grasp of its standard type classes. The essentials of each type class are introduced, with examples, commentary, and extensive references for further reading.

=Introduction=

Have you ever had any of the following thoughts?
* What the heck is a monoid, and how is it different from a monad?

* I finally figured out how to use [[Parsec]] with do-notation, and someone told me I should use something called <code>Applicative</code> instead. Um, what?

* Someone in the [[IRC channel|#haskell]] IRC channel used <code>(***)</code>, and when I asked Lambdabot to tell me its type, it printed out scary gobbledygook that didn’t even fit on one line! Then someone used <code>fmap fmap fmap</code> and my brain exploded.

* When I asked how to do something I thought was really complicated, people started typing things like <code>zip.ap fmap.(id &&& wtf)</code> and the scary thing is that they worked! Anyway, I think those people must actually be robots because there’s no way anyone could come up with that in two seconds off the top of their head.

If you have, look no further! You, too, can write and understand concise, elegant, idiomatic Haskell code with the best of them.

There are two keys to an expert Haskell hacker’s wisdom:
# Understand the types.
# Gain a deep intuition for each type class and its relationship to other type classes, backed up by familiarity with many examples.

It’s impossible to overstate the importance of the first; the patient student of type signatures will uncover many profound secrets. Conversely, anyone ignorant of the types in their code is doomed to eternal uncertainty. “Hmm, it doesn’t compile ... maybe I’ll stick in an
<code>fmap</code> here ... nope, let’s see ... maybe I need another <code>(.)</code> somewhere? ... um ...”

The second key—gaining deep intuition, backed by examples—is also important, but much more difficult to attain. A primary goal of this document is to set you on the road to gaining such intuition. However—

:''There is no royal road to Haskell. {{h:title|Well, he probably would have said it if he knew Haskell.|—Euclid}}''

This document can only be a starting point, since good intuition comes from hard work, [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ not from learning the right metaphor]. Anyone who reads and understands all of it will still have an arduous journey ahead—but sometimes a good starting point makes a big difference.

It should be noted that this is not a Haskell tutorial; it is assumed that the reader is already familiar with the basics of Haskell, including the standard <code>[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html Prelude]</code>, the type system, data types, and type classes.

The type classes we will be discussing and their interrelationships:

[[Image:Typeclassopedia-diagram.png]]

{{note|<code>Semigroup</code> can be found in the [http://hackage.haskell.org/package/semigroups <code>semigroups</code> package], <code>Apply</code> in the [http://hackage.haskell.org/package/semigroupoids <code>semigroupoids</code> package], and <code>Comonad</code> in the [http://hackage.haskell.org/package/comonad <code>comonad</code> package].}}

* Solid arrows point from the general to the specific; that is, if there is an arrow from <code>Foo</code> to <code>Bar</code> it means that every <code>Bar</code> is (or should be, or can be made into) a <code>Foo</code>.
* Dotted arrows indicate some other sort of relationship.
* <code>Monad</code> and <code>ArrowApply</code> are equivalent.
* <code>Semigroup</code>, <code>Apply</code> and <code>Comonad</code> are greyed out since they are not actually (yet?) in the standard Haskell libraries {{noteref}}.

One more note before we begin. The original spelling of “type class” is with two words, as evidenced by, for example, the [http://www.haskell.org/onlinereport/haskell2010/ Haskell 2010 Language Report], early papers on type classes like [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.103.5639 Type classes in Haskell] and [http://research.microsoft.com/en-us/um/people/simonpj/papers/type-class-design-space/ Type classes: exploring the design space], and [http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.168.4008 Hudak et al.’s history of Haskell]. However, as often happens with two-word phrases that see a lot of use, it has started to show up as one word (“typeclass”) or, rarely, hyphenated (“type-class”). When wearing my prescriptivist hat, I prefer “type class”, but realize (after changing into my descriptivist hat) that there's probably not much I can do about it.

We now begin with the simplest type class of all: <code>Functor</code>.

=Functor=

The <code>Functor</code> class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Functor haddock]) is the most basic and ubiquitous type class in the Haskell libraries. A simple intuition is that a <code>Functor</code> represents a “container” of some sort, along with the ability to apply a function uniformly to every element in the container. For example, a list is a container of elements, and we can apply a function to every element of a list, using <code>map</code>. As another example, a binary tree is also a container of elements, and it’s not hard to come up with a way to recursively apply a function to every element in a tree.

Another intuition is that a <code>Functor</code> represents some sort of “computational context”. This intuition is generally more useful, but is more difficult to explain, precisely because it is so general. Some examples later should help to clarify the <code>Functor</code>-as-context point of view.

In the end, however, a <code>Functor</code> is simply what it is defined to be; doubtless there are many examples of <code>Functor</code> instances that don’t exactly fit either of the above intuitions. The wise student will focus their attention on definitions and examples, without leaning too heavily on any particular metaphor. Intuition will come, in time, on its own.

==Definition==

Here is the type class declaration for <code>Functor</code>:

<haskell>
class Functor f where
fmap :: (a -> b) -> f a -> f b
</haskell>

<code>Functor</code> is exported by the <code>Prelude</code>, so no special imports are needed to use it.

First, the <code>f a</code> and <code>f b</code> in the type signature for <code>fmap</code> tell us that <code>f</code> isn’t just a type; it is a ''type constructor'' which takes another type as a parameter. (A more precise way to say this is that the ''kind'' of <code>f</code> must be <code>* -> *</code>.) For example, <code>Maybe</code> is such a type constructor: <code>Maybe</code> is not a type in and of itself, but requires another type as a parameter, like <code>Maybe Integer</code>. So it would not make sense to say <code>instance Functor Integer</code>, but it could make sense to say <code>instance Functor Maybe</code>.

Now look at the type of <code>fmap</code>: it takes any function from <code>a</code> to <code>b</code>, and a value of type <code>f a</code>, and outputs a value of type <code>f b</code>. From the container point of view, the intention is that <code>fmap</code> applies a function to each element of a container, without altering the structure of the container. From the context point of view, the intention is that <code>fmap</code> applies a function to a value without altering its context. Let’s look at a few specific examples.

==Instances==

{{note|Recall that <code>[]</code> has two meanings in Haskell: it can either stand for the empty list, or, as here, it can represent the list type constructor (pronounced “list-of”). In other words, the type <code>[a]</code> (list-of-<code>a</code>) can also be written <code>[] a</code>.}}

{{note|You might ask why we need a separate <code>map</code> function. Why not just do away with the current list-only <code>map</code> function, and rename <code>fmap</code> to <code>map</code> instead? Well, that’s a good question. The usual argument is that someone just learning Haskell, when using <code>map</code> incorrectly, would much rather see an error about lists than about <code>Functor</code>s.}}

As noted before, the list constructor <code>[]</code> is a functor {{noteref}}; we can use the standard list function <code>map</code> to apply a function to each element of a list {{noteref}}. The <code>Maybe</code> type constructor is also a functor, representing a container which might hold a single element. The function <code>fmap g</code> has no effect on <code>Nothing</code> (there are no elements to which <code>g</code> can be applied), and simply applies <code>g</code> to the single element inside a <code>Just</code>. Alternatively, under the context interpretation, the list functor represents a context of nondeterministic choice; that is, a list can be thought of as representing a single value which is nondeterministically chosen from among several possibilities (the elements of the list). Likewise, the <code>Maybe</code> functor represents a context with possible failure. These instances are:

<haskell>
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : fmap g xs
-- or we could just say fmap = map

instance Functor Maybe where
fmap _ Nothing = Nothing
fmap g (Just a) = Just (g a)
</haskell>

As an aside, in idiomatic Haskell code you will often see the letter <code>f</code> used to stand for both an arbitrary <code>Functor</code> and an arbitrary function. In this document, <code>f</code> represents only <code>Functor</code>s, and <code>g</code> or <code>h</code> always represent functions, but you should be aware of the potential confusion. In practice, what <code>f</code> stands for should always be clear from the context, by noting whether it is part of a type or part of the code.

There are other <code>Functor</code> instances in the standard libraries; below are a few. Note that some of these instances are not exported by the <code>Prelude</code>; to access them, you can import <code>Control.Monad.Instances</code>.

* <code>Either e</code> is an instance of <code>Functor</code>; <code>Either e a</code> represents a container which can contain either a value of type <code>a</code>, or a value of type <code>e</code> (often representing some sort of error condition). It is similar to <code>Maybe</code> in that it represents possible failure, but it can carry some extra information about the failure as well.

* <code>((,) e)</code> represents a container which holds an “annotation” of type <code>e</code> along with the actual value it holds. It might be clearer to write it as <code>(e,)</code>, by analogy with an operator section like <code>(1+)</code>, but that syntax is not allowed in types (although it is allowed in expressions with the <code>TupleSections</code> extension enabled). However, you can certainly ''think'' of it as <code>(e,)</code>.

* <code>((->) e)</code> (which can be thought of as <code>(e ->)</code>; see above), the type of functions which take a value of type <code>e</code> as a parameter, is a <code>Functor</code>. As a container, <code>(e -> a)</code> represents a (possibly infinite) set of values of <code>a</code>, indexed by values of <code>e</code>. Alternatively, and more usefully, <code>((->) e)</code> can be thought of as a context in which a value of type <code>e</code> is available to be consulted in a read-only fashion. This is also why <code>((->) e)</code> is sometimes referred to as the ''reader monad''; more on this later.

* <code>IO</code> is a <code>Functor</code>; a value of type <code>IO a</code> represents a computation producing a value of type <code>a</code> which may have I/O effects. If <code>m</code> computes the value <code>x</code> while producing some I/O effects, then <code>fmap g m</code> will compute the value <code>g x</code> while producing the same I/O effects.

* Many standard types from the [http://hackage.haskell.org/package/containers/ containers library] (such as <code>Tree</code>, <code>Map</code>, and <code>Sequence</code>) are instances of <code>Functor</code>. A notable exception is <code>Set</code>, which cannot be made a <code>Functor</code> in Haskell (although it is certainly a mathematical functor) since it requires an <code>Ord</code> constraint on its elements; <code>fmap</code> must be applicable to ''any'' types <code>a</code> and <code>b</code>. However, <code>Set</code> (and other similarly restricted data types) can be made an instance of a suitable generalization of <code>Functor</code>, either by [http://article.gmane.org/gmane.comp.lang.haskell.cafe/78052/ making <code>a</code> and <code>b</code> arguments to the <code>Functor</code> type class themselves], or by adding an [http://blog.omega-prime.co.uk/?p=127 associated constraint].

{{Exercises|
<ol>
<li>Implement <code>Functor</code> instances for <code>Either e</code> and <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> instances for <code>((,) e)</code> and for <code>Pair</code>, defined as

<haskell>data Pair a = Pair a a</haskell>

Explain their similarities and differences.
</li>
<li>Implement a <code>Functor</code> instance for the type <code>ITree</code>, defined as

<haskell>
data ITree a = Leaf (Int -> a)
| Node [ITree a]
</haskell>
</li>
<li>Give an example of a type of kind <code>* -> *</code> which cannot be made an instance of <code>Functor</code> (without using <code>undefined</code>).
</li>
<li>Is this statement true or false?

:''The composition of two <code>Functor</code>s is also a <code>Functor</code>.''

If false, give a counterexample; if true, prove it by exhibiting some appropriate Haskell code.
</li>
</ol>
}}

==Laws==

As far as the Haskell language itself is concerned, the only requirement to be a <code>Functor</code> is an implementation of <code>fmap</code> with the proper type. Any sensible <code>Functor</code> instance, however, will also satisfy the ''functor laws'', which are part of the definition of a mathematical functor. There are two:

<haskell>
fmap id = id
fmap (g . h) = (fmap g) . (fmap h)
</haskell>

{{note|Technically, these laws make <code>f</code> and <code>fmap</code> together an endofunctor on ''Hask'', the category of Haskell types (ignoring [[Bottom|&perp;]], which is a party pooper). See [http://en.wikibooks.org/wiki/Haskell/Category_theory Wikibook: Category theory].}}

Together, these laws ensure that <code>fmap g</code> does not change the ''structure'' of a container, only the elements. Equivalently, and more simply, they ensure that <code>fmap g</code> changes a value without altering its context {{noteref}}.

The first law says that mapping the identity function over every item in a container has no effect. The second says that mapping a composition of two functions over every item in a container is the same as first mapping one function, and then mapping the other.

As an example, the following code is a “valid” instance of <code>Functor</code> (it typechecks), but it violates the functor laws. Do you see why?

<haskell>
-- Evil Functor instance
instance Functor [] where
fmap _ [] = []
fmap g (x:xs) = g x : g x : fmap g xs
</haskell>

Any Haskeller worth their salt would reject this code as a gruesome abomination.

Unlike some other type classes we will encounter, a given type has at most one valid instance of <code>Functor</code>. This [http://article.gmane.org/gmane.comp.lang.haskell.libraries/15384 can be proven] via the [http://homepages.inf.ed.ac.uk/wadler/topics/parametricity.html#free ''free theorem''] for the type of <code>fmap</code>. In fact, [http://byorgey.wordpress.com/2010/03/03/deriving-pleasure-from-ghc-6-12-1/ GHC can automatically derive] <code>Functor</code> instances for many data types.

{{note|Actually, if <code>seq</code>/<code>undefined</code> are considered, it [http://stackoverflow.com/a/8323243/305559 is possible] to have an implementation which satisfies the first law but not the second. The rest of the comments in this section should considered in a context where <code>seq</code> and <code>undefined</code> are excluded.}}

A [https://github.com/quchen/articles/blob/master/second_functor_law.md similar argument also shows] that any <code>Functor</code> instance satisfying the first law (<code>fmap id = id</code>) will automatically satisfy the second law as well. Practically, this means that only the first law needs to be checked (usually by a very straightforward induction) to ensure that a <code>Functor</code> instance is valid.{{noteref}}

{{Exercises|
# Although it is not possible for a <code>Functor</code> instance to satisfy the first <code>Functor</code> law but not the second (excluding <code>undefined</code>), the reverse is possible. Give an example of a (bogus) <code>Functor</code> instance which satisfies the second law but not the first.
# Which laws are violated by the evil <code>Functor</code> instance for list shown above: both laws, or the first law alone? Give specific counterexamples.
}}

==Intuition==

There are two fundamental ways to think about <code>fmap</code>. The first has already been mentioned: it takes two parameters, a function and a container, and applies the function “inside” the container, producing a new container. Alternately, we can think of <code>fmap</code> as applying a function to a value in a context (without altering the context).

Just like all other Haskell functions of “more than one parameter”, however, <code>fmap</code> is actually ''curried'': it does not really take two parameters, but takes a single parameter and returns a function. For emphasis, we can write <code>fmap</code>’s type with extra parentheses: <code>fmap :: (a -> b) -> (f a -> f b)</code>. Written in this form, it is apparent that <code>fmap</code> transforms a “normal” function (<code>g :: a -> b</code>) into one which operates over containers/contexts (<code>fmap g :: f a -> f b</code>). This transformation is often referred to as a ''lift''; <code>fmap</code> “lifts” a function from the “normal world” into the “<code>f</code> world”.

==Further reading==

A good starting point for reading about the category theory behind the concept of a functor is the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page on category theory].

=Applicative=

A somewhat newer addition to the pantheon of standard Haskell type classes, ''applicative functors'' represent an abstraction lying in between <code>Functor</code> and <code>Monad</code> in expressivity, first described by McBride and Paterson. The title of their classic paper, [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative Programming with Effects], gives a hint at the intended intuition behind the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html <code>Applicative</code>] type class. It encapsulates certain sorts of “effectful” computations in a functionally pure way, and encourages an “applicative” programming style. Exactly what these things mean will be seen later.

==Definition==

Recall that <code>Functor</code> allows us to lift a “normal” function to a function on computational contexts. But <code>fmap</code> doesn’t allow us to apply a function which is itself in a context to a value in a context. <code>Applicative</code> gives us just such a tool, <code>(<*>)</code>. It also provides a method, <code>pure</code>, for embedding values in a default, “effect free” context. Here is the type class declaration for <code>Applicative</code>, as defined in <code>Control.Applicative</code>:

<haskell>
class Functor f => Applicative f where
pure :: a -> f a
(<*>) :: f (a -> b) -> f a -> f b
</haskell>

Note that every <code>Applicative</code> must also be a <code>Functor</code>. In fact, as we will see, <code>fmap</code> can be implemented using the <code>Applicative</code> methods, so every <code>Applicative</code> is a functor whether we like it or not; the <code>Functor</code> constraint forces us to be honest.

{{note|Recall that <code>($)</code> is just function application: <code>f $ x {{=}} f x</code>.}}

As always, it’s crucial to understand the type signatures. First, consider <code>(<*>)</code>: the best way of thinking about it comes from noting that the type of <code>(<*>)</code> is similar to the type of <code>($)</code> {{noteref}}, but with everything enclosed in an <code>f</code>. In other words, <code>(<*>)</code> is just function application within a computational context. The type of <code>(<*>)</code> is also very similar to the type of <code>fmap</code>; the only difference is that the first parameter is <code>f (a -> b)</code>, a function in a context, instead of a “normal” function <code>(a -> b)</code>.

<code>pure</code> takes a value of any type <code>a</code>, and returns a context/container of type <code>f a</code>. The intention is that <code>pure</code> creates some sort of “default” container or “effect free” context. In fact, the behavior of <code>pure</code> is quite constrained by the laws it should satisfy in conjunction with <code>(<*>)</code>. Usually, for a given implementation of <code>(<*>)</code> there is only one possible implementation of <code>pure</code>.

(Note that previous versions of the Typeclassopedia explained <code>pure</code> in terms of a type class <code>Pointed</code>, which can still be found in the [http://hackage.haskell.org/package/pointed <code>pointed</code> package]. However, the current consensus is that <code>Pointed</code> is not very useful after all. For a more detailed explanation, see [[Why not Pointed?]])

==Laws==

{{note|See
[http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html haddock for Applicative] and [http://www.soi.city.ac.uk/~ross/papers/Applicative.html Applicative programming with effects]}}

Traditionally, there are four laws that <code>Applicative</code> instances should satisfy {{noteref}}. In some sense, they are all concerned with making sure that <code>pure</code> deserves its name:

* The identity law: <haskell>pure id <*> v = v</haskell>
* Homomorphism: <haskell>pure f <*> pure x = pure (f x)</haskell>Intuitively, applying a non-effectful function to a non-effectful argument in an effectful context is the same as just applying the function to the argument and then injecting the result into the context with <code>pure</code>.
* Interchange: <haskell>u <*> pure y = pure ($ y) <*> u</haskell>Intuitively, this says that when evaluating the application of an effectful function to a pure argument, the order in which we evaluate the function and its argument doesn't matter.
* Composition: <haskell>u <*> (v <*> w) = pure (.) <*> u <*> v <*> w </haskell>This one is the trickiest law to gain intuition for. In some sense it is expressing a sort of associativity property of <code>(<*>)</code>. The reader may wish to simply convince themselves that this law is type-correct.

Considered as left-to-right rewrite rules, the homomorphism, interchange, and composition laws actually constitute an algorithm for transforming any expression using <code>pure</code> and <code>(<*>)</code> into a canonical form with only a single use of <code>pure</code> at the very beginning and only left-nested occurrences of <code>(<*>)</code>. Composition allows reassociating <code>(<*>)</code>; interchange allows moving occurrences of <code>pure</code> leftwards; and homomorphism allows collapsing multiple adjacent occurrences of <code>pure</code> into one.

There is also a law specifying how <code>Applicative</code> should relate to <code>Functor</code>:

<haskell>
fmap g x = pure g <*> x
</haskell>

It says that mapping a pure function <code>g</code> over a context <code>x</code> is the same as first injecting <code>g</code> into a context with <code>pure</code>, and then applying it to <code>x</code> with <code>(<*>)</code>. In other words, we can decompose <code>fmap</code> into two more atomic operations: injection into a context, and application within a context. The <code>Control.Applicative</code> module also defines <code>(<$>)</code> as a synonym for <code>fmap</code>, so the above law can also be expressed as:

<code>g <$> x = pure g <*> x</code>.

{{Exercises|
# (Tricky) One might imagine a variant of the interchange law that says something about applying a pure function to an effectful argument. Using the above laws, prove that<haskell>pure f <*> x = pure (flip ($)) <*> x <*> pure f</haskell>
}}

==Instances==

Most of the standard types which are instances of <code>Functor</code> are also instances of <code>Applicative</code>.

<code>Maybe</code> can easily be made an instance of <code>Applicative</code>; writing such an instance is left as an exercise for the reader.

The list type constructor <code>[]</code> can actually be made an instance of <code>Applicative</code> in two ways; essentially, it comes down to whether we want to think of lists as ordered collections of elements, or as contexts representing multiple results of a nondeterministic computation (see Wadler’s [http://www.springerlink.com/content/y7450255v2670167/ How to replace failure by a list of successes]).

Let’s first consider the collection point of view. Since there can only be one instance of a given type class for any particular type, one or both of the list instances of <code>Applicative</code> need to be defined for a <code>newtype</code> wrapper; as it happens, the nondeterministic computation instance is the default, and the collection instance is defined in terms of a <code>newtype</code> called <code>ZipList</code>. This instance is:

<haskell>
newtype ZipList a = ZipList { getZipList :: [a] }

instance Applicative ZipList where
pure = undefined -- exercise
(ZipList gs) <*> (ZipList xs) = ZipList (zipWith ($) gs xs)
</haskell>

To apply a list of functions to a list of inputs with <code>(<*>)</code>, we just match up the functions and inputs elementwise, and produce a list of the resulting outputs. In other words, we “zip” the lists together with function application, <code>($)</code>; hence the name <code>ZipList</code>.

The other <code>Applicative</code> instance for lists, based on the nondeterministic computation point of view, is:

<haskell>
instance Applicative [] where
pure x = [x]
gs <*> xs = [ g x | g <- gs, x <- xs ]
</haskell>

Instead of applying functions to inputs pairwise, we apply each function to all the inputs in turn, and collect all the results in a list.

Now we can write nondeterministic computations in a natural style. To add the numbers <code>3</code> and <code>4</code> deterministically, we can of course write <code>(+) 3 4</code>. But suppose instead of <code>3</code> we have a nondeterministic computation that might result in <code>2</code>, <code>3</code>, or <code>4</code>; then we can write

<haskell>
pure (+) <*> [2,3,4] <*> pure 4
</haskell>

or, more idiomatically,

<haskell>
(+) <$> [2,3,4] <*> pure 4.
</haskell>

There are several other <code>Applicative</code> instances as well:

* <code>IO</code> is an instance of <code>Applicative</code>, and behaves exactly as you would think: to execute <code>m1 <*> m2</code>, first <code>m1</code> is executed, resulting in a function <code>f</code>, then <code>m2</code> is executed, resulting in a value <code>x</code>, and finally the value <code>f x</code> is returned as the result of executing <code>m1 <*> m2</code>.

* <code>((,) a)</code> is an <code>Applicative</code>, as long as <code>a</code> is an instance of <code>Monoid</code> ([[#Monoid|section Monoid]]). The <code>a</code> values are accumulated in parallel with the computation.

* The <code>Applicative</code> module defines the <code>Const</code> type constructor; a value of type <code>Const a b</code> simply contains an <code>a</code>. This is an instance of <code>Applicative</code> for any <code>Monoid a</code>; this instance becomes especially useful in conjunction with things like <code>Foldable</code> ([[#Foldable|section Foldable]]).

* The <code>WrappedMonad</code> and <code>WrappedArrow</code> newtypes make any instances of <code>Monad</code> ([[#Monad|section Monad]]) or <code>Arrow</code> ([[#Arrow|section Arrow]]) respectively into instances of <code>Applicative</code>; as we will see when we study those type classes, both are strictly more expressive than <code>Applicative</code>, in the sense that the <code>Applicative</code> methods can be implemented in terms of their methods.

{{Exercises|
# Implement an instance of <code>Applicative</code> for <code>Maybe</code>.
# Determine the correct definition of <code>pure</code> for the <code>ZipList</code> instance of <code>Applicative</code>—there is only one implementation that satisfies the law relating <code>pure</code> and <code>(<*>)</code>.
}}

==Intuition==

McBride and Paterson’s paper introduces the notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> to denote function application in a computational context. If each <math>x_i\ </math> has type <math>f \; t_i\ </math> for some applicative functor <math>f\ </math>, and <math>g\ </math> has type <math>t_1 \to t_2 \to \dots \to t_n \to t\ </math>, then the entire expression <math>[[g \; x_1 \; \cdots \; x_n]]\ </math> has type <math>f \; t\ </math>. You can think of this as applying a function to multiple “effectful” arguments. In this sense, the double bracket notation is a generalization of <code>fmap</code>, which allows us to apply a function to a single argument in a context.

Why do we need <code>Applicative</code> to implement this generalization of <code>fmap</code>? Suppose we use <code>fmap</code> to apply <code>g</code> to the first parameter <code>x1</code>. Then we get something of type <code>f (t2 -> ... t)</code>, but now we are stuck: we can’t apply this function-in-a-context to the next argument with <code>fmap</code>. However, this is precisely what <code>(<*>)</code> allows us to do.

This suggests the proper translation of the idealized notation <math>[[g \; x_1 \; x_2 \; \cdots \; x_n]]\ </math> into Haskell, namely
<haskell>
g <$> x1 <*> x2 <*> ... <*> xn,
</haskell>

recalling that <code>Control.Applicative</code> defines <code>(<$>)</code> as convenient infix shorthand for <code>fmap</code>. This is what is meant by an “applicative style”—effectful computations can still be described in terms of function application; the only difference is that we have to use the special operator <code>(<*>)</code> for application instead of simple juxtaposition.

Note that <code>pure</code> allows embedding “non-effectful” arguments in the middle of an idiomatic application, like
<haskell>
g <$> x1 <*> pure x2 <*> x3
</haskell>
which has type <code>f d</code>, given
<haskell>
g :: a -> b -> c -> d
x1 :: f a
x2 :: b
x3 :: f c
</haskell>

The double brackets are commonly known as “idiom brackets”, because they allow writing “idiomatic” function application, that is, function application that looks normal but has some special, non-standard meaning (determined by the particular instance of <code>Applicative</code> being used). Idiom brackets are not supported by GHC, but they are supported by the [http://personal.cis.strath.ac.uk/~conor/pub/she/ Strathclyde Haskell Enhancement], a preprocessor which (among many other things) translates idiom brackets into standard uses of <code>(<$>)</code> and <code>(<*>)</code>. This can result in much more readable code when making heavy use of <code>Applicative</code>.

==Alternative formulation==

An alternative, equivalent formulation of <code>Applicative</code> is given by

<haskell>
class Functor f => Monoidal f where
unit :: f ()
(**) :: f a -> f b -> f (a,b)
</haskell>

{{note|In category-theory speak, we say <code>f</code> is a ''lax'' monoidal functor because there aren't necessarily functions in the other direction, like <code>f (a, b) -> (f a, f b)</code>.}}
Intuitively, this states that a monoidal functor{{noteref}} is one which has some sort of "default shape" and which supports some sort of "combining" operation. <code>pure</code> and <code>(<*>)</code> are equivalent in power to <code>unit</code> and <code>(**)</code> (see the Exercises below). More technically, the idea is that <code>f</code> preserves the "monoidal structure" given by the pairing constructor <code>(,)</code> and unit type <code>()</code>. This can be seen even more clearly if we rewrite the types of <code>unit</code> and <code>(**)</code> as
<haskell>
unit' :: () -> f ()
(**') :: (f a, f b) -> f (a, b)
</haskell>

Furthermore, to deserve the name "monoidal" (see the [[#Monoid|section on Monoids]]), instances of <code>Monoidal</code> ought to satisfy the following laws, which seem much more straightforward than the traditional <code>Applicative</code> laws:

{{note|In this and the following laws, <code>≅</code> refers to isomorphism rather than equality. In particular we consider <code>(x,()) ≅ x ≅ ((),x)</code> and <code>((x,y),z) ≅ (x,(y,z))</code>.}}
* Left identity{{noteref}}: <haskell>unit ** v ≅ v</haskell>
* Right identity: <haskell>u ** unit ≅ u</haskell>
* Associativity: <haskell>u ** (v ** w) ≅ (u ** v) ** w</haskell>

These turn out to be equivalent to the usual <code>Applicative</code> laws. In a category theory setting, one would also require a naturality law:

{{note|Here <code>g *** h {{=}} \(x,y) -> (g x, h y)</code>. See [[#Arrow|Arrows]].}}
* Naturality: <haskell>fmap (g *** h) (u ** v) = fmap g u ** fmap h v</haskell>

but in the context of Haskell, this is a free theorem.

Much of this section was taken from [http://blog.ezyang.com/2012/08/applicative-functors/ a blog post by Edward Z. Yang]; see his actual post for a bit more information.

{{Exercises|
# Implement <code>pure</code> and <code>(<*>)</code> in terms of <code>unit</code> and <code>(**)</code>, and vice versa.
# Are there any <code>Applicative</code> instances for which there are also functions <code>f () -> ()</code> and <code>f (a,b) -> (f a, f b)</code>, satisfying some "reasonable" laws?
# (Tricky) Prove that given your implementations from the previous exercise, the usual <code>Applicative</code> laws and the <code>Monoidal</code> laws stated above are equivalent.
}}

==Further reading==

There are many other useful combinators in the standard libraries implemented in terms of <code>pure</code> and <code>(<*>)</code>: for example, <code>(*>)</code>, <code>(<*)</code>, <code>(<**>)</code>, <code>(<$)</code>, and so on (see [http://www.haskell.org/ghc/docs/latest/html/libraries/base-4.7.0.0/Control-Applicative.html haddock for Applicative]). Judicious use of such secondary combinators can often make code using <code>Applicative</code>s much easier to read.

[http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s original paper] is a treasure-trove of information and examples, as well as some perspectives on the connection between <code>Applicative</code> and category theory. Beginners will find it difficult to make it through the entire paper, but it is extremely well-motivated—even beginners will be able to glean something from reading as far as they are able.

{{note|Introduced by [http://conal.net/papers/simply-reactive/ an earlier paper] that was since superseded by [http://conal.net/papers/push-pull-frp/ Push-pull functional reactive programming].}}

Conal Elliott has been one of the biggest proponents of <code>Applicative</code>. For example, the [http://conal.net/papers/functional-images/ Pan library for functional images] and the reactive library for functional reactive programming (FRP) {{noteref}} make key use of it; his blog also contains [http://conal.net/blog/tag/applicative-functor many examples of <code>Applicative</code> in action]. Building on the work of McBride and Paterson, Elliott also built the [[TypeCompose]] library, which embodies the observation (among others) that <code>Applicative</code> types are closed under composition; therefore, <code>Applicative</code> instances can often be automatically derived for complex types built out of simpler ones.

Although the [http://hackage.haskell.org/package/parsec Parsec parsing library] ([http://legacy.cs.uu.nl/daan/download/papers/parsec-paper.pdf paper]) was originally designed for use as a monad, in its most common use cases an <code>Applicative</code> instance can be used to great effect; [http://www.serpentine.com/blog/2008/02/06/the-basics-of-applicative-functors-put-to-practical-work/ Bryan O’Sullivan’s blog post] is a good starting point. If the extra power provided by <code>Monad</code> isn’t needed, it’s usually a good idea to use <code>Applicative</code> instead.

A couple other nice examples of <code>Applicative</code> in action include the [http://web.archive.org/web/20090416111947/chrisdone.com/blog/html/2009-02-10-applicative-configfile-hsql.html ConfigFile and HSQL libraries] and the [http://groups.inf.ed.ac.uk/links/formlets/ formlets library].

Gershom Bazerman's [http://comonad.com/reader/2012/abstracting-with-applicatives/ post] contains many insights into applicatives.

=Monad=

It’s a safe bet that if you’re reading this, you’ve heard of monads—although it’s quite possible you’ve never heard of <code>Applicative</code> before, or <code>Arrow</code>, or even <code>Monoid</code>. Why are monads such a big deal in Haskell? There are several reasons.

* Haskell does, in fact, single out monads for special attention by making them the framework in which to construct I/O operations.
* Haskell also singles out monads for special attention by providing a special syntactic sugar for monadic expressions: the <code>do</code>-notation.
* <code>Monad</code> has been around longer than other abstract models of computation such as <code>Applicative</code> or <code>Arrow</code>.
* The more monad tutorials there are, the harder people think monads must be, and the more new monad tutorials are written by people who think they finally “get” monads (the [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ monad tutorial fallacy]).

I will let you judge for yourself whether these are good reasons.

In the end, despite all the hoopla, <code>Monad</code> is just another type class. Let’s take a look at its definition.

==Definition==

The type class declaration for [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#t:Monad <code>Monad</code>] is:

<haskell>
class Monad m where
return :: a -> m a
(>>=) :: m a -> (a -> m b) -> m b
(>>) :: m a -> m b -> m b
m >> n = m >>= \_ -> n

fail :: String -> m a
</haskell>

The <code>Monad</code> type class is exported by the <code>Prelude</code>, along with a few standard instances. However, many utility functions are found in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>], and there are also several instances (such as <code>((->) e)</code>) defined in [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad-Instances.html <code>Control.Monad.Instances</code>].

{{note|However, as of GHC 7.10 this will be fixed!}}
Let’s examine the methods in the <code>Monad</code> class one by one. The type of <code>return</code> should look familiar; it’s the same as <code>pure</code>. Indeed, <code>return</code> ''is'' <code>pure</code>, but with an unfortunate name. (Unfortunate, since someone coming from an imperative programming background might think that <code>return</code> is like the C or Java keyword of the same name, when in fact the similarities are minimal.) From a mathematical point of view, every monad is an applicative functor, but for historical reasons, the <code>Monad</code> type class declaration unfortunately does not require this.{{noteref}}

We can see that <code>(>>)</code> is a specialized version of <code>(>>=)</code>, with a default implementation given. It is only included in the type class declaration so that specific instances of <code>Monad</code> can override the default implementation of <code>(>>)</code> with a more efficient one, if desired. Also, note that although <code>_ >> n = n</code> would be a type-correct implementation of <code>(>>)</code>, it would not correspond to the intended semantics: the intention is that <code>m >> n</code> ignores the ''result'' of <code>m</code>, but not its ''effects''.

The <code>fail</code> function is an awful hack that has no place in the <code>Monad</code> class; more on this later.

The only really interesting thing to look at—and what makes <code>Monad</code> strictly more powerful than <code>Applicative</code>—is <code>(>>=)</code>, which is often called ''bind''. An alternative definition of <code>Monad</code> could look like:

<haskell>
class Applicative m => Monad' m where
(>>=) :: m a -> (a -> m b) -> m b
</haskell>

We could spend a while talking about the intuition behind <code>(>>=)</code>—and we will. But first, let’s look at some examples.

==Instances==

Even if you don’t understand the intuition behind the <code>Monad</code> class, you can still create instances of it by just seeing where the types lead you. You may be surprised to find that this actually gets you a long way towards understanding the intuition; at the very least, it will give you some concrete examples to play with as you read more about the <code>Monad</code> class in general. The first few examples are from the standard <code>Prelude</code>; the remaining examples are from the [http://hackage.haskell.org/package/transformers <code>transformers</code> package].

<ul>
<li>The simplest possible instance of <code>Monad</code> is [http://hackage.haskell.org/packages/archive/mtl/1.1.0.2/doc/html/Control-Monad-Identity.html <code>Identity</code>], which is described in Dan Piponi’s highly recommended blog post on [http://blog.sigfpe.com/2007/04/trivial-monad.html The Trivial Monad]. Despite being “trivial”, it is a great introduction to the <code>Monad</code> type class, and contains some good exercises to get your brain working.
</li>
<li>The next simplest instance of <code>Monad</code> is <code>Maybe</code>. We already know how to write <code>return</code>/<code>pure</code> for <code>Maybe</code>. So how do we write <code>(>>=)</code>? Well, let’s think about its type. Specializing for <code>Maybe</code>, we have

<haskell>
(>>=) :: Maybe a -> (a -> Maybe b) -> Maybe b.
</haskell>

If the first argument to <code>(>>=)</code> is <code>Just x</code>, then we have something of type <code>a</code> (namely, <code>x</code>), to which we can apply the second argument—resulting in a <code>Maybe b</code>, which is exactly what we wanted. What if the first argument to <code>(>>=)</code> is <code>Nothing</code>? In that case, we don’t have anything to which we can apply the <code>a -> Maybe b</code> function, so there’s only one thing we can do: yield <code>Nothing</code>. This instance is:

<haskell>
instance Monad Maybe where
return = Just
(Just x) >>= g = g x
Nothing >>= _ = Nothing
</haskell>

We can already get a bit of intuition as to what is going on here: if we build up a computation by chaining together a bunch of functions with <code>(>>=)</code>, as soon as any one of them fails, the entire computation will fail (because <code>Nothing >>= f</code> is <code>Nothing</code>, no matter what <code>f</code> is). The entire computation succeeds only if all the constituent functions individually succeed. So the <code>Maybe</code> monad models computations which may fail.
</li>

<li>The <code>Monad</code> instance for the list constructor <code>[]</code> is similar to its <code>Applicative</code> instance; see the exercise below.
</li>

<li>Of course, the <code>IO</code> constructor is famously a <code>Monad</code>, but its implementation is somewhat magical, and may in fact differ from compiler to compiler. It is worth emphasizing that the <code>IO</code> monad is the ''only'' monad which is magical. It allows us to build up, in an entirely pure way, values representing possibly effectful computations. The special value <code>main</code>, of type <code>IO ()</code>, is taken by the runtime and actually executed, producing actual effects. Every other monad is functionally pure, and requires no special compiler support. We often speak of monadic values as “effectful computations”, but this is because some monads allow us to write code ''as if'' it has side effects, when in fact the monad is hiding the plumbing which allows these apparent side effects to be implemented in a functionally pure way.
</li>

<li>As mentioned earlier, <code>((->) e)</code> is known as the ''reader monad'', since it describes computations in which a value of type <code>e</code> is available as a read-only environment.

The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Reader.html <code>Control.Monad.Reader</code>] module provides the <code>Reader e a</code> type, which is just a convenient <code>newtype</code> wrapper around <code>(e -> a)</code>, along with an appropriate <code>Monad</code> instance and some <code>Reader</code>-specific utility functions such as <code>ask</code> (retrieve the environment), <code>asks</code> (retrieve a function of the environment), and <code>local</code> (run a subcomputation under a different environment).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Writer-Lazy.html <code>Control.Monad.Writer</code>] module provides the <code>Writer</code> monad, which allows information to be collected as a computation progresses. <code>Writer w a</code> is isomorphic to <code>(a,w)</code>, where the output value <code>a</code> is carried along with an annotation or “log” of type <code>w</code>, which must be an instance of <code>Monoid</code> (see [[#Monoid|section Monoid]]); the special function <code>tell</code> performs logging.
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-State-Lazy.html <code>Control.Monad.State</code>] module provides the <code>State s a</code> type, a <code>newtype</code> wrapper around <code>s -> (a,s)</code>. Something of type <code>State s a</code> represents a stateful computation which produces an <code>a</code> but can access and modify the state of type <code>s</code> along the way. The module also provides <code>State</code>-specific utility functions such as <code>get</code> (read the current state), <code>gets</code> (read a function of the current state), <code>put</code> (overwrite the state), and <code>modify</code> (apply a function to the state).
</li>

<li>The [http://hackage.haskell.org/packages/archive/mtl/latest/doc/html/Control-Monad-Cont.html <code>Control.Monad.Cont</code>] module provides the <code>Cont</code> monad, which represents computations in continuation-passing style. It can be used to suspend and resume computations, and to implement non-local transfers of control, co-routines, other complex control structures—all in a functionally pure way. <code>Cont</code> has been called the [http://blog.sigfpe.com/2008/12/mother-of-all-monads.html “mother of all monads”] because of its universal properties.
</li>
</ul>

{{Exercises|
<ol>
<li>Implement a <code>Monad</code> instance for the list constructor, <code>[]</code>. Follow the types!</li>
<li>Implement a <code>Monad</code> instance for <code>((->) e)</code>.</li>
<li>Implement <code>Functor</code> and <code>Monad</code> instances for <code>Free f</code>, defined as
<haskell>
data Free f a = Var a
| Node (f (Free f a))
</haskell>
You may assume that <code>f</code> has a <code>Functor</code> instance. This is known as the ''free monad'' built from the functor <code>f</code>.
</li>
</ol>
}}

==Intuition==

Let’s look more closely at the type of <code>(>>=)</code>. The basic intuition is that it combines two computations into one larger computation. The first argument, <code>m a</code>, is the first computation. However, it would be boring if the second argument were just an <code>m b</code>; then there would be no way for the computations to interact with one another (actually, this is exactly the situation with <code>Applicative</code>). So, the second argument to <code>(>>=)</code> has type <code>a -> m b</code>: a function of this type, given a ''result'' of the first computation, can produce a second computation to be run. In other words, <code>x >>= k</code> is a computation which runs <code>x</code>, and then uses the result(s) of <code>x</code> to ''decide'' what computation to run second, using the output of the second computation as the result of the entire computation.

{{note|Actually, because Haskell allows general recursion, this is a lie: using a Haskell parsing library one can recursively construct ''infinite'' grammars, and hence <code>Applicative</code> (together with <code>Alternative</code>) is enough to parse any context-sensitive language with a finite alphabet. See [http://byorgey.wordpress.com/2012/01/05/parsing-context-sensitive-languages-with-applicative/ Parsing context-sensitive languages with Applicative].}}
Intuitively, it is this ability to use the output from previous computations to decide what computations to run next that makes <code>Monad</code> more powerful than <code>Applicative</code>. The structure of an <code>Applicative</code> computation is fixed, whereas the structure of a <code>Monad</code> computation can change based on intermediate results. This also means that parsers built using an <code>Applicative</code> interface can only parse context-free languages; in order to parse context-sensitive languages a <code>Monad</code> interface is needed.{{noteref}}

To see the increased power of <code>Monad</code> from a different point of view, let’s see what happens if we try to implement <code>(>>=)</code> in terms of <code>fmap</code>, <code>pure</code>, and <code>(<*>)</code>. We are given a value <code>x</code> of type <code>m a</code>, and a function <code>k</code> of type <code>a -> m b</code>, so the only thing we can do is apply <code>k</code> to <code>x</code>. We can’t apply it directly, of course; we have to use <code>fmap</code> to lift it over the <code>m</code>. But what is the type of <code>fmap k</code>? Well, it’s <code>m a -> m (m b)</code>. So after we apply it to <code>x</code>, we are left with something of type <code>m (m b)</code>—but now we are stuck; what we really want is an <code>m b</code>, but there’s no way to get there from here. We can ''add'' <code>m</code>’s using <code>pure</code>, but we have no way to ''collapse'' multiple <code>m</code>’s into one.

{{note|1=You might hear some people claim that that the definition in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> is the “math definition” and the definition in terms of <code>return</code> and <code>(>>=)</code> is something specific to Haskell. In fact, both definitions were known in the mathematics community long before Haskell picked up monads.}}

This ability to collapse multiple <code>m</code>’s is exactly the ability provided by the function <code>join :: m (m a) -> m a</code>, and it should come as no surprise that an alternative definition of <code>Monad</code> can be given in terms of <code>join</code>:

<haskell>
class Applicative m => Monad'' m where
join :: m (m a) -> m a
</haskell>

In fact, the canonical definition of monads in category theory is in terms of <code>return</code>, <code>fmap</code>, and <code>join</code> (often called <math>\eta</math>, <math>T</math>, and <math>\mu</math> in the mathematical literature). Haskell uses an alternative formulation with <code>(>>=)</code> instead of <code>join</code> since it is more convenient to use {{noteref}}. However, sometimes it can be easier to think about <code>Monad</code> instances in terms of <code>join</code>, since it is a more “atomic” operation. (For example, <code>join</code> for the list monad is just <code>concat</code>.)

{{Exercises|
# Implement <code>(>>{{=}})</code> in terms of <code>fmap</code> (or <code>liftM</code>) and <code>join</code>.
# Now implement <code>join</code> and <code>fmap</code> (<code>liftM</code>) in terms of <code>(>>{{=}})</code> and <code>return</code>.
}}

==Utility functions==

The [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html <code>Control.Monad</code>] module provides a large number of convenient utility functions, all of which can be implemented in terms of the basic <code>Monad</code> operations (<code>return</code> and <code>(>>=)</code> in particular). We have already seen one of them, namely, <code>join</code>. We also mention some other noteworthy ones here; implementing these utility functions oneself is a good exercise. For a more detailed guide to these functions, with commentary and example code, see Henk-Jan van Tuyl’s [http://members.chello.nl/hjgtuyl/tourdemonad.html tour].

{{note|This will most likely change in Haskell 2014 with the implementation of the [[Functor-Applicative-Monad_Proposal|Haskell 2014 Applicative => Monad proposal]].}}

* <code>liftM :: Monad m => (a -> b) -> m a -> m b</code>. This should be familiar; of course, it is just <code>fmap</code>. The fact that we have both <code>fmap</code> and <code>liftM</code> is an unfortunate consequence of the fact that the <code>Monad</code> type class does not require a <code>Functor</code> instance, even though mathematically speaking, every monad is a functor. However, <code>fmap</code> and <code>liftM</code> are essentially interchangeable, since it is a bug (in a social rather than technical sense) for any type to be an instance of <code>Monad</code> without also being an instance of <code>Functor</code> {{noteref}}.

* <code>ap :: Monad m => m (a -> b) -> m a -> m b</code> should also be familiar: it is equivalent to <code>(<*>)</code>, justifying the claim that the <code>Monad</code> interface is strictly more powerful than <code>Applicative</code>. We can make any <code>Monad</code> into an instance of <code>Applicative</code> by setting <code>pure = return</code> and <code>(<*>) = ap</code>.

* <code>sequence :: Monad m => [m a] -> m [a]</code> takes a list of computations and combines them into one computation which collects a list of their results. It is again something of a historical accident that <code>sequence</code> has a <code>Monad</code> constraint, since it can actually be implemented only in terms of <code>Applicative</code>. There is an additional generalization of <code>sequence</code> to structures other than lists, which will be discussed in the [[#Traversable|section on <code>Traversable</code>]].

* <code>replicateM :: Monad m => Int -> m a -> m [a]</code> is simply a combination of [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Prelude.html#v:replicate <code>replicate</code>] and <code>sequence</code>.

* <code>when :: Monad m => Bool -> m () -> m ()</code> conditionally executes a computation, evaluating to its second argument if the test is <code>True</code>, and to <code>return ()</code> if the test is <code>False</code>. A collection of other sorts of monadic conditionals can be found in the [http://hackage.haskell.org/package/IfElse <code>IfElse</code> package].

* <code>mapM :: Monad m => (a -> m b) -> [a] -> m [b]</code> maps its first argument over the second, and <code>sequence</code>s the results. The <code>forM</code> function is just <code>mapM</code> with its arguments reversed; it is called <code>forM</code> since it models generalized <code>for</code> loops: the list <code>[a]</code> provides the loop indices, and the function <code>a -> m b</code> specifies the “body” of the loop for each index.

* <code>(=<<) :: Monad m => (a -> m b) -> m a -> m b</code> is just <code>(>>=)</code> with its arguments reversed; sometimes this direction is more convenient since it corresponds more closely to function application.

* <code>(>=>) :: Monad m => (a -> m b) -> (b -> m c) -> a -> m c</code> is sort of like function composition, but with an extra <code>m</code> on the result type of each function, and the arguments swapped. We’ll have more to say about this operation later. There is also a flipped variant, <code>(<=<)</code>.

* The <code>guard</code> function is for use with instances of <code>MonadPlus</code>, which is discussed at the end of the [[#Monoid|<code>Monoid</code> section]].

Many of these functions also have “underscored” variants, such as <code>sequence_</code> and <code>mapM_</code>; these variants throw away the results of the computations passed to them as arguments, using them only for their side effects.

Other monadic functions which are occasionally useful include <code>filterM</code>, <code>zipWithM</code>, <code>foldM</code>, and <code>forever</code>.

==Laws==

There are several laws that instances of <code>Monad</code> should satisfy (see also the [[Monad laws]] wiki page). The standard presentation is:

<haskell>
return a >>= k = k a
m >>= return = m
m >>= (\x -> k x >>= h) = (m >>= k) >>= h

fmap f xs = xs >>= return . f = liftM f xs
</haskell>

The first and second laws express the fact that <code>return</code> behaves nicely: if we inject a value <code>a</code> into a monadic context with <code>return</code>, and then bind to <code>k</code>, it is the same as just applying <code>k</code> to <code>a</code> in the first place; if we bind a computation <code>m</code> to <code>return</code>, nothing changes. The third law essentially says that <code>(>>=)</code> is associative, sort of. The last law ensures that <code>fmap</code> and <code>liftM</code> are the same for types which are instances of both <code>Functor</code> and <code>Monad</code>—which, as already noted, should be every instance of <code>Monad</code>.

{{note|I like to pronounce this operator “fish”.}}

However, the presentation of the above laws, especially the third, is marred by the asymmetry of <code>(>>=)</code>. It’s hard to look at the laws and see what they’re really saying. I prefer a much more elegant version of the laws, which is formulated in terms of <code>(>=>)</code> {{noteref}}. Recall that <code>(>=>)</code> “composes” two functions of type <code>a -> m b</code> and <code>b -> m c</code>. You can think of something of type <code>a -> m b</code> (roughly) as a function from <code>a</code> to <code>b</code> which may also have some sort of effect in the context corresponding to <code>m</code>. <code>(>=>)</code> lets us compose these “effectful functions”, and we would like to know what properties <code>(>=>)</code> has. The monad laws reformulated in terms of <code>(>=>)</code> are:

<haskell>
return >=> g = g
g >=> return = g
(g >=> h) >=> k = g >=> (h >=> k)
</haskell>

{{note|As fans of category theory will note, these laws say precisely that functions of type <code>a -> m b</code> are the arrows of a category with <code>(>{{=}}>)</code> as composition! Indeed, this is known as the ''Kleisli category'' of the monad <code>m</code>. It will come up again when we discuss <code>Arrow</code>s.}}

Ah, much better! The laws simply state that <code>return</code> is the identity of <code>(>=>)</code>, and that <code>(>=>)</code> is associative {{noteref}}.

There is also a formulation of the monad laws in terms of <code>fmap</code>, <code>return</code>, and <code>join</code>; for a discussion of this formulation, see the Haskell [http://en.wikibooks.org/wiki/Haskell/Category_theory wikibook page on category theory].

{{Exercises|
# Given the definition <code>g >{{=}}> h {{=}} \x -> g x >>{{=}} h</code>, prove the equivalence of the above laws and the usual monad laws.
}}

==<code>do</code> notation==

Haskell’s special <code>do</code> notation supports an “imperative style” of programming by providing syntactic sugar for chains of monadic expressions. The genesis of the notation lies in realizing that something like <code>a >>= \x -> b >> c >>= \y -> d </code> can be more readably written by putting successive computations on separate lines:

<haskell>
a >>= \x ->
b >>
c >>= \y ->
d
</haskell>

This emphasizes that the overall computation consists of four computations <code>a</code>, <code>b</code>, <code>c</code>, and <code>d</code>, and that <code>x</code> is bound to the result of <code>a</code>, and <code>y</code> is bound to the result of <code>c</code> (<code>b</code>, <code>c</code>, and <code>d</code> are allowed to refer to <code>x</code>, and <code>d</code> is allowed to refer to <code>y</code> as well). From here it is not hard to imagine a nicer notation:

<haskell>
do { x <- a
; b
; y <- c
; d
}
</haskell>

(The curly braces and semicolons may optionally be omitted; the Haskell parser uses layout to determine where they should be inserted.) This discussion should make clear that <code>do</code> notation is just syntactic sugar. In fact, <code>do</code> blocks are recursively translated into monad operations (almost) like this:

<pre>
do e → e
do { e; stmts } → e >> do { stmts }
do { v <- e; stmts } → e >>= \v -> do { stmts }
do { let decls; stmts} → let decls in do { stmts }
</pre>

This is not quite the whole story, since <code>v</code> might be a pattern instead of a variable. For example, one can write

<haskell>
do (x:xs) <- foo
bar x
</haskell>

but what happens if <code>foo</code> produces an empty list? Well, remember that ugly <code>fail</code> function in the <code>Monad</code> type class declaration? That’s what happens. See [http://www.haskell.org/onlinereport/exps.html#sect3.14 section 3.14 of the Haskell Report] for the full details. See also the discussion of <code>MonadPlus</code> and <code>MonadZero</code> in the [[#Other monoidal classes: Alternative, MonadPlus, ArrowPlus|section on other monoidal classes]].

A final note on intuition: <code>do</code> notation plays very strongly to the “computational context” point of view rather than the “container” point of view, since the binding notation <code>x <- m</code> is suggestive of “extracting” a single <code>x</code> from <code>m</code> and doing something with it. But <code>m</code> may represent some sort of a container, such as a list or a tree; the meaning of <code>x <- m</code> is entirely dependent on the implementation of <code>(>>=)</code>. For example, if <code>m</code> is a list, <code>x <- m</code> actually means that <code>x</code> will take on each value from the list in turn.

==Further reading==

Philip Wadler was the first to propose using monads to structure functional programs. [http://homepages.inf.ed.ac.uk/wadler/topics/monads.html His paper] is still a readable introduction to the subject.

{{note|1=
[[All About Monads]],
[http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers],
[http://en.wikibooks.org/w/index.php?title=Haskell/Understanding_monads Understanding monads],
[[The Monadic Way]],
[http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads! (And Maybe You Already Have.)],
[http://www.haskell.org/pipermail/haskell-cafe/2006-November/019190.html there’s a monster in my Haskell!],
[http://kawagner.blogspot.com/2007/02/understanding-monads-for-real.html Understanding Monads. For real.],
[http://www.randomhacks.net/articles/2007/03/12/monads-in-15-minutes Monads in 15 minutes: Backtracking and Maybe],
[http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation],
[http://metafoo.co.uk/practical-monads.txt Practical Monads]}}

There are, of course, numerous monad tutorials of varying quality {{noteref}}.

A few of the best include Cale Gibbard’s [http://www.haskell.org/haskellwiki/Monads_as_Containers Monads as containers] and [http://www.haskell.org/haskellwiki/Monads_as_computation Monads as computation]; Jeff Newbern’s [[All About Monads]], a comprehensive guide with lots of examples; and Dan Piponi’s [http://blog.sigfpe.com/2006/08/you-could-have-invented-monads-and.html You Could Have Invented Monads!], which features great exercises. If you just want to know how to use <code>IO</code>, you could consult the [[Introduction to IO]]. Even this is just a sampling; the [[monad tutorials timeline]] is a more complete list. (All these monad tutorials have prompted parodies like [http://koweycode.blogspot.com/2007/01/think-of-monad.html think of a monad ...] as well as other kinds of backlash like [http://ahamsandwich.wordpress.com/2007/07/26/monads-and-why-monad-tutorials-are-all-awful/ Monads! (and Why Monad Tutorials Are All Awful)] or [http://byorgey.wordpress.com/2009/01/12/abstraction-intuition-and-the-monad-tutorial-fallacy/ Abstraction, intuition, and the “monad tutorial fallacy”].)

Other good monad references which are not necessarily tutorials include [http://members.chello.nl/hjgtuyl/tourdemonad.html Henk-Jan van Tuyl’s tour] of the functions in <code>Control.Monad</code>, Dan Piponi’s [http://blog.sigfpe.com/2006/10/monads-field-guide.html field guide], Tim Newsham’s [http://www.thenewsh.com/~newsham/haskell/monad.html What’s a Monad?], and Chris Smith's excellent article [http://cdsmith.wordpress.com/2012/04/18/why-do-monads-matter/ Why Do Monads Matter?]. There are also many blog posts which have been written on various aspects of monads; a collection of links can be found under [[Blog articles/Monads]].

For help constructing monads from scratch, and for obtaining a "deep embedding" of monad operations suitable for use in, say, compiling a domain-specific language, see [http://projects.haskell.org/operational Apfelmus's operational package].

One of the quirks of the <code>Monad</code> class and the Haskell type system is that it is not possible to straightforwardly declare <code>Monad</code> instances for types which require a class constraint on their data, even if they are monads from a mathematical point of view. For example, <code>Data.Set</code> requires an <code>Ord</code> constraint on its data, so it cannot be easily made an instance of <code>Monad</code>. A solution to this problem was [http://www.randomhacks.net/articles/2007/03/15/data-set-monad-haskell-macros first described by Eric Kidd], and later made into a [http://hackage.haskell.org/cgi-bin/hackage-scripts/package/rmonad library named rmonad] by Ganesh Sittampalam and Peter Gavin.

There are many good reasons for eschewing <code>do</code> notation; some have gone so far as to [[Do_notation_considered_harmful|consider it harmful]].

Monads can be generalized in various ways; for an exposition of one possibility, see Robert Atkey’s paper on [http://homepages.inf.ed.ac.uk/ratkey/paramnotions-jfp.pdf parameterized monads], or Dan Piponi’s [http://blog.sigfpe.com/2009/02/beyond-monads.html Beyond Monads].

For the categorically inclined, monads can be viewed as monoids ([http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html From Monoids to Monads]) and also as closure operators [http://blog.plover.com/math/monad-closure.html Triples and Closure]. Derek Elkins’s article in [http://www.haskell.org/wikiupload/8/85/TMR-Issue13.pdf issue 13 of the Monad.Reader] contains an exposition of the category-theoretic underpinnings of some of the standard <code>Monad</code> instances, such as <code>State</code> and <code>Cont</code>. Jonathan Hill and Keith Clarke have [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.53.6497 an early paper explaining the connection between monads as they arise in category theory and as used in functional programming]. There is also a [http://okmij.org/ftp/Computation/IO-monad-history.html web page by Oleg Kiselyov] explaining the history of the IO monad.

Links to many more research papers related to monads can be found under [[Research papers/Monads and arrows]].

=Monad transformers=

One would often like to be able to combine two monads into one: for example, to have stateful, nondeterministic computations (<code>State</code> + <code>[]</code>), or computations which may fail and can consult a read-only environment (<code>Maybe</code> + <code>Reader</code>), and so on. Unfortunately, monads do not compose as nicely as applicative functors (yet another reason to use <code>Applicative</code> if you don’t need the full power that <code>Monad</code> provides), but some monads can be combined in certain ways.

==Standard monad transformers==

The [http://hackage.haskell.org/package/transformers transformers] library provides a number of standard ''monad transformers''. Each monad transformer adds a particular capability/feature/effect to any existing monad.

* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Identity.html <code>IdentityT</code>] is the identity transformer, which maps a monad to (something isomorphic to) itself. This may seem useless at first glance, but it is useful for the same reason that the <code>id</code> function is useful -- it can be passed as an argument to things which are parameterized over an arbitrary monad transformer, when you do not actually want any extra capabilities.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-State.html <code>StateT</code>] adds a read-write state.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Reader.html <code>ReaderT</code>] adds a read-only environment.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Writer.html <code>WriterT</code>] adds a write-only log.
* [http://hackage.haskell.org/packages/archive/transformers/0.2.2.0/doc/html/Control-Monad-Trans-RWS.html <code>RWST</code>] conveniently combines <code>ReaderT</code>, <code>WriterT</code>, and <code>StateT</code> into one.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Maybe.html <code>MaybeT</code>] adds the possibility of failure.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Error.html <code>ErrorT</code>] adds the possibility of failure with an arbitrary type to represent errors.
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-List.html <code>ListT</code>] adds non-determinism (however, see the discussion of <code>ListT</code> below).
* [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Control-Monad-Trans-Cont.html <code>ContT</code>] adds continuation handling.

For example, <code>StateT s Maybe</code> is an instance of <code>Monad</code>; computations of type <code>StateT s Maybe a</code> may fail, and have access to a mutable state of type <code>s</code>. Monad transformers can be multiply stacked. One thing to keep in mind while using monad transformers is that the order of composition matters. For example, when a <code>StateT s Maybe a</code> computation fails, the state ceases being updated (indeed, it simply disappears); on the other hand, the state of a <code>MaybeT (State s) a</code> computation may continue to be modified even after the computation has "failed". This may seem backwards, but it is correct. Monad transformers build composite monads “inside out”; <code>MaybeT (State s) a</code> is isomorphic to <code>s -> (Maybe a, s)</code>. (Lambdabot has an indispensable <code>@unmtl</code> command which you can use to “unpack” a monad transformer stack in this way.)
Intuitively, the monads become "more fundamental" the further inside the stack you get, and the effects of inner monads "have precedence" over the effects of outer ones. Of course, this is just handwaving, and if you are unsure of the proper order for some monads you wish to combine, there is no substitute for using <code>@unmtl</code> or simply trying out the various options.

==Definition and laws==

All monad transformers should implement the <code>MonadTrans</code> type class, defined in <code>Control.Monad.Trans.Class</code>:

<haskell>
class MonadTrans t where
lift :: Monad m => m a -> t m a
</haskell>

It allows arbitrary computations in the base monad <code>m</code> to be “lifted” into computations in the transformed monad <code>t m</code>. (Note that type application associates to the left, just like function application, so <code>t m a = (t m) a</code>.)

<code>lift</code> must satisfy the laws
<haskell>
lift . return = return
lift (m >>= f) = lift m >>= (lift . f)
</haskell>
which intuitively state that <code>lift</code> transforms <code>m a</code> computations into <code>t m a</code> computations in a "sensible" way, which sends the <code>return</code> and <code>(>>=)</code> of <code>m</code> to the <code>return</code> and <code>(>>=)</code> of <code>t m</code>.

{{Exercises|
# What is the kind of <code>t</code> in the declaration of <code>MonadTrans</code>?
}}

==Transformer type classes and "capability" style==

{{note|The only problem with this scheme is the quadratic number of instances required as the number of standard monad transformers grows—but as the current set of standard monad transformers seems adequate for most common use cases, this may not be that big of a deal.}}

There are also type classes (provided by the [http://hackage.haskell.org/package/mtl <code>mtl</code> package]) for the operations of each transformer. For example, the <code>MonadState</code> type class provides the state-specific methods <code>get</code> and <code>put</code>, allowing you to conveniently use these methods not only with <code>State</code>, but with any monad which is an instance of <code>MonadState</code>—including <code>MaybeT (State s)</code>, <code>StateT s (ReaderT r IO)</code>, and so on. Similar type classes exist for <code>Reader</code>, <code>Writer</code>, <code>Cont</code>, <code>IO</code>, and others {{noteref}}.

These type classes serve two purposes. First, they get rid of (most of) the need for explicitly using <code>lift</code>, giving a type-directed way to automatically determine the right number of calls to <code>lift</code>. Simply writing <code>put</code> will be automatically translated into <code>lift . put</code>, <code>lift . lift . put</code>, or something similar depending on what concrete monad stack you are using.

Second, they give you more flexibility to switch between different concrete monad stacks. For example, if you are writing a state-based algorithm, don't write
<haskell>
foo :: State Int Char
foo = modify (*2) >> return 'x'
</haskell>
but rather
<haskell>
foo :: MonadState Int m => m Char
foo = modify (*2) >> return 'x'
</haskell>
Now, if somewhere down the line you realize you need to introduce the possibility of failure, you might switch from <code>State Int</code> to <code>MaybeT (State Int)</code>. The type of the first version of <code>foo</code> would need to be modified to reflect this change, but the second version of <code>foo</code> can still be used as-is.

However, this sort of "capability-based" style (e.g. specifying that <code>foo</code> works for any monad with the "state capability") quickly runs into problems when you try to naively scale it up: for example, what if you need to maintain two independent states? A framework for solving this and related problems is described by Schrijvers and Olivera ([http://users.ugent.be/~tschrijv/Research/papers/icfp2011.pdf Monads, zippers and views: virtualizing the monad stack, ICFP 2011]) and is implemented in the [http://hackage.haskell.org/package/Monatron <code>Monatron</code> package].

==Composing monads==

Is the composition of two monads always a monad? As hinted previously, the answer is no.

Since <code>Applicative</code> functors are closed under composition, the problem must lie with <code>join</code>. Indeed, suppose <code>m</code> and <code>n</code> are arbitrary monads; to make a monad out of their composition we would need to be able to implement
<haskell>
join :: m (n (m (n a))) -> m (n a)
</haskell>
but it is not clear how this could be done in general. The <code>join</code> method for <code>m</code> is no help, because the two occurrences of <code>m</code> are not next to each other (and likewise for <code>n</code>).

However, one situation in which it can be done is if <code>n</code> ''distributes'' over <code>m</code>, that is, if there is a function
<haskell>
distrib :: n (m a) -> m (n a)
</haskell>
satisfying certain laws. See Jones and Duponcheel ([http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.2605 Composing Monads]); see also the [[#Traversable|section on Traversable]].

For a much more in-depth discussion and analysis of the failure of monads to be closed under composition, see [http://stackoverflow.com/questions/13034229/concrete-example-showing-that-monads-are-not-closed-under-composition-with-proo?lq=1 this question on StackOverflow].

{{Exercises|
* Implement <code>join :: M (N (M (N a))) -> M (N a)</code>, given <code>distrib :: N (M a) -> M (N a)</code> and assuming <code>M</code> and <code>N</code> are instances of <code>Monad</code>.
}}

==Further reading==

Much of the monad transformer library (originally [http://hackage.haskell.org/package/mtl <code>mtl</code>], now split between <code>mtl</code> and [http://hackage.haskell.org/package/transformers <code>transformers</code>]), including the <code>Reader</code>, <code>Writer</code>, <code>State</code>, and other monads, as well as the monad transformer framework itself, was inspired by Mark Jones’s classic paper [http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Functional Programming with Overloading and Higher-Order Polymorphism]. It’s still very much worth a read—and highly readable—after almost fifteen years.

See [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17139 Edward Kmett's mailing list message] for a description of the history and relationships among monad transformer packages (<code>mtl</code>, <code>transformers</code>, <code>monads-fd</code>, <code>monads-tf</code>).

There are two excellent references on monad transformers. Martin Grabmüller’s [http://www.grabmueller.de/martin/www/pub/Transformers.en.html Monad Transformers Step by Step] is a thorough description, with running examples, of how to use monad transformers to elegantly build up computations with various effects. [http://cale.yi.org/index.php/How_To_Use_Monad_Transformers Cale Gibbard’s article] on how to use monad transformers is more practical, describing how to structure code using monad transformers to make writing it as painless as possible. Another good starting place for learning about monad transformers is a [http://blog.sigfpe.com/2006/05/grok-haskell-monad-transformers.html blog post by Dan Piponi].

The <code>ListT</code> transformer from the <code>transformers</code> package comes with the caveat that <code>ListT m</code> is only a monad when <code>m</code> is ''commutative'', that is, when <code>ma >>= \a -> mb >>= \b -> foo</code> is equivalent to <code>mb >>= \b -> ma >>= \a -> foo</code> (i.e. the order of <code>m</code>'s effects does not matter). For one explanation why, see Dan Piponi's blog post [http://blog.sigfpe.com/2006/11/why-isnt-listt-monad.html "Why isn't <code><nowiki>ListT []</nowiki></code> a monad"]. For more examples, as well as a design for a version of <code>ListT</code> which does not have this problem, see [http://www.haskell.org/haskellwiki/ListT_done_right <code>ListT</code> done right].

There is an alternative way to compose monads, using coproducts, as described by [http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.8.3581 Lüth and Ghani]. This method is interesting but has not (yet?) seen widespread use. For a more recent alternative, see Kiselyov et al's [http://okmij.org/ftp/Haskell/extensible/exteff.pdf Extensible Effects: An Alternative to Monad Transformers].

=MonadFix=

''Note: <code>MonadFix</code> is included here for completeness (and because it is interesting) but seems not to be used much. Skipping this section on a first read-through is perfectly OK (and perhaps even recommended).''

==<code>mdo</code>/<code>do rec</code> notation==

{{note|In GHC 7.6, the flag has been changed to <code>-XRecursiveDo</code>.}}
The <code>MonadFix</code> class describes monads which support the special fixpoint operation <code>mfix :: (a -> m a) -> m a</code>, which allows the output of monadic computations to be defined via (effectful) recursion. This is [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation supported in GHC] by a special “recursive do” notation, enabled by the <code>-XDoRec</code> flag{{noteref}}. Within a <code>do</code> block, one may have a nested <code>rec</code> block, like so:
<haskell>
do { x <- foo
; rec { y <- baz
; z <- bar
; bob
}
; w <- frob
}
</haskell>
Normally (if we had <code>do</code> in place of <code>rec</code> in the above example), <code>y</code> would be in scope in <code>bar</code> and <code>bob</code> but not in <code>baz</code>, and <code>z</code> would be in scope only in <code>bob</code>. With the <code>rec</code>, however, <code>y</code> and <code>z</code> are both in scope in all three of <code>baz</code>, <code>bar</code>, and <code>bob</code>. A <code>rec</code> block is analogous to a <code>let</code> block such as
<haskell>
let { y = baz
; z = bar
}
in bob
</haskell>
because, in Haskell, every variable bound in a <code>let</code>-block is in scope throughout the entire block. (From this point of view, Haskell's normal <code>do</code> blocks are analogous to Scheme's <code>let*</code> construct.)

What could such a feature be used for? One of the motivating examples given in the original paper describing <code>MonadFix</code> (see below) is encoding circuit descriptions. A line in a <code>do</code>-block such as
<haskell>
x <- gate y z
</haskell>
describes a gate whose input wires are labeled <code>y</code> and <code>z</code> and whose output wire is labeled <code>x</code>. Many (most?) useful circuits, however, involve some sort of feedback loop, making them impossible to write in a normal <code>do</code>-block (since some wire would have to be mentioned as an input ''before'' being listed as an output). Using a <code>rec</code> block solves this problem.

==Examples and intuition==

Of course, not every monad supports such recursive binding. However, as mentioned above, it suffices to have an implementation of <code>mfix :: (a -> m a) -> m a</code>, satisfying a few laws. Let's try implementing <code>mfix</code> for the <code>Maybe</code> monad. That is, we want to implement a function
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
</haskell>
{{note|Actually, <code>fix</code> is implemented slightly differently for efficiency reasons; but the given definition is equivalent and simpler for the present purpose.}}
Let's think for a moment about the implementation {{noteref}} of the non-monadic <code>fix :: (a -> a) -> a</code>:
<haskell>
fix f = f (fix f)
</haskell>
Inspired by <code>fix</code>, our first attempt at implementing <code>maybeFix</code> might be something like
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = maybeFix f >>= f
</haskell>
This has the right type. However, something seems wrong: there is nothing in particular here about <code>Maybe</code>; <code>maybeFix</code> actually has the more general type <code>Monad m => (a -> m a) -> m a</code>. But didn't we just say that not all monads support <code>mfix</code>?

The answer is that although this implementation of <code>maybeFix</code> has the right type, it does ''not'' have the intended semantics. If we think about how <code>(>>=)</code> works for the <code>Maybe</code> monad (by pattern-matching on its first argument to see whether it is <code>Nothing</code> or <code>Just</code>) we can see that this definition of <code>maybeFix</code> is completely useless: it will just recurse infinitely, trying to decide whether it is going to return <code>Nothing</code> or <code>Just</code>, without ever even so much as a glance in the direction of <code>f</code>.

The trick is to simply ''assume'' that <code>maybeFix</code> will return <code>Just</code>, and get on with life!
<haskell>
maybeFix :: (a -> Maybe a) -> Maybe a
maybeFix f = ma
where ma = f (fromJust ma)
</haskell>
This says that the result of <code>maybeFix</code> is <code>ma</code>, and assuming that <code>ma = Just x</code>, it is defined (recursively) to be equal to <code>f x</code>.

Why is this OK? Isn't <code>fromJust</code> almost as bad as <code>unsafePerformIO</code>? Well, usually, yes. This is just about the only situation in which it is justified! The interesting thing to note is that <code>maybeFix</code> ''will never crash'' -- although it may, of course, fail to terminate. The only way we could get a crash is if we try to evaluate <code>fromJust ma</code> when we know that <code>ma = Nothing</code>. But how could we know <code>ma = Nothing</code>? Since <code>ma</code> is defined as <code>f (fromJust ma)</code>, it must be that this expression has already been evaluated to <code>Nothing</code> -- in which case there is no reason for us to be evaluating <code>fromJust ma</code> in the first place!

To see this from another point of view, we can consider three possibilities. First, if <code>f</code> outputs <code>Nothing</code> without looking at its argument, then <code>maybeFix f</code> clearly returns <code>Nothing</code>. Second, if <code>f</code> always outputs <code>Just x</code>, where <code>x</code> depends on its argument, then the recursion can proceed usefully: <code>fromJust ma</code> will be able to evaluate to <code>x</code>, thus feeding <code>f</code>'s output back to it as input. Third, if <code>f</code> tries to use its argument to decide whether to output <code>Just</code> or <code>Nothing</code>, then <code>maybeFix f</code> will not terminate: evaluating <code>f</code>'s argument requires evaluating <code>ma</code> to see whether it is <code>Just</code>, which requires evaluating <code>f (fromJust ma)</code>, which requires evaluating <code>ma</code>, ... and so on.

There are also instances of <code>MonadFix</code> for lists (which works analogously to the instance for <code>Maybe</code>), for <code>ST</code>, and for <code>IO</code>. The [http://hackage.haskell.org/packages/archive/base/latest/doc/html/src/System-IO.html#fixIO instance for <code>IO</code>] is particularly amusing: it creates a new (empty) <code>MVar</code>, immediately reads its contents using <code>unsafeInterleaveIO</code> (which delays the actual reading lazily until the value is needed), uses the contents of the <code>MVar</code> to compute a new value, which it then writes back into the <code>MVar</code>. It almost seems, spookily, that <code>mfix</code> is sending a value back in time to itself through the <code>MVar</code> -- though of course what is really going on is that the reading is delayed just long enough (via <code>unsafeInterleaveIO</code>) to get the process bootstrapped.

{{Exercises|
* Implement a <code>MonadFix</code> instance for <code>[]</code>.
}}

==GHC 7.6 changes==

GHC 7.6 reinstated the old <code>mdo</code> syntax, so the example at the start of this section can be written

<haskell>
mdo { x <- foo
; y <- baz
; z <- bar
; bob
; w <- frob
}
</haskell>

which will be translated into the original example (assuming that, say, <code>bar</code> and <code>bob</code> refer to <code>y</code>. The difference is that <code>mdo</code> will analyze the code in order to find minimal recursive blocks, which will be placed in <code>rec</code> blocks, whereas <code>rec</code> blocks desugar directly into calls to <code>mfix</code> without any further analysis.
==Further reading==

For more information (such as the precise desugaring rules for <code>rec</code> blocks), see Levent Erkök and John Launchbury's 2002 Haskell workshop paper, [http://sites.google.com/site/leventerkok/recdo.pdf?attredirects=0 A Recursive do for Haskell], or for full details, Levent Erkök’s thesis, [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.15.1543&rep=rep1&type=pdf Value Recursion in Monadic Computations]. (Note, while reading, that <code>MonadFix</code> used to be called <code>MonadRec</code>.) You can also read the [http://www.haskell.org/ghc/docs/latest/html/users_guide/syntax-extns.html#recursive-do-notation GHC user manual section on recursive do-notation].

=Semigroup=

A semigroup is a set <math>S\ </math> together with a binary operation <math>\oplus\ </math> which
combines elements from <math>S\ </math>. The <math>\oplus\ </math> operator is required to be associative
(that is, <math>(a \oplus b) \oplus c = a \oplus (b \oplus c)\ </math>, for any
<math>a,b,c\ </math> which are elements of <math>S\ </math>).

For example, the natural numbers under addition form a semigroup: the sum of any two natural numbers is a natural number, and <math>(a+b)+c = a+(b+c)\ </math> for any natural numbers <math>a\ </math>, <math>b\ </math>, and <math>c\,\ </math>. The integers under multiplication also form a semigroup, as do the integers (or rationals, or reals) under <math>\max\ </math> or <math>\min\ </math>, Boolean values under conjunction and disjunction, lists under concatenation, functions from a set to itself under composition ... Semigroups show up all over the place, once you know to look for them.

==Definition==

Semigroups are not (yet?) defined in the base package, but the {{HackagePackage|id=semigroups}} package provides a standard definition.

The definition of the <code>Semigroup</code> type class ([http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock]) is as follows:

<haskell>
class Semigroup a where
(<>) :: a -> a -> a

sconcat :: NonEmpty a -> a
sconcat = sconcat (a :| as) = go a as where
go b (c:cs) = b <> go c cs
go b [] = b

times1p :: Whole n => n -> a -> a
times1p = ...
</haskell>

The really important method is <code>(<>)</code>, representing the associative binary operation. The other two methods have default implementations in terms of <code>(<>)</code>, and are included in the type class in case some instances can give more efficient implementations than the default. <code>sconcat</code> reduces a nonempty list using <code>(<>)</code>; <code>times1p n</code> is equivalent to (but more efficient than) <code>sconcat . replicate n</code>. See the [http://hackage.haskell.org/packages/archive/semigroups/latest/doc/html/Data-Semigroup.html haddock documentation] for more information on <code>sconcat</code> and <code>times1p</code>.

==Laws==

The only law is that <code>(<>)</code> must be associative:

<haskell>
(x <> y) <> z = x <> (y <> z)
</haskell>

=Monoid=

Many semigroups have a special element <math>e</math> for which the binary operation <math>\oplus</math> is the identity, that is, <math>e \oplus x = x \oplus e = x</math> for every element <math>x</math>. Such a semigroup-with-identity-element is called a ''monoid''.

==Definition==

The definition of the <code>Monoid</code> type class (defined in
<code>Data.Monoid</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Monoid.html haddock]) is:

<haskell>
class Monoid a where
mempty :: a
mappend :: a -> a -> a

mconcat :: [a] -> a
mconcat = foldr mappend mempty
</haskell>

The <code>mempty</code> value specifies the identity element of the monoid, and <code>mappend</code>
is the binary operation. The default definition for <code>mconcat</code>
“reduces” a list of elements by combining them all with <code>mappend</code>,
using a right fold. It is only in the <code>Monoid</code> class so that specific
instances have the option of providing an alternative, more efficient
implementation; usually, you can safely ignore <code>mconcat</code> when creating
a <code>Monoid</code> instance, since its default definition will work just fine.

The <code>Monoid</code> methods are rather unfortunately named; they are inspired
by the list instance of <code>Monoid</code>, where indeed <code>mempty = []</code> and <code>mappend = (++)</code>, but this is misleading since many
monoids have little to do with appending (see these [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 Comments from OCaml Hacker Brian Hurt] on the Haskell-cafe mailing list). This was improved in GHC 7.4, where <code>(<>)</code> was added as an alias to <code>mappend</code>.

==Laws==

Of course, every <code>Monoid</code> instance should actually be a monoid in the
mathematical sense, which implies these laws:

<haskell>
mempty `mappend` x = x
x `mappend` mempty = x
(x `mappend` y) `mappend` z = x `mappend` (y `mappend` z)
</haskell>

==Instances==

There are quite a few interesting <code>Monoid</code> instances defined in <code>Data.Monoid</code>.

<ul>
<li><code>[a]</code> is a <code>Monoid</code>, with <code>mempty = []</code> and <code>mappend = (++)</code>. It is not hard to check that <code>(x ++ y) ++ z = x ++ (y ++ z)</code> for any lists <code>x</code>, <code>y</code>, and <code>z</code>, and that the empty list is the identity: <code>[] ++ x = x ++ [] = x</code>.</li>

<li>As noted previously, we can make a monoid out of any numeric type under either addition or multiplication. However, since we can’t have two instances for the same type, <code>Data.Monoid</code> provides two <code>newtype</code> wrappers, <code>Sum</code> and <code>Product</code>, with appropriate <code>Monoid</code> instances.

<haskell>
> getSum (mconcat . map Sum $ [1..5])
15
> getProduct (mconcat . map Product $ [1..5])
120
</haskell>

This example code is silly, of course; we could just write
<code>sum [1..5]</code> and <code>product [1..5]</code>. Nevertheless, these instances are useful in more generalized settings, as we will see in the [[Foldable|section on <code>Foldable</code>]].</li>

<li><code>Any</code> and <code>All</code> are <code>newtype</code> wrappers providing <code>Monoid</code> instances for <code>Bool</code> (under disjunction and conjunction, respectively).</li>

<li> There are three instances for <code>Maybe</code>: a basic instance which lifts a <code>Monoid</code> instance for <code>a</code> to an instance for <code>Maybe a</code>, and two <code>newtype</code> wrappers <code>First</code> and <code>Last</code> for which <code>mappend</code> selects the first (respectively last) non-<code>Nothing</code> item.</li>

<li><code>Endo a</code> is a newtype wrapper for functions <code>a -> a</code>, which form a monoid under composition.</li>

<li>There are several ways to “lift” <code>Monoid</code> instances to instances with additional structure. We have already seen that an instance for <code>a</code> can be lifted to an instance for <code>Maybe a</code>. There are also tuple instances: if <code>a</code> and <code>b</code> are instances of <code>Monoid</code>, then so is <code>(a,b)</code>, using the monoid operations for <code>a</code> and <code>b</code> in the obvious pairwise manner. Finally, if <code>a</code> is a <code>Monoid</code>, then so is the function type <code>e -> a</code> for any <code>e</code>; in particular, <code>g `mappend` h</code> is the function which applies both <code>g</code> and <code>h</code> to its argument and then combines the results using the underlying <code>Monoid</code> instance for <code>a</code>. This can be quite useful and elegant (see [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/52416 example]).</li>

<li>The type <code>Ordering = LT | EQ | GT</code> is a <code>Monoid</code>, defined in such a way that <code>mconcat (zipWith compare xs ys)</code> computes the lexicographic ordering of <code>xs</code> and <code>ys</code> (if <code>xs</code> and <code>ys</code> have the same length). In particular, <code>mempty = EQ</code>, and <code>mappend</code> evaluates to its leftmost non-<code>EQ</code> argument (or <code>EQ</code> if both arguments are <code>EQ</code>). This can be used together with the function instance of <code>Monoid</code> to do some clever things ([http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx example]).</li>

<li>There are also <code>Monoid</code> instances for several standard data structures in the containers library ([http://hackage.haskell.org/packages/archive/containers/0.2.0.0/doc/html/index.html haddock]), including <code>Map</code>, <code>Set</code>, and <code>Sequence</code>.</li>
</ul>

<code>Monoid</code> is also used to enable several other type class instances.
As noted previously, we can use <code>Monoid</code> to make <code>((,) e)</code> an instance of <code>Applicative</code>:

<haskell>
instance Monoid e => Applicative ((,) e) where
pure x = (mempty, x)
(u, f) <*> (v, x) = (u `mappend` v, f x)
</haskell>

<code>Monoid</code> can be similarly used to make <code>((,) e)</code> an instance of <code>Monad</code> as well; this is known as the ''writer monad''. As we’ve already seen, <code>Writer</code> and <code>WriterT</code> are a newtype wrapper and transformer for this monad, respectively.

<code>Monoid</code> also plays a key role in the <code>Foldable</code> type class (see section [[#Foldable|Foldable]]).

==Other monoidal classes: Alternative, MonadPlus, ArrowPlus==

The <code>Alternative</code> type class ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Applicative.html#g:2 haddock])
is for <code>Applicative</code> functors which also have
a monoid structure:

<haskell>
class Applicative f => Alternative f where
empty :: f a
(<|>) :: f a -> f a -> f a
</haskell>

Of course, instances of <code>Alternative</code> should satisfy the monoid laws

<haskell>
empty <|> x = x
x <|> empty = x
(x <|> y) <|> z = x <|> (y <|> z)
</haskell>

Likewise, <code>MonadPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Monad.html#t:MonadPlus haddock])
is for <code>Monad</code>s with a monoid structure:

<haskell>
class Monad m => MonadPlus m where
mzero :: m a
mplus :: m a -> m a -> m a
</haskell>

The <code>MonadPlus</code> documentation states that it is intended to model
monads which also support “choice and failure”; in addition to the
monoid laws, instances of <code>MonadPlus</code> are expected to satisfy

<haskell>
mzero >>= f = mzero
v >> mzero = mzero
</haskell>

which explains the sense in which <code>mzero</code> denotes failure. Since
<code>mzero</code> should be the identity for <code>mplus</code>, the computation <code>m1 `mplus` m2</code> succeeds (evaluates to something other than <code>mzero</code>) if
either <code>m1</code> or <code>m2</code> does; so <code>mplus</code> represents choice. The <code>guard</code>
function can also be used with instances of <code>MonadPlus</code>; it requires a
condition to be satisfied and fails (using <code>mzero</code>) if it is not. A
simple example of a <code>MonadPlus</code> instance is <code>[]</code>, which is exactly the
same as the <code>Monoid</code> instance for <code>[]</code>: the empty list represents
failure, and list concatenation represents choice. In general,
however, a <code>MonadPlus</code> instance for a type need not be the same as its
<code>Monoid</code> instance; <code>Maybe</code> is an example of such a type. A great
introduction to the <code>MonadPlus</code> type class, with interesting examples
of its use, is Doug Auclair’s ''MonadPlus: What a Super Monad!'' in [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad.Reader issue 11].

There used to be a type class called <code>MonadZero</code> containing only
<code>mzero</code>, representing monads with failure. The <code>do</code>-notation requires
some notion of failure to deal with failing pattern matches.
Unfortunately, <code>MonadZero</code> was scrapped in favor of adding the <code>fail</code>
method to the <code>Monad</code> class. If we are lucky, someday <code>MonadZero</code> will
be restored, and <code>fail</code> will be banished to the bit bucket where it
belongs (see [[MonadPlus reform proposal]]). The idea is that any
<code>do</code>-block which uses pattern matching (and hence may fail) would require
a <code>MonadZero</code> constraint; otherwise, only a <code>Monad</code> constraint would be
required.

Finally, <code>ArrowZero</code> and <code>ArrowPlus</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html#t:ArrowZero haddock])
represent <code>Arrow</code>s ([[#Arrow|see below]]) with a
monoid structure:

<haskell>
class Arrow arr => ArrowZero arr where
zeroArrow :: b `arr` c

class ArrowZero arr => ArrowPlus arr where
(<+>) :: (b `arr` c) -> (b `arr` c) -> (b `arr` c)
</haskell>

==Further reading==

Monoids have gotten a fair bit of attention recently, ultimately due
to
[http://enfranchisedmind.com/blog/posts/random-thoughts-on-haskell/ a blog post by Brian Hurt], in which he
complained about the fact that the names of many Haskell type classes
(<code>Monoid</code> in particular) are taken from abstract mathematics. This
resulted in [http://thread.gmane.org/gmane.comp.lang.haskell.cafe/50590 a long Haskell-cafe thread]
arguing the point and discussing monoids in general.

{{note|May its name live forever.}}

However, this was quickly followed by several blog posts about
<code>Monoid</code> {{noteref}}. First, Dan Piponi
wrote a great introductory post, [http://blog.sigfpe.com/2009/01/haskell-monoids-and-their-uses.html Haskell Monoids and their Uses]. This was quickly followed by
Heinrich Apfelmus’s [http://apfelmus.nfshost.com/monoid-fingertree.html Monoids and Finger Trees], an accessible exposition of
Hinze and Paterson’s [http://www.soi.city.ac.uk/%7Eross/papers/FingerTree.html classic paper on 2-3 finger trees], which makes very clever
use of <code>Monoid</code> to implement an elegant and generic data structure.
Dan Piponi then wrote two fascinating articles about using <code>Monoids</code>
(and finger trees): [http://blog.sigfpe.com/2009/01/fast-incremental-regular-expression.html Fast Incremental Regular Expressions] and [http://blog.sigfpe.com/2009/01/beyond-regular-expressions-more.html Beyond Regular Expressions]

In a similar vein, David Place’s article on improving <code>Data.Map</code> in
order to compute incremental folds (see [http://www.haskell.org/wikiupload/6/6a/TMR-Issue11.pdf the Monad Reader issue 11])
is also a
good example of using <code>Monoid</code> to generalize a data structure.

Some other interesting examples of <code>Monoid</code> use include [http://www.reddit.com/r/programming/comments/7cf4r/monoids_in_my_programming_language/c06adnx building elegant list sorting combinators], [http://byorgey.wordpress.com/2008/04/17/collecting-unstructured-information-with-the-monoid-of-partial-knowledge/ collecting unstructured information], [http://izbicki.me/blog/gausian-distributions-are-monoids combining probability distributions], and a brilliant series of posts by Chung-Chieh Shan and Dylan Thurston using <code>Monoid</code>s to [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers1/ elegantly solve a difficult combinatorial puzzle] (followed by [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers2/ part 2], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers3/ part 3], [http://conway.rutgers.edu/~ccshan/wiki/blog/posts/WordNumbers4/ part 4]).

As unlikely as it sounds, monads can actually be viewed as a sort of
monoid, with <code>join</code> playing the role of the binary operation and
<code>return</code> the role of the identity; see [http://blog.sigfpe.com/2008/11/from-monoids-to-monads.html Dan Piponi’s blog post].

=Foldable=

The <code>Foldable</code> class, defined in the <code>Data.Foldable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html haddock]), abstracts over containers which can be
“folded” into a summary value. This allows such folding operations
to be written in a container-agnostic way.

==Definition==

The definition of the <code>Foldable</code> type class is:

<haskell>
class Foldable t where
fold :: Monoid m => t m -> m
foldMap :: Monoid m => (a -> m) -> t a -> m

foldr :: (a -> b -> b) -> b -> t a -> b
foldl :: (a -> b -> a) -> a -> t b -> a
foldr1 :: (a -> a -> a) -> t a -> a
foldl1 :: (a -> a -> a) -> t a -> a
</haskell>

This may look complicated, but in fact, to make a <code>Foldable</code> instance
you only need to implement one method: your choice of <code>foldMap</code> or
<code>foldr</code>. All the other methods have default implementations in terms
of these, and are presumably included in the class in case more
efficient implementations can be provided.

==Instances and examples==

The type of <code>foldMap</code> should make it clear what it is supposed to do:
given a way to convert the data in a container into a <code>Monoid</code> (a
function <code>a -> m</code>) and a container of <code>a</code>’s (<code>t a</code>), <code>foldMap</code>
provides a way to iterate over the entire contents of the container,
converting all the <code>a</code>’s to <code>m</code>’s and combining all the <code>m</code>’s with
<code>mappend</code>. The following code shows two examples: a simple
implementation of <code>foldMap</code> for lists, and a binary tree example
provided by the <code>Foldable</code> documentation.

<haskell>
instance Foldable [] where
foldMap g = mconcat . map g

data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Foldable Tree where
foldMap f Empty = mempty
foldMap f (Leaf x) = f x
foldMap f (Node l k r) = foldMap f l `mappend` f k `mappend` foldMap f r
</haskell>

The <code>foldr</code> function has a type similar to the <code>foldr</code> found in the <code>Prelude</code>, but
more general, since the <code>foldr</code> in the <code>Prelude</code> works only on lists.

The <code>Foldable</code> module also provides instances for <code>Maybe</code> and <code>Array</code>;
additionally, many of the data structures found in the standard [http://hackage.haskell.org/package/containers containers library] (for example, <code>Map</code>, <code>Set</code>, <code>Tree</code>,
and <code>Sequence</code>) provide their own <code>Foldable</code> instances.

{{Exercises|
# What is the type of <code>foldMap . foldMap</code>? Or <code>foldMap . foldMap . foldMap</code>, etc.? What do they do?
}}

==Derived folds==

Given an instance of <code>Foldable</code>, we can write generic,
container-agnostic functions such as:

<haskell>
-- Compute the size of any container.
containerSize :: Foldable f => f a -> Int
containerSize = getSum . foldMap (const (Sum 1))

-- Compute a list of elements of a container satisfying a predicate.
filterF :: Foldable f => (a -> Bool) -> f a -> [a]
filterF p = foldMap (\a -> if p a then [a] else [])

-- Get a list of all the Strings in a container which include the
-- letter a.
aStrings :: Foldable f => f String -> [String]
aStrings = filterF (elem 'a')
</haskell>

The <code>Foldable</code> module also provides a large number of predefined
folds, many of which are generalized versions of <code>Prelude</code> functions of the
same name that only work on lists: <code>concat</code>, <code>concatMap</code>, <code>and</code>,
<code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>),
<code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>.

The important function <code>toList</code> is also provided, which turns any <code>Foldable</code> structure into a list of its elements in left-right order; it works by folding with the list monoid.

There are also generic functions that work with <code>Applicative</code> or
<code>Monad</code> instances to generate some sort of computation from each
element in a container, and then perform all the side effects from
those computations, discarding the results: <code>traverse_</code>, <code>sequenceA_</code>,
and others. The results must be discarded because the <code>Foldable</code>
class is too weak to specify what to do with them: we cannot, in
general, make an arbitrary <code>Applicative</code> or <code>Monad</code> instance into a <code>Monoid</code>, but we can make <code>m ()</code> into a <code>Monoid</code> for any such <code>m</code>. If we do have an <code>Applicative</code> or <code>Monad</code> with a monoid
structure—that is, an <code>Alternative</code> or a <code>MonadPlus</code>—then we can
use the <code>asum</code> or <code>msum</code> functions, which can combine the results as
well. Consult the [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Foldable.html <code>Foldable</code> documentation] for
more details on any of these functions.

Note that the <code>Foldable</code> operations always forget the structure of
the container being folded. If we start with a container of type <code>t a</code> for some <code>Foldable t</code>, then <code>t</code> will never appear in the output
type of any operations defined in the <code>Foldable</code> module. Many times
this is exactly what we want, but sometimes we would like to be able
to generically traverse a container while preserving its
structure—and this is exactly what the <code>Traversable</code> class provides,
which will be discussed in the next section.

{{Exercises|
# Implement <code>toList :: Foldable f {{=}}> f a -> [a]</code>.
# Pick some of the following functions to implement: <code>concat</code>, <code>concatMap</code>, <code>and</code>, <code>or</code>, <code>any</code>, <code>all</code>, <code>sum</code>, <code>product</code>, <code>maximum</code>(<code>By</code>), <code>minimum</code>(<code>By</code>), <code>elem</code>, <code>notElem</code>, and <code>find</code>. Figure out how they generalize to <code>Foldable</code> and come up with elegant implementations using <code>fold</code> or <code>foldMap</code> along with appropriate <code>Monoid</code> instances.
}}

==Foldable actually isn't==

The generic term "fold" is often used to refer to the more technical concept of [[Catamorphisms|catamorphism]]. Intuitively, given a way to summarize "one level of structure" (where recursive subterms have already been replaced with their summaries), a catamorphism can summarize an entire recursive structure. It is important to realize that <code>Foldable</code> does not correspond to catamorphisms, but to something weaker. In particular, <code>Foldable</code> allows observing only the left-right order of elements within a structure, not the actual structure itself. Put another way, every use of <code>Foldable</code> can be expressed in terms of <code>toList</code>. For example, <code>fold</code> itself is equivalent to <code>mconcat . toList</code>.

This is sufficient for many tasks, but not all. For example, consider trying to compute the depth of a <code>Tree</code>: try as we might, there is no way to implement it using <code>Foldable</code>. However, it can be implemented as a catamorphism.

==Further reading==

The <code>Foldable</code> class had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s paper]
introducing <code>Applicative</code>, although it has
been fleshed out quite a bit from the form in the paper.

An interesting use of <code>Foldable</code> (as well as <code>Traversable</code>) can be
found in Janis Voigtländer’s paper [http://doi.acm.org/10.1145/1480881.1480904 Bidirectionalization for free!].

=Traversable=

==Definition==

The <code>Traversable</code> type class, defined in the <code>Data.Traversable</code>
module ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Data-Traversable.html haddock]), is:

<haskell>
class (Functor t, Foldable t) => Traversable t where
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
sequenceA :: Applicative f => t (f a) -> f (t a)
mapM :: Monad m => (a -> m b) -> t a -> m (t b)
sequence :: Monad m => t (m a) -> m (t a)
</haskell>

As you can see, every <code>Traversable</code> is also a foldable functor. Like
<code>Foldable</code>, there is a lot in this type class, but making instances is
actually rather easy: one need only implement <code>traverse</code> or
<code>sequenceA</code>; the other methods all have default implementations in
terms of these functions. A good exercise is to figure out what the default
implementations should be: given either <code>traverse</code> or <code>sequenceA</code>, how
would you define the other three methods? (Hint for <code>mapM</code>:
<code>Control.Applicative</code> exports the <code>WrapMonad</code> newtype, which makes any
<code>Monad</code> into an <code>Applicative</code>. The <code>sequence</code> function can be implemented in terms
of <code>mapM</code>.)

==Intuition==

The key method of the <code>Traversable</code> class, and the source of its
unique power, is <code>sequenceA</code>. Consider its type:
<haskell>
sequenceA :: Applicative f => t (f a) -> f (t a)
</haskell>
This answers the fundamental question: when can we commute two
functors? For example, can we turn a tree of lists into a list of
trees?

The ability to compose two monads depends crucially on this ability to
commute functors. Intuitively, if we want to build a composed monad
<code>M a = m (n a)</code> out of monads <code>m</code> and <code>n</code>, then to be able to
implement <code>join :: M (M a) -> M a</code>, that is,
<code>join :: m (n (m (n a))) -> m (n a)</code>, we have to be able to commute
the <code>n</code> past the <code>m</code> to get <code>m (m (n (n a)))</code>, and then we can use the
<code>join</code>s for <code>m</code> and <code>n</code> to produce something of type <code>m (n a)</code>. See
[http://web.cecs.pdx.edu/~mpj/pubs/springschool.html Mark Jones’s paper] for more details.

Alternatively, looking at the type of <code>traverse</code>,
<haskell>
traverse :: Applicative f => (a -> f b) -> t a -> f (t b)
</haskell>
leads us to view <code>Traversable</code> as a generalization of <code>Functor</code>. <code>traverse</code> is an "effectful <code>fmap</code>": it allows us to map over a structure of type <code>t a</code>, applying a function to every element of type <code>a</code> and in order to produce a new structure of type <code>t b</code>; but along the way the function may have some effects (captured by the applicative functor <code>f</code>).

{{Exercises|
# There are at least two natural ways to turn a tree of lists into a list of trees. What are they, and why?
# Give a natural way to turn a list of trees into a tree of lists.
# What is the type of <code>traverse . traverse</code>? What does it do?
}}

==Instances and examples==

What’s an example of a <code>Traversable</code> instance?
The following code shows an example instance for the same
<code>Tree</code> type used as an example in the previous <code>Foldable</code> section. It
is instructive to compare this instance with a <code>Functor</code> instance for
<code>Tree</code>, which is also shown.

<haskell>
data Tree a = Empty | Leaf a | Node (Tree a) a (Tree a)

instance Traversable Tree where
traverse g Empty = pure Empty
traverse g (Leaf x) = Leaf <$> g x
traverse g (Node l x r) = Node <$> traverse g l
<*> g x
<*> traverse g r

instance Functor Tree where
fmap g Empty = Empty
fmap g (Leaf x) = Leaf $ g x
fmap g (Node l x r) = Node (fmap g l)
(g x)
(fmap g r)
</haskell>

It should be clear that the <code>Traversable</code> and <code>Functor</code> instances for
<code>Tree</code> are almost identical; the only difference is that the <code>Functor</code>
instance involves normal function application, whereas the
applications in the <code>Traversable</code> instance take place within an
<code>Applicative</code> context, using <code>(<$>)</code> and <code>(<*>)</code>. In fact, this will
be
true for any type.

Any <code>Traversable</code> functor is also <code>Foldable</code>, and a <code>Functor</code>. We can see
this not only from the class declaration, but by the fact that we can
implement the methods of both classes given only the <code>Traversable</code>
methods.

The standard libraries provide a number of <code>Traversable</code> instances,
including instances for <code>[]</code>, <code>Maybe</code>, <code>Map</code>, <code>Tree</code>, and <code>Sequence</code>.
Notably, <code>Set</code> is not <code>Traversable</code>, although it is <code>Foldable</code>.

{{Exercises|
# Implement <code>fmap</code> and <code>foldMap</code> using only the <code>Traversable</code> methods. (Note that the <code>Traversable</code> module provides these implementations as <code>fmapDefault</code> and <code>foldMapDefault</code>.)
}}

==Laws==

Any instance of <code>Traversable</code> must satisfy the following two laws, where <code>Identity</code> is the identity functor (as defined in the [http://hackage.haskell.org/packages/archive/transformers/latest/doc/html/Data-Functor-Identity.html <code>Data.Functor.Identity</code> module] from the <code>transformers</code> package), and <code>Compose</code> wraps the composition of two functors (as defined in [http://hackage.haskell.org/packages/archive/transformers/0.3.0.0/doc/html/Data-Functor-Compose.html <code>Data.Functor.Compose</code>]):

# <code>traverse Identity = Identity</code>
# <code>traverse (Compose . fmap g . f) = Compose . fmap (traverse g) . traverse f</code>

The first law essentially says that traversals cannot make up arbitrary effects. The second law explains how doing two traversals in sequence can be collapsed to a single traversal.

Additionally, suppose <code>eta</code> is an "<code>Applicative</code> morphism", that is,
<haskell>
eta :: forall a f g. (Applicative f, Applicative g) => f a -> g a
</haskell>
and <code>eta</code> preserves the <code>Applicative</code> operations: <code>eta (pure x) = pure x</code> and <code>eta (x <*> y) = eta x <*> eta y</code>. Then, by parametricity, any instance of <code>Traversable</code> satisfying the above two laws will also satisfy <code>eta . traverse f = traverse (eta . f)</code>.

==Further reading==

The <code>Traversable</code> class also had its genesis in [http://www.soi.city.ac.uk/~ross/papers/Applicative.html McBride and Paterson’s <code>Applicative</code> paper],
and is described in more detail in Gibbons and Oliveira, [http://www.comlab.ox.ac.uk/jeremy.gibbons/publications/iterator.pdf The Essence of the Iterator Pattern],
which also contains a wealth of references to related work.

<code>Traversable</code> forms a core component of Edward Kmett's [http://hackage.haskell.org/package/lens lens library]. Watching [https://vimeo.com/56063074 Edward's talk on the subject] is a highly recommended way to gain better insight into <code>Traversable</code>, <code>Foldable</code>, <code>Applicative</code>, and many other things besides.

For references on the <code>Traversable</code> laws, see Russell O'Connor's [http://article.gmane.org/gmane.comp.lang.haskell.libraries/17778 mailing list post] (and subsequent thread).

=Category=

<code>Category</code> is a relatively recent addition to the Haskell standard libraries. It generalizes the notion of function composition to general “morphisms”.

{{note|GHC 7.6.1 changed its rules regarding types and type variables. Now, any operator at the type level is treated as a type ''constructor'' rather than a type ''variable''; prior to GHC 7.6.1 it was possible to use <code>(~>)</code> instead of <code>`arr`</code>. For more information, see [http://thread.gmane.org/gmane.comp.lang.haskell.glasgow.user/21350 the discussion on the GHC-users mailing list]. For a new approach to nice arrow notation that works with GHC 7.6.1, see [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22615 this message] and also [http://article.gmane.org/gmane.comp.lang.haskell.glasgow.user/22616 this message] from Edward Kmett, though for simplicity I haven't adopted it here.}}
The definition of the <code>Category</code> type class (from
<code>Control.Category</code>; [http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Category.html haddock]) is shown below. For ease of reading, note that I have used an infix type variable <code>`arr`</code>, in parallel with the infix function type constructor <code>(->)</code>. {{noteref}} This syntax is not part of Haskell 2010. The second definition shown is the one used in the standard libraries. For the remainder of this document, I will use the infix type constructor <code>`arr`</code> for <code>Category</code> as well as <code>Arrow</code>.

<haskell>
class Category arr where
id :: a `arr` a
(.) :: (b `arr` c) -> (a `arr` b) -> (a `arr` c)

-- The same thing, with a normal (prefix) type constructor
class Category cat where
id :: cat a a
(.) :: cat b c -> cat a b -> cat a c
</haskell>

Note that an instance of <code>Category</code> should be a type constructor which takes two type arguments, that is, something of kind <code>* -> * -> *</code>. It is instructive to imagine the type constructor variable <code>cat</code> replaced by the function constructor <code>(->)</code>: indeed, in this case we recover precisely the familiar identity function <code>id</code> and function composition operator <code>(.)</code> defined in the standard <code>Prelude</code>.

Of course, the <code>Category</code> module provides exactly such an instance of
<code>Category</code> for <code>(->)</code>. But it also provides one other instance, shown below, which should be familiar from the previous discussion of the <code>Monad</code> laws. <code>Kleisli m a b</code>, as defined in the <code>Control.Arrow</code> module, is just a <code>newtype</code> wrapper around <code>a -> m b</code>.

<haskell>
newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Category (Kleisli m) where
id = Kleisli return
Kleisli g . Kleisli h = Kleisli (h >=> g)
</haskell>

The only law that <code>Category</code> instances should satisfy is that <code>id</code> and <code>(.)</code> should form a monoid—that is, <code>id</code> should be the identity of <code>(.)</code>, and <code>(.)</code> should be associative.

Finally, the <code>Category</code> module exports two additional operators:
<code>(<<<)</code>, which is just a synonym for <code>(.)</code>, and <code>(>>>)</code>, which is <code>(.)</code> with its arguments reversed. (In previous versions of the libraries, these operators were defined as part of the <code>Arrow</code> class.)

==Further reading==

The name <code>Category</code> is a bit misleading, since the <code>Category</code> class cannot represent arbitrary categories, but only categories whose objects are objects of <code>Hask</code>, the category of Haskell types. For a more general treatment of categories within Haskell, see the [http://hackage.haskell.org/package/category-extras category-extras package]. For more about category theory in general, see the excellent [http://en.wikibooks.org/wiki/Haskell/Category_theory Haskell wikibook page],
[http://books.google.com/books/about/Category_theory.html?id=-MCJ6x2lC7oC Steve Awodey’s new book], Benjamin Pierce’s [http://books.google.com/books/about/Basic_category_theory_for_computer_scien.html?id=ezdeaHfpYPwC Basic category theory for computer scientists], or [http://folli.loria.fr/cds/1999/esslli99/courses/barr-wells.html Barr and Wells’s category theory lecture notes]. [http://dekudekuplex.wordpress.com/2009/01/19/motivating-learning-category-theory-for-non-mathematicians/ Benjamin Russell’s blog post]
is another good source of motivation and category theory links. You certainly don’t need to know any category theory to be a successful and productive Haskell programmer, but it does lend itself to much deeper appreciation of Haskell’s underlying theory.

=Arrow=

The <code>Arrow</code> class represents another abstraction of computation, in a
similar vein to <code>Monad</code> and <code>Applicative</code>. However, unlike <code>Monad</code>
and <code>Applicative</code>, whose types only reflect their output, the type of
an <code>Arrow</code> computation reflects both its input and output. Arrows
generalize functions: if <code>arr</code> is an instance of <code>Arrow</code>, a value of
type <code>b `arr` c</code> can be thought of as a computation which takes values of
type <code>b</code> as input, and produces values of type <code>c</code> as output. In the
<code>(->)</code> instance of <code>Arrow</code> this is just a pure function; in general, however,
an arrow may represent some sort of “effectful” computation.

==Definition==

The definition of the <code>Arrow</code> type class, from
<code>Control.Arrow</code> ([http://www.haskell.org/ghc/docs/latest/html/libraries/base/Control-Arrow.html haddock]), is:

<haskell>
class Category arr => Arrow arr where
arr :: (b -> c) -> (b `arr` c)
first :: (b `arr` c) -> ((b, d) `arr` (c, d))
second :: (b `arr` c) -> ((d, b) `arr` (d, c))
(***) :: (b `arr` c) -> (b' `arr` c') -> ((b, b') `arr` (c, c'))
(&&&) :: (b `arr` c) -> (b `arr` c') -> (b `arr` (c, c'))
</haskell>

{{note|In versions of the <code>base</code>
package prior to version 4, there is no <code>Category</code> class, and the
<code>Arrow</code> class includes the arrow composition operator <code>(>>>)</code>. It
also includes <code>pure</code> as a synonym for <code>arr</code>, but this was removed
since it conflicts with the <code>pure</code> from <code>Applicative</code>.}}

The first thing to note is the <code>Category</code> class constraint, which
means that we get identity arrows and arrow composition for free:
given two arrows <code>g :: b `arr` c</code> and <code>h :: c `arr` d</code>, we can form their
composition <code>g >>> h :: b `arr` d</code> {{noteref}}.

As should be a familiar pattern by now, the only methods which must be
defined when writing a new instance of <code>Arrow</code> are <code>arr</code> and <code>first</code>;
the other methods have default definitions in terms of these, but are
included in the <code>Arrow</code> class so that they can be overridden with more
efficient implementations if desired.

==Intuition==

Let’s look at each of the arrow methods in turn. [http://www.haskell.org/arrows/ Ross Paterson’s web page on arrows] has nice diagrams which can help
build intuition.

* The <code>arr</code> function takes any function <code>b -> c</code> and turns it into a generalized arrow <code>b `arr` c</code>. The <code>arr</code> method justifies the claim that arrows generalize functions, since it says that we can treat any function as an arrow. It is intended that the arrow <code>arr g</code> is “pure” in the sense that it only computes <code>g</code> and has no “effects” (whatever that might mean for any particular arrow type).

* The <code>first</code> method turns any arrow from <code>b</code> to <code>c</code> into an arrow from <code>(b,d)</code> to <code>(c,d)</code>. The idea is that <code>first g</code> uses <code>g</code> to process the first element of a tuple, and lets the second element pass through unchanged. For the function instance of <code>Arrow</code>, of course, <code>first g (x,y) = (g x, y)</code>.

* The <code>second</code> function is similar to <code>first</code>, but with the elements of the tuples swapped. Indeed, it can be defined in terms of <code>first</code> using an auxiliary function <code>swap</code>, defined by <code>swap (x,y) = (y,x)</code>.

* The <code>(***)</code> operator is “parallel composition” of arrows: it takes two arrows and makes them into one arrow on tuples, which has the behavior of the first arrow on the first element of a tuple, and the behavior of the second arrow on the second element. The mnemonic is that <code>g *** h</code> is the ''product'' (hence <code>*</code>) of <code>g</code> and <code>h</code>. For the function instance of <code>Arrow</code>, we define <code>(g *** h) (x,y) = (g x, h y)</code>. The default implementation of <code>(***)</code> is in terms of <code>first</code>, <code>second</code>, and sequential arrow composition <code>(>>>)</code>. The reader may also wish to think about how to implement <code>first</code> and <code>second</code> in terms of <code>(***)</code>.

* The <code>(&&&)</code> operator is “fanout composition” of arrows: it takes two arrows <code>g</code> and <code>h</code> and makes them into a new arrow <code>g &&& h</code> which supplies its input as the input to both <code>g</code> and <code>h</code>, returning their results as a tuple. The mnemonic is that <code>g &&& h</code> performs both <code>g</code> ''and'' <code>h</code> (hence <code>&</code>) on its input. For functions, we define <code>(g &&& h) x = (g x, h x)</code>.

==Instances==

The <code>Arrow</code> library itself only provides two <code>Arrow</code> instances, both
of which we have already seen: <code>(->)</code>, the normal function
constructor, and <code>Kleisli m</code>, which makes functions of
type <code>a -> m b</code> into <code>Arrow</code>s for any <code>Monad m</code>. These instances are:

<haskell>
instance Arrow (->) where
arr g = g
first g (x,y) = (g x, y)

newtype Kleisli m a b = Kleisli { runKleisli :: a -> m b }

instance Monad m => Arrow (Kleisli m) where
arr f = Kleisli (return . f)
first (Kleisli f) = Kleisli (\ ~(b,d) -> do c <- f b
return (c,d) )
</haskell>

==Laws==

{{note|See [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 John Hughes: Generalising monads to arrows]; [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf Sam Lindley, Philip Wadler, Jeremy Yallop: The arrow calculus]; [http://www.soi.city.ac.uk/~ross/papers/fop.html Ross Paterson: Programming with Arrows].}}

There are quite a few laws that instances of <code>Arrow</code> should
satisfy {{noteref}}:

<haskell>
arr id = id
arr (h . g) = arr g >>> arr h
first (arr g) = arr (g *** id)
first (g >>> h) = first g >>> first h
first g >>> arr (id *** h) = arr (id *** h) >>> first g
first g >>> arr fst = arr fst >>> g
first (first g) >>> arr assoc = arr assoc >>> first g

assoc ((x,y),z) = (x,(y,z))
</haskell>

Note that this version of the laws is slightly different than the laws given in the
first two above references, since several of the laws have now been
subsumed by the <code>Category</code> laws (in particular, the requirements that
<code>id</code> is the identity arrow and that <code>(>>>)</code> is associative). The laws
shown here follow those in Paterson’s Programming with Arrows, which uses the
<code>Category</code> class.

{{note|Unless category-theory-induced insomnolence is your cup of tea.}}

The reader is advised not to lose too much sleep over the <code>Arrow</code>
laws {{noteref}}, since it is not essential to understand them in order to
program with arrows. There are also laws that <code>ArrowChoice</code>,
<code>ArrowApply</code>, and <code>ArrowLoop</code> instances should satisfy; the interested
reader should consult [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows].

==ArrowChoice==

Computations built using the <code>Arrow</code> class, like those built using
the <code>Applicative</code> class, are rather inflexible: the structure of the computation
is fixed at the outset, and there is no ability to choose between
alternate execution paths based on intermediate results.
The <code>ArrowChoice</code> class provides exactly such an ability:

<haskell>
class Arrow arr => ArrowChoice arr where
left :: (b `arr` c) -> (Either b d `arr` Either c d)
right :: (b `arr` c) -> (Either d b `arr` Either d c)
(+++) :: (b `arr` c) -> (b' `arr` c') -> (Either b b' `arr` Either c c')
(|||) :: (b `arr` d) -> (c `arr` d) -> (Either b c `arr` d)
</haskell>

A comparison of <code>ArrowChoice</code> to <code>Arrow</code> will reveal a striking
parallel between <code>left</code>, <code>right</code>, <code>(+++)</code>, <code>(|||)</code> and <code>first</code>,
<code>second</code>, <code>(***)</code>, <code>(&&&)</code>, respectively. Indeed, they are dual:
<code>first</code>, <code>second</code>, <code>(***)</code>, and <code>(&&&)</code> all operate on product types
(tuples), and <code>left</code>, <code>right</code>, <code>(+++)</code>, and <code>(|||)</code> are the
corresponding operations on sum types. In general, these operations
create arrows whose inputs are tagged with <code>Left</code> or <code>Right</code>, and can
choose how to act based on these tags.

* If <code>g</code> is an arrow from <code>b</code> to <code>c</code>, then <code>left g</code> is an arrow from <code>Either b d</code> to <code>Either c d</code>. On inputs tagged with <code>Left</code>, the <code>left g</code> arrow has the behavior of <code>g</code>; on inputs tagged with <code>Right</code>, it behaves as the identity.

* The <code>right</code> function, of course, is the mirror image of <code>left</code>. The arrow <code>right g</code> has the behavior of <code>g</code> on inputs tagged with <code>Right</code>.

* The <code>(+++)</code> operator performs “multiplexing”: <code>g +++ h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and as <code>h</code> on inputs tagged with <code>Right</code>. The tags are preserved. The <code>(+++)</code> operator is the ''sum'' (hence <code>+</code>) of two arrows, just as <code>(***)</code> is the product.

* The <code>(|||)</code> operator is “merge” or “fanin”: the arrow <code>g ||| h</code> behaves as <code>g</code> on inputs tagged with <code>Left</code>, and <code>h</code> on inputs tagged with <code>Right</code>, but the tags are discarded (hence, <code>g</code> and <code>h</code> must have the same output type). The mnemonic is that <code>g ||| h</code> performs either <code>g</code> ''or'' <code>h</code> on its input.

The <code>ArrowChoice</code> class allows computations to choose among a finite number of execution paths, based on intermediate results. The possible
execution paths must be known in advance, and explicitly assembled with <code>(+++)</code> or <code>(|||)</code>. However, sometimes more flexibility is
needed: we would like to be able to ''compute'' an arrow from intermediate results, and use this computed arrow to continue the computation. This is the power given to us by <code>ArrowApply</code>.

==ArrowApply==

The <code>ArrowApply</code> type class is:

<haskell>
class Arrow arr => ArrowApply arr where
app :: (b `arr` c, b) `arr` c
</haskell>

If we have computed an arrow as the output of some previous
computation, then <code>app</code> allows us to apply that arrow to an input,
producing its output as the output of <code>app</code>. As an exercise, the
reader may wish to use <code>app</code> to implement an alternative “curried”
version, <code>app2 :: b `arr` ((b `arr` c) `arr` c)</code>.

This notion of being able to ''compute'' a new computation
may sound familiar:
this is exactly what the monadic bind operator <code>(>>=)</code> does. It
should not particularly come as a surprise that <code>ArrowApply</code> and
<code>Monad</code> are exactly equivalent in expressive power. In particular,
<code>Kleisli m</code> can be made an instance of <code>ArrowApply</code>, and any instance
of <code>ArrowApply</code> can be made a <code>Monad</code> (via the <code>newtype</code> wrapper
<code>ArrowMonad</code>). As an exercise, the reader may wish to try
implementing these instances:

<haskell>
instance Monad m => ArrowApply (Kleisli m) where
app = -- exercise

newtype ArrowApply a => ArrowMonad a b = ArrowMonad (a () b)

instance ArrowApply a => Monad (ArrowMonad a) where
return = -- exercise
(ArrowMonad a) >>= k = -- exercise
</haskell>

==ArrowLoop==

The <code>ArrowLoop</code> type class is:

<haskell>
class Arrow a => ArrowLoop a where
loop :: a (b, d) (c, d) -> a b c

trace :: ((b,d) -> (c,d)) -> b -> c
trace f b = let (c,d) = f (b,d) in c
</haskell>

It describes arrows that can use recursion to compute results, and is
used to desugar the <code>rec</code> construct in arrow notation (described
below).

Taken by itself, the type of the <code>loop</code> method does not seem to tell
us much. Its intention, however, is a generalization of the <code>trace</code>
function which is also shown. The <code>d</code> component of the first arrow’s
output is fed back in as its own input. In other words, the arrow
<code>loop g</code> is obtained by recursively “fixing” the second component of
the input to <code>g</code>.

It can be a bit difficult to grok what the <code>trace</code> function is doing.
How can <code>d</code> appear on the left and right sides of the <code>let</code>? Well,
this is Haskell’s laziness at work. There is not space here for a
full explanation; the interested reader is encouraged to study the
standard <code>fix</code> function, and to read [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson’s arrow tutorial].

==Arrow notation==

Programming directly with the arrow combinators can be painful,
especially when writing complex computations which need to retain
simultaneous reference to a number of intermediate results. With
nothing but the arrow combinators, such intermediate results must be
kept in nested tuples, and it is up to the programmer to remember
which intermediate results are in which components, and to swap,
reassociate, and generally mangle tuples as necessary. This problem
is solved by the special arrow notation supported by GHC, similar to
<code>do</code> notation for monads, that allows names to be assigned to
intermediate results while building up arrow computations. An example
arrow implemented using arrow notation, taken from
Paterson, is:

<haskell>
class ArrowLoop arr => ArrowCircuit arr where
delay :: b -> (b `arr` b)

counter :: ArrowCircuit arr => Bool `arr` Int
counter = proc reset -> do
rec output <- idA -< if reset then 0 else next
next <- delay 0 -< output + 1
idA -< output
</haskell>

This arrow is intended to
represent a recursively defined counter circuit with a reset line.

There is not space here for a full explanation of arrow notation; the
interested reader should consult
[http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper introducing the notation], or his later [http://www.soi.city.ac.uk/~ross/papers/fop.html tutorial which presents a simplified version].

==Further reading==

An excellent starting place for the student of arrows is the [http://www.haskell.org/arrows/ arrows web page], which contains an
introduction and many references. Some key papers on arrows include
Hughes’s original paper introducing arrows, [http://dx.doi.org/10.1016/S0167-6423(99)00023-4 Generalising monads to arrows], and [http://www.soi.city.ac.uk/~ross/papers/notation.html Paterson’s paper on arrow notation].

Both Hughes and Paterson later wrote accessible tutorials intended for a broader
audience: [http://www.soi.city.ac.uk/~ross/papers/fop.html Paterson: Programming with Arrows] and [http://www.cse.chalmers.se/~rjmh/afp-arrows.pdf Hughes: Programming with Arrows].

Although Hughes’s goal in defining the <code>Arrow</code> class was to
generalize <code>Monad</code>s, and it has been said that <code>Arrow</code> lies “between
<code>Applicative</code> and <code>Monad</code>” in power, they are not directly
comparable. The precise relationship remained in some confusion until
[http://homepages.inf.ed.ac.uk/wadler/papers/arrows-and-idioms/arrows-and-idioms.pdf analyzed by Lindley, Wadler, and Yallop], who
also invented a new calculus of arrows, based on the lambda calculus,
which considerably simplifies the presentation of the arrow laws
(see [http://homepages.inf.ed.ac.uk/wadler/papers/arrows/arrows.pdf The arrow calculus]). There is also a precise technical sense in which [http://just-bottom.blogspot.de/2010/04/programming-with-effects-story-so-far.html <code>Arrow</code> can be seen as the intersection of <code>Applicative</code> and <code>Category</code>].

Some examples of <code>Arrow</code>s include [http://www.haskell.org/yampa/ Yampa], the
[http://www.fh-wedel.de/~si/HXmlToolbox/ Haskell XML Toolkit], and the functional GUI library [[Grapefruit]].

Some extensions to arrows have been explored; for example, the
[http://www.cs.ru.nl/A.vanWeelden/bi-arrows/ <code>BiArrow</code>s of Alimarine et al.], for two-way instead of one-way
computation.

The Haskell wiki has [[Research papers/Monads and Arrows|links to many additional research papers relating to <code>Arrow</code>s]].

=Comonad=

The final type class we will examine is <code>Comonad</code>. The <code>Comonad</code> class
is the categorical dual of <code>Monad</code>; that is, <code>Comonad</code> is like <code>Monad</code>
but with all the function arrows flipped. It is not actually in the
standard Haskell libraries, but it has seen some interesting uses
recently, so we include it here for completeness.

==Definition==

The <code>Comonad</code> type class, defined in the <code>Control.Comonad</code> module of
the [http://hackage.haskell.org/package/comonad comonad library], is:

<haskell>
class Functor w => Comonad w where
extract :: w a -> a

duplicate :: w a -> w (w a)
duplicate = extend id

extend :: (w a -> b) -> w a -> w b
extend f = fmap f . duplicate
</haskell>

As you can see, <code>extract</code> is the dual of <code>return</code>, <code>duplicate</code> is the dual of <code>join</code>, and <code>extend</code> is the dual of <code>(=<<)</code>. The definition of <code>Comonad</code> is a bit redundant, giving the programmer the choice on whether extend or duplicate are implemented; the other operation then has a default implementation.

A prototypical example of a <code>Comonad</code> instance is:

<haskell>
-- Infinite lazy streams
data Stream a = Cons a (Stream a)

-- 'duplicate' is like the list function 'tails'
-- 'extend' computes a new Stream from an old, where the element
-- at position n is computed as a function of everything from
-- position n onwards in the old Stream
instance Comonad Stream where
extract (Cons x _) = x
duplicate s@(Cons x xs) = Cons s (duplicate xs)
extend g s@(Cons x xs) = Cons (g s) (extend g xs)
-- = fmap g (duplicate s)
</haskell>

==Further reading==

Dan Piponi explains in a blog post what [http://blog.sigfpe.com/2006/12/evaluating-cellular-automata-is.html cellular automata have to do with comonads]. In another blog post, Conal Elliott has examined [http://conal.net/blog/posts/functional-interactive-behavior/ a comonadic formulation of functional reactive programming]. Sterling Clover’s blog post [http://fmapfixreturn.wordpress.com/2008/07/09/comonads-in-everyday-life/ Comonads in everyday life] explains the relationship between comonads and zippers, and how comonads can be used to design a menu system for a web site.

Uustalu and Vene have a number of papers exploring ideas related to comonads and functional programming:
* [http://dx.doi.org/10.1016/j.entcs.2008.05.029 Comonadic Notions of Computation]
* [http://www.ioc.ee/~tarmo/papers/sfp01-book.pdf The dual of substitution is redecoration] (Also available as [http://www.cs.ut.ee/~varmo/papers/sfp01-book.ps.gz ps.gz].)
* [http://dx.doi.org/10.1016/j.ic.2005.08.005 Recursive coalgebras from comonads]
* [http://www.fing.edu.uy/~pardo/papers/njc01.ps.gz Recursion schemes from comonads]
* [http://cs.ioc.ee/~tarmo/papers/essence.pdf The Essence of Dataflow Programming].

Gabriel Gonzalez's [http://www.haskellforall.com/2013/02/you-could-have-invented-comonads.html Comonads are objects] points out similarities between comonads and object-oriented programming.

The [http://hackage.haskell.org/package/comonad-transformers comonad-transformers] package contains comonad transformers.

=Acknowledgements=

A special thanks to all of those who taught me about standard Haskell
type classes and helped me develop good intuition for them,
particularly Jules Bean (quicksilver), Derek Elkins (ddarius), Conal
Elliott (conal), Cale Gibbard (Cale), David House, Dan Piponi
(sigfpe), and Kevin Reid (kpreid).

I also thank the many people who provided a mountain of helpful
feedback and suggestions on a first draft of the Typeclassopedia: David Amos,
Kevin Ballard, Reid Barton, Doug Beardsley, Joachim Breitner, Andrew
Cave, David Christiansen, Gregory Collins, Mark Jason Dominus, Conal
Elliott, Yitz Gale, George Giorgidze, Steven Grady, Travis Hartwell,
Steve Hicks, Philip Hölzenspies, Edward Kmett, Eric Kow, Serge Le
Huitouze, Felipe Lessa, Stefan Ljungstrand, Eric Macaulay, Rob MacAulay, Simon Meier,
Eric Mertens, Tim Newsham, Russell O’Connor, Conrad Parker, Walt
Rorie-Baety, Colin Ross, Tom Schrijvers, Aditya Siram, C. Smith,
Martijn van Steenbergen, Joe Thornber, Jared Updike, Rob Vollmert,
Andrew Wagner, Louis Wasserman, and Ashley Yakeley, as well as a few
only known to me by their IRC nicks: b_jonas, maltem, tehgeekmeister,
and ziman. I have undoubtedly omitted a few inadvertently, which in
no way diminishes my gratitude.

Finally, I would like to thank Wouter Swierstra for his fantastic work
editing the Monad.Reader, and my wife Joyia for her patience during
the process of writing the Typeclassopedia.

=About the author=

Brent Yorgey ([http://byorgey.wordpress.com/ blog], [http://www.cis.upenn.edu/~byorgey/ homepage]) is (as of November 2011) a fourth-year Ph.D. student in the [http://www.cis.upenn.edu/~plclub/ programming languages group] at the [http://www.upenn.edu University of Pennsylvania]. He enjoys teaching, creating EDSLs, playing Bach fugues, musing upon category theory, and cooking tasty lambda-treats for the denizens of #haskell.

=Colophon=

The Typeclassopedia was written by Brent Yorgey and initially published in March 2009. Painstakingly converted to wiki syntax by [[User:Geheimdienst]] in November 2011, after asking Brent’s permission.

If something like this TeX to wiki syntax conversion ever needs to be done again, here are some vim commands that helped:

* <nowiki>%s/\\section{$[^}]*$}/=\1=/gc</nowiki>
* <nowiki>%s/\\subsection{$[^}]*$}/==\1==/gc</nowiki>
* <nowiki>%s/^ *\\item /\r* /gc</nowiki>
* <nowiki>%s/---/—/gc</nowiki>
* <nowiki>%s/\$$[^$]*$\$/<math>\1\\ <\/math>/gc</nowiki> ''Appending “\ ” forces images to be rendered. Otherwise, Mediawiki would go back and forth between one font for short <nowiki><math></nowiki> tags, and another more Tex-like font for longer tags (containing more than a few characters)""
* <nowiki>%s/|$[^|]*$|/<code>\1<\/code>/gc</nowiki>
* <nowiki>%s/\\dots/.../gc</nowiki>
* <nowiki>%s/^\\label{.*$//gc</nowiki>
* <nowiki>%s/\\emph{$[^}]*$}/''\1''/gc</nowiki>
* <nowiki>%s/\\term{$[^}]*$}/''\1''/gc</nowiki>

The biggest issue was taking the academic-paper-style citations and turning them into hyperlinks with an appropriate title and an appropriate target. In most cases there was an obvious thing to do (e.g. online PDFs of the cited papers or CiteSeer entries). Sometimes, however, it’s less clear and you might want to check the
[[Media:Typeclassopedia.pdf|original Typeclassopedia PDF]]
with the
[http://code.haskell.org/~byorgey/TMR/Issue13/typeclassopedia.bib original bibliography file].

To get all the citations into the main text, I first tried processing the source with TeX or Lyx. This didn’t work due to missing unfindable packages, syntax errors, and my general ineptitude with Tex.

I then went for the next best solution, which seemed to be extracting all instances of “\cite{something}” from the source and ''in that order'' pulling the referenced entries from the .bib file. This way you can go through the source file and sorted-references file in parallel, copying over what you need, without searching back and forth in the .bib file. I used:

* <nowiki>egrep -o "\cite\{[^\}]*\}" ~/typeclassopedia.lhs | cut -c 6- | tr "," "\n" | tr -d "}" > /tmp/citations</nowiki>
* <nowiki>for i in $(cat /tmp/citations); do grep -A99 "$i" ~/typeclassopedia.bib|egrep -B99 '^\}$' -m1 ; done > ~/typeclasso-refs-sorted</nowiki>

[[Category:Applicative Functor]]
[[Category:Arrow]]
[[Category:Functor]]
[[Category:Monad]]
[[Category:Standard classes]]
[[Category:Standard libraries]]
[[Category:Standard packages]]
[[Category:Standard types]]

Hitchhikers guide to Haskell

2011-03-30T22:34:43Z

Imz: /* Chapter 3: Packing the knapsack and testing it with class, too (and don't forget your towel!) */ +wikilink

== Preface: DON'T PANIC! ==
[[Category:Tutorials]]
Recent experiences from a few of my fellow C++/Java programmers
indicate that they read various Haskell tutorials with "exponential
speedup" (think about how TCP/IP session starts up). They start slow
and cautious, but when they see that the first 3-5 pages do not
contain "anything interesting" in terms of code and examples, they
begin skipping paragraphs, then chapters, then whole pages, only to
slow down - often to a complete halt - somewhere on page 50, finding
themselves in the thick of concepts like "type classes", "type
constructors", "monadic IO", at which point they usually panic, think
of a perfectly rational excuse not to read further anymore, and
happily forget this sad and scary encounter with Haskell (as human
beings usually tend to forget sad and scary things).

This text intends to introduce the reader to the practical aspects of Haskell
from the very beginning (plans for the first chapters include: I/O, darcs,
Parsec, QuickCheck, profiling and debugging, to mention a few). The reader
is expected to know (where to find) at least the basics of Haskell: how to run
"hugs" or "ghci", '''that layout is 2-dimensional''', etc. Other than that, we do
not plan to take radical leaps, and will go one step at a time in order not to
lose the reader along the way. So DON'T PANIC, take your towel with you and
read along.

'''In case you've skipped over the previous paragraph''', I would like
to stress out once again that Haskell is sensitive to indentation and
spacing, so pay attention to that during cut-n-pastes or manual
alignment of code in the text editor with proportional fonts.

Oh, almost forgot: author is very interested in ANY feedback. Drop him a line
or a word (see [[User:Adept|Adept]] for contact info) or submit
patches to the tutorial via darcs (
[http://adept.linux.kiev.ua:8080/repos/hhgtth/ repository is here]) or directly to this
Wiki.

== Chapter 1: Ubiquitous "Hello world!" and other ways to do IO in Haskell ==

Each chapter will be dedicated to one small real-life task which we will
complete from the ground up.

So here is the task for this chapter: in order to free up space on
your hard drive for all the Haskell code you are going to write in the
nearest future, you are going to archive some of the old and dusty
information on CDs and DVDs. While CD (or DVD) burning itself is easy
these days, it usually takes some (or quite a lot of) time to decide
how to put several GB of digital photos on CD-Rs, when directories
with images range from 10 to 300 Mb's in size, and you don't want to
burn half-full (or half-empty) CD-Rs.

So, the task is to write a program which will help us put a given
collection of directories on the minimum possible amount of media,
while packing the media as tightly as possible. Let's name this program
"cd-fit".

Oh. Wait. Let's do the usual "hello world" thing, before we forget about it,
and then move on to more interesting things:

<haskell>
-- Taken from 'hello.hs'
-- From now on, a comment at the beginning of the code snippet
-- will specify the file which contain the full program from
-- which the snippet is taken. You can get the code from the darcs
-- repository "http://adept.linux.kiev.ua:8080/repos/hhgtth" by issuing
-- command "darcs get http://adept.linux.kiev.ua:8080/repos/hhgtth"
module Main where
main = putStrLn "Hello world!"
</haskell>

Run it:

$ runhaskell ./hello.hs
Hello world!

OK, we've done it. Move along now, nothing interesting here :)

Any serious development must be done with the help of a version control
system, and we will not make an exception. We will use the modern
distributed version control system "darcs". "Modern" means that it is
written in Haskell, "distributed" means that each working copy is
a repository in itself.

First, let's create an empty directory for all our code, and invoke
"darcs init" there, which will create subdirectory "_darcs" to store
all version-control-related stuff there.

Fire up your favorite editor and create a new file called "cd-fit.hs"
in our working directory. Now let's think for a moment about how our
program will operate and express it in pseudocode:

<haskell>
main = Read list of directories and their sizes.
Decide how to fit them on CD-Rs.
Print solution.
</haskell>

Sounds reasonable? I thought so.

Let's simplify our life a little and assume for now that we will
compute directory sizes somewhere outside our program (for example,
with "du -sb *") and read this information from stdin.
Now let me convert all this to Haskell:

<haskell>
-- Taken from 'cd-fit-1-1.hs'
module Main where

main = do input <- getContents
putStrLn ("DEBUG: got input " ++ input)
-- compute solution and print it
</haskell>

Not really working, but pretty close to plain English, eh? Let's stop
for a moment and look more closely at what's written here line-by-line

Let's begin from the top:

<haskell>
-- Taken from 'cd-fit-1-1.hs'
input <- getContents
</haskell>

This is an example of the Haskell syntax for doing IO (namely, input). This
line is an instruction to read all the information available from the stdin,
return it as a single string, and bind it to the symbol "input", so we can
process this string any way we want.

How did I know that? Did I memorize all the functions by heart? Of course not!
Each function has a type, which, along with function's name, usually tells a
lot about what a function will do.

Let's fire up an interactive Haskell environment and examine this function
up close:

$ ghci
___ ___ _
/ _ \ /\ /\/ __(_)
/ /_\// /_/ / / | | GHC Interactive, version 6.4.1, for Haskell 98.
/ /_\\/ __ / /___| | http://www.haskell.org/ghc/
\____/\/ /_/\____/|_| Type :? for help.

Loading package base-1.0 ... linking ... done.
Prelude> :type getContents
getContents :: IO String
Prelude>

We see that "getContents" is a function without arguments, that will return
"IO String". Prefix "IO" meant that this is an IO action. It will return
String, when evaluated. Action will be evaluated as soon as we use "<-" to
bind its result to some symbol.

Note that "<-" is not a fancy way to assign value to variable. It is a way to
evaluate (execute) IO actions, in other words - to actually do some I/O and
return its result (if any).

We can choose not to evaluate the action obtained from "getContents", but rather carry it around a bit and evaluate later:

<haskell>
let x = getContents
-- 300 lines of code here
input <- x
</haskell>

So, as you see, IO actions can act like an ordinary values. Suppose that we
have built a list of IO actions and have found a way to execute them one by one.
This would be a way to simulate imperative programming with its notion of
"order of execution".

Haskell allows you to do better than that.

The standard language library (named "Prelude", by the way) provides
us with lots of functions that return useful primitive IO actions. In
order to combine them to produce an even more complex actions, we use a "do":

<haskell>
c = do a <- someAction
b <- someOtherAction
print (bar b)
print (foo a)
putStrLn "done"
</haskell>

Here we '''bind''' "c" to an action with the following "scenario":
* '''evaluate''' action "someAction" and '''bind''' its result to "a"
* then, '''evaluate''' "someOtherAction" and '''bind''' its result to "b"
* then, process "b" with function "bar" and print result
* then, process "a" with function "foo" and print result
* then, print the word "done"

When will all this actually be executed? Answer: as soon as we evaluate "c"
using the "<-" (if it returns result, as "getContents" does) or just
by using it as a function name (if it does not return a result, as "print"
does):

<haskell>
process = do putStrLn "Will do some processing"
c
putStrLn "Done"
</haskell>

Notice that we took a bunch of functions ("someAction", "someOtherAction",
"print", "putStrLn") and using "do" created from them a new function, which we
bound to symbol "c". Now we could use "c" as a building block to produce an even
more complex function, "process", and we could carry this on and on.
Eventually, some of the functions will be mentioned in the code of function
"main", to which the ultimate topmost IO action any Haskell program is bound.

When will the "main" be executed/evaluated/forced? As soon as we run the
program. Read this twice and try to comprehend:

''The execution of a Haskell program is an evaluation of the symbol "main" to
which we have bound an IO action. Via evaluation we obtain the result of that
action''.

Readers familiar with advanced C++ or Java programming and that arcane body of
knowledge named "OOP Design Patterns" might note that "build actions from
actions" and "evaluate actions to get result" is essentially a "Command
pattern" and "Composition pattern" combined. Good news: in Haskell you get them
for all your IO, and get them '''for free''' :)

----
'''Exercise:'''
Consider the following code:

<haskell>
-- Taken from 'exercise-1-1.hs'
module Main where
c = putStrLn "C!"

combine before after =
do before
putStrLn "In the middle"
after

main = do combine c c
let b = combine (putStrLn "Hello!") (putStrLn "Bye!")
let d = combine (b) (combine c c)
putStrLn "So long!"
</haskell>

Notice how we carefully indent lines so that source looks neat?
Actually, Haskell code has to be aligned this way, or it will not
compile. If you use tabulation to indent your sources, take into
account that Haskell compilers assume that tabstop is 8 characters
wide.

Often people complain that it is very difficult to write Haskell
because it requires them to align code. Actually, this is not true. If
you align your code, compiler will guess the beginnings and endings of
syntactic blocks. However, if you don't want to indent your code, you
could explicitly specify end of each and every expression and use
arbitrary layout as in this example:
<haskell>
-- Taken from 'exercise-1-2.hs'
combine before after =
do { before;
putStrLn "In the middle";
after; };

main =
do { combine c c; let { b = combine (putStrLn "Hello!") (putStrLn "Bye!")};
let {d = combine (b) (combine c c)};
putStrLn "So long!" };
</haskell>

Back to the exercise - see how we construct code out of thin air? Try
to imagine what this code will do, then run it and check yourself.

Do you understand why "Hello!" and "Bye!" are not printed?
----

Let's examine our "main" function closer:

Prelude> :load cd-fit.hs
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :type main
main :: IO ()
*Main>

We see that "main" is indeed an IO action which will return nothing
when evaluated. When combining actions with "do", the type of the
result will be the type of the last action, and "putStrLn something" has type
"IO ()":

*Main> :type putStrLn "Hello world!"
putStrLn "Hello world!" :: IO ()
*Main>

Oh, by the way: have you noticed that we actually compiled our first
Haskell program in order to examine "main"? :)

let's celebrate that by putting it under version control: execute
"darcs add cd-fit.hs" and "darcs record", answer "y" to all questions
and provide a commit comment "Skeleton of cd-fit.hs"

Let's try to run it:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: got input foo

----
'''Exercises''':

* Try to write a program that takes your name from the stdin and greets you (keywords: getLine, putStrLn);

* Try to write a program that asks for you name, reads it, greets you, asks for your favorite color, and prints it back (keywords: getLine, putStrLn).

== Chapter 2: Parsing the input ==

OK, now that we have proper understanding of the powers of Haskell IO
(and are awed by them, I hope), let's forget about IO and actually do
some useful work.

As you remember, we set forth to pack some CD-Rs as tightly as
possible with data scattered in several input directories. We assume
that "du -sb" will compute the sizes of input directories and output
something like:

65572 /home/adept/photos/raw-to-burn/dir1
68268 /home/adept/photos/raw-to-burn/dir2
53372 /home/adept/photos/raw-to-burn/dir3
713124 /home/adept/photos/raw-to-burn/dir4
437952 /home/adept/photos/raw-to-burn/dir5

Our next task is to parse that input into some suitable internal
representation.

For that we will use powerful library of '''parsing combinators''' named
"[[Parsec]]" which ships with most Haskell implementations.

Much like the IO facilities we have seen in the first chapter, this
library provides a set of basic parsers and means to combine into more
complex parsing constructs.

Unlike other tools in this area (lex/yacc or JavaCC to name a few),
[[Parsec]] parsers do not require a separate preprocessing stage. Since in
Haskell we can return function as a result of function and thus
construct functions "from the thin air", there is no need for a separate
syntax for parser description. But enough advertisements, let's actually
do some parsing:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
import Text.ParserCombinators.Parsec

-- parseInput parses output of "du -sb", which consists of many lines,
-- each of which describes single directory
parseInput =
do dirs <- many dirAndSize
eof
return dirs

-- Datatype Dir holds information about single directory - its size and name
data Dir = Dir Int String deriving Show

-- `dirAndSize` parses information about single directory, which is:
-- a size in bytes (number), some spaces, then directory name, which extends till newline
dirAndSize =
do size <- many1 digit
spaces
dir_name <- anyChar `manyTill` newline
return (Dir (read size) dir_name)
</haskell>

Just add those lines to "cd-fit.hs", between the declaration of
the Main module and the definition of main.

Here we see quite a lot of new
things, and several those that we know already.
First of all, note the familiar "do" construct, which, as we know, is
used to combine IO actions to produce new IO actions. Here we use it
to combine "parsing" actions into new "parsing" actions. Does this
mean that "parsing" implies "doing IO"? Not at all. Thing is, I must
admit that I lied to you - "do" is used not only to combine IO
actions. "Do" is used to combine any kind of so-called ''monadic
actions'' or ''monadic values'' together.

Think about [[monad]] as a "[[:Category:Idioms|design pattern]]" in the functional world.
[[Monad]] is a way to hide from the user (programmer) all the machinery
required for complex functionality to operate.

As you might have heard, Haskell has no notion of "assignment",
"mutable state", "variables", and is a "pure functional language",
which means that every function called with the same input parameters
will return exactly the same result. Meanwhile "doing IO" requires
hauling around file handles and their states and dealing with IO
errors. "Parsing" requires to track position in the input and dealing
with parsing errors.

In both cases Wise Men Who Wrote Libraries cared for our needs and
hide all underlying complexities from us, exposing the [http://en.wikipedia.org/wiki/Application_programming_interface API] of their
libraries (IO and parsing) in the form of "monadic action" which we
are free to combine as we see fit.

Think of programming with monads as of doing the remodelling with the
help of professional remodelling crew. You describe sequence of
actions on the piece of paper (that's us writing in "do" notation),
and then, when required, that sequence will be evaluated by the
remodelling crew ("in the monad") which will provide you with end
result, hiding all the underlying complexity (how to prepare the
paint, which nails to choose, etc) from you.

let's use the interactive Haskell environment to decipher all the
instructions we've written for the parsing library. As usually, we'll
go top-down:

*Main> :reload
Ok, modules loaded: Main.
*Main> :t parseInput
parseInput :: GenParser Char st [Dir]
*Main> :t dirAndSize
dirAndSize :: GenParser Char st Dir
*Main>

Assuming (well, take my word for it) that "GenParser Char st" is our
parsing monad, we could see that "parseInput", when evaluated, will
produce a list of "Dir", and "dirAndSize", when evaluated, will
produce "Dir". Assuming that "Dir" somehow represents information
about single directory, that is pretty much what we wanted, isn't it?

Let's see what a "Dir" means. We defined ''data[[type]]'' Dir as a record,
which holds an Int and a String:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
data Dir = Dir Int String deriving Show
</haskell>

In order to construct such records, we must use ''data [[constructor]]''
Dir:

*Main> :t Dir 1 "foo"
Dir 1 "foo" :: Dir

In order to reduce confusion for newbies, we could have written:
<haskell>
data Dir = D Int String deriving Show
</haskell>

, which would define ''data[[type]]'' "Dir" with ''data [[constructor]]'' "D".
However, traditionally name of the data[[type]] and its [[constructor]] are
chosen to be the same.

Clause "[[deriving]] Show" instructs the compiler to make enough code "behind
the curtains" to make this ''datatype'' conform to the interface of
the ''type [[class]]'' Show. We will explain ''type [[class]]es'' later, for
now let's just say that this will allow us to "print" instances of
"Dir".

'''Exercises:'''
* examine types of "digit", "anyChar", "many", "many1" and "manyTill" to see how they are used to build more complex parsers from single ones.

* compare types of "manyTill", "manyTill anyChar" and "manyTill anyChar newline". Note that "anyChar `manyTill` newline" is just another syntax sugar. Note that when function is supplied with less arguments that it actually needs, we get not a value, but a new function, which is called ''partial application''.

OK. So, we combined a lot of primitive parsing actions to get ourselves a
parser for output of "du -sb". How can we actually parse something? the [[Parsec]] library supplies us with function "parse":

*Main> :t parse
parse :: GenParser tok () a
-> SourceName
-> [tok]
-> Either ParseError a
*Main> :t parse parseInput
parse parseInput :: SourceName -> [Char] -> Either ParseError [Dir]
*Main>

At first the [[type]] might be a bit cryptic, but once we supply "parse" with the parser we made, the compiler gets more information and presents us with a more concise [[type]].

Stop and consider this for a moment. The compiler figured out type of the function without a single type annotation supplied by us! Imagine if a Java compiler deduced types for you, and you wouldn't have to specify types of arguments and return values of methods, ever.

OK, back to the code. We can observe that the "parser" is a function, which,
given a parser, a name of the source file or channel (f.e. "stdin"), and
source data (String, which is a list of "Char"s, which is written "[Char]"),
will either produce parse error, or parse us a list of "Dir".

Datatype "Either" is an example of datatype whose constructor has name, different
from the name of the datatype. In fact, "Either" has two constructors:

<haskell>
data Either a b = Left a | Right b
</haskell>

In order to understand better what does this mean consider the following
example:

*Main> :t Left 'a'
Left 'a' :: Either Char b
*Main> :t Right "aaa"
Right "aaa" :: Either a [Char]
*Main>

You see that "Either" is a ''union'' (much like the C/C++ "union") which could
hold value of one of the two distinct types. However, unlike C/C++ "union",
when presented with value of type "Either Int Char" we could immediately see
whether its an Int or a Char - by looking at the constructor which was used to
produce the value. Such datatypes are called "tagged unions", and they are
another [[:Category:Idioms|power tool]] in the Haskell toolset.

Did you also notice that we provide "parse" with parser, which is a monadic
value, but receive not a new monadic value, but a parsing result? That is
because "parse" is an evaluator for "Parser" monad, much like the [[GHC]] or [[Hugs]] runtime is an evaluator for the IO monad. The function "parser" implements all monadic machinery: it tracks errors and positions in input, implements backtracking and lookahead, etc.

let's extend our "main" function to use "parse" and actually parse the input
and show us the parsed data structures:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
main = do input <- getContents
putStrLn ("DEBUG: got input " ++ input)
let dirs = case parse parseInput "stdin" input of
Left err -> error $ "Input:\n" ++ show input ++
"\nError:\n" ++ show err
Right result -> result
putStrLn "DEBUG: parsed:"; print dirs
</haskell>

'''Exercise:'''

* In order to understand this snippet of code better, examine (with ghci or hugs) the difference between 'drop 1 ( drop 1 ( drop 1 ( drop 1 ( drop 1 "foobar" ))))' and 'drop 1 $ drop 1 $ drop 1 $ drop 1 $ drop 1 "foobar"'. Examine type of ($).
* Try putStrLn "aaa" and print "aaa" and see the difference, examine their types.
* Try print (Dir 1 "foo") and putStrLn (Dir 1 "foo"). Examine types of print and putStrLn to understand the behavior in both cases.

Let's try to run what we have so far:

$ du -sb * | runhaskell ./cd-fit.hs

DEBUG: got input 22325 Article.txt
18928 Article.txt~
1706 cd-fit.hs
964 cd-fit.hs~
61609 _darcs

DEBUG: parsed:
[Dir 22325 "Article.txt",Dir 18928 "Article.txt~",
Dir 1706 "cd-fit.hs",Dir 964 "cd-fit.hs~",Dir 61609 "_darcs"]

Seems to be doing exactly as planned. Now let's try some erroneous
input:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: got input foo

DEBUG: parsed:
*** Exception: Input:
"foo\n"
Error:
"stdin" (line 1, column 1):
unexpected "f"
expecting digit or end of input

Seems to be doing fine.

If you followed advice to put your code under version control, you
could now use "darcs whatsnew" or "darcs diff -u" to examine your
changes to the previous version. Use "darcs record" to commit them. As
an exercise, first record the changes "outside" of function "main" and
then record the changes in "main". Do "darcs changes" to examine a
list of changes you've recorded so far.

== Chapter 3: Packing the knapsack and testing it with class, too (and don't forget your towel!) ==

Enough preliminaries already. let's go pack some CDs.

As you might already have recognized, our problem is a classical one. It is
called a "knapsack problem" ([http://www.google.com/search?q=knapsack+problem google it up], if you don't know already what it
is. There are more than 100000 links).

let's start from the greedy solution, but first let's slightly modify our "Dir"
datatype to allow easy extraction of its components:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

----
'''Exercise:''' examine types of "Dir", "dir_size" and "dir_name"
----

From now on, we could use "dir_size d" to get a size of directory, and
"dir_name d" to get its name, provided that "d" is of type "Dir".

The Greedy algorithm sorts directories from the biggest down, and tries to put
them on CD one by one, until there is no room for more. We will need to track
which directories we added to CD, so let's add another datatype, and code this
simple packing algorithm:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
import Data.List (sortBy)

-- DirPack holds a set of directories which are to be stored on single CD.
-- 'pack_size' could be calculated, but we will store it separately to reduce
-- amount of calculation
data DirPack = DirPack {pack_size::Int, dirs::[Dir]} deriving Show

-- For simplicity, let's assume that we deal with standard 700 Mb CDs for now
media_size = 700*1024*1024

-- Greedy packer tries to add directories one by one to initially empty 'DirPack'
greedy_pack dirs = foldl maybe_add_dir (DirPack 0 []) $ sortBy cmpSize dirs
where
cmpSize d1 d2 = compare (dir_size d1) (dir_size d2)

-- Helper function, which only adds directory "d" to the pack "p" when new
-- total size does not exceed media_size
maybe_add_dir p d =
let new_size = pack_size p + dir_size d
new_dirs = d:(dirs p)
in if new_size > media_size then p else DirPack new_size new_dirs
</haskell>

----
I'll highlight the areas which you could explore on your own (using other nice
tutorials out there, of which I especially recommend "[[Yet Another Haskell Tutorial]]" by Hal Daume):
* We choose to import a single function "sortBy" from a module [[Data.List]], not the whole thing.
* Instead of coding case-by-case recursive definition of "greedy_pack", we go with higher-order approach, choosing "foldl" as a vehicle for list traversal. Examine its type. Other useful function from the same category are "map", "foldr", "scanl" and "scanr". Look them up!
* To sort list of "Dir" by size only, we use custom sort function and parametrized sort - "sortBy". This sort of setup where the user may provide a custom "modifier" for a generic library function is quite common: look up "deleteBy", "deleteFirstsBy", "groupBy", "insertBy", "intersectBy", "maximumBy", "minimumBy", "sortBy", "unionBy".
* To code the quite complex function "maybe_add_dir", we introduced several '''local definitions''' in the "let" clause, which we can reuse within the function body. We used a "where" clause in the "greedy_pack" function to achieve the same effect. Read about "let" and "where" clauses and the differences between them.
* Note that in order to construct a new value of type "DirPack" (in function "maybe_add_dir") we haven't used the helper accessor functions "pack_size" and "dirs"
----

In order to actually use our greedy packer we must call it from our "main"
function, so let's add a lines:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
main = do ...
-- compute solution and print it
putStrLn "Solution:" ; print (greedy_pack dirs)
</haskell>

Verify integrity of our definitions by (re)loading our code in ghci. Compiles?
Thought so :) Now, do "darcs record" and add some sensible commit message.

Now it is time to test our creation. We could do it by actually running it in
the wild like this:

$ du -sb ~/DOWNLOADS/* | runhaskell ./cd-fit.hs

This will prove that our code seems to be working. At least, this once. How
about establishing with reasonable degree of certainty that our code, parts
and the whole, works properly, and doing so in re-usable manner? In other
words, how about writing some test?

Java programmers used to JUnit probably thought about screens of boiler-plate
code and hand-coded method invocations. Never fear, we will not do anything as
silly :)

Enter '''[[QuickCheck]]'''.

[[QuickCheck]] is a tool to do automated testing of your functions using
(semi)random input data. In the spirit of "100b of code examples is worth 1kb of
praise" let's show the code for testing the following ''property'': An attempt to pack directories returned by "greedy_pack" should return "DirPack" of exactly the same pack:

<haskell>
-- Taken from 'cd-fit-3-2.hs'
import Test.QuickCheck
import Control.Monad (liftM2, replicateM)

-- We must teach QuickCheck how to generate arbitrary "Dir"s
instance Arbitrary Dir where
-- Let's just skip "coarbitrary" for now, ok?
-- I promise, we will get back to it later :)
coarbitrary = undefined
-- We generate arbitrary "Dir" by generating random size and random name
-- and stuffing them inside "Dir"
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM n (elements "fubar/")

-- For convenience and by tradition, all QuickCheck tests begin with prefix "prop_".
-- Assume that "ds" will be a random list of "Dir"s and code your test.
prop_greedy_pack_is_fixpoint ds =
let pack = greedy_pack ds
in pack_size pack == pack_size (greedy_pack (dirs pack))
</haskell>

let's run the test, after which I'll explain how it all works:

Prelude> :r
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> quickCheck prop_greedy_pack_is_fixpoint
[numbers spinning]
OK, passed 100 tests.
*Main>

We've just seen our "greedy_pack" run on a 100 completely (well, almost
completely) random lists of "Dir"s, and it seems that property indeed holds.

let's dissect the code. The most intriguing part is "instance Arbitrary Dir
where", which declares that "Dir" is an '''[[instance]]''' of '''type[[class]]''' "Arbitrary". Whoa, that's a whole lot of unknown words! :) Let's slow down a
bit.

What is a '''type[[class]]'''? A typeclass is a Haskell way of dealing with the
following situation: suppose that you are writing a library of useful
functions and you don't know in advance how exactly they will be used, so you
want to make them generic. Now, on one hand you don't want to restrict your
users to certain type (e.g. String). On the other hand, you want to enforce
the convention that arguments for your function must satisfy a certain set of
constraints. That is where '''typeclass''' comes in handy.

Think of typeclass as a '''contract''' (or "interface", in Java terms) that
your type must fulfill in order to be admitted as an argument to certain
functions.

Let's examine the typeclass "Arbitrary":

*Main> :i Arbitrary
class Arbitrary a where
arbitrary :: Gen a
coarbitrary :: a -> Gen b -> Gen b
-- Imported from Test.QuickCheck
instance Arbitrary Dir
-- Defined at ./cd-fit.hs:61:0
instance Arbitrary Bool -- Imported from Test.QuickCheck
instance Arbitrary Double -- Imported from Test.QuickCheck
instance Arbitrary Float -- Imported from Test.QuickCheck
instance Arbitrary Int -- Imported from Test.QuickCheck
instance Arbitrary Integer -- Imported from Test.QuickCheck
-- rest skipped --

It could be read this way: "Any [[type]] (let's name it 'a') could be a member of the [[class]] Arbitrary as soon as we define two functions for it: "arbitrary" and "coarbitrary", with signatures shown. For types Dir, Bool, Double, Float, Int and Integer such definitions were provided, so all those types are instance of class Arbitrary".

Now, if you write a function which operates on its arguments solely by means
of "arbitrary" and "coarbitrary", you can be sure that this function will work
on any type which is an instance of "Arbitrary"!

let's say it again. Someone (maybe even you) writes the code (API or library),
which requires that input values implement certain ''interfaces'', which is
described in terms of functions. Once you show how your type implements this
''interface'' you are free to use API or library.

Consider the function "sort" from standard library:

*Main> :t Data.List.sort
Data.List.sort :: (Ord a) => [a] -> [a]

We see that it sorts lists of any values which are instance of typeclass
"Ord". Let's examine that class:

*Main> :i Ord
class Eq a => Ord a where
compare :: a -> a -> Ordering
(<) :: a -> a -> Bool
(>=) :: a -> a -> Bool
(>) :: a -> a -> Bool
(<=) :: a -> a -> Bool
max :: a -> a -> a
min :: a -> a -> a
-- skip
instance Ord Double -- Imported from GHC.Float
instance Ord Float -- Imported from GHC.Float
instance Ord Bool -- Imported from GHC.Base
instance Ord Char -- Imported from GHC.Base
instance Ord Integer -- Imported from GHC.Num
instance Ord Int -- Imported from GHC.Base
-- skip
*Main>

We see a couple of interesting things: first, there is an additional
requirement listed: in order to be an instance of "Ord", type must first be an
instance of typeclass "Eq". Then, we see that there is an awful lot of
functions to define in order to be an instance of "Ord". Wait a second, isn't
it silly to define both (<) and (>) when one could be expressed via another?

Right you are! Usually, typeclass contains several "default" implementation
for its functions, when it is possible to express them through each other (as
it is with "Ord"). In this case it is possible to supply only a minimal
definition (which in case of "Ord" consists of any single function) and others
will be automatically derived. If you supplied fewer functions than are required
for minimal implementation, the compiler/interpreter will say so and
explain which functions you still have to define.

Once again, we see that a lot of [[type]]s are already instances of typeclass Ord, and thus we are able to sort them.

Now, let's take a look back to the definition of "Dir":

<haskell>
-- Taken from 'cd-fit-3-2.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

See that "[[deriving]]" clause? It instructs the compiler to automatically derive code to make "Dir" an instance of typeclass Show. The compiler knows about a bunch of standard typeclasses (Eq, Ord, Show, Enum, Bound, Typeable to name a few) and knows how to make a type into a "suitably good" instance of any of them. If you want to derive instances of more than one typeclass, say it this way: "deriving (Eq,Ord,Show)". Voila! Now we can compare, sort and print data of
that type!

Side note for Java programmers: just imagine java compiler which derives code
for "implements Storable" for you...

Side note for C++ programmers: just imagine that deep copy constructors are
being written for you by compiler....

----
'''Exercises:'''
* Examine typeclasses Eq and Show
* Examine types of (==) and "print"
* Try to make "Dir" instance of "Eq"
----

OK, back to our tests. So, what we have had to do in order to make "Dir" an
instance of "Arbitrary"? Minimal definition consists of "arbitrary". Let's
examine it up close:

*Main> :t arbitrary
arbitrary :: (Arbitrary a) => Gen a

See that "Gen a"? Reminds you of something? Right! Think of "IO a" and "Parser
a" which we've seen already. This is yet another example of action-returning
function, which could be used inside "do"-notation. (You might ask yourself,
wouldn't it be useful to generalize that convenient concept of actions and
"do"? Of course! It is already done, the concept is called "[[Monad]]" and we will talk about it in Chapter 400 :) )

Since 'a' here is a [[type variable]] which is an instance of "Arbitrary", we could substitute "Dir" here. So, how we can make and return an action of type "Gen Dir"?

Let's look at the code:

<haskell>
-- Taken from 'cd-fit-3-2.hs'
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM n (elements "fubar/")
</haskell>

We have used the library-provided functions "choose" and "elements" to build up
"gen_size :: Gen Int" and "gen_name :: Gen String" (exercise: don't take my
word on that. Find a way to check types of "gen_name" and "gen_size"). Since
"Int" and "String" are components of "Dir", we sure must be able to use "Gen
Int" and "Gen String" to build "Gen Dir". But where is the "do" block for
that? There is none, and there is only single call to "liftM2".

Let's examine it:

*Main> :t liftM2
liftM2 :: (Monad m) => (a1 -> a2 -> r) -> m a1 -> m a2 -> m r

Kind of scary, right? Let's provide typechecker with more context:

*Main> :t liftM2 Dir
liftM2 Dir :: (Monad m) => m Int -> m String -> m Dir

Since you already heard that "Gen" is a "Monad", you could substitute "Gen" for "m" here, obtaining "liftM2 Dir :: (Monad Gen) => Gen Int -> Gen String ->
Gen Dir". Exactly what we wanted!

Consider "liftM2" to be "advanced topic" of this chapter (which we will cover
later) and just note for now that:
* "2" is a number of arguments for data constructor "Dir" and we have used "liftM2" to construct "Gen Dir" out of "Dir"
* There are also "liftM", "liftM3", "liftM4", "liftM5"
* "liftM2" is defined as "liftM2 f a1 a2 = do x<-a1; y<-a2; return (f x y)"

Hopefully, this will all make sense after you read it for the third
time ;)

Oh, by the way - don't forget to "darcs record" your changes!

== Chapter 4: REALLY packing the knapsack this time ==

In this chapter we are going to write another not-so-trivial packing
method, compare packing methods efficiency, and learn something new
about debugging and profiling of the Haskell programs along the way.

It might not be immediately obvious whether our packing algorithm is
effective, and if yes - in which particular way? Whether it's runtime,
memory consumption or result are of sufficient quality, are there any
alternative algorithms, and how do they compare to each other?

Let's code another solution to the knapsack packing problem, called the "dynamic programming method" and put both variants to the test.

This time, I'll not dissect the listing and explain it bit by bit. Instead, comments are provided in the code:

<haskell>
-- Taken from 'cd-fit-4-1.hs'
----------------------------------------------------------------------------------
-- Dynamic programming solution to the knapsack (or, rather, disk) packing problem
--
-- Let the `bestDisk x' be the "most tightly packed" disk of total
-- size no more than `x'.
precomputeDisksFor :: [Dir] -> [DirPack]
precomputeDisksFor dirs =
-- By calculating `bestDisk' for all possible disk sizes, we could
-- obtain a solution for particular case by simple lookup in our list of
-- solutions :)
let precomp = map bestDisk [0..]

-- How to calculate `bestDisk'? Lets opt for a recursive definition:
-- Recursion base: best packed disk of size 0 is empty
bestDisk 0 = DirPack 0 []
-- Recursion step: for size `limit`, bigger than 0, best packed disk is
-- computed as follows:
bestDisk limit =
-- 1. Take all non-empty dirs that could possibly fit to that disk by itself.
-- Consider them one by one. Let the size of particular dir be `dir_size d'.
-- Let's add it to the best-packed disk of size <= (limit - dir_size d), thus
-- producing the disk of size <= limit. Lets do that for all "candidate"
-- dirs that are not yet on our disk:
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(limit - dir_size d)
, d `notElem` ds
] of
-- We either fail to add any dirs (probably, because all of them too big).
-- Well, just report that disk must be left empty:
[] -> DirPack 0 []
-- Or we produce some alternative packings. Let's choose the best of them all:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

in precomp

-- When we precomputed disk of all possible sizes for the given set of dirs, solution to
-- particular problem is simple: just take the solution for the required 'media_size' and
-- that's it!
dynamic_pack dirs = (precomputeDisksFor dirs)!!media_size
</haskell>

Notice that it took almost the same amount of text to describe algorithm and to write implementation for it. Nice, eh?

----

'''Exercises:'''
* Make all necessary amendments to the previously written code to make this example compile. Hints: browse modules Data.List and Data.Ix for functions that are "missing" - maybe you will find them there (use ":browse Module.Name" at ghci prompt). Have you had to define some new instances of some classes? How did you do that?
* <tt>[ other_function local_binding | x <- some_list, x > 0, let local_binding = some_function x ]</tt> is called a "list comprehension". This is another example of "syntactic sugar", which could lead to nicely readable code, but, when abused, could lead to syntactic caries :) Do you understand what does this sample do: <tt>let solve x = [ y | x <- [0..], y<-[0..], y == x * x ]</tt>? Could write (with help of decent tutorial) write de-sugared version of this? (Yes, I know that finding a square root does not require list traversals, but for the sake of self-education try and do it)
* Notice that in order to code quite complex implementation of <tt>precomputeDisksFor</tt> we split it up in several smaller pieces and put them as a '''local bindings''' inside '''let''' clause.
* Notice that we use '''pattern matching''' to both define <tt>bestKnap</tt> on case-by-case basis and to "peer into" ('''de-construct''') <tt>DirPack</tt> in the <tt>let (DirPack s ds)=precomp!!(limit - dir_size d)</tt> line
* Notice how we use function composition to compose complex condition to filter the list of dirs

----

Before we move any further, let's do a small cosmetic change to our
code. Right now our solution uses 'Int' to store directory size. In
Haskell, 'Int' is a platform-dependent integer, which imposes certain
limitations on the values of this type. Attempt to compute the value
of type 'Int' that exceeds the bounds will result in overflow error.
Standard Haskell libraries have special typeclass
<hask>Bounded</hask>, which allows to define and examine such bounds:

Prelude> :i Bounded
class Bounded a where
minBound :: a
maxBound :: a
-- skip --
instance Bounded Int -- Imported from GHC.Enum

We see that 'Int' is indeed bounded. Let's examine the bounds:

Prelude> minBound :: Int
-2147483648
Prelude> maxBound :: Int
2147483647
Prelude>

Those of you who are C-literate, will spot at once that in this case
the 'Int' is so-called "signed 32-bit integer", which means that we
would run into errors trying to operate on directories/directory packs
which are bigger than 2 GB.

Luckily for us, Haskell has integers of arbitrary precision (limited
only by the amount of available memory). The appropriate type is
called 'Integer':

Prelude> (2^50) :: Int
0 -- overflow
Prelude> (2^50) :: Integer
1125899906842624 -- no overflow
Prelude>

Lets change definitions of 'Dir' and 'DirPack' to allow for bigger
directory sizes:
<haskell>
-- Taken from 'cd-fit-4-2.hs'
data Dir = Dir {dir_size::Integer, dir_name::String} deriving (Eq,Show)
data DirPack = DirPack {pack_size::Integer, dirs::[Dir]} deriving Show
</haskell>

Try to compile the code or load it into ghci. You will get the
following errors:

cd-fit-4-2.hs:73:79:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the expression: limit - (dir_size d)
In the second argument of `(!!)', namely `(limit - (dir_size d))'

cd-fit-4-2.hs:89:47:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the second argument of `(!!)', namely `media_size'
In the definition of `dynamic_pack':
dynamic_pack dirs = (precomputeDisksFor dirs) !! media_size

It seems like Haskell have some troubles using 'Integer' with '(!!)'.
Let's see why:

Prelude> :t (!!)
(!!) :: [a] -> Int -> a

Seems like definition of '(!!)' demands that index will be 'Int', not
'Integer'. Haskell never converts any type to some other type
automatically - programmer have to explicitly ask for that.

I will not repeat the section "Standard Haskell Classes" from
[http://haskell.org/onlinereport/basic.html the Haskell Report] and
explain, why typeclasses for various numbers organized the way they
are organized. I will just say that standard typeclass
<hask>Num</hask> demands that numeric types implement method
<hask>fromInteger</hask>:

Prelude> :i Num
class (Eq a, Show a) => Num a where
(+) :: a -> a -> a
(*) :: a -> a -> a
(-) :: a -> a -> a
negate :: a -> a
abs :: a -> a
signum :: a -> a
fromInteger :: Integer -> a
-- Imported from GHC.Num
instance Num Float -- Imported from GHC.Float
instance Num Double -- Imported from GHC.Float
instance Num Integer -- Imported from GHC.Num
instance Num Int -- Imported from GHC.Num

We see that <hask>Integer</hask> is a member of typeclass
<hask>Num</hask>, thus we could use <hask>fromInteger</hask> to make
the type errors go away:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
-- snip
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
] of
-- snip
dynamic_pack dirs = (precomputeDisksFor dirs)!!(fromInteger media_size)
-- snip
</haskell>

Type errors went away, but careful reader will spot at once that when
expression <hask>(limit - dir_size d)</hask> will exceed the bounds
for <hask>Int</hask>, overflow will occur, and we will not access the
correct list element. Don't worry, we will deal with this in a short while.

Now, lets code the QuickCheck test for this function along the lines of the test for <tt>greedy_pack</tt>:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack ds
in pack_size pack == pack_size (dynamic_pack (dirs pack))
</haskell>

Now, lets try to run (DON'T PANIC and save all you work in other applications first!):

*Main> quickCheck prop_dynamic_pack_is_fixpoint

Now, you took my advice seriously, don't you? And you did have your '''Ctrl-C''' handy, didn't you? Most probably, the attempt to run the test resulted in all your memory being taken by <tt>ghci</tt> process, which you hopefully interrupted soon enough by pressing '''Ctrl-C'''.

What happened? Who ate all the memory? How to debug this problem? GHC comes with profiling abilities, but we could not use them - they produce report after program terminates, and our doesn't seem to do so without consuming several terabytes of memory first. Still, there is a lot of room for maneuver.

Let's see. Since we called <tt>dynamic_pack</tt> and it ate all the memory, let's not do this again. Instead, let's see what this function does and tweak it a bit to explore it's behavior.

Since we already know that random lists of "Dir"s generated for our QuickCheck tests are of modest size (after all, <tt>greedy_pack</tt> munches them without significant memory consumption), the size of the input most probably is not the issue. However, <tt>dynamic_pack_is_fixpoint</tt> is building quite a huge list internally (via <tt>precomputeDisksFor</tt>). Could this be a problem?

Let's turn the timing/memory stats on (":set +s" on ghci prompt) and try to peek into various elements of list returned by <tt>precomputeDisksFor</tt>:

Prelude> :l cd-fit.hs
Compiling Main ( cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :set +s
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 0
DirPack {pack_size = 0, dirs = []}
(0.06 secs, 1277972 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10
DirPack {pack_size = 0, dirs = []}
(0.00 secs, 0 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100
DirPack {pack_size = 0, dirs = []}
(0.01 secs, 1519064 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 1000
DirPack {pack_size = 0, dirs = []}
(0.03 secs, 1081808 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10000
DirPack {pack_size = 0, dirs = []}
(1.39 secs, 12714088 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100000
Interrupted.

Aha! This seems to be a problem, since computation of 100000 fails to terminate in "reasonable" time, and to think that we have tried to compute <tt>700*1024*1024</tt>th element...

Lets modify our code a bit, to allow disk size to be tweaked:

<haskell>
-- Taken from 'cd-fit-4-3.hs'
dynamic_pack limit dirs = (precomputeDisksFor dirs)!!(fromInteger limit)

prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack media_size ds
in pack_size pack == pack_size (dynamic_pack media_size (dirs pack))

prop_dynamic_pack_small_disk ds =
let pack = dynamic_pack 50000 ds
in pack_size pack == pack_size (dynamic_pack 50000 (dirs pack))

-- rename "old" main to "moin"
main = quickCheck prop_dynamic_pack_small_disk
</haskell>

Compute a profiling version of you code with <tt>ghc -O --make -prof -auto-all -o cd-fit cd-fit.hs</tt> and run it like this:

$ ./cd-fit +RTS -p
OK, passed 100 tests.

First thing, note that our code satisfies at least one simple property. Good. Now let's examine profile. Look into file "cd-fit.prof", which was produced in your current directory.

Most probably, you'll see something like this:

cd-fit +RTS -p -RTS

total time = 2.18 secs (109 ticks @ 20 ms)
total alloc = 721,433,008 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

precomputeDisksFor Main 88.1 99.8
dynamic_pack Main 11.0 0.0

individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc

MAIN MAIN 1 0 0.0 0.0 100.0 100.0
CAF Main 174 11 0.9 0.2 100.0 100.0
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 99.1 99.8
dynamic_pack Main 182 200 11.0 0.0 99.1 99.8
precomputeDisksFor Main 183 200 88.1 99.8 88.1 99.8
main Main 180 1 0.0 0.0 0.0 0.0

Examine column of "individual %alloc". As we thought, all memory was
allocated within <tt>precomputeDisksFor</tt>. However, amount of
memory allocated (more than 700 MB, according to the line "total
alloc") seems to be a little too much for our simple task. We will dig
deeper and find where we a wasting it.

Let's examine memory consumption a little closer via so-called "heap
profiles". Run <tt>./cd-fit +RTS -hb</tt>. This produces "biographical
heap profile", which tells us how various parts of the memory were
used during the program run time. Heap profile was saved to
"cd-fit.hp". It is next to impossible to read and comprehend it as is,
so use "hp2ps cd-fit.hp" to produce a nice PostScript picture which
is worth a thousand words. View it with "gv" or "ghostview" or "full
Adobe Acrobat (not Reader)". (This and subsequent pictures are
'''not''' attached here).

Notice that most of the graph is taken up by region marked as "VOID".
This means that memory allocated was never used. Notice that there is
'''no''' areas marked as "USE", "LAG" or "DRAG". Seems like our
program hardly uses '''any''' of the allocated memory at all. Wait a
minute! How could that be? Surely it must use something when it packs
to the imaginary disks of 50000 bytes those random-generated
directories which are 10 to 1400 Mb in size.... Oops. Severe size
mismatch. We should have spotted it earlier, when we were timing
<tt>precomputeDisksFor</tt>. Scroll back and observe how each run
returned the very same result - empty directory set.

Our random directories are too big, but nevertheless code spends time
and memory trying to "pack" them. Obviously,
<tt>precomputeDisksFor</tt> (which is responsible for 90% of total
memory consumption and run time) is flawed in some way.

Let's take a closer look at what takes up so much memory. Run
<tt>./cd-fit +RTS -h -hbvoid</tt> and produce PostScript picture for
this memory profile. This will give us detailed breakdown of all
memory whose "biography" shows that it's been "VOID" (unused). My
picture (and I presume that yours as well) shows that VOID memory
comprises of "thunks" labeled "precomputeDisksFor/pre...". We could
safely assume that second word would be "precomp" (You wonder why?
Look again at the code and try to find function named "pre.*" which is
called from inside <tt>precomputeDisksFor</tt>)

This means that memory has been taken by the list generated inside
"precomp". Rumor has it that memory leaks with Haskell are caused by
either too little laziness or too much laziness. It seems like we have
too little laziness here: we evaluate more elements of the list that
we actually need and keep them from being garbage-collected.

Note how we look up element from "precomp" in this piece of code:

<haskell>
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
</haskell>

Obviously, the whole list generated by "precomp" must be kept in
memory for such lookups, since we can't be sure that some element
could be garbage collected and will not be needed again.

Let's rewrite the code to eliminate the list (incidentally, this will also deal with the possible Int overflow while accessing the "precomp" via (!!) operator):

<haskell>
-- Taken from 'cd-fit-4-4.hs'
-- Let the `bestDisk x' be the "most tightly packed" disk of total
-- size no more than `x'.
-- How to calculate `bestDisk'? Lets opt for a recursive definition:
-- Recursion base: best packed disk of size 0 is empty and best-packed
-- disk for empty list of directories on it is also empty.
bestDisk 0 _ = DirPack 0 []
bestDisk _ [] = DirPack 0 []
-- Recursion step: for size `limit`, bigger than 0, best packed disk is
-- computed as follows:
bestDisk limit dirs =
-- Take all non-empty dirs that could possibly fit to that disk by itself.
-- Consider them one by one. Let the size of particular dir be `dir_size d'.
-- Let's add it to the best-packed disk of size <= (limit - dir_size d), thus
-- producing the disk of size <= limit. Lets do that for all "candidate"
-- dirs that are not yet on our disk:
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) dirs
, d `notElem` ds
] of
-- We either fail to add any dirs (probably, because all of them too big).
-- Well, just report that disk must be left empty:
[] -> DirPack 0 []
-- Or we produce some alternative packings. Let's choose the best of them all:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

dynamic_pack limit dirs = bestDisk limit dirs
</haskell>

Compile the profiling version of this code and obtain the overall
execution profile (with "+RTS -p"). You'll get something like this:

cd-fit +RTS -p -RTS

total time = 0.00 secs (0 ticks @ 20 ms)
total alloc = 1,129,520 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

CAF GHC.Float 0.0 4.4
main Main 0.0 93.9

individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc
MAIN MAIN 1 0 0.0 0.0 0.0 100.0
main Main 180 1 0.0 93.9 0.0 94.2
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 0.0 0.3
dynamic_pack Main 182 200 0.0 0.2 0.0 0.3
bestDisk Main 183 200 0.0 0.1 0.0 0.1

We achieved the major improvement: memory consumption is reduced by factor
of 700! Now we could test the code on the "real task" - change the
code to run the test for packing the full-sized disk:

<haskell>
main = quickCheck prop_dynamic_pack_is_fixpoint
</haskell>

Compile with profiling and run (with "+RTS -p"). If you are not lucky
and a considerably big test set would be randomly generated for your
runs, you'll have to wait. And wait even more. And more.

Go make some tea. Drink it. Read some Tolstoi (Do you have "War and
peace" handy?). Chances are that by the time you are done with
Tolstoi, program will still be running (just take my word on it, don't
check).

If you are lucky, your program will finish fast enough and leave you
with profile. According to a profile, program spends 99% of its time
inside <tt>bestDisk</tt>. Could we speed up <tt>bestDisk</tt> somehow?

Note that <tt>bestDisk</tt> performs several simple calculation for
which it must call itself. However, it is done rather inefficiently -
each time we pass to <tt>bestDisk</tt> the exact same set of
directories as it was called with, even if we have already "packed"
some of them. Let's amend this:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
case [ DirPack (dir_size d + s) (d:ds)
| let small_enough = filter ( (inRange (0,limit)).dir_size ) dirs
, d <- small_enough
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) (delete d small_enough)
] of
</haskell>

Recompile and run again. Runtimes could be lengthy, but bearable, and
number of times <tt>bestDisk</tt> is called (according to the profile)
should decrease significantly.

Finally, let's compare both packing algorithms. Intuitively, we feel
that greedy algorithm should produce worse results, don't we? Lets put
this feeling to the test:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
prop_greedy_pack_is_no_better_than_dynamic_pack ds =
pack_size (greedy_pack ds) <= pack_size (dynamic_pack media_size ds)
</haskell>

Verify that it is indeed so by running <tt>quickCheck</tt> for this
test several time. I feel that this concludes our knapsacking
exercises.

Adventurous readers could continue further by implementing so-called
"scaling" for <tt>dynamic_pack</tt> where we divide all directory
sizes and medium size by the size of the smallest directory to proceed
with smaller numbers (which promises faster runtimes).

== Chapter 5: (Ab)using monads and destructing constructors for fun and profit ==

We already mentioned monads quite a few times. They are described in
numerous articles and tutorial (See Chapter 400). It's hard to read a
daily dose of any Haskell mailing list and not to come across a word
"monad" a dozen times.

Since we already made quite a progress with Haskell, it's time we
revisit the monads once again. I will let the other sources teach you
theory behind the monads, overall usefulness of the concept, etc.
Instead, I will focus on providing you with examples.

Let's take a part of the real world program which involves XML
processing. We will work with XML tag attributes, which are
essentially named values:
<haskell>
-- Taken from 'chapter5-1.hs'
type Attribute = (Name, AttValue)
</haskell>

'Name' is a plain string, and value could be '''either''' string or
references (also strings) to another attributes which holds the actual
value (now, this is not a valid XML thing, but for the sake of
providing a nice example, let's accept this). Word "either" suggests
that we use 'Either' datatype:
<haskell>
type AttValue = Either Value [Reference]
type Name = String
type Value = String
type Reference = String

-- Sample list of simple attributes:
simple_attrs = [ ( "xml:lang", Left "en" )
, ( "xmlns", Left "jabber:client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]

-- Sample list of attributes with references:
complex_attrs = [ ( "xml:lang", Right ["lang"] )
, ( "lang", Left "en" )
, ( "xmlns", Right ["ns","subns"] )
, ( "ns", Left "jabber" )
, ( "subns", Left "client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]
</haskell>

'''Our task is:''' to write a function that will look up a value of
attribute by it's name from the given list of attributes. When
attribute contains reference(s), we resolve them (looking for the
referenced attribute in the same list) and concatenate their values,
separated by semicolon. Thus, lookup of attribute "xmlns" form both
sample sets of attributes should return the same value.

Following the example set by the <hask>Data.List.lookup</hask> from
the standard libraries, we will call our function
<hask>lookupAttr</hask> and it will return <hask>Maybe Value</hask>,
allowing for lookup errors:

<haskell>
-- Taken from 'chapter5-1.hs'
lookupAttr :: Name -> [Attribute] -> Maybe Value
-- Since we dont have code for 'lookupAttr', but want
-- to compile code already, we use the function 'undefined' to
-- provide default, "always-fail-with-runtime-error" function body.
lookupAttr = undefined
</haskell>

Let's try to code <hask>lookupAttr</hask> using <hask>lookup</hask> in
a very straightforward way:

<haskell>
-- Taken from 'chapter5-1.hs'
import Data.List

lookupAttr :: Name -> [Attribute] -> Maybe Value
lookupAttr nm attrs =
-- First, we lookup 'Maybe AttValue' by name and
-- check whether we are successful:
case (lookup nm attrs) of
-- Pass the lookup error through.
Nothing -> Nothing
-- If given name exist, see if it is value of reference:
Just attv -> case attv of
-- It's a value. Return it!
Left val -> Just val
-- It's a list of references :(
-- We have to look them up, accounting for
-- possible failures.
-- First, we will perform lookup of all references ...
Right refs ->
let vals = [ lookupAttr ref attrs | ref <- refs ]
-- .. then, we will exclude lookup failures
wo_failures = filter (/=Nothing) vals
-- ... find a way to remove annoying 'Just' wrapper
stripJust (Just v) = v
-- ... use it to extract all lookup results as strings
strings = map stripJust wo_failures
in
-- ... finally, combine them into single String.
-- If all lookups failed, we should pass failure to caller.
case null strings of
True -> Nothing
False -> Just (concat (intersperse ":" strings))
</haskell>

Testing:

*Main> lookupAttr "xmlns" complex_attrs
Just "jabber:client"
*Main> lookupAttr "xmlns" simple_attrs
Just "jabber:client"
*Main>

It works, but ... It seems strange that such a boatload of code
required for quite simple task. If you examine the code closely,
you'll see that the code bloat is caused by:

* the fact that after each step we check whether the error occurred

* unwrapping Strings from <hask>Maybe</hask> and <hask>Either</hask> data constructors and wrapping them back.

At this point C++/Java programmers would say that since we just pass
errors upstream, all those cases could be replaced by the single "try
... catch ..." block, and they would be right. Does this mean that
Haskell programmers are reduced to using "case"s, which were already
obsolete 10 years ago?

Monads to the rescue! As you can read elsewhere (see section 400),
monads are used in advanced ways to construct computations from other
computations. Just what we need - we want to combine several simple
steps (lookup value, lookup reference, ...) into function
<hask>lookupAttr</hask> in a way that would take into account possible
failures.

Lets start from the code and dissect in afterwards:
<haskell>
-- Taken from 'chapter5-2.hs'
import Control.Monad

lookupAttr' nm attrs = do
-- First, we lookup 'AttValue' by name
attv <- lookup nm attrs
-- See if it is value of reference:
case attv of
-- It's a value. Return it!
Left val -> Just val
-- It's a list of references :(
-- We have to look them up, accounting for
-- possible failures.
-- First, we will perform lookup of all references ...
Right refs -> do vals <- sequence $ map (flip lookupAttr' attrs) refs
-- ... since all failures are already excluded by "monad magic",
-- ... all all 'Just's have been removed likewise,
-- ... we just combine values into single String,
-- ... and return failure if it is empty.
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

'''Exercise''': compile the code, test that <hask>lookupAttr</hask>
and <hask>lookupAttr'</hask> really behave in the same way. Try to
write a QuickCheck test for that, defining the
<hask>instance Arbitrary Name</hask> such that arbitrary names will be taken from
names available in <hask>simple_attrs</hask>.

Well, back to the story. Noticed the drastic reduction in code size?
If you drop comments, the code will occupy mere 7 lines instead of 13
- almost two-fold reduction. How we achieved this?

First, notice that we never ever check whether some computation
returns <hask>Nothing</hask> anymore. Yet, try to lookup some
non-existing attribute name, and <hask>lookupAttr'</hask> will return
<hask>Nothing</hask>. How does this happen? Secret lies in the fact
that type constructor <hask>Maybe</hask> is a "monad".

We use keyword <hask>do</hask> to indicate that following block of
code is a sequence of '''monadic actions''', where '''monadic magic'''
have to happen when we use '<-', 'return' or move from one action to
another.

Different monads have different '''magic'''. Library code says that
type constructor <hask>Maybe</hask> is such a monad that we could use
<hask><-</hask> to "extract" values from wrapper <hask>Just</hask> and
use <hask>return</hask> to put them back in form of
<hask>Just some_value</hask>. When we move from one action in the "do" block to
another a check happens. If the action returned <hask>Nothing</hask>,
all subsequent computations will be skipped and the whole "do" block
will return <hask>Nothing</hask>.

Try this to understand it all better:
<haskell>
*Main> let foo x = do v <- x; return (v+1) in foo (Just 5)
Just 6
*Main> let foo x = do v <- x; return (v+1) in foo Nothing
Nothing
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo (Just 'a')
Just 97
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo Nothing
Nothing
*Main>
</haskell>

Do not mind <hask>sequence</hask> and <hask>guard</hask> just for now
- we will get to them in the little while.

Since we already removed one reason for code bloat, it is time to deal
with the other one. Notice that we have to use <hask>case</hask> to
'''deconstruct''' the value of type <hask>Either Value
[Reference]</hask>. Surely we are not the first to do this, and such
use case have to be quite a common one.

Indeed, there is a simple remedy for our case, and it is called
<hask>either</hask>:

*Main> :t either
either :: (a -> c) -> (b -> c) -> Either a b -> c

Scary type signature, but here are examples to help you grok it:

*Main> :t either (+1) (length)
either (+1) (length) :: Either Int [a] -> Int
*Main> either (+1) (length) (Left 5)
6
*Main> either (+1) (length) (Right "foo")
3
*Main>

Seems like this is exactly our case. Let's replace the
<hask>case</hask> with invocation of <hask>either</hask>:

<haskell>
-- Taken from 'chapter5-3.hs'
lookupAttr'' nm attrs = do
attv <- lookup nm attrs
either Just (dereference attrs) attv
where
dereference attrs refs = do
vals <- sequence $ map (flip lookupAttr'' attrs) refs
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

It keeps getting better and better :)

Now, as semi-exercise, try to understand the meaning of "sequence",
"guard" and "flip" looking at the following ghci sessions:

*Main> :t sequence
sequence :: (Monad m) => [m a] -> m [a]
*Main> :t [Just 'a', Just 'b', Nothing, Just 'c']
[Just 'a', Just 'b', Nothing, Just 'c'] :: [Maybe Char]
*Main> :t sequence [Just 'a', Just 'b', Nothing, Just 'c']
sequence [Just 'a', Just 'b', Nothing, Just 'c'] :: Maybe [Char]

*Main> sequence [Just 'a', Just 'b', Nothing, Just 'c']
Nothing
*Main> sequence [Just 'a', Just 'b', Nothing]
Nothing
*Main> sequence [Just 'a', Just 'b']
Just "ab"

*Main> :t [putStrLn "a", putStrLn "b"]
[putStrLn "a", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", putStrLn "b"]
sequence [putStrLn "a", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", putStrLn "b"]
a
b

*Main> :t [putStrLn "a", fail "stop here", putStrLn "b"]
[putStrLn "a", fail "stop here", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", fail "stop here", putStrLn "b"]
sequence [putStrLn "a", fail "stop here", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", fail "stop here", putStrLn "b"]
a
*** Exception: user error (stop here)

Notice that for monad <hask>Maybe</hask> sequence continues execution
until the first <hask>Nothing</hask>. The same behavior could be
observed for IO monad. Take into account that different behaviors are
not hardcoded into the definition of <hask>sequence</hask>!

Now, let's examine <hask>guard</hask>:

*Main> let foo x = do v <- x; guard (v/=5); return (v+1) in map foo [Just 4, Just 5, Just 6]
[Just 5,Nothing,Just 7]

As you can see, it's just a simple way to "stop" execution at some
condition.

If you have been hooked on monads, I urge you to read "All About
Monads" right now (link in Chapter 400).

== Chapter 6: Where do you want to go tomorrow? ==

As the name implies, the author is open for proposals - where should
we go next? I had networking + xml/xmpp in mind, but it might be too
heavy and too narrow for most of the readers.

What do you think? Drop me a line.

== Chapter 400: Monads up close ==

Read [http://en.wikibooks.org/wiki/Haskell/Understanding_monads this wikibook chapter].
Then, read [http://horna.org.ua/books/All_About_Monads.pdf "All about monads"] (PDF).
'Nuff said :)

== Chapter 500: IO up close ==

Shows that:

<haskell>
c = do a <- someAction
b <- someOtherAction
print (bar b)
print (foo a)
print "done"
</haskell>

really is just a syntax sugar for:

<haskell>
c = someAction >>= \a ->
someOtherAction >>= \b ->
print (bar b) >>
print (foo a) >>
print "done"
</haskell>

and explains about ">>=" and ">>". Oh wait. This was already explained
in Chapter 400 :)

== Chapter 9999: Installing Haskell Compiler/Interpreter and all necessary software ==

Plenty of material on this on the web and this wiki. Just go get
yourself installation of [[GHC]] (6.4 or above) or [[Hugs]] (v200311 or
above) and "[[darcs]]", which we will use for version control.

== Chapter 10000: Thanks! ==

Thanks for comments, proofreading, good advice and kind words go to:
Helge, alt, dottedmag, Paul Moore, Ben Rudiak-Gould, Jim Wilkinson,
Andrew Zhdanov (avalez), Martin Percossi, SpellingNazi, Davor
Cubranic, Brett Giles, Stdrange, Brian Chrisman, Nathan Collins,
Anastasia Gornostaeva (ermine), Remi, Ptolomy, Zimbatm,
HenkJanVanTuyl, Miguel, Mforbes, Kartik Agaram, Jake Luck, Ketil
Malde, Mike Mimic, Jens Kubieziel.

If I should have mentioned YOU and forgot - tell me so.

Without you I would have stopped after Chapter 1 :)

Languages: [[Haskellへのヒッチハイカーガイド|jp]], [[Es/Guía de Haskell para autoestopistas|es]]

Es/Guía de Haskell para autoestopistas

2011-03-30T19:10:16Z

Imz: /* Capítulo 1: Omnipresente “¡Hola mundo!” y otras formas de hacer IO en Haskell */ The pseudocode as human-language sentences: this way it's more obvious each of the 3 lines is an item of our plan, and the plan has an imperative spirit for now

{{Es/Traducción en progreso|titulo=Guía de Haskell para autoestopistas|original=Hitchhikers guide to Haskell}}

== Preámbulo: ¡NO CORRER! ==

Experiencias recientes de algunos compañeros programadores de C++/Java indican que leyeron varios tutoriales sobre Haskell con "velocidad exponencial" (piensa en como las sesiones de TCP/IP aumentan). Al principio son pocas y prudentes, pero cuando ven que las primeras 3-5 páginas no contienen "nada interesante" en términos de código y ejemplos, empiezan a saltarse párrafos, después capítulos, después páginas enteras, tan sólo decelerando - a menudo hasta parar completamente - alrededor de la página 50, encontrándose con el grueso de conceptos como "clases de tipos", "constructores de tipos", "IO monádica", punto en el que normalmente les entra el pánico, piensa en una escusa perfectamente racional para no seguir leyendo, y se olvida alegremente de este triste y escalofriante encuentro con Haskell (ya que los seres humanos tendemos a olvidar las cosas tristas y escalofriantes).

Este texto pretende introducir al lector a los aspectos prácticos de Haskell desde el principio del todo (los planes para el primer capítulo incluyen: I/O, darcs, Parsec, QuickCheck, depurar y perfilar, por mencionar algunos). Se supone que el lector sabe (donde encontrar) por lo menos los primeros pasos de Haskell: como ejecutar "hugs” o “ghci", '''ese diseño es 2-dimensional''', etc. Aparte de eso, no esperamos avanzar radicalmente, e iremos paso por paso para no perder a los lectores por el camino. Así que NO CORRER, coge la toalla contigo y continua leyendo.

'''Si te has saltado el párrafo anterior''', me gustaría destacar una vez más que Haskell es sensible a la indentación y espaciado, así que presta atención cuando hagas copias o alineación manual de código en el editor de texto con fuentes proporcionales.

Ah, casi me olvido: el autor está muy interesado en CUALQUIER opinión. Escríbele algunas líneas o palabras (mira [http://www.haskell.org/haskellwiki/User:Adept Adept] para la información de contacto) o propón cambios al tutorial mediante darcs ([http://adept.linux.kiev.ua/repos/hhgtth/ el repositorio está aquí]) o directamente a este Wiki.

== Capítulo 1: Omnipresente “¡Hola mundo!” y otras formas de hacer IO en Haskell ==

Cada capítulo será dedicado a una pequeña tarea real que completaremos desde el principio.

Así que aquí está la tarea para este capítulo: para liberar espacio en tu disco duro para todo el código de haskell que vas a escribir en un futuro cercano, vas a guardar algo de la vieja y polvorienta información en CDs y DVDs. Mientras que grabar CDs (o DVDs) es fácil hoy en día, normalmente ocupa algo (o mucho) tiempo decidir como grabar algunos GB de fotos digitales en CD-Rs, cuando los directorios con las imágenes varían desde 10 a 300 Mb de espacio, y no quieres grabar CDs medio llenos (o medio vacíos).

El ejercicio consiste en escribir un programa que nos ayude a poner un conjunto de directorios en la cantidad mínima posible de discos, al mismo tiempo que aprovechamos cada disco lo máximo posible. Llamemos a este programa “cd-fit”.

Oh. Espera. Hagamos el usual programa “hola mundo”, antes de que nos
olvidemos, y después sigamos con cosas más interesante:

<haskell>
-- Cogido de 'hola.hs'
-- A partir de ahora, un comentario al principio del trozo de código
-- especificará el archivo que contiene el programa entero del que fue cogido
-- el trozo. Puedes coger el código del repositorio de darcs
-- "http://adept.linux.kiev.ua/repos/hhgtth" introduciendo el comando
-- "darcs get http://adept.linux.kiev.ua/repos/hhgtth"
module Main where
main = putStrLn "¡Hola mundo!"
</haskell>

Ejecútalo:

$ runhaskell ./hola.hs
¡Hola mundo!

Vale, ya lo hicimos. Sigamos ahora, no hay nada interesante aquí :)

Cualquier desarrollo serio se debe hacer con un sistema de control de versiones, y no haremos una excepción. Usaremos el moderno sistema de control de versiones distribuido “darcs”. “Moderno” significa que está escrito en Haskell, “distribuido” significa que cada copia que funcione es un repositorio en si mismo.

Primero, creemos un directorio vacío para nuestro todo nuestro código, y ejecuta “darcs init” en él, que creará un subdirectorio “_darcs” para guardar todo lo relacionado con el control de versiones dentro de él.

Inicia tu editor favorito y crea un fichero nuevo llamado “cd-fits.hs” en nuestro directorio de trabajo. Ahora pensemos por un momento como funcionará nuestro programa y expresémoslo en pseudocódigo:

<haskell>
main = Leer lista de directorio y sus tamaños.
Decidir como ajustarlos a los CD-Rs.
Imprimir la solución.
</haskell>

¿Parece lógio? Pienso que si.

Vamos a simplificar nuestra vida un poco y asumir por ahora que vamos calcular el espacio de los directorios de alguna forma fuera de nuestro programa (por ejemplo, con “du -sb *”) y leer esa información desde stdin. Convirtamos esto a Haskell:

<haskell>
-- Cogido de 'cd-fit-1-1.hs'
module Main where

main = do input <- getContents
putStrLn ("DEBUG: tengo entrada" ++ input)
-- calcular solución e imprimirla
</haskell>

No funciona en realidad, pero bastante parecido al Español, ¿no? Paremos por un momento y miremos más de cerca que hemos escrito línea por línea.

Empecemos desde arriba:

<haskell>
-- Cogido de 'cd-fit-1-1.hs'
input <- getContents
</haskell>

Esto es un ejemplo de la sintaxis de Haskell para hacer IO (en este caso, entrada de datos). Esta línea es una instrucción para leer toda la información disponible desde stdin, devolverla como una única cadena, y unirla al símbolo “input”, de forma que podemos procesar esta cadena de la forma que querramos.

¿Cómo lo sabía? ¿Memoricé todas las funciones? ¡Claro que no! Cada función tiene un tipo, que con su nombre, tiende a decir mucho sobre lo que hace la función.

Lanza un entorno interactivo de Haskell y examinemos esta función de cerca:

$ ghci
___ ___ _
/ _ \ /\ /\/ __(_)
/ /_\// /_/ / / | | GHC Interactive, version 6.4.1, for Haskell 98.
/ /_\\/ __ / /___| | http://www.haskell.org/ghc/
\____/\/ /_/\____/|_| Type :? for help.

Loading package base-1.0 ... linking ... done.
Prelude> :type getContents
getContents :: IO String
Prelude>

Vemos que “getContents” es una función sin argumentos, que devuelve un “IO String”. El prefijo “IO” significa que es una acción de IO. Devolverá un String, cuando se evalue. La acción será evaluada cuando usemos “<-” para enlazar su resultado a un símbolo.

Ten en cuenta que “<-” no es una forma bonita de asignar un valor a una variable. Es una forma de evaluar (ejecutar) acciones de IO, en otras palabras - para hacer alguna operación de I/O y devolver su resultado (si es que tiene).

Podemos escoger no evaluar la acción que obtenemos de “getContents”, y en vez de eso dejarla por ahí y evaluarla más tarde:

<haskell>
let x = getContents
-- 300 líneas de código
input <- x
</haskell>

Como puedes ver, las acciones de IO pueden ser usadas como valores ordinarios. Supón que hemos construído una lista de acciones IO y hemos encontrado una forma de ejecutarlas una por una. Sería una forma de simular programación imperativa con su notación de “orden de ejecución”.

Haskell te permite hacerlo mejor.

La librería estandard del lenguaje (llamada “Prelude”) nos da acceso a muchas funcinoes que devuelven primitivas de acciones de IO muy útiles. Para combinarlas entre ellas para producir acciones más complejas, usamos “do”:

<haskell>
c = do a <- algunaAcción
b <- algunaOtraAcción
print (bar b)
print (foo a)
putStrLn "hecho"
</haskell>

De esta forma '''asociamos''' “c” a una acción con el siguiente “escenario”:
* '''evalua''' la acción “algunaAcción” y '''asocia''' su resultado a “a”
* después, '''evalua''' “algunaOtraAcción” y '''asocia''' su resultado a “b”
* después, procesa “b” con la función “bar” e imprime su resultado
* después, procesa “a” con la función “foo” e imprime su resultado
* después, imprime la palabra “hecho”

¿Cuando se ejecutará todo esto en realidad? Respuesta: tan rápido como evaluemos “c” usando “<-” (si devuelve un resultado, como hace “getContents”) o usándola como el nombre de una función (si no devuelve un resultado, como hace “print”):

<haskell>
process = do putStrLn "Procesará algo"
c
putStrLn "Hecho"
</haskell>

Date cuenta de que hemos cogido unas cuantas funciones (“algunaAcción”, “algunaOtraAcción”, “print”, “putStrLn”) y usando “do” creamos a partir de ellas una nueva función, que enlazamos al símbolo “c”. Ahora podemos usar “c” como una pieza de construcción para construir una función más compleja “proceso”, y podríamos continuar haciendo esto más y más. Finalmente, algunas de las funciones las mencionaremos en el código de la función “main”, función a la cual la última y más importante acción de IO en cualquier programa de Haskell está asociada.

¿Cuándo se ejececutará/evaluará/forzará “main”? Tan rápido como ejecutemos el programa. Lee esto dos veces y trata de comprenderlo:

''La ejecución de un programa de Haskell es una evaluación del símbolo “main” a la que hemos asociado una acción de IO. Mediante esta evaluación obtenemos el resultado de la acción''.

Los lectores que estén familiriazados con programación en C++ o Java avanzado y ese conjunto de conocimiento arcano llamado “OOP Patrones de Diseño” se darán cuenta de que “construir acciones a partir de acciones” y “evaluar acciones para obtener resultados” es esencialmente un “Patrón de comando” y “Patrón de composición” combinados. Buenas noticias: en Haskell las tienes para todo tu IO, y lo tienes '''gratis''' :)

----

'''Ejercicio:''' Fíjate en el siguiente código:

<haskell>
-- Cogido de 'exercise-1-1.hs'
module Main where
c = putStrLn "C!"

combine before after =
do before
putStrLn "En el medio"
after

main = do combine c c
let b = combine (putStrLn "¡Hola!") (putStrLn "¡Adios!")
let d = combine (b) (combine c c)
putStrLn "¡Tanto tiempo!"
</haskell>

¿Te das cuenta de como indentamos las líneas con cuidado para que el código parezca limpio? En realidad, el código de Haskell tiene que estar alineado de esta forma, o sino no compilará. Si usas tabulados para indentar tus códigos
fuente, ten en cuenta que los compiladores de Haskell aumen que el tabstop tiene de ancho 8 caracteres.

A menudo las personas se quejan de que es una forma muy difícil de escribir en Haskell porque requiere que alinees el código. En realidad no es verdad. Si alineas tu código, el compilador adivinará cual es el principio y final de los bloques sintácticos. Sin embargo, si no indentas tu código, puedes especificarlos explícitamente en cada una de las expresiones y usar una distribución arbitraria como en este ejemplo:

<haskell>
-- Cogido de 'exercise-1-2.hs'
combine before after =
do { before;
putStrLn "En el medio";
after; };

main =
do { combine c c; let { b = combine (putStrLn "¡Hola!") (putStrLn "¡Adios!")};
let {d = combine (b) (combine c c)};
putStrLn "¡Tanto tiempo!" };
</haskell>

De vuelta al ejercicio - ¿ves como hacemos código desde la nada? Trata de imaginar que hará este código, después ejecútalo y compruébalo por ti mismo.

¿Entiendas por qué “¡Hola!” y “¡Adiós!” no se imprimen?

----

Examinemos nuestra función “main” más de cerca:

Prelude> :load cd-fit.hs
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :type main
main :: IO ()
*Main>

Vemos que “main” es en realidad una acción IO que no devuelve nada cuando la evaluamos. Cuando combinamos acciones con “do”, el tipo del resultado será el tipo de la última acción, y “putStrLn algo” tiene tipo “IO ()”:

*Main> :type putStrLn "¡Hola mundo!"
putStrLn "¡Hola mundo!" :: IO ()
*Main>

Ah, por cierto: ¿te has dado cuenta que hemos compilado nuestro primer programa de Haskell para examinar “main”? :)

celebrémoslo poniéndolo bajo control de versión: ejecuta “darcs add cd-fit.hs” y “darcs record”, responder “y” a todas las preguntas y dale un comentario al añadido “Esqueleto de cd-fit.hs”

Vamos a probar a ejecutarlo:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: tengo entrada foo

----

'''Ejercicios''':
* Trata de escribir un programa que coja tu nombre de stdin y te felicite (palabras clave: getLine, putStrLn);
* Trata de escribir un programa que pregunte por tu nombre, lo lea, te felicite, pregunte por tu color favotiro, y lo devuelva (palabras clave: getLine, putStrLn).

== Capítulo 2: procesando la entrada ==

Bien, ahora que tenemos un entendimiento cabal de los poderes de ES en
Haskell (y estamos deslumbrados por ellos, espero), nos olvidemos un
poco de ES y hagamos algún trabajo útil.

Como tu recordarás, nos habíamos propuesto poner datos tomados de
varios directorios en tan pocos discos CD-Rs como sea
posible. Asumimos que "du -sb" calculará el tamaño de los directorios
de entrada y nos dará como salida algo como lo siguiente:

65572 /home/adept/photos/raw-to-burn/dir1
68268 /home/adept/photos/raw-to-burn/dir2
53372 /home/adept/photos/raw-to-burn/dir3
713124 /home/adept/photos/raw-to-burn/dir4
437952 /home/adept/photos/raw-to-burn/dir5

Nuestra siguiente tarea es procesar esa entrada y convertirla en
alguna representación interna más cómoda.

Para ello usaremos la poderosa librería de '''combinadores de
parsers''' llamada "[[Parsec]]", que está presente en la mayoría de
las implementaciones de Haskell.

Como muchas de las facilidades para ES que hemos visto en el capítulo
anterior, esta libreria provee un conjunto de parsers básicos y medios de
combinarlos para obtener construcciones de parseo más complejas.

A diferencia de otras herramientas en esta área (lex/yacc o JavaCC,
para nombrar algunas), los parsers de [[Parsec]] no requieren una
etapa de prepocesamiento previo. Dado que Haskell podeoms devolver
funciones como resultado de funciones y de esta manera construir
funciones a partir de "mero aire", no hay necesidad de una sintáxis
separada para la descripción de parsers. Pero ya hicimos suficiente
propaganda, hagamos algún parseo:

<haskell>
-- Tomado de 'cd-fit-2-1.hs'
import Text.ParserCombinators.Parsec

-- parseInput parsea la entrada de "du -sb", que consiste de muchas líneas,
-- cada una de ellas describe un solo directorio
parseInput =
do dirs <- many dirAndSize
eof
return dirs

-- El tipo de dato Dir tiene información sobre un solo directorio:
-- su tamaño y su nombre
data Dir = Dir Int String deriving Show

-- `dirAndSize` parsea la información de un solo directorio, que es:
-- un tamaño en bytes (número), algunos espacios, y luego el nombre del
-- directorio que llega hasta el fin de la línea
dirAndSize =
do size <- many1 digit
spaces
dir_name <- anyChar `manyTill` newline
return (Dir (read size) dir_name)
</haskell>

Simplemente agrega esas líneas al inicio de "cd-fit.hs". Aquí vemos muchas
cosas nuevas, y muchas que ya conocíamos.

Primero que nada, notemos la conocida construcción "do", que, como
sabemos, se usa para combinar acciones de ES para producir nuevas
acciones de ES. Aquí la usamos para combinar acciones de "parseo" en
nuevas acciones de "parseo". Significa eso que "parseo" implica "hacer
ES"? Absolutamente no. Es decir, tengo que admitir que te mentí - "do"
no sólo se usa para combinar acciones de ES. "do" se usa para combinar
cualquier tipo de las así llamadas ''acciones monádicas'' o ''valores
monádicos''.

Piensa en las [[monad|mónadas]] como un "[[:Category:Idioms|patrón de
diseño]]" en el mundo funcional. Las [[monad|mónadas]] son una forma
de ocultar al usuario (programador) toda la maquinaria necesaria para
que operen funcionalidades complejas.

Como habrás oido, Haskell no tiene ninguna noción de "assignación",
"estado mutable" ni "variables", y es un "lenguaje funcional puro",
que significa que toda función invocada con los mismos parámetros
devolverá exactamente los mismos resultados. A la vez, "hacer ES"
requiere acarrear descriptores de ficheros y sus estados y lidiar
con errores de ES. El "parseo" requiere llevar registro de la posición
en la entrada y lidiar con errores de parseo.

En ambos casos Hombres Sabios que Escribieron Librerías tomaron las
precauciones por nosotros y nos ocultaron toda la complejidad,
exponiendo la
[http://es.wikipedia.org/wiki/Application_programming_interface API]
de sus librerías (ES y parseo) en la forma de "acciones monádicas",
que nosotros podemos combinar libremente como nos parezca.

Piensa en la programación con mónadas como hacer un remodelamiento con
ayuda de un grupo de profesionales de remodelamiento. Tu describes
secuencias de acciones en el papel (eso somos nosotros escribiendo con
la notación "do"), y después, cuando es necesario, esa secuencia será
evaluada por los profesionales ("en la mónada") lo que te dará el
resultado final, ocultandote toda la complejidad subyacente (cómo
preparar la pintuea, qué clavos elegir, etc).

Usemos el entorno interactivo de Haskell para descifrar todas las
instrucciones que hemos escritos para la librería de parseo. Como
es usual, iremos de arriba hacia abajo:

*Main> :reload
Ok, modules loaded: Main.
*Main> :t parseInput
parseInput :: GenParser Char st [Dir]
*Main> :t dirAndSize
dirAndSize :: GenParser Char st Dir
*Main>

Asumiendo (bueno, confía en mi palabra) que "GenParser Char st" es
nuestra mónada de parseo, podríamos ver que "parseInput", cuando sea
evaluada, producirá una lista de "Dir", y que "dirAndSize", cuando sea
evaluada, producirá "Dir". Asumiendo que "Dir" representa de alguna
forma la iformación sobre un solo directorio, es casi lo que queríamos,
o no?

Veamos que significa un "Dir". Nosotros hemos definido el
''[[type|tipo]] de datos'' Dir como un registro, que contiene un Int y
un String:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
data Dir = Dir Int String deriving Show
</haskell>

Para construir esos registros, debemos usar el ''[[constructor]] de
datos'' Dir:

*Main> :t Dir 1 "foo"
Dir 1 "foo" :: Dir

Para evitar la confusión para los novatos, podriamos haber escrito:

<haskell>
data Dir = D Int String deriving Show
</haskell>

, que define el ''[[type|tipo]] de datos'' "Dir" con el
''[[constructor]] de datos'' "D". Sin embargo, tradicionalmente el
nombre del [[type|tipo]] de datos y de su [[constructor]] son el mismo.

La cláusula "[[deriving]] Show" le ordena al compilador hacer "tras
bambalinas" todo el código necesario para que este ''tipo de datos''
cumpla con la interfaz de la ''[[class|clase]] de tipo''
Show. Explicaremos ''[[class|clases]] de tipo'' más adelante, por
ahora digamos que esto nos permitirá "imprimir" instancias de "Dir".

'''Ejercicios:'''
* examina los tipos de "digit", "anyChar", "many", "many1" y "manyTill" para ver como són usados para construir parser complejos a partir de parsers simples.

* compara los tipos de "manyTill", "manyTill anyChar" y "manyTill anyChar newline". Nota que "anyChar `manyTill` newline" es simplemente azucar sintácticto para el anterior. Nota que cuando no le aplicamos todos los argumentos necesarios a una función, entonces no obtenemos un valor, sino una nueva función, esto se llama ''aplicación parcial''.
Bien, hasta aquí combinamos varias acciones primitivas de "parseo"
para obtener nosotros un parser para la salida de "du -sb". Ahora, cómo podemos
parsear algo? La librería [[Parsec]] nos brinda la función "parse":

*Main> :t parse
parse :: GenParser tok () a
-> SourceName
-> [tok]
-> Either ParseError a
*Main> :t parse parseInput
parse parseInput :: SourceName -> [Char] -> Either ParseError [Dir]
*Main>

Al principio el [[type|tipo]] puede parecer un poco críptico, pero una
vez que aplicamos el parser que nosotros hicimos, el compilador tiene más
información y nos muestra un [[type|tipo]] más particular.

Detente aquí y considera lo siguiente por un momento: el compilador se
dió cuenta del tipo sin que nosotros hayamos hecho una sola anotación
de tipos! Imagínate si un compilador de Java dedujera los tipos por
nosotros, y no tuvieramos que especificar los tipos de los argumentos
y de los valores de retorno nunca.

Ahora volvamos al código. Podemos observar que "parser" es una
función, que cuando recibe un parser, un nombre de un fichero o un
canal (p.e. "stdin") y datos de entrada (String, que es una lista de
"Char"s, que se escribe "[Char]"), producirá o bien un error de "parseo" o
una lista de "Dir".

El tipo de datos "Either" es un ejemplo de un tipo de datos cuyo
constructor tiene un nombre diferente del nombre del tipo de datos. De
hecho, "Either" tiene dos constructores:

<haskell>
data either a b = Left a | Right b
</haskell>

Para entender mejor qué significa, considera el siguiente ejemplo:

*Main> :t Left 'a'
Left 'a' :: Either Char b
*Main> :t Right "aaa"
Right "aaa" :: Either a [Char]
*Main>

Puedes ver que "Either" es una ''unión disjunta'' (bastante parecida a
la "union" en C/C++), que puede tener un valor de uno de los dos tipos
de datos. Sin embargo, a diferencia de la "union" en C/C++, si
tuvieramos un valor de tipo "Either Int Char" podríamos decir
inmediatamente si es un Int o un Char - mirando el constructor usado
para producir el valor. Estos tipo de datos se llaman "uniones
disjuntas" y son una [[:Category:Idioms|herramienta poderosa]] de la
caja de herramientas de Haskell.

¿Has notado que le hemos dado a "parse" un parser que es un valor
monádico, pero no hemos recibido un nuevo valor monádico, sino un
resultado de "parseo"? Esto es así porque "parse" es un evaludador
para la mónada "Parser", así como el runtime de [[GHC]] o el de
[[Hugs]] es un evaluador para la mónada de E/S. La función "parser"
implementa toda la maquinaria monádica: tiene un registro de los errores
y posiciones en la entrada, implementa backtracking y lookahead, etc.

Extendamos nuestr función "main" para que use "parse", parseé realmente
la entrada y nos muestra las estructuras parseadas:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
main = do input <- getContents
putStrLn ("DEBUG: got input " ++ input)
let dirs = case parse parseInput "stdin" input of
Left err -> error $ "Input:\n" ++ show input ++
"\nError:\n" ++ show err
Right result -> result
putStrLn "DEBUG: parsed:"; print dirs
</haskell>

'''Ejercicios:'''

* Para entender mejor el siguiente pedazo de cógigo, examina (con ghci
o hugs) las diferencias entre 'drop 1 ( drop 1 ( drop 1 ( drop 1 ( drop 1 "foobar" ))))' y 'drop 1 $ drop 1 $ drop 1 $ drop 1 $ drop 1 "foobar"'. Examina el tipo de ($).

* Intenta hacer 'putStrLn "aaa"' y 'print "aaa"' y mira la la diferencia, examina sus tipos.

* Intenta 'print (Dir 1 "foo")' y 'putStrLn (Dir 1 "foo")'. Examina los tipos de print y de putStrLn para entender el comportamiento en cada caso.

Intentemos ejecutar lo que tenemos hasta aquí:

$ du -sb * | runhaskell ./cd-fit.hs

DEBUG: got input 22325 Article.txt
18928 Article.txt~
1706 cd-fit.hs
964 cd-fit.hs~
61609 _darcs

DEBUG: parsed:
[Dir 22325 "Article.txt",Dir 18928 "Article.txt~",Dir 1706 "cd-fit.hs",Dir 964 "cd-fit.hs~",Dir 61609 "_darcs"]

Parece que está haciendo exactamente lo planeado. Ahora probemos con alguna
entrada errónea:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: got input foo

DEBUG: parsed:
*** Exception: Input:
"foo\n"
Error:
"stdin" (line 1, column 1):
unexpected "f"
expecting digit or end of input

Parece que se comporta bien.

Si tú has seguido el consejo de poner tu código bajo control de versiones,
tú puedes usar "darcs whatsnew" o "darc diff -u" para examinar los
cambios respecto a la versión anterior. Usando "darcs record" creas una
nueva versión con los cambios. Como ejercicio, primero registra los
cambios que no están dentro de la función "main" y luego
registra los que están en "main". Usa "darcs changes" para examinar
una lista de los cambios registrados hasta ahora.

== Capítulo 3: Empacando la mochila y controlándola con clase también (¡y no te olvides tu toalla!) ==

Ya tuvimos bastante preámbulo. Hagamos algunos CDs.

Como debes haberte dado cuenta, nuestro problema es clásico. Se llama
el "problema de la mochila"
([http://www.google.com/search?hl=es&q=problema+mochila búscalo en google], si no lo conoces. Hay más de un millón de resultados.)

Empecemos con la solución voraz, pero primero modifiquemos levemente
nuestro tipo de datos "Dir" para facilitar la extracción de sus
componentes:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

----
'''Ejercicio:''' examina los tipos de "Dir", "dir_size" y "dir_name".
----

A partir de ahora, podríamos usar "dir_size d" para obtener un tamaño
de directorio y "dir_name d" para obtener su nombre, si "d" es de tipo
"Dir".

El algoritmo voraz ordena los directorios de mayor a menor e intenta
ponerlos en el CD uno a uno, hasta que no haya más espacio. Vamos a
tener que llevar registro de los directorios que agregamos al CD, para
ello agreguemos otro tipo de datos y programemos este simple algoritmo de
empaque:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
import Data.List (sortBy)

-- DirPack holds a set of directories which are to be stored on single CD.
-- 'pack_size' could be calculated, but we will store it separately to reduce
-- amount of calculation
data DirPack = DirPack {pack_size::Int, dirs::[Dir]} deriving Show

-- For simplicity, let's assume that we deal with standard 700 Mb CDs for now
media_size = 700*1024*1024

-- Greedy packer tries to add directories one by one to initially empty 'DirPack'
greedy_pack dirs = foldl maybe_add_dir (DirPack 0 []) $ sortBy cmpSize dirs
where
cmpSize d1 d2 = compare (dir_size d1) (dir_size d2)

-- Helper function, which only adds directory "d" to the pack "p" when new
-- total size does not exceed media_size
maybe_add_dir p d =
let new_size = pack_size p + dir_size d
new_dirs = d:(dirs p)
in if new_size > media_size then p else DirPack new_size new_dirs
</haskell>

----
Yo resaltaré las áreas que podrías explorar vos mismo (usando otros
tutoriales bonitos, especialmente te recomiento "Yet Another Haskell
Tutorial" ("Otro tutorial para Haskell") por Hal Daume):
* Elegimos importar una sola función "sortBy" de un módulo [[Data.List]], no el módulo entero.

* En vez de programar caso por caso la definición recursiva de "greedy_pack", usamos un enfoque de alto orden, y elegimos "foldl" como una forma para recorrido de listas. Examina su tipo. Otras funciones útiles de esa categoría son "map", "foldr", "scanl" y "scanr". Búscalas!

* Para ordenar una lista de "Dir" por tamaño usamos una función particular de orden con el ordenamiento parametrizable - "sortBy". Este tipo de funciones donde uno provee un modificador particular para una función de librería genérica es bastante común: busca "deleteBy", "deleteFirstBy", "groupBy", "insertBy", "intersectBy", "maximumBy", "minimumBy", "sortBy", "unionBy".

* Para programar la compleja función "maybe_add_dir" hemos introducido '''definiciones locales''' con la cláusula "let"; estas definiciones pueden ser usadas nuevamente en el cuerpo de la función. Usamos una cláusula "where" en la función "greedy_pack" para lo mismo. Lee acerca de las cláusulas "let" y "where" y sobre las diferencias entre ellas.

* Nota que para poder construir un nuevo valor de tipo "DirPack" (en la función "maybe_add_dir") no hemos usado las funciones auxiliares "pack_size" y "dirs".
----

Para poder utilizar nuestro empacador voraz debemos invocarlo desde nuestra función
"main", así que agreguemos algunas líneas:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
main = do ...
-- compute solution and print it
putStrLn "Solution:" ; print (greedy_pack dirs)
</haskell>

Verifica la integridad de las definiciones (re)cargando el código en ghci. ¿Compila? Claro que sí :) Ahora, haz "darcs record" y agrega algún mensaje razonable para documentar los cambios.

Es el momento de probar nuestra creación. Podemos hacerlo ejecutando directamente de la siguiente forma:

$ du -sb ~/DOWNLOADS/* | runhaskell ./cd-fit.hs

Esto demostrará que nuestro código parece funcionar. Por lo menos esta vez. ¿Qué tal estaría
establecer con algún grado de certeza que nuestro código funciona correctamente, parcial y completamente, y hacerlo de una forma re-usable? ¿En otras palabras, qué tal si escribimos
algunas pruebas?

Los programadores de Java, acostumbrados a JUnit, probablemente pensaron en pantallas y pantallas de plantillas e invocaciones de métodos escritas a mano. No temáis, no haremos nada tan tonto :)

Presentando '''[[QuickCheck]]'''.

[[QuickCheck]] es una herramienta que hace pruebas automatizadas de tus funciones utilizando datos de entrada (semi) aleatorios. Siguiendo la idea de "100b de ejemplos de código valen lo que 1k de elogios" mostremos el código para verificar la siguiente "propiedad": un intento de empacar directorios devueltos por "greedy_pack" debe regresar "DirPack" de exactamente el mismo paquete.

<haskell>
-- Taken from 'cd-fit-3-2.hs'
import Test.QuickCheck
import Control.Monad (liftM2, replicateM)

-- We must teach QuickCheck how to generate arbitrary "Dir"s
instance Arbitrary Dir where
-- Let's just skip "coarbitrary" for now, ok?
-- I promise, we will get back to it later :)
coarbitrary = undefined
-- We generate arbitrary "Dir" by generating random size and random name
-- and stuffing them inside "Dir"
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM (n*10+1) (elements "fubar/")

-- For convenience and by tradition, all QuickCheck tests begin with prefix "prop_".
-- Assume that "ds" will be a random list of "Dir"s and code your test.
prop_greedy_pack_is_fixpoint ds =
let pack = greedy_pack ds
in pack_size pack == pack_size (greedy_pack (dirs pack))
</haskell>

ejecutemos la prueba, tras lo cual explicaré cómo funciona:

Prelude> :r
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> quickCheck prop_greedy_pack_is_fixpoint
[numbers spinning]
OK, passed 100 tests.
*Main>

Hemos visto que nuestro "greedy_pack" corre en 100 listas completamente (bueno, casi completamente) aleatorias de "Dir"s, y parece que esa propiedad efectivamente se cumple.

Disequemos el código. La parte más intrigante es "instance Arbitrary Dir where", que declara que "Dir" es una instancia ('''[[instance]]''') de '''type[[class]]''') "Arbitrary". Whoa, ¡es un montón de palabras desconocidas! :) Veamos con más calma.

¿Qué es una clase de tipos ('''type[[class]]''')? Una clase de tipos es la forma de Haskell de lidiar con la siguiente situación: supón que estás escribiendo una biblioteca de funciones útiles y no sabes por anticipado exactamente cómo van a ser utilizadas, así que quieres hacerlas genéricas. Por un lado no quieres restringir a tus usuarios a cierto tipo (e.g. String). Por otro lado, quieres que se cumpla que los argumentos para tu función deban satisfacer un cierto conjunto de restricciones. Aquí es donde '''typeclass''' es útil.

Piensa en typeclass como en un '''contrato''' (o "interface", en términos Java) que el tipo debe cumplir para ser aceptado como argumento a ciertas funciones.

Examinemos la clase de tipos "Arbitrary":

*Main> :i Arbitrary
class Arbitrary a where
arbitrary :: Gen a
coarbitrary :: a -> Gen b -> Gen b
-- Imported from Test.QuickCheck
instance Arbitrary Dir
-- Defined at ./cd-fit.hs:61:0
instance Arbitrary Bool -- Imported from Test.QuickCheck
instance Arbitrary Double -- Imported from Test.QuickCheck
instance Arbitrary Float -- Imported from Test.QuickCheck
instance Arbitrary Int -- Imported from Test.QuickCheck
instance Arbitrary Integer -- Imported from Test.QuickCheck
-- rest skipped --

Se puede leer así: "Cualquier [[type|tipo]] (llamémosle 'a') puede ser miembro de la [[class|clase]] Arbitrary siempre que le definamos dos funciones: "arbitrary" y "coarbitrary", con las definiciones de tipos mostradas. Para los tipos Dir, Bool, Double, Float, Int e Integer exiten tales definiciones, por lo tanto, todos esos tipos son instancia de la clase Arbitrary".

¡Ahora, si escribes una función que opere en sus argumentos solamente por medio de "arbitrary" y "coarbitrary", puedes estar seguro de que esa función funcionará en cualquier tipo que sea instancia de "Arbitrary"!

Vamos a decirlo otra vez. Alguien (tal vez tú mismo) escribe código (API o biblioteca) que requiere que los valores de entrada implementen ciertas "interfaces" que están descritas en términos de funciones. Una vez que muestras cómo implementa tu tipo esta "interfaz", eres libre de emplear el API o las bibliotecas.

Considera la función "sort" ("ordenar") de la biblioteca estándar:

*Main> :t Data.List.sort
Data.List.sort :: (Ord a) => [a] -> [a]

Vemos que ordena listas de valores cualesquiera que sean instancia de la clase de tipos "Ord". Examinemos esa clase:

*Main> :i Ord
class Eq a => Ord a where
compare :: a -> a -> Ordering
(<) :: a -> a -> Bool
(>=) :: a -> a -> Bool
(>) :: a -> a -> Bool
(<=) :: a -> a -> Bool
max :: a -> a -> a
min :: a -> a -> a
-- skip
instance Ord Double -- Imported from GHC.Float
instance Ord Float -- Imported from GHC.Float
instance Ord Bool -- Imported from GHC.Base
instance Ord Char -- Imported from GHC.Base
instance Ord Integer -- Imported from GHC.Num
instance Ord Int -- Imported from GHC.Base
-- skip
*Main>

Vemos un par de cosas interesantes: primero, hay un requisito adicional: para que sea instancia de "Ord", el tipo debe primero ser una instancia de la clase de tipos "Eq". Luego, vemos que hay una gran cantidad de funciones que deben ser definidas para ser una instancia de "Ord". Un momento, ¿no es algo tonto tener que definir (<) y (>) cuando se puede expresar uno en términos del otro?

¡Claro que sí! Usualmente, las clases de tipo contienen varias implementaciones por "default" para sus funciones, cuando es posible expresarlas una a través de otra (como es el caso de "Ord"). En este caso es posible proveer solamente una definición mínima (que, en el caso de "Ord", consiste en cualquier función de comparación) y las demás serán automáticamente derivadas. Si provees menos funciones de las que se requieren para una implementación mínima, el compilador/intérprete lo indicará y explicará qué funciones faltan por definir.

Una vez más, vemos que muchos de los [[type|tipo]]s son ya instancia de la clase de tipos Ord, y por eso es posible ordenarlos.

Veamos de nuevo la definición de "Dir":

<haskell>
-- Taken from 'cd-fit-3-2.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

¿Notas la presencia de la cláusula ''[[deriving]]''? Esto le dice al compilador que derive automáticamente código para hacer que "Dir" sea una instancia de la clase de tipos Show. El compilador conoce unas cuantas clases de tipos estándar (Eq, Ord, Show, Enum, Bound, Typeable para mencionar algunas) y sabe cómo convertir un tipo en una instancia "suficientemente buena" de cualquiera de ellas. Si quieres derivar instancias de más de una clase de tipos, dilo de esta forma: "deriving (Eq,Ord,Show)". Voila! Ahora podemos comparar, ordenar e imprimir datos de ese tipo.

Un comentario para programadores Java: imaginen un compilador java que derive automáticamente código para "implements Storable"...

Un comentario para programadores C++: imaginen que los constructores de copia son escritos por el compilador...

----
'''Exercises:'''
* Examine typeclasses Eq and Show
* Examine types of (==) and "print"
* Try to make "Dir" instance of "Eq"
----

Ok, de regreso a las pruebas. Así que, ¿qué hemos tenido que hacer para que "Dir" sea instancia de "Arbitrary"? La definición mínima consiste en "arbitrary". Examinémosla de cerca:

*Main> :t arbitrary
arbitrary :: (Arbitrary a) => Gen a

¿Notas ese "Gen a"? ¿Te recuerda algo? ¡Correcto! Piensa en "IO a" y "Parser a", que ya hemos visto. Este es otro ejemplo más de función que regresa acciones, que cuede ser utilizada dentro de la notación "do". (Puedes preguntarte, ¿no sería útil generalizar ese concepto tan conveniente de acciones y "do"? ¡Por supuesto! Ya está hecho, el concepto se llama "[[Monad]]" y lo mencionaremos en el capítulo 400 :) )

Como aquí 'a' es una [[type variable|variable de tipo]] que es una instancia de "Arbitrary", podemos sustituirla con "Dir". Así que, ¿cómo creamos y regresamos una acción de tipo "Gen Dir"?

Veamos el código:

<haskell>
-- Taken from 'cd-fit-3-2.hs'
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM (n*10+1) (elements "fubar/")
</haskell>

Hemos empleado las funciones de biblioteca "choose" y "elements" para construir "gen_size :: Gen Int" y "gen_name :: Gen String" (ejercicio: no me creas. Busca una forma de verificar los tipos de "gen_name" y "gen_size"). Como "Int" y "String" son los componentes de "Dir", seguramente debemos poder usar "Gen Int" y "Gen String" para construir "Gen Dir". ¿Pero donde está el bloque "do" para esto? No hay ninguno; solamente hay una sola llamada a "liftM2".

Examinémoslo:

*Main> :t liftM2
liftM2 :: (Monad m) => (a1 -> a2 -> r) -> m a1 -> m a2 -> m r

¿Espanta, no? Démosle al verificador de tipos un poco de contexto:

*Main> :t liftM2 Dir
liftM2 Dir :: (Monad m) => m Int -> m String -> m Dir

Como ya has oído que "Gen" es una "Monad", puedes sustituir "Gen" en lugar de "m" aquí, obteniendo "liftM2 Dir :: (Monad Gen) => Gen Int -> Gen String ->
Gen Dir". ¡Exactamente lo que queríamos!

Considera que "liftM2" es un "tópico avanzado" de este capítulo (lo veremos después) y por ahora nota que:

* "2" es un número de argumentos para el constructor de datos "Dir" y hemos utilizado "liftM2" para construir "Gen Dir" a partir de "Dir"
* También hay "liftM", "liftM3", "liftM4", "liftM5"
* "liftM2" está definido como "liftM2 f a1 a2 = do x<-a1; y<-a2; return (f x y)"

Esperemos que todo esto tenga sentido tras leerlo por tercera vez ;)

Ah, y de paso - ¡no olvides hacer "darcs record" para guardar tus cambios!

== Capítulo 4: Empacando DE VERDAD la mochila, ahora sí ==

En este capítulo vamos a escribir otro método de empaque no tan trivial, comparar la eficiencia de los métodos de empaque y, de paso, aprender cosas nuevas sobre depuración y medición de desempeño de programas en Haskell.

Puede no ser inmediatamente obvio si nuestro algoritmo de empaque es eficiente, y, si lo es, ¿en qué forma en particular? ¿Cuánto tarda en completar, cuanta memoria consume, el resultado obtenido es de suficiente calidad? ¿Hay otros algoritmos, y cómo se comparan uno al otro?

Escribamos otra solución al problema de empacar la mochila, llamado el "método de programación dinámico", y pongamos ambas variantes a prueba.

Esta vez no separaré el listado para explicarlo parte por parte. En lugar de ello he incluído comentarios en el código:

<haskell>
-- Taken from 'cd-fit-4-1.hs'
----------------------------------------------------------------------------------
-- Solución por programación dinámica al problema de empacar
-- una mochila (o más bien un disco)
--
-- Sea `bestDisk x' el disco "más compactamente empacado" de
-- tamaño total no mayor a `x'.
precomputeDisksFor :: [Dir] -> [DirPack]
precomputeDisksFor dirs =
-- Calculando `bestDisk' para todos los posibles tamaños de un
-- disco podemos obtener una solución para un caso en particular
-- simplemente buscando en la lista de soluciones :)
let precomp = map bestDisk [0..]

-- ¿Cómo calcular `bestDisk'? Optemos por una definición recursiva:
-- Caso base de la recursión: el disco de tamaño 0
-- mejor empacado es el vacío
bestDisk 0 = DirPack 0 []
-- Paso recursivo: para el tamaño `limit', mayor que 0, el disco mejor
-- empacado se calcula de la siguiente forma:
bestDisk limit =
-- 1. Toma todos los directorios no vacíos que pudieran posiblemente caber
-- en ese disco por sí mismos. Considéralos uno por uno. Que el tamaño de
-- un directorio en particular sea `dir_size d'. Agreguémoslo al disco mejor
-- empacado de tamaño <= (limit - dir_size d), produciendo el disco de
-- tamaño <= limit. Hagamos esto para todos los directorios "candidato" que
-- aún no están en nuestro disco:
case [ DirPack (dir_size d + s) (d:ds) | d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(limit - dir_size d)
, d `notElem` ds
] of
-- O no podemos agregar ningún directorio (probablemente porque todos
-- son demasiado grandes); bueno, informemos que el disco se debe
-- quedar vacío:
[] -> DirPack 0 []
-- O generemos otro empaque diferente. Seleccionemos el mejor de todos:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

in precomp

-- Cuando precalculamos discos de todos los posibles tamaños para el conjunto de
-- directorios dado, la solución a un problema en particular es simple: sólo toma la
-- solución para el 'media_size' requerido, y eso es todo.

dynamic_pack dirs = (precomputeDisksFor dirs) !! media_size
</haskell>

Nota que se usó casi la misma cantidad de texto para describir el algoritmo y para implementarlo. ¿Bien, no?

----

'''Exercises:'''
* Make all necessary amendments to the previously written code to make this example compile. Hints: browse modules Data.List and Data.Ix for functions that are "missing" - maybe you will find them there (use ":browse Module.Name" at ghci prompt). Have you had to define some new instances of some classes? How did you do that?
* <tt>[ other_function local_binding | x <- some_list, x > 0, let local_binding = some_function x ]</tt> is called a "list comprehension". This is another example of "syntactic sugar", which could lead to nicely readable code, but, when abused, could lead to syntactic caries :) Do you understand what does this sample do: <tt>let solve x = [ y | x <- [0..], y<-[0..], y == x * x ]</tt>? Could write (with help of decent tutorial) write de-sugared version of this? (Yes, I know that finding a square root does not require list traversals, but for the sake of self-education try and do it)
* Notice that in order to code quite complex implementation of <tt>precomputeDisksFor</tt> we split it up in several smaller pieces and put them as a '''local bindings''' inside '''let''' clause.
* Notice that we use '''pattern matching''' to both define <tt>bestKnap</tt> on case-by-case basis and to "peer into" ('''de-construct''') <tt>DirPack</tt> in the <tt>let (DirPack s ds)=precomp!!(limit - dir_size d)</tt> line
* Notice how we use function composition to compose complex condition to filter the list of dirs

----

Antes de avanzar más, hagamos un pequeño cambio cosmético en nuestro código. Actualmente nuestra solución utiliza 'Int' para almacenar el tamaño del directorio. En Haskell, 'Int' es un entero dependiente de la plataforma, lo que impone ciertas limitaciones en los valores de este tipo. Intentar calcular valores de tipo 'Int' que excedan los límites resultará en un error de sobreflujo. Las bibliotecas estándar Haskell tienen la clase de tipos especial <hask>Bounded</hask>, que permite definir y examinar esos límites:

Prelude> :i Bounded
class Bounded a where
minBound :: a
maxBound :: a
-- skip --
instance Bounded Int -- Imported from GHC.Enum

Vemos que 'Int' efectivamente tiene límites. Examinemos dichos límites:

Prelude> minBound :: Int
-2147483648
Prelude> maxBound :: Int
2147483647
Prelude>

Para los que entienden de C, diremos que en este caso el 'Int' es un "entero de 32 bit con signo", lo que significa que produciremos errores si intentamos operar en paquetes de directorio o directorios que sean mayores a 2 Gb.

Afortunadamente para nosotros, Haskell tiene enteros de precisión arbitraria (limitados solamente por la cantidad de memoria disponible). El tipo apropiado se llama 'Integer':

Prelude> (2^50) :: Int
0 -- overflow
Prelude> (2^50) :: Integer
1125899906842624 -- no overflow
Prelude>

Cambiemos las definiciones de 'Dir' y de 'DirPack' para permitir directorios de tamaños mayores:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
data Dir = Dir {dir_size::Integer, dir_name::String} deriving (Eq,Show)
data DirPack = DirPack {pack_size::Integer, dirs::[Dir]} deriving Show
</haskell>

Intenta compilar el código o cargarlo en ghci. Obtendrás los siguientes errores:

cd-fit-4-2.hs:73:79:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the expression: limit - (dir_size d)
In the second argument of `(!!)', namely `(limit - (dir_size d))'

cd-fit-4-2.hs:89:47:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the second argument of `(!!)', namely `media_size'
In the definition of `dynamic_pack':
dynamic_pack dirs = (precomputeDisksFor dirs) !! media_size

Parece que Haskell tiene algunos problemas usando 'Integer' con '(!!)'. Veamos por qué:

Prelude> :t (!!)
(!!) :: [a] -> Int -> a

Parece que la definición de '(!!)' exige que el índice sea un 'Int', no un 'Integer'. Haskell nunca convierte un tipo en otro automáticamente - el programador debe solicitarlo de forma explícita:

No voy a repetir la sección "Standard Haskell Classes" de
[http://haskell.org/onlinereport/basic.html the Haskell Report] y explicar por qué los tipos de clases para varios tipos numéricos están organizados como están. Solamente diré que el tipos de clases estándar <hask>Num</hask> requiere que los tipos numéricos implementen el método <hask>fromInteger</hask>:

Prelude> :i Num
class (Eq a, Show a) => Num a where
(+) :: a -> a -> a
(*) :: a -> a -> a
(-) :: a -> a -> a
negate :: a -> a
abs :: a -> a
signum :: a -> a
fromInteger :: Integer -> a
-- Imported from GHC.Num
instance Num Float -- Imported from GHC.Float
instance Num Double -- Imported from GHC.Float
instance Num Integer -- Imported from GHC.Num
instance Num Int -- Imported from GHC.Num

Vemos que <hask>Integer</hask> es miembro de la clase de tipos <hask>Num</hask>, y por lo tanto podemos utilizar <hask>fromInteger</hask> para hacer que desaparezcan los errores de tipo:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
-- snip
case [ DirPack (dir_size d + s) (d:ds) | d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
] of
-- snip
dynamic_pack dirs = (precomputeDisksFor dirs)!!(fromInteger media_size)
-- snip
</haskell>

Los errores de tipo desaparecieron, pero el lector atento se dará cuenta de que cuando la expresión <hask>(limit - dir_size d)</hask> exceda los límites de <hask>Int</hask>, ocurrirá un sobreflujo, y no podremos acceder al elemento correcto en la lista. No te preocupes, resolveremos esto en un momento.

Ahora, escribamos la prueba QuickCheck para esta función de la misma forma que la prueba para <tt>greedy_pack</tt>:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack ds
in pack_size pack == pack_size (dynamic_pack (dirs pack))
</haskell>

Ahora, intentemos ejecutar (¡NO ENTRAR EN PÁNICO y guarda primero lo que estés haciendo en otras aplicaciones!):

*Main> quickCheck prop_dynamic_pack_is_fixpoint

¿Tomaste en serio mi consejo, verdad? ¿Y tenías '''Ctrl-C''' listo, no? Lo más probable es que el intento de ejecutar la prueba haya causado que toda tu memoria haya sido saturada por el proceso <tt>ghci</tt>, que, si fuiste suficientemente rápido, pudiste interrumpir presionando '''Ctrl-C'''.

¿Qué sucedió? ¿Quién se comió toda la memoria? ¿Cómo depuramos este problema? GHC puede medir el desempeño y decir donde se ocupó la memoria, pero no podemos hacerlo ahora - el reporte se produce después de que el programa finaliza, y el nuestro no parece querer finalizar sin antes consumir varios terabytes de memoria. Aún así, hay mucho terreno donde maniobrar.

Veamos. Como llamamos a <tt>dynamic_pack</tt> y se comió toda la memoria, no lo hagamos de nuevo. En lugar de eso, veamos qué hace esa función y alterémosla un poco para modificar su comportamiento.

Como ya sabemos que las listas aleatorias de "Dir"s generadas para nuestras pruebas QuickCheck son de tamaño mediano (después de todo, <tt>greedy_pack</tt> las mastica sin consumir demasiada memoria), el problema más probablemente no es el tamaño de la entrada. Sin embargo, <tt>dynamic_pack_is_fixpoint</tt> está construyendo internamente una lista enorme (via <tt>precomputeDisksFor</tt>). ¿Podría ser ese el problema?

Activemos las estadísticas de medición de tiempo y memoria (":set +s" dentro de ghci) e intentemos espiar dentro de varios elementos de la lista devuelta por <tt>precomputeDisksFor</tt>:

Prelude> :l cd-fit.hs
Compiling Main ( cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :set +s
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 0
DirPack {pack_size = 0, dirs = []}
(0.06 secs, 1277972 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10
DirPack {pack_size = 0, dirs = []}
(0.00 secs, 0 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100
DirPack {pack_size = 0, dirs = []}
(0.01 secs, 1519064 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 1000
DirPack {pack_size = 0, dirs = []}
(0.03 secs, 1081808 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10000
DirPack {pack_size = 0, dirs = []}
(1.39 secs, 12714088 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100000
Interrupted.

¡Aha! Parece que aquí hay un problema, porque el cálculo de 100000 no finaliza en un tiempo "razonable". Y pensar que hemos tratado de calcular el elemento número <tt>700*1024*1024</tt>...

Modifiquemos un poco el código, para permitir alterar el tamaño del disco:

<haskell>
-- Taken from 'cd-fit-4-3.hs'
dynamic_pack limit dirs = (precomputeDisksFor dirs)!!(fromInteger limit)

prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack media_size ds
in pack_size pack == pack_size (dynamic_pack media_size (dirs pack))

prop_dynamic_pack_small_disk ds =
let pack = dynamic_pack 50000 ds
in pack_size pack == pack_size (dynamic_pack 50000 (dirs pack))

-- rename "old" main to "moin"
main = quickCheck prop_dynamic_pack_small_disk
</haskell>

Compila una versión con soporte para medir el desempeño con <tt>ghc -O --make -prof -auto-all -o cd-fit cd-fit.hs</tt> y ejecútala así:

$ ./cd-fit +RTS -p
OK, passed 100 tests.

Primero, notemos que nuestro código satisface al menos una propiedad simple. Bien. Ahora examinemos el reporte de desempeño. Mira en el archivo "cd-fig.prof", que ha sido generado en el directorio actual.

Seguramente vas a ver algo parecido a esto:

cd-fit +RTS -p -RTS

total time = 2.18 secs (109 ticks @ 20 ms)
total alloc = 721,433,008 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

precomputeDisksFor Main 88.1 99.8
dynamic_pack Main 11.0 0.0
individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc

MAIN MAIN 1 0 0.0 0.0 100.0 100.0
CAF Main 174 11 0.9 0.2 100.0 100.0
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 99.1 99.8
dynamic_pack Main 182 200 11.0 0.0 99.1 99.8
precomputeDisksFor Main 183 200 88.1 99.8 88.1 99.8
main Main 180 1 0.0 0.0 0.0 0.0

Examina la columna "individual %alloc". Como lo pensamos, toda la memoria ha sido ubicada dentro de <tt>precomputeDisksFor</tt>. Sin embargo, la cantidad de memoria ubicada (más de 700 Mb, de acuerdo a la línea "total alloc") parece ser demasiado para resolver nuestro problema simple. Investiguemos más profundo para averiguar en donde estamos desperdiciando.

Examinemos el consumo de memoria más de cerca empleando "heap profiles". Ejecuta <tt>./cd-fit +RTS -hb</tt>. Eso produce "perfiles de memoria biográficos", que nos dicen cómo fueron utilizadas las varias partes de la memoria durante la ejecución del programa. El perfil ha sido almacenado en "cd-fit.hp". Es casi imposible de leer e interpretar tal como está; emplearemos "hp2ps cd-fit.hp" para producir una imagen en PostScript que vale más que mil palabras. Visualízala con "gv" o "ghostview" o "Adobe Acrobat" completo (no el "Reader"). (Esta y las siguientes imágenes '''no''' están incluídas aquí)

Nota que la mayor parte de la gráfica está ocupada por la región marcada "VOID" (vacío). Eso significa que la memoria ubicada nunca fué usada. Nota que '''no''' hay áreas marcadas como "USE", "LAG o "DRAG". Parece que nuestro programa no usa '''casi nada''' de la memoria que ha reservado. ¡Un momento! ¿Cómo es posible? Tiene que estar usando algo cuando empaca en los discos imaginarios de 50000 bytes esos directorios generados al azar que miden de 10 a 1400 Mb... Oops. Es una enorme diferencia de tamaños. Debimos habernos dado cuenta antes, cuando estábamos midiendo <tt>precomputeDisksFor</tt>. Regresa y observa cómo es que todas las ejecuciones regresan exactamente el mismo resultado - el conjunto vacío de directorios.

Nuestros directorios al azar son demasiado grandes, pero de todas formas el código consume tiempo y memoria intentando "empacarlos". Obviamente, <tt>precomputeDisksFor</tt> (que es responsable del 90% del consumo de tiempo y memoria) tiene algún error.

Miremos más de cerca qué consume tanta memoria. Ejecuta <tt>./cd-fit +RTS -h -hbvoid</tt> y genera el PostScript para este perfil de memoria. Esto nos dará un informe detallado de toda la memoria cuya "biografía" muestra que ha sido "VOID" (no utilizada). Mi imagen (y me imagino que la tuya también) muestra que la memoria VOID consiste en pedazos etiquetados "precomputeDisksFor/pre...". Podemos asumir que la segunda palabra debe ser "precomp" (¿quieres saber por qué? Mira el código y trata de encontrar funciones con nombres que empiecen con "pre" que sean llamados desde dentro de <tt>precomputeDisksFor</tt>)

Esto significa que la memoria ha sido ocupada por la lista generada dentro de "precomp". Los rumores dicen que las fugas de memoria en Haskell se producen por falta de flojera o por demasiada flojera. Parece que aquí tenemos muy poca flojera: estamos evaluando más elementos de la lista de los que realmente necesitamos y eso impide que sean liberados por el recolector de basura.

Nota cómo buscamos elementos de "precomp" en esta porción de código:

<haskell>
case [ DirPack (dir_size d + s) (d:ds) | d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
</haskell>

Está claro que la lista completa generada por "precomp" debe ser mantenida en memoria para hacer esas búsquedas, dado que no podemos estar seguros de si algún elemento ya no va a ser necesario y puede ser retirado de la memoria.

Escribamos el código de nuevo para eliminar la lista:

<haskell>
-- Taken from 'cd-fit-4-4.hs'
-- Sea `bestDisk x' el disco "más compactamente empacado" de
-- tamaño total no mayor a `x'.
-- ¿Cómo calcular `bestDisk'? Optemos por una definición recursiva:
-- Caso base de la recursión: el disco mejor empacado para tamaño 0
-- es vacío y el disco mejor empacado para una lista vacía de directorios
-- también es el vacío
bestDisk 0 _ = DirPack 0 []
bestDisk _ [] = DirPack 0 []
-- Paso recursivo: para el tamaño `limit' mayor que cero, el disco mejor
-- empacado se calcula de la manera siguiente:

bestDisk limit dirs =
-- Toma todos los directorios no vacíos que quepan por sí mismos en ese disco,
-- uno por uno. Sea el tamaño de un directorio d en particular `dir_size d'.
-- Agreguémoslo al disco mejor empacado de tamaño <= (limit - dir_size d),
-- produciendo un disco de tamaño <= limit. Hagamos esto para todos los
--directorios "candidato" que no están aún en el disco:
case [ DirPack (dir_size d + s) (d:ds) | d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) dirs
, d `notElem` ds
] of
-- O no podemos agregar ningún directorio (probablemente porque todos
-- son muy grandes); bueno, reportemos que ese disco se debe quedar vacío:
[] -> DirPack 0 []
-- O creamos otros empacamientos diferentes, y seleccionamos el mejor de todos:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

dynamic_pack limit dirs = bestDisk limit dirs
</haskell>

Compila la versión con evaluación de desempeño de este código y obtén el perfil de ejecución general (con "+RTS -p"). Vas a conseguir algo parecido a esto:

cd-fit +RTS -p -RTS

total time = 0.00 secs (0 ticks @ 20 ms)
total alloc = 1,129,520 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

CAF GHC.Float 0.0 4.4
main Main 0.0 93.9

individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc
MAIN MAIN 1 0 0.0 0.0 0.0 100.0
main Main 180 1 0.0 93.9 0.0 94.2
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 0.0 0.3
dynamic_pack Main 182 200 0.0 0.2 0.0 0.3
bestDisk Main 183 200 0.0 0.1 0.0 0.1

Obtuvimos una gran mejora: ¡el consumo de memoria se reduce por un factor de 700! Ya podemos probar el código en el problema real - modifica el código para ejecutar la prueba para empacar el disco de tamaño completo:

<haskell>
main = quickCheck prop_dynamic_pack_is_fixpoint
</haskell>

Compila con evaluación de desempeño y ejecuta (con "+RTS -p"). Si no tienes suerte y se produce al azar un conjunto de pruebas considerablemente grande, tendrás que esperar. Y esperar aún más. Y más.

Ve a preparar té. Tómate el té. Lee algo de Tolstoi (¿tienes "La Guerra y La Paz" a la mano?). Lo más probable es que para cuando termines con Tolstoi el programa siga corriendo (mejor créeme, no hagas la prueba).

Si tienes suerte, tu programa finalizará suficientemente rápido y te producirá un perfil. De acuerdo con un perfil, el programa pasa el 99% del tiempo dentro de <tt>bestDisk</tt>. ¿Podemos mejorar de alguna forma el desempeño de <tt>bestDisk</tt>?

Nota que <tt>bestDisk</tt> realiza varios cálculos simples para los que se debe llamar a sí mismo. Sin embargo, lo hace de forma ineficiente - cada vez le pasamos a <tt>bestDisk</tt> exactamente el mismo conjunto de directorios, aún cuando ya hemos "empacado" algunos. Arreglemos eso:

Note that <tt>bestDisk</tt> performs several simple calculation for
which it must call itself. However, it is done rather inefficiently -
each time we pass to <tt>bestDisk</tt> the exact same set of
directories as it was called with, even if we have already "packed"
some of them. Let's amend this:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
case [ DirPack (dir_size d + s) (d:ds) | let small_enough = filter ( (inRange (0,limit)).dir_size ) dirs
, d <- small_enough
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) (delete d small_enough)
] of
</haskell>

Recompila y ejecuta de nuevo. Los tiempos pueden ser prolongados, pero soportables, y el número de veces que llama a <tt>bestDisk</tt> (de acuerdo al perfil) debe disminuir significativamente.

Finalmente, comparemos ambos algoritmos de empacado. Intuitivamente sentimos que el algoritmo voraz debe producir peores resultados, ¿o no?; hagamos pruebas para verificar:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
prop_greedy_pack_is_no_better_than_dynamic_pack ds =
pack_size (greedy_pack ds) <= pack_size (dynamic_pack media_size ds)
</haskell>

Ejecuta <tt>quickCheck</tt> con esta prueba varias veces para hacer la comparación. Yo siento que con esto concluyen nuestros ejercicios empacando la mochila.

El lector con sed de aventura puede ir más lejos implementando "escalamiento" para <tt>dynamic_pack</tt>, que es cuando se dividen los directorios y los discos por tamaño y se comienza empacando los más pequeños (lo que promete que corre más rápido).

== Capítulo 5: (Ab)usando mónadas y destruyendo constructores por negocio y diversión ==

Ya hemos mencionado las mónadas varias veces. Están descritas en numerosos artículos y tutoriales (ver el Capítulo 400). Es difícil leer una lista de correos de Haskell y no cruzarse con la palabra "monad" una docena de veces.

Como ya hemos hecho avances con Haskell, es momento de revisitar las mónadas una vez más. Dejaré que otras fuentes te enseñen la teoría detrás de las mónadas, la utilidad del concepto, etc; en lugar de eso, me enfocaré en mostrar ejemplos.

Tomemos una parte de un programa de mundo real que involucra procesamiento XML. Trabajaremos con atributos de etiqueta XML, que son esencialmente valores con nombre:
<haskell>
-- Taken from 'chapter5-1.hs'
type Attribute = (Name, AttValue)
</haskell>

'Name' es una cadena, y AttValue puede ser una cadena '''o''' referencias (también cadenas) a otros atributos que guardan el valor real (esto no es algo válido en XML, pero, para el ejemplo, lo haremos así). Decir "o" sugiere que usemos el tipo de dato 'Either' ("uno de"):
<haskell>
type AttValue = Either Value [Reference]
type Name = String
type Value = String
type Reference = String

-- Lista de atributos simples muestra:
simple_attrs = [ ( "xml:lang", Left "en" )
, ( "xmlns", Left "jabber:client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]

-- Lista de atributos muestra con referencias:
complex_attrs = [ ( "xml:lang", Right ["lang"] )
, ( "lang", Left "en" )
, ( "xmlns", Right ["ns","subns"] )
, ( "ns", Left "jabber" )
, ( "subns", Left "client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]
</haskell>

'''Nuestro objetivo es:''' escribir una función que busque un valor del atributo por nombre en una lista dada de atributos. Cuando el atributo contenga referencias, las resolvemos (buscando el atributo referenciado en la misma lista) y concatenamos sus valores, separados por punto y coma. Entonces, la búsqueda del atributo "xmlns" desde ambos conjuntos muestra debe regresar el mismo valor.

Siguiendo el ejemplo de <hask>Data.List.lookup</hask> de la biblioteca estándar, llamaremos a nuestra función <hask>lookupAttr</hask> que regresará <hask>Maybe Value</hask>, permitiendo manejar errores en la búsqueda:

<haskell>
-- Taken from 'chapter5-1.hs'
lookupAttr :: Name -> [Attribute] -> Maybe Value
-- Como no tenemos código para 'lookupAttr', pero queremos
-- que compile, usamos la función 'undefined' para
-- indicar un cuerpo de función que siempre falla al ser ejecutado.
lookupAttr = undefined
</haskell>

Intentemos escribir <hask>lookupAttr</hask> usando <hask>lookup</hask> de forma directa:

<haskell>
-- Taken from 'chapter5-1.hs'
import Data.List

lookupAttr :: Name -> [Attribute] -> Maybe Value
lookupAttr nm attrs =
-- Primero, buscamos 'Maybe AttValue' por nombre y
-- vemos si hemos tenido éxito:
case (lookup nm attrs) of
-- Simplemente propaga el error.
Nothing -> Nothing
-- Si el nombre existe, ver si es valor o referencia:
Just attv -> case attv of
-- Es un valor; regrésalo.
Left val -> Just val
-- Es una lista de referencias :(
-- Tenemos que seguirlas, y tener cuidado con
-- los posibles errores.
-- Primero, hacemos búsqueda en todas las referencias...
Right refs -> let vals = [ lookupAttr ref attrs | ref <- refs ]
-- ...luego, excluimos los errores
wo_failures = filter (/=Nothing) vals
-- ...buscamos forma de sacar los datos del contenedor 'Just'
stripJust (Just v) = v
-- ...la usamos para extraer los resultados como cadenas
strings = map stripJust wo_failures
in
-- ...finalmente los combinamos en una sola cadena.
-- Si todas las búsquedas fallaron, debemos propagar un error.
case null strings of
True -> Nothing
False -> Just (concat (intersperse ":" strings))
</haskell>

Probando:

*Main> lookupAttr "xmlns" complex_attrs
Just "jabber:client"
*Main> lookupAttr "xmlns" simple_attrs
Just "jabber:client"
*Main>

Funciona, pero... parece extraño que haga falta tanto código para hacer algo tan sencillo. Si examinas el código de cerca, verás que el exceso de código es causado por:

* el hecho de que verificamos si un error ocurrió después de cada paso

* sacar Strings de los constructores de datos <hask>Maybe</hask> y <hask>Either</hask> para volverlas a meter.

En este punto los programadores Java/C++ dirían que, como estamos pasando los errores hacia arriba, todos esos casos se pueden reemplazar con un bloque "try ... catch ...", y tendrían razón. ¿Significa eso que los programadores Haskell están limitados a usar "case", que lleva más de 10 años siendo obsoleto?

¡Mónadas al rescate! Como puedes leer en otras partes (vé en la sección 400), las mónadas se usan de formas avanzadas para construir cálculos a partir de otros cálculos. Exactamente lo que necesitamos - queremos combinar varios pasos simples (búsqueda de valores, búsqueda de referencias...) en la función <hask>lookupAttr</hask> de forma que podamos tomar en cuenta posibles errores.

Comencemos con el código y lo analizaremos luego:
<haskell>
-- Taken from 'chapter5-2.hs'
import Control.Monad

lookupAttr' nm attrs = do
-- Buscamos 'AttValue' por nombre
attv <- lookup nm attrs
-- Vemos si es valor o referencia
case attv of
-- Es valor; lo regresamos
Left val -> Just val
-- Es lista de referencias
-- Las buscamos, teniendo cuidado con los errores
-- Buscamos todas las referencias...
Right refs -> do vals <- sequence $ map (flip lookupAttr' attrs) refs
-- ...como todos los errores fueron filtrados por "Magia Monádica",
-- ...y todos los 'Just' fueron también retirados,
-- ...sólo combinamos los valores en una sola cadena
-- ...y regresamos un error si está vacía.
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

'''Ejercicio''': compila el código, y prueba que <hask>lookupAttr</hask> y <hask>lookupAttr'</hask> de verdad funcionan igual. Trata de hacerlo escribiendo un a prueba QuickCheck, definiendo <hask>instance Arbitrary Name</hask> para que los nombres arbitrarios los tome de los nombres disponibles en <hask>simple_attrs</hask>.

Bueno, de regreso a la historia. ¿Notaste la drástica reducción en tamaño de código? Sin los comentarios, el código ocupa 7 líneas en lugar de 13 - poco más de la mitad. ¿Cómo conseguimos esto?

Primero, date cuenta de que nunca verificamos si algún cálculo regresa <hask>Nothing</hask>. Aún así, trata de buscar un nombre de atributo que no exista, y <hask>lookupAttr'</haskell> va a regresar Nothing. ¿Cómo puede ser? El secreto está en el hecho de que el constructor de tipo <hask>Maybe</hask> es una "mónada".

Empleamos la palabra clave <hask>do</hask> para indicar que el bloque de código a continuación es una secuencia de '''acciones monádicas''', donde tiene que suceder '''magia monádica''' cuando usemos '<-', 'return' o pasemos de una acción a otra.

Diferentes mónadas tienen diferente '''magia'''. El código de biblioteca dice que el constructor de tipo <hask>Maybe</hask>es una mónada en la que podemos usar <hask><-</hask> para "extraer" valores del contenedor <hask>Just</hask> y usar <hask>return</hask> para volver a meterlos en forma de <hask>Just some_value</hask>. Cuando pasamos de una acción a otra en el bloque "do" ocurre una verificación. Si la acción regresa <hask>Nothing</hask>, todos los cálculos de ahí en adelante serán omitidos y todo el bloque "do" regresará <hask>Nothing</hask>.

Intenta hacer esto para entenderlo mejor:

<haskell>
*Main> let foo x = do v <- x; return (v+1) in foo (Just 5)
Just 6
*Main> let foo x = do v <- x; return (v+1) in foo Nothing
Nothing
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo (Just 'a')
Just 97
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo Nothing
Nothing
*Main>
</haskell>

Por ahora no te fijes en <hask>sequence</hask> y <hask>guard</hask>; más tarde veremos esa parte.

Como ya retiramos una razón para el exceso de código, es momento de atacar la otra. Nota que henos tenido que usar <hask>case</hask> para '''deconstruir''' el valor de tipo <hask>Either Value
[Reference]</hask>. Seguramente no somos los primeros que tenemos que hacer esto, y que ese caso de uso debe ser muy común.

En efecto, hay un remedio simple para nuestro caso, y se llama <hask>either</hask>:

*Main> :t either
either :: (a -> c) -> (b -> c) -> Either a b -> c

La declaración de tipo se ve complicada, pero aquí hay algunos ejemplos para ayudar a entenderla:

*Main> :t either (+1) (length)
either (+1) (length) :: Either Int [a] -> Int
*Main> either (+1) (length) (Left 5)
6
*Main> either (+1) (length) (Right "foo")
3
*Main>

Parece que este es exactamente el caso. Reemplacemos <hask>case</hask> con una invocación a <hask>either</hask>:

<haskell>
-- Taken from 'chapter5-3.hs'
lookupAttr'' nm attrs = do
attv <- lookup nm attrs
either Just (dereference attrs) attv
where
dereference attrs refs = do
vals <- sequence $ map (flip lookupAttr'' attrs) refs
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

Se va poniendo mejor :)

Ahora, como semi-ejercicio, intenta entender el significado de "sequence", "guard" and "flip" a partir de la siguiente sesión en ghci:

*Main> :t sequence
sequence :: (Monad m) => [m a] -> m [a]
*Main> :t [Just 'a', Just 'b', Nothing, Just 'c']
[Just 'a', Just 'b', Nothing, Just 'c'] :: [Maybe Char]
*Main> :t sequence [Just 'a', Just 'b', Nothing, Just 'c']
sequence [Just 'a', Just 'b', Nothing, Just 'c'] :: Maybe [Char]

*Main> sequence [Just 'a', Just 'b', Nothing, Just 'c']
Nothing
*Main> sequence [Just 'a', Just 'b', Nothing]
Nothing
*Main> sequence [Just 'a', Just 'b']
Just "ab"

*Main> :t [putStrLn "a", putStrLn "b"]
[putStrLn "a", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", putStrLn "b"]
sequence [putStrLn "a", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", putStrLn "b"]
a
b

*Main> :t [putStrLn "a", fail "stop here", putStrLn "b"]
[putStrLn "a", fail "stop here", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", fail "stop here", putStrLn "b"]
sequence [putStrLn "a", fail "stop here", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", fail "stop here", putStrLn "b"]
a
*** Exception: user error (stop here)

Nota que para la mónada <hask>Maybe</hask> sequence continúa la ejecución hasta el primer <hask>Nothing</hask>. Se puede observar el mismo comportamiento para la mónada IO. ¡Toma en cuenta que la definición de <hask>sequence</hask> no incluye ese comportamiento!

Ahora, examinemos <hask>guard</hask>:

*Main> let foo x = do v <- x; guard (v/=5); return (v+1) in map foo [Just 4, Just 5, Just 6]
[Just 5,Nothing,Just 7]

Como puedes ver, es solamente una forma simple de "detener" la ejecución cuando se cumple alguna condición.

If you have been hooked on monads, I urge you to read "All About
Monads" right now (link in Chapter 400).

== Chapter 6: Where do you want to go tomorrow? ==

As the name implies, the author is open for proposals - where should
we go next? I had networking + xml/xmpp in mind, but it might be too
heavy and too narrow for most of the readers.

What do you think? Drop me a line.

== Chapter 400: Monads up close ==

Read [http://en.wikibooks.org/wiki/Haskell/Understanding_monads this wikibook chapter].
Then, read [http://www.nomaware.com/monads "All about monads"].
'Nuff said :)

== Chapter 500: IO up close ==

Shows that:

<haskell>
c = do a <- someAction
b <- someOtherAction
print (bar b)
print (foo a)
print "done"
</haskell>

really is just a syntax sugar for:

<haskell>
c = someAction >>= \a ->
someOtherAction >>= \b ->
print (bar b) >>
print (foo a) >>
print "done"
</haskell>

and explains about ">>=" and ">>". Oh wait. This was already explained
in Chapter 400 :)

== Chapter 9999: Installing Haskell Compiler/Interpreter and all necessary software ==

Plenty of material on this on the web and this wiki. Just go get
yourself installation of [[GHC]] (6.4 or above) or [[Hugs]] (v200311 or
above) and "[[darcs]]", which we will use for version control.

== Chapter 10000: Thanks! ==

Thanks for comments, proofreading, good advice and kind words go to:
Helge, alt, dottedmag, Paul Moore, Ben Rudiak-Gould, Jim Wilkinson,
Andrew Zhdanov (avalez), Martin Percossi, SpellingNazi, Davor
Cubranic, Brett Giles, Stdrange, Brian Chrisman, Nathan Collins,
Anastasia Gornostaeva (ermine), Remi, Ptolomy, Zimbatm,
HenkJanVanTuyl, Miguel, Mforbes, Kartik Agaram.

If I should have mentioned YOU and forgot - tell me so.

Without you I would have stopped after Chapter 1 :)

{{traduccion|titulo=Hitchhikers guide to Haskell}}
[[Category:Es/Tutoriales|Guía de Haskell para autoestopistas]]

Es/Haskell

2011-03-30T19:06:32Z

Imz: /* Tutoriales */ remove "garbage" from how the wikilink has been printed out.

Bienvenido a la página en español acerca del lenguaje de programación Haskell.

La idea es recopilar información en español acerca de este lenguaje puramente funcional, y de esa forma promover su uso entre hispano-hablantes.

== Acerca de Haskell ==

* [[Introducción]]
* [[Librerías y Herramientas|Librerías y Herramientas]]
* [[Haskell en 5 pasos]]
* [[Es/Implementaciones | Implementaciones]]

----

==Boletín Semanal de Haskell (HWN)==

Última entrega: [http://www.haskell.org/haskellwiki/HWN/es/2006-10-31 2006-10-31]

Visita el [http://www.haskell.org/haskellwiki/HWN/es Boletín Semanal Haskell]
para traducciones de la 'Haskell Weekly Newsletter'. Una publicación semanal ofreciendo todos los desarrollos y acontecimientos que se llevan a cabo en la comunidad de Haskell.

----
==Libros==

Blas C. Ruiz, Francisco Gutiérrez, Pablo Guerrero y José E. Gallardo. [http://www.lcc.uma.es/~pepeg/pfHaskell/index.html Razonando con Haskell]: Thompson 2004. ISBN 84-9732-277-0.
<blockquote>
Descripción
El objetivo principal de este libro es el de servir como libro de texto de las asignaturas de Programación Declarativa correspondientes a los estudios de Informática o Ciencias de la Computación, y otras ciencias en general ( Matemáticas, Física, etc.).

El texto es fruto de una larga experiencia docente de los autores dentro de las distintas asignaturas que desarrollan la Programación Funcional en distintas titulaciones de la Universidad de Málaga. Aún así, su lectura no queda condicionada a un conocimiento previo sobre lenguajes de programación (de computadores), ni sobre Informática. De esta forma, el libro puede ser utilizado por todo aquel que desee tener un conocimiento amplio sobre la Programación Funcional.
</blockquote>

----

==Tutoriales==

;[http://www.lcc.uma.es/~blas/pfHaskell/gentle/ Una introducción agradable a Haskell]: Traducción en español del famoso tutorial en inglés "A Gentle Introduction to Haskell".

;[http://www.cs.uu.nl/people/jeroen/courses/fp-sp.pdf Programación Funcional]: Tutorial escrito por Jeroen Fokker del Departamento de Informática de la Universidad de Utrecht. Este tutorial cubre de forma muy didáctica los aspectos básicos para empezar a entender Haskell.

;[[Es/Guía de Haskell para autoestopistas|Guía de Haskell para autoestopistas]]: Traducción (en progreso) en español de "[[Hitchhikers guide to Haskell]]".

;[http://www.muitovar.com/glade/es-index.html Tutorial de Glade]:
Este tutorial intenta proporcionar una guía paso a paso para los desarrolladores de Haskell que quieren escribir aplicaciones GTK+ usando Glade. Asumimos que estás usando Linux aunque tanto el conjunto de herramientas Gtk+, el diseñador de interfaces Glade y Gtk2Hs están disponibles en otras plataformas. Esta página tutorial es una adaptación para Haskell y Gtk2Hs de un tutorial original para C y la GTK+ C API

;[http://darcs.haskell.org/gtk2hs/docs/tutorial/Tutorial_Port/es-index.xhtml Tutorial básico de Gtk2Hs]:
Capítulos: Introducción, Empezando, Empaquetando Widgets, Programa de demostración de empaquetado y Empaquetado usando tablas, El Widget botón, Ajustes, Escala y Rango, Etiquetas, Flechas y Tooltips,
Diálogos, elementos disponibles y barras de progreso, Entradas de texto y barras de estado, Botones de Spin, Calendario, Selección de fichero, Selección de Fuente y Color, Bloc de notas, Ventanas con desplazamiento(scroll), Cajas de evento y cajas de botones, El contenedor de layout, Ventanas con paneles y marcos de ratio constante, Menús y Barras de herramientas, Menús Popup, acciones de radio y acciones Toggle. Apéndice: Dibujando con Cairo: Empezando...

----

== Principiantes Haskell ==
;[http://www2.ucsp.edu.pe/%7Eapaz/apuntes/ Todo Haskell en español]:
Texto guia para principiantes, con pequeñas definiciones y ejemplos de programacion funcional haskell, en un sentido academico muy sencillo de entender y comprender.

----

==Canal de IRC==

* #haskell.es : Canal oficial de la comunidad hispano-hablante de Haskell en la red irc.freenode.net.

----

== Paquetes recientes ==

{{Main/News}}

----

== TODO ==

* Traducir las páginas de [[Special:Popularpages]] más importantes.
** [[:Category:Es/Traducción en progreso| Traducciones en progreso]]
* Agregar cualquier información referente a Haskell en Español.
* Agregar proyectos desarrollados por hispano-hablantes.
* Corregir errores que puedan presentar las traducciones.
* Colaborar con las traducciones de la HWN.

[[Category:Community]]

Es/Haskell

2011-03-30T19:04:28Z

Imz: /* Tutoriales */ Linked to the translation of "Hitchhikers guide to Haskell", not yet finished. But why should the work get lost without links to it?..

Bienvenido a la página en español acerca del lenguaje de programación Haskell.

La idea es recopilar información en español acerca de este lenguaje puramente funcional, y de esa forma promover su uso entre hispano-hablantes.

== Acerca de Haskell ==

* [[Introducción]]
* [[Librerías y Herramientas|Librerías y Herramientas]]
* [[Haskell en 5 pasos]]
* [[Es/Implementaciones | Implementaciones]]

----

==Boletín Semanal de Haskell (HWN)==

Última entrega: [http://www.haskell.org/haskellwiki/HWN/es/2006-10-31 2006-10-31]

Visita el [http://www.haskell.org/haskellwiki/HWN/es Boletín Semanal Haskell]
para traducciones de la 'Haskell Weekly Newsletter'. Una publicación semanal ofreciendo todos los desarrollos y acontecimientos que se llevan a cabo en la comunidad de Haskell.

----
==Libros==

Blas C. Ruiz, Francisco Gutiérrez, Pablo Guerrero y José E. Gallardo. [http://www.lcc.uma.es/~pepeg/pfHaskell/index.html Razonando con Haskell]: Thompson 2004. ISBN 84-9732-277-0.
<blockquote>
Descripción
El objetivo principal de este libro es el de servir como libro de texto de las asignaturas de Programación Declarativa correspondientes a los estudios de Informática o Ciencias de la Computación, y otras ciencias en general ( Matemáticas, Física, etc.).

El texto es fruto de una larga experiencia docente de los autores dentro de las distintas asignaturas que desarrollan la Programación Funcional en distintas titulaciones de la Universidad de Málaga. Aún así, su lectura no queda condicionada a un conocimiento previo sobre lenguajes de programación (de computadores), ni sobre Informática. De esta forma, el libro puede ser utilizado por todo aquel que desee tener un conocimiento amplio sobre la Programación Funcional.
</blockquote>

----

==Tutoriales==

;[http://www.lcc.uma.es/~blas/pfHaskell/gentle/ Una introducción agradable a Haskell]: Traducción en español del famoso tutorial en inglés "A Gentle Introduction to Haskell".

;[http://www.cs.uu.nl/people/jeroen/courses/fp-sp.pdf Programación Funcional]: Tutorial escrito por Jeroen Fokker del Departamento de Informática de la Universidad de Utrecht. Este tutorial cubre de forma muy didáctica los aspectos básicos para empezar a entender Haskell.

;[[Es/Guía de Haskell para autoestopistas|Es/Guía de Haskell para autoestopistas]]: Traducción (en progreso) en español de "[[Hitchhikers guide to Haskell]]".

;[http://www.muitovar.com/glade/es-index.html Tutorial de Glade]:
Este tutorial intenta proporcionar una guía paso a paso para los desarrolladores de Haskell que quieren escribir aplicaciones GTK+ usando Glade. Asumimos que estás usando Linux aunque tanto el conjunto de herramientas Gtk+, el diseñador de interfaces Glade y Gtk2Hs están disponibles en otras plataformas. Esta página tutorial es una adaptación para Haskell y Gtk2Hs de un tutorial original para C y la GTK+ C API

;[http://darcs.haskell.org/gtk2hs/docs/tutorial/Tutorial_Port/es-index.xhtml Tutorial básico de Gtk2Hs]:
Capítulos: Introducción, Empezando, Empaquetando Widgets, Programa de demostración de empaquetado y Empaquetado usando tablas, El Widget botón, Ajustes, Escala y Rango, Etiquetas, Flechas y Tooltips,
Diálogos, elementos disponibles y barras de progreso, Entradas de texto y barras de estado, Botones de Spin, Calendario, Selección de fichero, Selección de Fuente y Color, Bloc de notas, Ventanas con desplazamiento(scroll), Cajas de evento y cajas de botones, El contenedor de layout, Ventanas con paneles y marcos de ratio constante, Menús y Barras de herramientas, Menús Popup, acciones de radio y acciones Toggle. Apéndice: Dibujando con Cairo: Empezando...

----

== Principiantes Haskell ==
;[http://www2.ucsp.edu.pe/%7Eapaz/apuntes/ Todo Haskell en español]:
Texto guia para principiantes, con pequeñas definiciones y ejemplos de programacion funcional haskell, en un sentido academico muy sencillo de entender y comprender.

----

==Canal de IRC==

* #haskell.es : Canal oficial de la comunidad hispano-hablante de Haskell en la red irc.freenode.net.

----

== Paquetes recientes ==

{{Main/News}}

----

== TODO ==

* Traducir las páginas de [[Special:Popularpages]] más importantes.
** [[:Category:Es/Traducción en progreso| Traducciones en progreso]]
* Agregar cualquier información referente a Haskell en Español.
* Agregar proyectos desarrollados por hispano-hablantes.
* Corregir errores que puedan presentar las traducciones.
* Colaborar con las traducciones de la HWN.

[[Category:Community]]

Hitchhikers guide to Haskell

2011-03-30T18:57:19Z

Imz: /* Chapter 10000: Thanks! */ Linked the Spanish translation, not fnished yet.

== Preface: DON'T PANIC! ==
[[Category:Tutorials]]
Recent experiences from a few of my fellow C++/Java programmers
indicate that they read various Haskell tutorials with "exponential
speedup" (think about how TCP/IP session starts up). They start slow
and cautious, but when they see that the first 3-5 pages do not
contain "anything interesting" in terms of code and examples, they
begin skipping paragraphs, then chapters, then whole pages, only to
slow down - often to a complete halt - somewhere on page 50, finding
themselves in the thick of concepts like "type classes", "type
constructors", "monadic IO", at which point they usually panic, think
of a perfectly rational excuse not to read further anymore, and
happily forget this sad and scary encounter with Haskell (as human
beings usually tend to forget sad and scary things).

This text intends to introduce the reader to the practical aspects of Haskell
from the very beginning (plans for the first chapters include: I/O, darcs,
Parsec, QuickCheck, profiling and debugging, to mention a few). The reader
is expected to know (where to find) at least the basics of Haskell: how to run
"hugs" or "ghci", '''that layout is 2-dimensional''', etc. Other than that, we do
not plan to take radical leaps, and will go one step at a time in order not to
lose the reader along the way. So DON'T PANIC, take your towel with you and
read along.

'''In case you've skipped over the previous paragraph''', I would like
to stress out once again that Haskell is sensitive to indentation and
spacing, so pay attention to that during cut-n-pastes or manual
alignment of code in the text editor with proportional fonts.

Oh, almost forgot: author is very interested in ANY feedback. Drop him a line
or a word (see [[User:Adept|Adept]] for contact info) or submit
patches to the tutorial via darcs (
[http://adept.linux.kiev.ua:8080/repos/hhgtth/ repository is here]) or directly to this
Wiki.

== Chapter 1: Ubiquitous "Hello world!" and other ways to do IO in Haskell ==

Each chapter will be dedicated to one small real-life task which we will
complete from the ground up.

So here is the task for this chapter: in order to free up space on
your hard drive for all the Haskell code you are going to write in the
nearest future, you are going to archive some of the old and dusty
information on CDs and DVDs. While CD (or DVD) burning itself is easy
these days, it usually takes some (or quite a lot of) time to decide
how to put several GB of digital photos on CD-Rs, when directories
with images range from 10 to 300 Mb's in size, and you don't want to
burn half-full (or half-empty) CD-Rs.

So, the task is to write a program which will help us put a given
collection of directories on the minimum possible amount of media,
while packing the media as tightly as possible. Let's name this program
"cd-fit".

Oh. Wait. Let's do the usual "hello world" thing, before we forget about it,
and then move on to more interesting things:

<haskell>
-- Taken from 'hello.hs'
-- From now on, a comment at the beginning of the code snippet
-- will specify the file which contain the full program from
-- which the snippet is taken. You can get the code from the darcs
-- repository "http://adept.linux.kiev.ua:8080/repos/hhgtth" by issuing
-- command "darcs get http://adept.linux.kiev.ua:8080/repos/hhgtth"
module Main where
main = putStrLn "Hello world!"
</haskell>

Run it:

$ runhaskell ./hello.hs
Hello world!

OK, we've done it. Move along now, nothing interesting here :)

Any serious development must be done with the help of a version control
system, and we will not make an exception. We will use the modern
distributed version control system "darcs". "Modern" means that it is
written in Haskell, "distributed" means that each working copy is
a repository in itself.

First, let's create an empty directory for all our code, and invoke
"darcs init" there, which will create subdirectory "_darcs" to store
all version-control-related stuff there.

Fire up your favorite editor and create a new file called "cd-fit.hs"
in our working directory. Now let's think for a moment about how our
program will operate and express it in pseudocode:

<haskell>
main = Read list of directories and their sizes.
Decide how to fit them on CD-Rs.
Print solution.
</haskell>

Sounds reasonable? I thought so.

Let's simplify our life a little and assume for now that we will
compute directory sizes somewhere outside our program (for example,
with "du -sb *") and read this information from stdin.
Now let me convert all this to Haskell:

<haskell>
-- Taken from 'cd-fit-1-1.hs'
module Main where

main = do input <- getContents
putStrLn ("DEBUG: got input " ++ input)
-- compute solution and print it
</haskell>

Not really working, but pretty close to plain English, eh? Let's stop
for a moment and look more closely at what's written here line-by-line

Let's begin from the top:

<haskell>
-- Taken from 'cd-fit-1-1.hs'
input <- getContents
</haskell>

This is an example of the Haskell syntax for doing IO (namely, input). This
line is an instruction to read all the information available from the stdin,
return it as a single string, and bind it to the symbol "input", so we can
process this string any way we want.

How did I know that? Did I memorize all the functions by heart? Of course not!
Each function has a type, which, along with function's name, usually tells a
lot about what a function will do.

Let's fire up an interactive Haskell environment and examine this function
up close:

$ ghci
___ ___ _
/ _ \ /\ /\/ __(_)
/ /_\// /_/ / / | | GHC Interactive, version 6.4.1, for Haskell 98.
/ /_\\/ __ / /___| | http://www.haskell.org/ghc/
\____/\/ /_/\____/|_| Type :? for help.

Loading package base-1.0 ... linking ... done.
Prelude> :type getContents
getContents :: IO String
Prelude>

We see that "getContents" is a function without arguments, that will return
"IO String". Prefix "IO" meant that this is an IO action. It will return
String, when evaluated. Action will be evaluated as soon as we use "<-" to
bind its result to some symbol.

Note that "<-" is not a fancy way to assign value to variable. It is a way to
evaluate (execute) IO actions, in other words - to actually do some I/O and
return its result (if any).

We can choose not to evaluate the action obtained from "getContents", but rather carry it around a bit and evaluate later:

<haskell>
let x = getContents
-- 300 lines of code here
input <- x
</haskell>

So, as you see, IO actions can act like an ordinary values. Suppose that we
have built a list of IO actions and have found a way to execute them one by one.
This would be a way to simulate imperative programming with its notion of
"order of execution".

Haskell allows you to do better than that.

The standard language library (named "Prelude", by the way) provides
us with lots of functions that return useful primitive IO actions. In
order to combine them to produce an even more complex actions, we use a "do":

<haskell>
c = do a <- someAction
b <- someOtherAction
print (bar b)
print (foo a)
putStrLn "done"
</haskell>

Here we '''bind''' "c" to an action with the following "scenario":
* '''evaluate''' action "someAction" and '''bind''' its result to "a"
* then, '''evaluate''' "someOtherAction" and '''bind''' its result to "b"
* then, process "b" with function "bar" and print result
* then, process "a" with function "foo" and print result
* then, print the word "done"

When will all this actually be executed? Answer: as soon as we evaluate "c"
using the "<-" (if it returns result, as "getContents" does) or just
by using it as a function name (if it does not return a result, as "print"
does):

<haskell>
process = do putStrLn "Will do some processing"
c
putStrLn "Done"
</haskell>

Notice that we took a bunch of functions ("someAction", "someOtherAction",
"print", "putStrLn") and using "do" created from them a new function, which we
bound to symbol "c". Now we could use "c" as a building block to produce an even
more complex function, "process", and we could carry this on and on.
Eventually, some of the functions will be mentioned in the code of function
"main", to which the ultimate topmost IO action any Haskell program is bound.

When will the "main" be executed/evaluated/forced? As soon as we run the
program. Read this twice and try to comprehend:

''The execution of a Haskell program is an evaluation of the symbol "main" to
which we have bound an IO action. Via evaluation we obtain the result of that
action''.

Readers familiar with advanced C++ or Java programming and that arcane body of
knowledge named "OOP Design Patterns" might note that "build actions from
actions" and "evaluate actions to get result" is essentially a "Command
pattern" and "Composition pattern" combined. Good news: in Haskell you get them
for all your IO, and get them '''for free''' :)

----
'''Exercise:'''
Consider the following code:

<haskell>
-- Taken from 'exercise-1-1.hs'
module Main where
c = putStrLn "C!"

combine before after =
do before
putStrLn "In the middle"
after

main = do combine c c
let b = combine (putStrLn "Hello!") (putStrLn "Bye!")
let d = combine (b) (combine c c)
putStrLn "So long!"
</haskell>

Notice how we carefully indent lines so that source looks neat?
Actually, Haskell code has to be aligned this way, or it will not
compile. If you use tabulation to indent your sources, take into
account that Haskell compilers assume that tabstop is 8 characters
wide.

Often people complain that it is very difficult to write Haskell
because it requires them to align code. Actually, this is not true. If
you align your code, compiler will guess the beginnings and endings of
syntactic blocks. However, if you don't want to indent your code, you
could explicitly specify end of each and every expression and use
arbitrary layout as in this example:
<haskell>
-- Taken from 'exercise-1-2.hs'
combine before after =
do { before;
putStrLn "In the middle";
after; };

main =
do { combine c c; let { b = combine (putStrLn "Hello!") (putStrLn "Bye!")};
let {d = combine (b) (combine c c)};
putStrLn "So long!" };
</haskell>

Back to the exercise - see how we construct code out of thin air? Try
to imagine what this code will do, then run it and check yourself.

Do you understand why "Hello!" and "Bye!" are not printed?
----

Let's examine our "main" function closer:

Prelude> :load cd-fit.hs
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :type main
main :: IO ()
*Main>

We see that "main" is indeed an IO action which will return nothing
when evaluated. When combining actions with "do", the type of the
result will be the type of the last action, and "putStrLn something" has type
"IO ()":

*Main> :type putStrLn "Hello world!"
putStrLn "Hello world!" :: IO ()
*Main>

Oh, by the way: have you noticed that we actually compiled our first
Haskell program in order to examine "main"? :)

let's celebrate that by putting it under version control: execute
"darcs add cd-fit.hs" and "darcs record", answer "y" to all questions
and provide a commit comment "Skeleton of cd-fit.hs"

Let's try to run it:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: got input foo

----
'''Exercises''':

* Try to write a program that takes your name from the stdin and greets you (keywords: getLine, putStrLn);

* Try to write a program that asks for you name, reads it, greets you, asks for your favorite color, and prints it back (keywords: getLine, putStrLn).

== Chapter 2: Parsing the input ==

OK, now that we have proper understanding of the powers of Haskell IO
(and are awed by them, I hope), let's forget about IO and actually do
some useful work.

As you remember, we set forth to pack some CD-Rs as tightly as
possible with data scattered in several input directories. We assume
that "du -sb" will compute the sizes of input directories and output
something like:

65572 /home/adept/photos/raw-to-burn/dir1
68268 /home/adept/photos/raw-to-burn/dir2
53372 /home/adept/photos/raw-to-burn/dir3
713124 /home/adept/photos/raw-to-burn/dir4
437952 /home/adept/photos/raw-to-burn/dir5

Our next task is to parse that input into some suitable internal
representation.

For that we will use powerful library of '''parsing combinators''' named
"[[Parsec]]" which ships with most Haskell implementations.

Much like the IO facilities we have seen in the first chapter, this
library provides a set of basic parsers and means to combine into more
complex parsing constructs.

Unlike other tools in this area (lex/yacc or JavaCC to name a few),
[[Parsec]] parsers do not require a separate preprocessing stage. Since in
Haskell we can return function as a result of function and thus
construct functions "from the thin air", there is no need for a separate
syntax for parser description. But enough advertisements, let's actually
do some parsing:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
import Text.ParserCombinators.Parsec

-- parseInput parses output of "du -sb", which consists of many lines,
-- each of which describes single directory
parseInput =
do dirs <- many dirAndSize
eof
return dirs

-- Datatype Dir holds information about single directory - its size and name
data Dir = Dir Int String deriving Show

-- `dirAndSize` parses information about single directory, which is:
-- a size in bytes (number), some spaces, then directory name, which extends till newline
dirAndSize =
do size <- many1 digit
spaces
dir_name <- anyChar `manyTill` newline
return (Dir (read size) dir_name)
</haskell>

Just add those lines to "cd-fit.hs", between the declaration of
the Main module and the definition of main.

Here we see quite a lot of new
things, and several those that we know already.
First of all, note the familiar "do" construct, which, as we know, is
used to combine IO actions to produce new IO actions. Here we use it
to combine "parsing" actions into new "parsing" actions. Does this
mean that "parsing" implies "doing IO"? Not at all. Thing is, I must
admit that I lied to you - "do" is used not only to combine IO
actions. "Do" is used to combine any kind of so-called ''monadic
actions'' or ''monadic values'' together.

Think about [[monad]] as a "[[:Category:Idioms|design pattern]]" in the functional world.
[[Monad]] is a way to hide from the user (programmer) all the machinery
required for complex functionality to operate.

As you might have heard, Haskell has no notion of "assignment",
"mutable state", "variables", and is a "pure functional language",
which means that every function called with the same input parameters
will return exactly the same result. Meanwhile "doing IO" requires
hauling around file handles and their states and dealing with IO
errors. "Parsing" requires to track position in the input and dealing
with parsing errors.

In both cases Wise Men Who Wrote Libraries cared for our needs and
hide all underlying complexities from us, exposing the [http://en.wikipedia.org/wiki/Application_programming_interface API] of their
libraries (IO and parsing) in the form of "monadic action" which we
are free to combine as we see fit.

Think of programming with monads as of doing the remodelling with the
help of professional remodelling crew. You describe sequence of
actions on the piece of paper (that's us writing in "do" notation),
and then, when required, that sequence will be evaluated by the
remodelling crew ("in the monad") which will provide you with end
result, hiding all the underlying complexity (how to prepare the
paint, which nails to choose, etc) from you.

let's use the interactive Haskell environment to decipher all the
instructions we've written for the parsing library. As usually, we'll
go top-down:

*Main> :reload
Ok, modules loaded: Main.
*Main> :t parseInput
parseInput :: GenParser Char st [Dir]
*Main> :t dirAndSize
dirAndSize :: GenParser Char st Dir
*Main>

Assuming (well, take my word for it) that "GenParser Char st" is our
parsing monad, we could see that "parseInput", when evaluated, will
produce a list of "Dir", and "dirAndSize", when evaluated, will
produce "Dir". Assuming that "Dir" somehow represents information
about single directory, that is pretty much what we wanted, isn't it?

Let's see what a "Dir" means. We defined ''data[[type]]'' Dir as a record,
which holds an Int and a String:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
data Dir = Dir Int String deriving Show
</haskell>

In order to construct such records, we must use ''data [[constructor]]''
Dir:

*Main> :t Dir 1 "foo"
Dir 1 "foo" :: Dir

In order to reduce confusion for newbies, we could have written:
<haskell>
data Dir = D Int String deriving Show
</haskell>

, which would define ''data[[type]]'' "Dir" with ''data [[constructor]]'' "D".
However, traditionally name of the data[[type]] and its [[constructor]] are
chosen to be the same.

Clause "[[deriving]] Show" instructs the compiler to make enough code "behind
the curtains" to make this ''datatype'' conform to the interface of
the ''type [[class]]'' Show. We will explain ''type [[class]]es'' later, for
now let's just say that this will allow us to "print" instances of
"Dir".

'''Exercises:'''
* examine types of "digit", "anyChar", "many", "many1" and "manyTill" to see how they are used to build more complex parsers from single ones.

* compare types of "manyTill", "manyTill anyChar" and "manyTill anyChar newline". Note that "anyChar `manyTill` newline" is just another syntax sugar. Note that when function is supplied with less arguments that it actually needs, we get not a value, but a new function, which is called ''partial application''.

OK. So, we combined a lot of primitive parsing actions to get ourselves a
parser for output of "du -sb". How can we actually parse something? the [[Parsec]] library supplies us with function "parse":

*Main> :t parse
parse :: GenParser tok () a
-> SourceName
-> [tok]
-> Either ParseError a
*Main> :t parse parseInput
parse parseInput :: SourceName -> [Char] -> Either ParseError [Dir]
*Main>

At first the [[type]] might be a bit cryptic, but once we supply "parse" with the parser we made, the compiler gets more information and presents us with a more concise [[type]].

Stop and consider this for a moment. The compiler figured out type of the function without a single type annotation supplied by us! Imagine if a Java compiler deduced types for you, and you wouldn't have to specify types of arguments and return values of methods, ever.

OK, back to the code. We can observe that the "parser" is a function, which,
given a parser, a name of the source file or channel (f.e. "stdin"), and
source data (String, which is a list of "Char"s, which is written "[Char]"),
will either produce parse error, or parse us a list of "Dir".

Datatype "Either" is an example of datatype whose constructor has name, different
from the name of the datatype. In fact, "Either" has two constructors:

<haskell>
data Either a b = Left a | Right b
</haskell>

In order to understand better what does this mean consider the following
example:

*Main> :t Left 'a'
Left 'a' :: Either Char b
*Main> :t Right "aaa"
Right "aaa" :: Either a [Char]
*Main>

You see that "Either" is a ''union'' (much like the C/C++ "union") which could
hold value of one of the two distinct types. However, unlike C/C++ "union",
when presented with value of type "Either Int Char" we could immediately see
whether its an Int or a Char - by looking at the constructor which was used to
produce the value. Such datatypes are called "tagged unions", and they are
another [[:Category:Idioms|power tool]] in the Haskell toolset.

Did you also notice that we provide "parse" with parser, which is a monadic
value, but receive not a new monadic value, but a parsing result? That is
because "parse" is an evaluator for "Parser" monad, much like the [[GHC]] or [[Hugs]] runtime is an evaluator for the IO monad. The function "parser" implements all monadic machinery: it tracks errors and positions in input, implements backtracking and lookahead, etc.

let's extend our "main" function to use "parse" and actually parse the input
and show us the parsed data structures:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
main = do input <- getContents
putStrLn ("DEBUG: got input " ++ input)
let dirs = case parse parseInput "stdin" input of
Left err -> error $ "Input:\n" ++ show input ++
"\nError:\n" ++ show err
Right result -> result
putStrLn "DEBUG: parsed:"; print dirs
</haskell>

'''Exercise:'''

* In order to understand this snippet of code better, examine (with ghci or hugs) the difference between 'drop 1 ( drop 1 ( drop 1 ( drop 1 ( drop 1 "foobar" ))))' and 'drop 1 $ drop 1 $ drop 1 $ drop 1 $ drop 1 "foobar"'. Examine type of ($).
* Try putStrLn "aaa" and print "aaa" and see the difference, examine their types.
* Try print (Dir 1 "foo") and putStrLn (Dir 1 "foo"). Examine types of print and putStrLn to understand the behavior in both cases.

Let's try to run what we have so far:

$ du -sb * | runhaskell ./cd-fit.hs

DEBUG: got input 22325 Article.txt
18928 Article.txt~
1706 cd-fit.hs
964 cd-fit.hs~
61609 _darcs

DEBUG: parsed:
[Dir 22325 "Article.txt",Dir 18928 "Article.txt~",
Dir 1706 "cd-fit.hs",Dir 964 "cd-fit.hs~",Dir 61609 "_darcs"]

Seems to be doing exactly as planned. Now let's try some erroneous
input:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: got input foo

DEBUG: parsed:
*** Exception: Input:
"foo\n"
Error:
"stdin" (line 1, column 1):
unexpected "f"
expecting digit or end of input

Seems to be doing fine.

If you followed advice to put your code under version control, you
could now use "darcs whatsnew" or "darcs diff -u" to examine your
changes to the previous version. Use "darcs record" to commit them. As
an exercise, first record the changes "outside" of function "main" and
then record the changes in "main". Do "darcs changes" to examine a
list of changes you've recorded so far.

== Chapter 3: Packing the knapsack and testing it with class, too (and don't forget your towel!) ==

Enough preliminaries already. let's go pack some CDs.

As you might already have recognized, our problem is a classical one. It is
called a "knapsack problem" ([http://www.google.com/search?q=knapsack+problem google it up], if you don't know already what it
is. There are more than 100000 links).

let's start from the greedy solution, but first let's slightly modify our "Dir"
datatype to allow easy extraction of its components:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

----
'''Exercise:''' examine types of "Dir", "dir_size" and "dir_name"
----

From now on, we could use "dir_size d" to get a size of directory, and
"dir_name d" to get its name, provided that "d" is of type "Dir".

The Greedy algorithm sorts directories from the biggest down, and tries to put
them on CD one by one, until there is no room for more. We will need to track
which directories we added to CD, so let's add another datatype, and code this
simple packing algorithm:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
import Data.List (sortBy)

-- DirPack holds a set of directories which are to be stored on single CD.
-- 'pack_size' could be calculated, but we will store it separately to reduce
-- amount of calculation
data DirPack = DirPack {pack_size::Int, dirs::[Dir]} deriving Show

-- For simplicity, let's assume that we deal with standard 700 Mb CDs for now
media_size = 700*1024*1024

-- Greedy packer tries to add directories one by one to initially empty 'DirPack'
greedy_pack dirs = foldl maybe_add_dir (DirPack 0 []) $ sortBy cmpSize dirs
where
cmpSize d1 d2 = compare (dir_size d1) (dir_size d2)

-- Helper function, which only adds directory "d" to the pack "p" when new
-- total size does not exceed media_size
maybe_add_dir p d =
let new_size = pack_size p + dir_size d
new_dirs = d:(dirs p)
in if new_size > media_size then p else DirPack new_size new_dirs
</haskell>

----
I'll highlight the areas which you could explore on your own (using other nice
tutorials out there, of which I especially recommend "Yet Another Haskell
Tutorial" by Hal Daume):
* We choose to import a single function "sortBy" from a module [[Data.List]], not the whole thing.
* Instead of coding case-by-case recursive definition of "greedy_pack", we go with higher-order approach, choosing "foldl" as a vehicle for list traversal. Examine its type. Other useful function from the same category are "map", "foldr", "scanl" and "scanr". Look them up!
* To sort list of "Dir" by size only, we use custom sort function and parametrized sort - "sortBy". This sort of setup where the user may provide a custom "modifier" for a generic library function is quite common: look up "deleteBy", "deleteFirstsBy", "groupBy", "insertBy", "intersectBy", "maximumBy", "minimumBy", "sortBy", "unionBy".
* To code the quite complex function "maybe_add_dir", we introduced several '''local definitions''' in the "let" clause, which we can reuse within the function body. We used a "where" clause in the "greedy_pack" function to achieve the same effect. Read about "let" and "where" clauses and the differences between them.
* Note that in order to construct a new value of type "DirPack" (in function "maybe_add_dir") we haven't used the helper accessor functions "pack_size" and "dirs"
----

In order to actually use our greedy packer we must call it from our "main"
function, so let's add a lines:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
main = do ...
-- compute solution and print it
putStrLn "Solution:" ; print (greedy_pack dirs)
</haskell>

Verify integrity of our definitions by (re)loading our code in ghci. Compiles?
Thought so :) Now, do "darcs record" and add some sensible commit message.

Now it is time to test our creation. We could do it by actually running it in
the wild like this:

$ du -sb ~/DOWNLOADS/* | runhaskell ./cd-fit.hs

This will prove that our code seems to be working. At least, this once. How
about establishing with reasonable degree of certainty that our code, parts
and the whole, works properly, and doing so in re-usable manner? In other
words, how about writing some test?

Java programmers used to JUnit probably thought about screens of boiler-plate
code and hand-coded method invocations. Never fear, we will not do anything as
silly :)

Enter '''[[QuickCheck]]'''.

[[QuickCheck]] is a tool to do automated testing of your functions using
(semi)random input data. In the spirit of "100b of code examples is worth 1kb of
praise" let's show the code for testing the following ''property'': An attempt to pack directories returned by "greedy_pack" should return "DirPack" of exactly the same pack:

<haskell>
-- Taken from 'cd-fit-3-2.hs'
import Test.QuickCheck
import Control.Monad (liftM2, replicateM)

-- We must teach QuickCheck how to generate arbitrary "Dir"s
instance Arbitrary Dir where
-- Let's just skip "coarbitrary" for now, ok?
-- I promise, we will get back to it later :)
coarbitrary = undefined
-- We generate arbitrary "Dir" by generating random size and random name
-- and stuffing them inside "Dir"
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM n (elements "fubar/")

-- For convenience and by tradition, all QuickCheck tests begin with prefix "prop_".
-- Assume that "ds" will be a random list of "Dir"s and code your test.
prop_greedy_pack_is_fixpoint ds =
let pack = greedy_pack ds
in pack_size pack == pack_size (greedy_pack (dirs pack))
</haskell>

let's run the test, after which I'll explain how it all works:

Prelude> :r
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> quickCheck prop_greedy_pack_is_fixpoint
[numbers spinning]
OK, passed 100 tests.
*Main>

We've just seen our "greedy_pack" run on a 100 completely (well, almost
completely) random lists of "Dir"s, and it seems that property indeed holds.

let's dissect the code. The most intriguing part is "instance Arbitrary Dir
where", which declares that "Dir" is an '''[[instance]]''' of '''type[[class]]''' "Arbitrary". Whoa, that's a whole lot of unknown words! :) Let's slow down a
bit.

What is a '''type[[class]]'''? A typeclass is a Haskell way of dealing with the
following situation: suppose that you are writing a library of useful
functions and you don't know in advance how exactly they will be used, so you
want to make them generic. Now, on one hand you don't want to restrict your
users to certain type (e.g. String). On the other hand, you want to enforce
the convention that arguments for your function must satisfy a certain set of
constraints. That is where '''typeclass''' comes in handy.

Think of typeclass as a '''contract''' (or "interface", in Java terms) that
your type must fulfill in order to be admitted as an argument to certain
functions.

Let's examine the typeclass "Arbitrary":

*Main> :i Arbitrary
class Arbitrary a where
arbitrary :: Gen a
coarbitrary :: a -> Gen b -> Gen b
-- Imported from Test.QuickCheck
instance Arbitrary Dir
-- Defined at ./cd-fit.hs:61:0
instance Arbitrary Bool -- Imported from Test.QuickCheck
instance Arbitrary Double -- Imported from Test.QuickCheck
instance Arbitrary Float -- Imported from Test.QuickCheck
instance Arbitrary Int -- Imported from Test.QuickCheck
instance Arbitrary Integer -- Imported from Test.QuickCheck
-- rest skipped --

It could be read this way: "Any [[type]] (let's name it 'a') could be a member of the [[class]] Arbitrary as soon as we define two functions for it: "arbitrary" and "coarbitrary", with signatures shown. For types Dir, Bool, Double, Float, Int and Integer such definitions were provided, so all those types are instance of class Arbitrary".

Now, if you write a function which operates on its arguments solely by means
of "arbitrary" and "coarbitrary", you can be sure that this function will work
on any type which is an instance of "Arbitrary"!

let's say it again. Someone (maybe even you) writes the code (API or library),
which requires that input values implement certain ''interfaces'', which is
described in terms of functions. Once you show how your type implements this
''interface'' you are free to use API or library.

Consider the function "sort" from standard library:

*Main> :t Data.List.sort
Data.List.sort :: (Ord a) => [a] -> [a]

We see that it sorts lists of any values which are instance of typeclass
"Ord". Let's examine that class:

*Main> :i Ord
class Eq a => Ord a where
compare :: a -> a -> Ordering
(<) :: a -> a -> Bool
(>=) :: a -> a -> Bool
(>) :: a -> a -> Bool
(<=) :: a -> a -> Bool
max :: a -> a -> a
min :: a -> a -> a
-- skip
instance Ord Double -- Imported from GHC.Float
instance Ord Float -- Imported from GHC.Float
instance Ord Bool -- Imported from GHC.Base
instance Ord Char -- Imported from GHC.Base
instance Ord Integer -- Imported from GHC.Num
instance Ord Int -- Imported from GHC.Base
-- skip
*Main>

We see a couple of interesting things: first, there is an additional
requirement listed: in order to be an instance of "Ord", type must first be an
instance of typeclass "Eq". Then, we see that there is an awful lot of
functions to define in order to be an instance of "Ord". Wait a second, isn't
it silly to define both (<) and (>) when one could be expressed via another?

Right you are! Usually, typeclass contains several "default" implementation
for its functions, when it is possible to express them through each other (as
it is with "Ord"). In this case it is possible to supply only a minimal
definition (which in case of "Ord" consists of any single function) and others
will be automatically derived. If you supplied fewer functions than are required
for minimal implementation, the compiler/interpreter will say so and
explain which functions you still have to define.

Once again, we see that a lot of [[type]]s are already instances of typeclass Ord, and thus we are able to sort them.

Now, let's take a look back to the definition of "Dir":

<haskell>
-- Taken from 'cd-fit-3-2.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

See that "[[deriving]]" clause? It instructs the compiler to automatically derive code to make "Dir" an instance of typeclass Show. The compiler knows about a bunch of standard typeclasses (Eq, Ord, Show, Enum, Bound, Typeable to name a few) and knows how to make a type into a "suitably good" instance of any of them. If you want to derive instances of more than one typeclass, say it this way: "deriving (Eq,Ord,Show)". Voila! Now we can compare, sort and print data of
that type!

Side note for Java programmers: just imagine java compiler which derives code
for "implements Storable" for you...

Side note for C++ programmers: just imagine that deep copy constructors are
being written for you by compiler....

----
'''Exercises:'''
* Examine typeclasses Eq and Show
* Examine types of (==) and "print"
* Try to make "Dir" instance of "Eq"
----

OK, back to our tests. So, what we have had to do in order to make "Dir" an
instance of "Arbitrary"? Minimal definition consists of "arbitrary". Let's
examine it up close:

*Main> :t arbitrary
arbitrary :: (Arbitrary a) => Gen a

See that "Gen a"? Reminds you of something? Right! Think of "IO a" and "Parser
a" which we've seen already. This is yet another example of action-returning
function, which could be used inside "do"-notation. (You might ask yourself,
wouldn't it be useful to generalize that convenient concept of actions and
"do"? Of course! It is already done, the concept is called "[[Monad]]" and we will talk about it in Chapter 400 :) )

Since 'a' here is a [[type variable]] which is an instance of "Arbitrary", we could substitute "Dir" here. So, how we can make and return an action of type "Gen Dir"?

Let's look at the code:

<haskell>
-- Taken from 'cd-fit-3-2.hs'
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM n (elements "fubar/")
</haskell>

We have used the library-provided functions "choose" and "elements" to build up
"gen_size :: Gen Int" and "gen_name :: Gen String" (exercise: don't take my
word on that. Find a way to check types of "gen_name" and "gen_size"). Since
"Int" and "String" are components of "Dir", we sure must be able to use "Gen
Int" and "Gen String" to build "Gen Dir". But where is the "do" block for
that? There is none, and there is only single call to "liftM2".

Let's examine it:

*Main> :t liftM2
liftM2 :: (Monad m) => (a1 -> a2 -> r) -> m a1 -> m a2 -> m r

Kind of scary, right? Let's provide typechecker with more context:

*Main> :t liftM2 Dir
liftM2 Dir :: (Monad m) => m Int -> m String -> m Dir

Since you already heard that "Gen" is a "Monad", you could substitute "Gen" for "m" here, obtaining "liftM2 Dir :: (Monad Gen) => Gen Int -> Gen String ->
Gen Dir". Exactly what we wanted!

Consider "liftM2" to be "advanced topic" of this chapter (which we will cover
later) and just note for now that:
* "2" is a number of arguments for data constructor "Dir" and we have used "liftM2" to construct "Gen Dir" out of "Dir"
* There are also "liftM", "liftM3", "liftM4", "liftM5"
* "liftM2" is defined as "liftM2 f a1 a2 = do x<-a1; y<-a2; return (f x y)"

Hopefully, this will all make sense after you read it for the third
time ;)

Oh, by the way - don't forget to "darcs record" your changes!

== Chapter 4: REALLY packing the knapsack this time ==

In this chapter we are going to write another not-so-trivial packing
method, compare packing methods efficiency, and learn something new
about debugging and profiling of the Haskell programs along the way.

It might not be immediately obvious whether our packing algorithm is
effective, and if yes - in which particular way? Whether it's runtime,
memory consumption or result are of sufficient quality, are there any
alternative algorithms, and how do they compare to each other?

Let's code another solution to the knapsack packing problem, called the "dynamic programming method" and put both variants to the test.

This time, I'll not dissect the listing and explain it bit by bit. Instead, comments are provided in the code:

<haskell>
-- Taken from 'cd-fit-4-1.hs'
----------------------------------------------------------------------------------
-- Dynamic programming solution to the knapsack (or, rather, disk) packing problem
--
-- Let the `bestDisk x' be the "most tightly packed" disk of total
-- size no more than `x'.
precomputeDisksFor :: [Dir] -> [DirPack]
precomputeDisksFor dirs =
-- By calculating `bestDisk' for all possible disk sizes, we could
-- obtain a solution for particular case by simple lookup in our list of
-- solutions :)
let precomp = map bestDisk [0..]

-- How to calculate `bestDisk'? Lets opt for a recursive definition:
-- Recursion base: best packed disk of size 0 is empty
bestDisk 0 = DirPack 0 []
-- Recursion step: for size `limit`, bigger than 0, best packed disk is
-- computed as follows:
bestDisk limit =
-- 1. Take all non-empty dirs that could possibly fit to that disk by itself.
-- Consider them one by one. Let the size of particular dir be `dir_size d'.
-- Let's add it to the best-packed disk of size <= (limit - dir_size d), thus
-- producing the disk of size <= limit. Lets do that for all "candidate"
-- dirs that are not yet on our disk:
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(limit - dir_size d)
, d `notElem` ds
] of
-- We either fail to add any dirs (probably, because all of them too big).
-- Well, just report that disk must be left empty:
[] -> DirPack 0 []
-- Or we produce some alternative packings. Let's choose the best of them all:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

in precomp

-- When we precomputed disk of all possible sizes for the given set of dirs, solution to
-- particular problem is simple: just take the solution for the required 'media_size' and
-- that's it!
dynamic_pack dirs = (precomputeDisksFor dirs)!!media_size
</haskell>

Notice that it took almost the same amount of text to describe algorithm and to write implementation for it. Nice, eh?

----

'''Exercises:'''
* Make all necessary amendments to the previously written code to make this example compile. Hints: browse modules Data.List and Data.Ix for functions that are "missing" - maybe you will find them there (use ":browse Module.Name" at ghci prompt). Have you had to define some new instances of some classes? How did you do that?
* <tt>[ other_function local_binding | x <- some_list, x > 0, let local_binding = some_function x ]</tt> is called a "list comprehension". This is another example of "syntactic sugar", which could lead to nicely readable code, but, when abused, could lead to syntactic caries :) Do you understand what does this sample do: <tt>let solve x = [ y | x <- [0..], y<-[0..], y == x * x ]</tt>? Could write (with help of decent tutorial) write de-sugared version of this? (Yes, I know that finding a square root does not require list traversals, but for the sake of self-education try and do it)
* Notice that in order to code quite complex implementation of <tt>precomputeDisksFor</tt> we split it up in several smaller pieces and put them as a '''local bindings''' inside '''let''' clause.
* Notice that we use '''pattern matching''' to both define <tt>bestKnap</tt> on case-by-case basis and to "peer into" ('''de-construct''') <tt>DirPack</tt> in the <tt>let (DirPack s ds)=precomp!!(limit - dir_size d)</tt> line
* Notice how we use function composition to compose complex condition to filter the list of dirs

----

Before we move any further, let's do a small cosmetic change to our
code. Right now our solution uses 'Int' to store directory size. In
Haskell, 'Int' is a platform-dependent integer, which imposes certain
limitations on the values of this type. Attempt to compute the value
of type 'Int' that exceeds the bounds will result in overflow error.
Standard Haskell libraries have special typeclass
<hask>Bounded</hask>, which allows to define and examine such bounds:

Prelude> :i Bounded
class Bounded a where
minBound :: a
maxBound :: a
-- skip --
instance Bounded Int -- Imported from GHC.Enum

We see that 'Int' is indeed bounded. Let's examine the bounds:

Prelude> minBound :: Int
-2147483648
Prelude> maxBound :: Int
2147483647
Prelude>

Those of you who are C-literate, will spot at once that in this case
the 'Int' is so-called "signed 32-bit integer", which means that we
would run into errors trying to operate on directories/directory packs
which are bigger than 2 GB.

Luckily for us, Haskell has integers of arbitrary precision (limited
only by the amount of available memory). The appropriate type is
called 'Integer':

Prelude> (2^50) :: Int
0 -- overflow
Prelude> (2^50) :: Integer
1125899906842624 -- no overflow
Prelude>

Lets change definitions of 'Dir' and 'DirPack' to allow for bigger
directory sizes:
<haskell>
-- Taken from 'cd-fit-4-2.hs'
data Dir = Dir {dir_size::Integer, dir_name::String} deriving (Eq,Show)
data DirPack = DirPack {pack_size::Integer, dirs::[Dir]} deriving Show
</haskell>

Try to compile the code or load it into ghci. You will get the
following errors:

cd-fit-4-2.hs:73:79:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the expression: limit - (dir_size d)
In the second argument of `(!!)', namely `(limit - (dir_size d))'

cd-fit-4-2.hs:89:47:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the second argument of `(!!)', namely `media_size'
In the definition of `dynamic_pack':
dynamic_pack dirs = (precomputeDisksFor dirs) !! media_size

It seems like Haskell have some troubles using 'Integer' with '(!!)'.
Let's see why:

Prelude> :t (!!)
(!!) :: [a] -> Int -> a

Seems like definition of '(!!)' demands that index will be 'Int', not
'Integer'. Haskell never converts any type to some other type
automatically - programmer have to explicitly ask for that.

I will not repeat the section "Standard Haskell Classes" from
[http://haskell.org/onlinereport/basic.html the Haskell Report] and
explain, why typeclasses for various numbers organized the way they
are organized. I will just say that standard typeclass
<hask>Num</hask> demands that numeric types implement method
<hask>fromInteger</hask>:

Prelude> :i Num
class (Eq a, Show a) => Num a where
(+) :: a -> a -> a
(*) :: a -> a -> a
(-) :: a -> a -> a
negate :: a -> a
abs :: a -> a
signum :: a -> a
fromInteger :: Integer -> a
-- Imported from GHC.Num
instance Num Float -- Imported from GHC.Float
instance Num Double -- Imported from GHC.Float
instance Num Integer -- Imported from GHC.Num
instance Num Int -- Imported from GHC.Num

We see that <hask>Integer</hask> is a member of typeclass
<hask>Num</hask>, thus we could use <hask>fromInteger</hask> to make
the type errors go away:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
-- snip
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
] of
-- snip
dynamic_pack dirs = (precomputeDisksFor dirs)!!(fromInteger media_size)
-- snip
</haskell>

Type errors went away, but careful reader will spot at once that when
expression <hask>(limit - dir_size d)</hask> will exceed the bounds
for <hask>Int</hask>, overflow will occur, and we will not access the
correct list element. Don't worry, we will deal with this in a short while.

Now, lets code the QuickCheck test for this function along the lines of the test for <tt>greedy_pack</tt>:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack ds
in pack_size pack == pack_size (dynamic_pack (dirs pack))
</haskell>

Now, lets try to run (DON'T PANIC and save all you work in other applications first!):

*Main> quickCheck prop_dynamic_pack_is_fixpoint

Now, you took my advice seriously, don't you? And you did have your '''Ctrl-C''' handy, didn't you? Most probably, the attempt to run the test resulted in all your memory being taken by <tt>ghci</tt> process, which you hopefully interrupted soon enough by pressing '''Ctrl-C'''.

What happened? Who ate all the memory? How to debug this problem? GHC comes with profiling abilities, but we could not use them - they produce report after program terminates, and our doesn't seem to do so without consuming several terabytes of memory first. Still, there is a lot of room for maneuver.

Let's see. Since we called <tt>dynamic_pack</tt> and it ate all the memory, let's not do this again. Instead, let's see what this function does and tweak it a bit to explore it's behavior.

Since we already know that random lists of "Dir"s generated for our QuickCheck tests are of modest size (after all, <tt>greedy_pack</tt> munches them without significant memory consumption), the size of the input most probably is not the issue. However, <tt>dynamic_pack_is_fixpoint</tt> is building quite a huge list internally (via <tt>precomputeDisksFor</tt>). Could this be a problem?

Let's turn the timing/memory stats on (":set +s" on ghci prompt) and try to peek into various elements of list returned by <tt>precomputeDisksFor</tt>:

Prelude> :l cd-fit.hs
Compiling Main ( cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :set +s
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 0
DirPack {pack_size = 0, dirs = []}
(0.06 secs, 1277972 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10
DirPack {pack_size = 0, dirs = []}
(0.00 secs, 0 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100
DirPack {pack_size = 0, dirs = []}
(0.01 secs, 1519064 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 1000
DirPack {pack_size = 0, dirs = []}
(0.03 secs, 1081808 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10000
DirPack {pack_size = 0, dirs = []}
(1.39 secs, 12714088 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100000
Interrupted.

Aha! This seems to be a problem, since computation of 100000 fails to terminate in "reasonable" time, and to think that we have tried to compute <tt>700*1024*1024</tt>th element...

Lets modify our code a bit, to allow disk size to be tweaked:

<haskell>
-- Taken from 'cd-fit-4-3.hs'
dynamic_pack limit dirs = (precomputeDisksFor dirs)!!(fromInteger limit)

prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack media_size ds
in pack_size pack == pack_size (dynamic_pack media_size (dirs pack))

prop_dynamic_pack_small_disk ds =
let pack = dynamic_pack 50000 ds
in pack_size pack == pack_size (dynamic_pack 50000 (dirs pack))

-- rename "old" main to "moin"
main = quickCheck prop_dynamic_pack_small_disk
</haskell>

Compute a profiling version of you code with <tt>ghc -O --make -prof -auto-all -o cd-fit cd-fit.hs</tt> and run it like this:

$ ./cd-fit +RTS -p
OK, passed 100 tests.

First thing, note that our code satisfies at least one simple property. Good. Now let's examine profile. Look into file "cd-fit.prof", which was produced in your current directory.

Most probably, you'll see something like this:

cd-fit +RTS -p -RTS

total time = 2.18 secs (109 ticks @ 20 ms)
total alloc = 721,433,008 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

precomputeDisksFor Main 88.1 99.8
dynamic_pack Main 11.0 0.0

individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc

MAIN MAIN 1 0 0.0 0.0 100.0 100.0
CAF Main 174 11 0.9 0.2 100.0 100.0
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 99.1 99.8
dynamic_pack Main 182 200 11.0 0.0 99.1 99.8
precomputeDisksFor Main 183 200 88.1 99.8 88.1 99.8
main Main 180 1 0.0 0.0 0.0 0.0

Examine column of "individual %alloc". As we thought, all memory was
allocated within <tt>precomputeDisksFor</tt>. However, amount of
memory allocated (more than 700 MB, according to the line "total
alloc") seems to be a little too much for our simple task. We will dig
deeper and find where we a wasting it.

Let's examine memory consumption a little closer via so-called "heap
profiles". Run <tt>./cd-fit +RTS -hb</tt>. This produces "biographical
heap profile", which tells us how various parts of the memory were
used during the program run time. Heap profile was saved to
"cd-fit.hp". It is next to impossible to read and comprehend it as is,
so use "hp2ps cd-fit.hp" to produce a nice PostScript picture which
is worth a thousand words. View it with "gv" or "ghostview" or "full
Adobe Acrobat (not Reader)". (This and subsequent pictures are
'''not''' attached here).

Notice that most of the graph is taken up by region marked as "VOID".
This means that memory allocated was never used. Notice that there is
'''no''' areas marked as "USE", "LAG" or "DRAG". Seems like our
program hardly uses '''any''' of the allocated memory at all. Wait a
minute! How could that be? Surely it must use something when it packs
to the imaginary disks of 50000 bytes those random-generated
directories which are 10 to 1400 Mb in size.... Oops. Severe size
mismatch. We should have spotted it earlier, when we were timing
<tt>precomputeDisksFor</tt>. Scroll back and observe how each run
returned the very same result - empty directory set.

Our random directories are too big, but nevertheless code spends time
and memory trying to "pack" them. Obviously,
<tt>precomputeDisksFor</tt> (which is responsible for 90% of total
memory consumption and run time) is flawed in some way.

Let's take a closer look at what takes up so much memory. Run
<tt>./cd-fit +RTS -h -hbvoid</tt> and produce PostScript picture for
this memory profile. This will give us detailed breakdown of all
memory whose "biography" shows that it's been "VOID" (unused). My
picture (and I presume that yours as well) shows that VOID memory
comprises of "thunks" labeled "precomputeDisksFor/pre...". We could
safely assume that second word would be "precomp" (You wonder why?
Look again at the code and try to find function named "pre.*" which is
called from inside <tt>precomputeDisksFor</tt>)

This means that memory has been taken by the list generated inside
"precomp". Rumor has it that memory leaks with Haskell are caused by
either too little laziness or too much laziness. It seems like we have
too little laziness here: we evaluate more elements of the list that
we actually need and keep them from being garbage-collected.

Note how we look up element from "precomp" in this piece of code:

<haskell>
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
</haskell>

Obviously, the whole list generated by "precomp" must be kept in
memory for such lookups, since we can't be sure that some element
could be garbage collected and will not be needed again.

Let's rewrite the code to eliminate the list (incidentally, this will also deal with the possible Int overflow while accessing the "precomp" via (!!) operator):

<haskell>
-- Taken from 'cd-fit-4-4.hs'
-- Let the `bestDisk x' be the "most tightly packed" disk of total
-- size no more than `x'.
-- How to calculate `bestDisk'? Lets opt for a recursive definition:
-- Recursion base: best packed disk of size 0 is empty and best-packed
-- disk for empty list of directories on it is also empty.
bestDisk 0 _ = DirPack 0 []
bestDisk _ [] = DirPack 0 []
-- Recursion step: for size `limit`, bigger than 0, best packed disk is
-- computed as follows:
bestDisk limit dirs =
-- Take all non-empty dirs that could possibly fit to that disk by itself.
-- Consider them one by one. Let the size of particular dir be `dir_size d'.
-- Let's add it to the best-packed disk of size <= (limit - dir_size d), thus
-- producing the disk of size <= limit. Lets do that for all "candidate"
-- dirs that are not yet on our disk:
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) dirs
, d `notElem` ds
] of
-- We either fail to add any dirs (probably, because all of them too big).
-- Well, just report that disk must be left empty:
[] -> DirPack 0 []
-- Or we produce some alternative packings. Let's choose the best of them all:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

dynamic_pack limit dirs = bestDisk limit dirs
</haskell>

Compile the profiling version of this code and obtain the overall
execution profile (with "+RTS -p"). You'll get something like this:

cd-fit +RTS -p -RTS

total time = 0.00 secs (0 ticks @ 20 ms)
total alloc = 1,129,520 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

CAF GHC.Float 0.0 4.4
main Main 0.0 93.9

individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc
MAIN MAIN 1 0 0.0 0.0 0.0 100.0
main Main 180 1 0.0 93.9 0.0 94.2
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 0.0 0.3
dynamic_pack Main 182 200 0.0 0.2 0.0 0.3
bestDisk Main 183 200 0.0 0.1 0.0 0.1

We achieved the major improvement: memory consumption is reduced by factor
of 700! Now we could test the code on the "real task" - change the
code to run the test for packing the full-sized disk:

<haskell>
main = quickCheck prop_dynamic_pack_is_fixpoint
</haskell>

Compile with profiling and run (with "+RTS -p"). If you are not lucky
and a considerably big test set would be randomly generated for your
runs, you'll have to wait. And wait even more. And more.

Go make some tea. Drink it. Read some Tolstoi (Do you have "War and
peace" handy?). Chances are that by the time you are done with
Tolstoi, program will still be running (just take my word on it, don't
check).

If you are lucky, your program will finish fast enough and leave you
with profile. According to a profile, program spends 99% of its time
inside <tt>bestDisk</tt>. Could we speed up <tt>bestDisk</tt> somehow?

Note that <tt>bestDisk</tt> performs several simple calculation for
which it must call itself. However, it is done rather inefficiently -
each time we pass to <tt>bestDisk</tt> the exact same set of
directories as it was called with, even if we have already "packed"
some of them. Let's amend this:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
case [ DirPack (dir_size d + s) (d:ds)
| let small_enough = filter ( (inRange (0,limit)).dir_size ) dirs
, d <- small_enough
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) (delete d small_enough)
] of
</haskell>

Recompile and run again. Runtimes could be lengthy, but bearable, and
number of times <tt>bestDisk</tt> is called (according to the profile)
should decrease significantly.

Finally, let's compare both packing algorithms. Intuitively, we feel
that greedy algorithm should produce worse results, don't we? Lets put
this feeling to the test:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
prop_greedy_pack_is_no_better_than_dynamic_pack ds =
pack_size (greedy_pack ds) <= pack_size (dynamic_pack media_size ds)
</haskell>

Verify that it is indeed so by running <tt>quickCheck</tt> for this
test several time. I feel that this concludes our knapsacking
exercises.

Adventurous readers could continue further by implementing so-called
"scaling" for <tt>dynamic_pack</tt> where we divide all directory
sizes and medium size by the size of the smallest directory to proceed
with smaller numbers (which promises faster runtimes).

== Chapter 5: (Ab)using monads and destructing constructors for fun and profit ==

We already mentioned monads quite a few times. They are described in
numerous articles and tutorial (See Chapter 400). It's hard to read a
daily dose of any Haskell mailing list and not to come across a word
"monad" a dozen times.

Since we already made quite a progress with Haskell, it's time we
revisit the monads once again. I will let the other sources teach you
theory behind the monads, overall usefulness of the concept, etc.
Instead, I will focus on providing you with examples.

Let's take a part of the real world program which involves XML
processing. We will work with XML tag attributes, which are
essentially named values:
<haskell>
-- Taken from 'chapter5-1.hs'
type Attribute = (Name, AttValue)
</haskell>

'Name' is a plain string, and value could be '''either''' string or
references (also strings) to another attributes which holds the actual
value (now, this is not a valid XML thing, but for the sake of
providing a nice example, let's accept this). Word "either" suggests
that we use 'Either' datatype:
<haskell>
type AttValue = Either Value [Reference]
type Name = String
type Value = String
type Reference = String

-- Sample list of simple attributes:
simple_attrs = [ ( "xml:lang", Left "en" )
, ( "xmlns", Left "jabber:client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]

-- Sample list of attributes with references:
complex_attrs = [ ( "xml:lang", Right ["lang"] )
, ( "lang", Left "en" )
, ( "xmlns", Right ["ns","subns"] )
, ( "ns", Left "jabber" )
, ( "subns", Left "client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]
</haskell>

'''Our task is:''' to write a function that will look up a value of
attribute by it's name from the given list of attributes. When
attribute contains reference(s), we resolve them (looking for the
referenced attribute in the same list) and concatenate their values,
separated by semicolon. Thus, lookup of attribute "xmlns" form both
sample sets of attributes should return the same value.

Following the example set by the <hask>Data.List.lookup</hask> from
the standard libraries, we will call our function
<hask>lookupAttr</hask> and it will return <hask>Maybe Value</hask>,
allowing for lookup errors:

<haskell>
-- Taken from 'chapter5-1.hs'
lookupAttr :: Name -> [Attribute] -> Maybe Value
-- Since we dont have code for 'lookupAttr', but want
-- to compile code already, we use the function 'undefined' to
-- provide default, "always-fail-with-runtime-error" function body.
lookupAttr = undefined
</haskell>

Let's try to code <hask>lookupAttr</hask> using <hask>lookup</hask> in
a very straightforward way:

<haskell>
-- Taken from 'chapter5-1.hs'
import Data.List

lookupAttr :: Name -> [Attribute] -> Maybe Value
lookupAttr nm attrs =
-- First, we lookup 'Maybe AttValue' by name and
-- check whether we are successful:
case (lookup nm attrs) of
-- Pass the lookup error through.
Nothing -> Nothing
-- If given name exist, see if it is value of reference:
Just attv -> case attv of
-- It's a value. Return it!
Left val -> Just val
-- It's a list of references :(
-- We have to look them up, accounting for
-- possible failures.
-- First, we will perform lookup of all references ...
Right refs ->
let vals = [ lookupAttr ref attrs | ref <- refs ]
-- .. then, we will exclude lookup failures
wo_failures = filter (/=Nothing) vals
-- ... find a way to remove annoying 'Just' wrapper
stripJust (Just v) = v
-- ... use it to extract all lookup results as strings
strings = map stripJust wo_failures
in
-- ... finally, combine them into single String.
-- If all lookups failed, we should pass failure to caller.
case null strings of
True -> Nothing
False -> Just (concat (intersperse ":" strings))
</haskell>

Testing:

*Main> lookupAttr "xmlns" complex_attrs
Just "jabber:client"
*Main> lookupAttr "xmlns" simple_attrs
Just "jabber:client"
*Main>

It works, but ... It seems strange that such a boatload of code
required for quite simple task. If you examine the code closely,
you'll see that the code bloat is caused by:

* the fact that after each step we check whether the error occurred

* unwrapping Strings from <hask>Maybe</hask> and <hask>Either</hask> data constructors and wrapping them back.

At this point C++/Java programmers would say that since we just pass
errors upstream, all those cases could be replaced by the single "try
... catch ..." block, and they would be right. Does this mean that
Haskell programmers are reduced to using "case"s, which were already
obsolete 10 years ago?

Monads to the rescue! As you can read elsewhere (see section 400),
monads are used in advanced ways to construct computations from other
computations. Just what we need - we want to combine several simple
steps (lookup value, lookup reference, ...) into function
<hask>lookupAttr</hask> in a way that would take into account possible
failures.

Lets start from the code and dissect in afterwards:
<haskell>
-- Taken from 'chapter5-2.hs'
import Control.Monad

lookupAttr' nm attrs = do
-- First, we lookup 'AttValue' by name
attv <- lookup nm attrs
-- See if it is value of reference:
case attv of
-- It's a value. Return it!
Left val -> Just val
-- It's a list of references :(
-- We have to look them up, accounting for
-- possible failures.
-- First, we will perform lookup of all references ...
Right refs -> do vals <- sequence $ map (flip lookupAttr' attrs) refs
-- ... since all failures are already excluded by "monad magic",
-- ... all all 'Just's have been removed likewise,
-- ... we just combine values into single String,
-- ... and return failure if it is empty.
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

'''Exercise''': compile the code, test that <hask>lookupAttr</hask>
and <hask>lookupAttr'</hask> really behave in the same way. Try to
write a QuickCheck test for that, defining the
<hask>instance Arbitrary Name</hask> such that arbitrary names will be taken from
names available in <hask>simple_attrs</hask>.

Well, back to the story. Noticed the drastic reduction in code size?
If you drop comments, the code will occupy mere 7 lines instead of 13
- almost two-fold reduction. How we achieved this?

First, notice that we never ever check whether some computation
returns <hask>Nothing</hask> anymore. Yet, try to lookup some
non-existing attribute name, and <hask>lookupAttr'</hask> will return
<hask>Nothing</hask>. How does this happen? Secret lies in the fact
that type constructor <hask>Maybe</hask> is a "monad".

We use keyword <hask>do</hask> to indicate that following block of
code is a sequence of '''monadic actions''', where '''monadic magic'''
have to happen when we use '<-', 'return' or move from one action to
another.

Different monads have different '''magic'''. Library code says that
type constructor <hask>Maybe</hask> is such a monad that we could use
<hask><-</hask> to "extract" values from wrapper <hask>Just</hask> and
use <hask>return</hask> to put them back in form of
<hask>Just some_value</hask>. When we move from one action in the "do" block to
another a check happens. If the action returned <hask>Nothing</hask>,
all subsequent computations will be skipped and the whole "do" block
will return <hask>Nothing</hask>.

Try this to understand it all better:
<haskell>
*Main> let foo x = do v <- x; return (v+1) in foo (Just 5)
Just 6
*Main> let foo x = do v <- x; return (v+1) in foo Nothing
Nothing
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo (Just 'a')
Just 97
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo Nothing
Nothing
*Main>
</haskell>

Do not mind <hask>sequence</hask> and <hask>guard</hask> just for now
- we will get to them in the little while.

Since we already removed one reason for code bloat, it is time to deal
with the other one. Notice that we have to use <hask>case</hask> to
'''deconstruct''' the value of type <hask>Either Value
[Reference]</hask>. Surely we are not the first to do this, and such
use case have to be quite a common one.

Indeed, there is a simple remedy for our case, and it is called
<hask>either</hask>:

*Main> :t either
either :: (a -> c) -> (b -> c) -> Either a b -> c

Scary type signature, but here are examples to help you grok it:

*Main> :t either (+1) (length)
either (+1) (length) :: Either Int [a] -> Int
*Main> either (+1) (length) (Left 5)
6
*Main> either (+1) (length) (Right "foo")
3
*Main>

Seems like this is exactly our case. Let's replace the
<hask>case</hask> with invocation of <hask>either</hask>:

<haskell>
-- Taken from 'chapter5-3.hs'
lookupAttr'' nm attrs = do
attv <- lookup nm attrs
either Just (dereference attrs) attv
where
dereference attrs refs = do
vals <- sequence $ map (flip lookupAttr'' attrs) refs
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

It keeps getting better and better :)

Now, as semi-exercise, try to understand the meaning of "sequence",
"guard" and "flip" looking at the following ghci sessions:

*Main> :t sequence
sequence :: (Monad m) => [m a] -> m [a]
*Main> :t [Just 'a', Just 'b', Nothing, Just 'c']
[Just 'a', Just 'b', Nothing, Just 'c'] :: [Maybe Char]
*Main> :t sequence [Just 'a', Just 'b', Nothing, Just 'c']
sequence [Just 'a', Just 'b', Nothing, Just 'c'] :: Maybe [Char]

*Main> sequence [Just 'a', Just 'b', Nothing, Just 'c']
Nothing
*Main> sequence [Just 'a', Just 'b', Nothing]
Nothing
*Main> sequence [Just 'a', Just 'b']
Just "ab"

*Main> :t [putStrLn "a", putStrLn "b"]
[putStrLn "a", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", putStrLn "b"]
sequence [putStrLn "a", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", putStrLn "b"]
a
b

*Main> :t [putStrLn "a", fail "stop here", putStrLn "b"]
[putStrLn "a", fail "stop here", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", fail "stop here", putStrLn "b"]
sequence [putStrLn "a", fail "stop here", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", fail "stop here", putStrLn "b"]
a
*** Exception: user error (stop here)

Notice that for monad <hask>Maybe</hask> sequence continues execution
until the first <hask>Nothing</hask>. The same behavior could be
observed for IO monad. Take into account that different behaviors are
not hardcoded into the definition of <hask>sequence</hask>!

Now, let's examine <hask>guard</hask>:

*Main> let foo x = do v <- x; guard (v/=5); return (v+1) in map foo [Just 4, Just 5, Just 6]
[Just 5,Nothing,Just 7]

As you can see, it's just a simple way to "stop" execution at some
condition.

If you have been hooked on monads, I urge you to read "All About
Monads" right now (link in Chapter 400).

== Chapter 6: Where do you want to go tomorrow? ==

As the name implies, the author is open for proposals - where should
we go next? I had networking + xml/xmpp in mind, but it might be too
heavy and too narrow for most of the readers.

What do you think? Drop me a line.

== Chapter 400: Monads up close ==

Read [http://en.wikibooks.org/wiki/Haskell/Understanding_monads this wikibook chapter].
Then, read [http://horna.org.ua/books/All_About_Monads.pdf "All about monads"] (PDF).
'Nuff said :)

== Chapter 500: IO up close ==

Shows that:

<haskell>
c = do a <- someAction
b <- someOtherAction
print (bar b)
print (foo a)
print "done"
</haskell>

really is just a syntax sugar for:

<haskell>
c = someAction >>= \a ->
someOtherAction >>= \b ->
print (bar b) >>
print (foo a) >>
print "done"
</haskell>

and explains about ">>=" and ">>". Oh wait. This was already explained
in Chapter 400 :)

== Chapter 9999: Installing Haskell Compiler/Interpreter and all necessary software ==

Plenty of material on this on the web and this wiki. Just go get
yourself installation of [[GHC]] (6.4 or above) or [[Hugs]] (v200311 or
above) and "[[darcs]]", which we will use for version control.

== Chapter 10000: Thanks! ==

Thanks for comments, proofreading, good advice and kind words go to:
Helge, alt, dottedmag, Paul Moore, Ben Rudiak-Gould, Jim Wilkinson,
Andrew Zhdanov (avalez), Martin Percossi, SpellingNazi, Davor
Cubranic, Brett Giles, Stdrange, Brian Chrisman, Nathan Collins,
Anastasia Gornostaeva (ermine), Remi, Ptolomy, Zimbatm,
HenkJanVanTuyl, Miguel, Mforbes, Kartik Agaram, Jake Luck, Ketil
Malde, Mike Mimic, Jens Kubieziel.

If I should have mentioned YOU and forgot - tell me so.

Without you I would have stopped after Chapter 1 :)

Languages: [[Haskellへのヒッチハイカーガイド|jp]], [[Es/Guía de Haskell para autoestopistas|es]]

Hitchhikers guide to Haskell

2011-03-30T15:56:11Z

Imz: /* Chapter 1: Ubiquitous "Hello world!" and other ways to do IO in Haskell */ The pseudocode as human-language sentences: this way it's more obvious each of the 3 lines is an item of our plan, and the plan has an imperative spirit for now.

== Preface: DON'T PANIC! ==
[[Category:Tutorials]]
Recent experiences from a few of my fellow C++/Java programmers
indicate that they read various Haskell tutorials with "exponential
speedup" (think about how TCP/IP session starts up). They start slow
and cautious, but when they see that the first 3-5 pages do not
contain "anything interesting" in terms of code and examples, they
begin skipping paragraphs, then chapters, then whole pages, only to
slow down - often to a complete halt - somewhere on page 50, finding
themselves in the thick of concepts like "type classes", "type
constructors", "monadic IO", at which point they usually panic, think
of a perfectly rational excuse not to read further anymore, and
happily forget this sad and scary encounter with Haskell (as human
beings usually tend to forget sad and scary things).

This text intends to introduce the reader to the practical aspects of Haskell
from the very beginning (plans for the first chapters include: I/O, darcs,
Parsec, QuickCheck, profiling and debugging, to mention a few). The reader
is expected to know (where to find) at least the basics of Haskell: how to run
"hugs" or "ghci", '''that layout is 2-dimensional''', etc. Other than that, we do
not plan to take radical leaps, and will go one step at a time in order not to
lose the reader along the way. So DON'T PANIC, take your towel with you and
read along.

'''In case you've skipped over the previous paragraph''', I would like
to stress out once again that Haskell is sensitive to indentation and
spacing, so pay attention to that during cut-n-pastes or manual
alignment of code in the text editor with proportional fonts.

Oh, almost forgot: author is very interested in ANY feedback. Drop him a line
or a word (see [[User:Adept|Adept]] for contact info) or submit
patches to the tutorial via darcs (
[http://adept.linux.kiev.ua:8080/repos/hhgtth/ repository is here]) or directly to this
Wiki.

== Chapter 1: Ubiquitous "Hello world!" and other ways to do IO in Haskell ==

Each chapter will be dedicated to one small real-life task which we will
complete from the ground up.

So here is the task for this chapter: in order to free up space on
your hard drive for all the Haskell code you are going to write in the
nearest future, you are going to archive some of the old and dusty
information on CDs and DVDs. While CD (or DVD) burning itself is easy
these days, it usually takes some (or quite a lot of) time to decide
how to put several GB of digital photos on CD-Rs, when directories
with images range from 10 to 300 Mb's in size, and you don't want to
burn half-full (or half-empty) CD-Rs.

So, the task is to write a program which will help us put a given
collection of directories on the minimum possible amount of media,
while packing the media as tightly as possible. Let's name this program
"cd-fit".

Oh. Wait. Let's do the usual "hello world" thing, before we forget about it,
and then move on to more interesting things:

<haskell>
-- Taken from 'hello.hs'
-- From now on, a comment at the beginning of the code snippet
-- will specify the file which contain the full program from
-- which the snippet is taken. You can get the code from the darcs
-- repository "http://adept.linux.kiev.ua:8080/repos/hhgtth" by issuing
-- command "darcs get http://adept.linux.kiev.ua:8080/repos/hhgtth"
module Main where
main = putStrLn "Hello world!"
</haskell>

Run it:

$ runhaskell ./hello.hs
Hello world!

OK, we've done it. Move along now, nothing interesting here :)

Any serious development must be done with the help of a version control
system, and we will not make an exception. We will use the modern
distributed version control system "darcs". "Modern" means that it is
written in Haskell, "distributed" means that each working copy is
a repository in itself.

First, let's create an empty directory for all our code, and invoke
"darcs init" there, which will create subdirectory "_darcs" to store
all version-control-related stuff there.

Fire up your favorite editor and create a new file called "cd-fit.hs"
in our working directory. Now let's think for a moment about how our
program will operate and express it in pseudocode:

<haskell>
main = Read list of directories and their sizes.
Decide how to fit them on CD-Rs.
Print solution.
</haskell>

Sounds reasonable? I thought so.

Let's simplify our life a little and assume for now that we will
compute directory sizes somewhere outside our program (for example,
with "du -sb *") and read this information from stdin.
Now let me convert all this to Haskell:

<haskell>
-- Taken from 'cd-fit-1-1.hs'
module Main where

main = do input <- getContents
putStrLn ("DEBUG: got input " ++ input)
-- compute solution and print it
</haskell>

Not really working, but pretty close to plain English, eh? Let's stop
for a moment and look more closely at what's written here line-by-line

Let's begin from the top:

<haskell>
-- Taken from 'cd-fit-1-1.hs'
input <- getContents
</haskell>

This is an example of the Haskell syntax for doing IO (namely, input). This
line is an instruction to read all the information available from the stdin,
return it as a single string, and bind it to the symbol "input", so we can
process this string any way we want.

How did I know that? Did I memorize all the functions by heart? Of course not!
Each function has a type, which, along with function's name, usually tells a
lot about what a function will do.

Let's fire up an interactive Haskell environment and examine this function
up close:

$ ghci
___ ___ _
/ _ \ /\ /\/ __(_)
/ /_\// /_/ / / | | GHC Interactive, version 6.4.1, for Haskell 98.
/ /_\\/ __ / /___| | http://www.haskell.org/ghc/
\____/\/ /_/\____/|_| Type :? for help.

Loading package base-1.0 ... linking ... done.
Prelude> :type getContents
getContents :: IO String
Prelude>

We see that "getContents" is a function without arguments, that will return
"IO String". Prefix "IO" meant that this is an IO action. It will return
String, when evaluated. Action will be evaluated as soon as we use "<-" to
bind its result to some symbol.

Note that "<-" is not a fancy way to assign value to variable. It is a way to
evaluate (execute) IO actions, in other words - to actually do some I/O and
return its result (if any).

We can choose not to evaluate the action obtained from "getContents", but rather carry it around a bit and evaluate later:

<haskell>
let x = getContents
-- 300 lines of code here
input <- x
</haskell>

So, as you see, IO actions can act like an ordinary values. Suppose that we
have built a list of IO actions and have found a way to execute them one by one.
This would be a way to simulate imperative programming with its notion of
"order of execution".

Haskell allows you to do better than that.

The standard language library (named "Prelude", by the way) provides
us with lots of functions that return useful primitive IO actions. In
order to combine them to produce an even more complex actions, we use a "do":

<haskell>
c = do a <- someAction
b <- someOtherAction
print (bar b)
print (foo a)
putStrLn "done"
</haskell>

Here we '''bind''' "c" to an action with the following "scenario":
* '''evaluate''' action "someAction" and '''bind''' its result to "a"
* then, '''evaluate''' "someOtherAction" and '''bind''' its result to "b"
* then, process "b" with function "bar" and print result
* then, process "a" with function "foo" and print result
* then, print the word "done"

When will all this actually be executed? Answer: as soon as we evaluate "c"
using the "<-" (if it returns result, as "getContents" does) or just
by using it as a function name (if it does not return a result, as "print"
does):

<haskell>
process = do putStrLn "Will do some processing"
c
putStrLn "Done"
</haskell>

Notice that we took a bunch of functions ("someAction", "someOtherAction",
"print", "putStrLn") and using "do" created from them a new function, which we
bound to symbol "c". Now we could use "c" as a building block to produce an even
more complex function, "process", and we could carry this on and on.
Eventually, some of the functions will be mentioned in the code of function
"main", to which the ultimate topmost IO action any Haskell program is bound.

When will the "main" be executed/evaluated/forced? As soon as we run the
program. Read this twice and try to comprehend:

''The execution of a Haskell program is an evaluation of the symbol "main" to
which we have bound an IO action. Via evaluation we obtain the result of that
action''.

Readers familiar with advanced C++ or Java programming and that arcane body of
knowledge named "OOP Design Patterns" might note that "build actions from
actions" and "evaluate actions to get result" is essentially a "Command
pattern" and "Composition pattern" combined. Good news: in Haskell you get them
for all your IO, and get them '''for free''' :)

----
'''Exercise:'''
Consider the following code:

<haskell>
-- Taken from 'exercise-1-1.hs'
module Main where
c = putStrLn "C!"

combine before after =
do before
putStrLn "In the middle"
after

main = do combine c c
let b = combine (putStrLn "Hello!") (putStrLn "Bye!")
let d = combine (b) (combine c c)
putStrLn "So long!"
</haskell>

Notice how we carefully indent lines so that source looks neat?
Actually, Haskell code has to be aligned this way, or it will not
compile. If you use tabulation to indent your sources, take into
account that Haskell compilers assume that tabstop is 8 characters
wide.

Often people complain that it is very difficult to write Haskell
because it requires them to align code. Actually, this is not true. If
you align your code, compiler will guess the beginnings and endings of
syntactic blocks. However, if you don't want to indent your code, you
could explicitly specify end of each and every expression and use
arbitrary layout as in this example:
<haskell>
-- Taken from 'exercise-1-2.hs'
combine before after =
do { before;
putStrLn "In the middle";
after; };

main =
do { combine c c; let { b = combine (putStrLn "Hello!") (putStrLn "Bye!")};
let {d = combine (b) (combine c c)};
putStrLn "So long!" };
</haskell>

Back to the exercise - see how we construct code out of thin air? Try
to imagine what this code will do, then run it and check yourself.

Do you understand why "Hello!" and "Bye!" are not printed?
----

Let's examine our "main" function closer:

Prelude> :load cd-fit.hs
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :type main
main :: IO ()
*Main>

We see that "main" is indeed an IO action which will return nothing
when evaluated. When combining actions with "do", the type of the
result will be the type of the last action, and "putStrLn something" has type
"IO ()":

*Main> :type putStrLn "Hello world!"
putStrLn "Hello world!" :: IO ()
*Main>

Oh, by the way: have you noticed that we actually compiled our first
Haskell program in order to examine "main"? :)

let's celebrate that by putting it under version control: execute
"darcs add cd-fit.hs" and "darcs record", answer "y" to all questions
and provide a commit comment "Skeleton of cd-fit.hs"

Let's try to run it:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: got input foo

----
'''Exercises''':

* Try to write a program that takes your name from the stdin and greets you (keywords: getLine, putStrLn);

* Try to write a program that asks for you name, reads it, greets you, asks for your favorite color, and prints it back (keywords: getLine, putStrLn).

== Chapter 2: Parsing the input ==

OK, now that we have proper understanding of the powers of Haskell IO
(and are awed by them, I hope), let's forget about IO and actually do
some useful work.

As you remember, we set forth to pack some CD-Rs as tightly as
possible with data scattered in several input directories. We assume
that "du -sb" will compute the sizes of input directories and output
something like:

65572 /home/adept/photos/raw-to-burn/dir1
68268 /home/adept/photos/raw-to-burn/dir2
53372 /home/adept/photos/raw-to-burn/dir3
713124 /home/adept/photos/raw-to-burn/dir4
437952 /home/adept/photos/raw-to-burn/dir5

Our next task is to parse that input into some suitable internal
representation.

For that we will use powerful library of '''parsing combinators''' named
"[[Parsec]]" which ships with most Haskell implementations.

Much like the IO facilities we have seen in the first chapter, this
library provides a set of basic parsers and means to combine into more
complex parsing constructs.

Unlike other tools in this area (lex/yacc or JavaCC to name a few),
[[Parsec]] parsers do not require a separate preprocessing stage. Since in
Haskell we can return function as a result of function and thus
construct functions "from the thin air", there is no need for a separate
syntax for parser description. But enough advertisements, let's actually
do some parsing:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
import Text.ParserCombinators.Parsec

-- parseInput parses output of "du -sb", which consists of many lines,
-- each of which describes single directory
parseInput =
do dirs <- many dirAndSize
eof
return dirs

-- Datatype Dir holds information about single directory - its size and name
data Dir = Dir Int String deriving Show

-- `dirAndSize` parses information about single directory, which is:
-- a size in bytes (number), some spaces, then directory name, which extends till newline
dirAndSize =
do size <- many1 digit
spaces
dir_name <- anyChar `manyTill` newline
return (Dir (read size) dir_name)
</haskell>

Just add those lines to "cd-fit.hs", between the declaration of
the Main module and the definition of main.

Here we see quite a lot of new
things, and several those that we know already.
First of all, note the familiar "do" construct, which, as we know, is
used to combine IO actions to produce new IO actions. Here we use it
to combine "parsing" actions into new "parsing" actions. Does this
mean that "parsing" implies "doing IO"? Not at all. Thing is, I must
admit that I lied to you - "do" is used not only to combine IO
actions. "Do" is used to combine any kind of so-called ''monadic
actions'' or ''monadic values'' together.

Think about [[monad]] as a "[[:Category:Idioms|design pattern]]" in the functional world.
[[Monad]] is a way to hide from the user (programmer) all the machinery
required for complex functionality to operate.

As you might have heard, Haskell has no notion of "assignment",
"mutable state", "variables", and is a "pure functional language",
which means that every function called with the same input parameters
will return exactly the same result. Meanwhile "doing IO" requires
hauling around file handles and their states and dealing with IO
errors. "Parsing" requires to track position in the input and dealing
with parsing errors.

In both cases Wise Men Who Wrote Libraries cared for our needs and
hide all underlying complexities from us, exposing the [http://en.wikipedia.org/wiki/Application_programming_interface API] of their
libraries (IO and parsing) in the form of "monadic action" which we
are free to combine as we see fit.

Think of programming with monads as of doing the remodelling with the
help of professional remodelling crew. You describe sequence of
actions on the piece of paper (that's us writing in "do" notation),
and then, when required, that sequence will be evaluated by the
remodelling crew ("in the monad") which will provide you with end
result, hiding all the underlying complexity (how to prepare the
paint, which nails to choose, etc) from you.

let's use the interactive Haskell environment to decipher all the
instructions we've written for the parsing library. As usually, we'll
go top-down:

*Main> :reload
Ok, modules loaded: Main.
*Main> :t parseInput
parseInput :: GenParser Char st [Dir]
*Main> :t dirAndSize
dirAndSize :: GenParser Char st Dir
*Main>

Assuming (well, take my word for it) that "GenParser Char st" is our
parsing monad, we could see that "parseInput", when evaluated, will
produce a list of "Dir", and "dirAndSize", when evaluated, will
produce "Dir". Assuming that "Dir" somehow represents information
about single directory, that is pretty much what we wanted, isn't it?

Let's see what a "Dir" means. We defined ''data[[type]]'' Dir as a record,
which holds an Int and a String:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
data Dir = Dir Int String deriving Show
</haskell>

In order to construct such records, we must use ''data [[constructor]]''
Dir:

*Main> :t Dir 1 "foo"
Dir 1 "foo" :: Dir

In order to reduce confusion for newbies, we could have written:
<haskell>
data Dir = D Int String deriving Show
</haskell>

, which would define ''data[[type]]'' "Dir" with ''data [[constructor]]'' "D".
However, traditionally name of the data[[type]] and its [[constructor]] are
chosen to be the same.

Clause "[[deriving]] Show" instructs the compiler to make enough code "behind
the curtains" to make this ''datatype'' conform to the interface of
the ''type [[class]]'' Show. We will explain ''type [[class]]es'' later, for
now let's just say that this will allow us to "print" instances of
"Dir".

'''Exercises:'''
* examine types of "digit", "anyChar", "many", "many1" and "manyTill" to see how they are used to build more complex parsers from single ones.

* compare types of "manyTill", "manyTill anyChar" and "manyTill anyChar newline". Note that "anyChar `manyTill` newline" is just another syntax sugar. Note that when function is supplied with less arguments that it actually needs, we get not a value, but a new function, which is called ''partial application''.

OK. So, we combined a lot of primitive parsing actions to get ourselves a
parser for output of "du -sb". How can we actually parse something? the [[Parsec]] library supplies us with function "parse":

*Main> :t parse
parse :: GenParser tok () a
-> SourceName
-> [tok]
-> Either ParseError a
*Main> :t parse parseInput
parse parseInput :: SourceName -> [Char] -> Either ParseError [Dir]
*Main>

At first the [[type]] might be a bit cryptic, but once we supply "parse" with the parser we made, the compiler gets more information and presents us with a more concise [[type]].

Stop and consider this for a moment. The compiler figured out type of the function without a single type annotation supplied by us! Imagine if a Java compiler deduced types for you, and you wouldn't have to specify types of arguments and return values of methods, ever.

OK, back to the code. We can observe that the "parser" is a function, which,
given a parser, a name of the source file or channel (f.e. "stdin"), and
source data (String, which is a list of "Char"s, which is written "[Char]"),
will either produce parse error, or parse us a list of "Dir".

Datatype "Either" is an example of datatype whose constructor has name, different
from the name of the datatype. In fact, "Either" has two constructors:

<haskell>
data Either a b = Left a | Right b
</haskell>

In order to understand better what does this mean consider the following
example:

*Main> :t Left 'a'
Left 'a' :: Either Char b
*Main> :t Right "aaa"
Right "aaa" :: Either a [Char]
*Main>

You see that "Either" is a ''union'' (much like the C/C++ "union") which could
hold value of one of the two distinct types. However, unlike C/C++ "union",
when presented with value of type "Either Int Char" we could immediately see
whether its an Int or a Char - by looking at the constructor which was used to
produce the value. Such datatypes are called "tagged unions", and they are
another [[:Category:Idioms|power tool]] in the Haskell toolset.

Did you also notice that we provide "parse" with parser, which is a monadic
value, but receive not a new monadic value, but a parsing result? That is
because "parse" is an evaluator for "Parser" monad, much like the [[GHC]] or [[Hugs]] runtime is an evaluator for the IO monad. The function "parser" implements all monadic machinery: it tracks errors and positions in input, implements backtracking and lookahead, etc.

let's extend our "main" function to use "parse" and actually parse the input
and show us the parsed data structures:

<haskell>
-- Taken from 'cd-fit-2-1.hs'
main = do input <- getContents
putStrLn ("DEBUG: got input " ++ input)
let dirs = case parse parseInput "stdin" input of
Left err -> error $ "Input:\n" ++ show input ++
"\nError:\n" ++ show err
Right result -> result
putStrLn "DEBUG: parsed:"; print dirs
</haskell>

'''Exercise:'''

* In order to understand this snippet of code better, examine (with ghci or hugs) the difference between 'drop 1 ( drop 1 ( drop 1 ( drop 1 ( drop 1 "foobar" ))))' and 'drop 1 $ drop 1 $ drop 1 $ drop 1 $ drop 1 "foobar"'. Examine type of ($).
* Try putStrLn "aaa" and print "aaa" and see the difference, examine their types.
* Try print (Dir 1 "foo") and putStrLn (Dir 1 "foo"). Examine types of print and putStrLn to understand the behavior in both cases.

Let's try to run what we have so far:

$ du -sb * | runhaskell ./cd-fit.hs

DEBUG: got input 22325 Article.txt
18928 Article.txt~
1706 cd-fit.hs
964 cd-fit.hs~
61609 _darcs

DEBUG: parsed:
[Dir 22325 "Article.txt",Dir 18928 "Article.txt~",
Dir 1706 "cd-fit.hs",Dir 964 "cd-fit.hs~",Dir 61609 "_darcs"]

Seems to be doing exactly as planned. Now let's try some erroneous
input:

$ echo "foo" | runhaskell cd-fit.hs
DEBUG: got input foo

DEBUG: parsed:
*** Exception: Input:
"foo\n"
Error:
"stdin" (line 1, column 1):
unexpected "f"
expecting digit or end of input

Seems to be doing fine.

If you followed advice to put your code under version control, you
could now use "darcs whatsnew" or "darcs diff -u" to examine your
changes to the previous version. Use "darcs record" to commit them. As
an exercise, first record the changes "outside" of function "main" and
then record the changes in "main". Do "darcs changes" to examine a
list of changes you've recorded so far.

== Chapter 3: Packing the knapsack and testing it with class, too (and don't forget your towel!) ==

Enough preliminaries already. let's go pack some CDs.

As you might already have recognized, our problem is a classical one. It is
called a "knapsack problem" ([http://www.google.com/search?q=knapsack+problem google it up], if you don't know already what it
is. There are more than 100000 links).

let's start from the greedy solution, but first let's slightly modify our "Dir"
datatype to allow easy extraction of its components:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

----
'''Exercise:''' examine types of "Dir", "dir_size" and "dir_name"
----

From now on, we could use "dir_size d" to get a size of directory, and
"dir_name d" to get its name, provided that "d" is of type "Dir".

The Greedy algorithm sorts directories from the biggest down, and tries to put
them on CD one by one, until there is no room for more. We will need to track
which directories we added to CD, so let's add another datatype, and code this
simple packing algorithm:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
import Data.List (sortBy)

-- DirPack holds a set of directories which are to be stored on single CD.
-- 'pack_size' could be calculated, but we will store it separately to reduce
-- amount of calculation
data DirPack = DirPack {pack_size::Int, dirs::[Dir]} deriving Show

-- For simplicity, let's assume that we deal with standard 700 Mb CDs for now
media_size = 700*1024*1024

-- Greedy packer tries to add directories one by one to initially empty 'DirPack'
greedy_pack dirs = foldl maybe_add_dir (DirPack 0 []) $ sortBy cmpSize dirs
where
cmpSize d1 d2 = compare (dir_size d1) (dir_size d2)

-- Helper function, which only adds directory "d" to the pack "p" when new
-- total size does not exceed media_size
maybe_add_dir p d =
let new_size = pack_size p + dir_size d
new_dirs = d:(dirs p)
in if new_size > media_size then p else DirPack new_size new_dirs
</haskell>

----
I'll highlight the areas which you could explore on your own (using other nice
tutorials out there, of which I especially recommend "Yet Another Haskell
Tutorial" by Hal Daume):
* We choose to import a single function "sortBy" from a module [[Data.List]], not the whole thing.
* Instead of coding case-by-case recursive definition of "greedy_pack", we go with higher-order approach, choosing "foldl" as a vehicle for list traversal. Examine its type. Other useful function from the same category are "map", "foldr", "scanl" and "scanr". Look them up!
* To sort list of "Dir" by size only, we use custom sort function and parametrized sort - "sortBy". This sort of setup where the user may provide a custom "modifier" for a generic library function is quite common: look up "deleteBy", "deleteFirstsBy", "groupBy", "insertBy", "intersectBy", "maximumBy", "minimumBy", "sortBy", "unionBy".
* To code the quite complex function "maybe_add_dir", we introduced several '''local definitions''' in the "let" clause, which we can reuse within the function body. We used a "where" clause in the "greedy_pack" function to achieve the same effect. Read about "let" and "where" clauses and the differences between them.
* Note that in order to construct a new value of type "DirPack" (in function "maybe_add_dir") we haven't used the helper accessor functions "pack_size" and "dirs"
----

In order to actually use our greedy packer we must call it from our "main"
function, so let's add a lines:

<haskell>
-- Taken from 'cd-fit-3-1.hs'
main = do ...
-- compute solution and print it
putStrLn "Solution:" ; print (greedy_pack dirs)
</haskell>

Verify integrity of our definitions by (re)loading our code in ghci. Compiles?
Thought so :) Now, do "darcs record" and add some sensible commit message.

Now it is time to test our creation. We could do it by actually running it in
the wild like this:

$ du -sb ~/DOWNLOADS/* | runhaskell ./cd-fit.hs

This will prove that our code seems to be working. At least, this once. How
about establishing with reasonable degree of certainty that our code, parts
and the whole, works properly, and doing so in re-usable manner? In other
words, how about writing some test?

Java programmers used to JUnit probably thought about screens of boiler-plate
code and hand-coded method invocations. Never fear, we will not do anything as
silly :)

Enter '''[[QuickCheck]]'''.

[[QuickCheck]] is a tool to do automated testing of your functions using
(semi)random input data. In the spirit of "100b of code examples is worth 1kb of
praise" let's show the code for testing the following ''property'': An attempt to pack directories returned by "greedy_pack" should return "DirPack" of exactly the same pack:

<haskell>
-- Taken from 'cd-fit-3-2.hs'
import Test.QuickCheck
import Control.Monad (liftM2, replicateM)

-- We must teach QuickCheck how to generate arbitrary "Dir"s
instance Arbitrary Dir where
-- Let's just skip "coarbitrary" for now, ok?
-- I promise, we will get back to it later :)
coarbitrary = undefined
-- We generate arbitrary "Dir" by generating random size and random name
-- and stuffing them inside "Dir"
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM n (elements "fubar/")

-- For convenience and by tradition, all QuickCheck tests begin with prefix "prop_".
-- Assume that "ds" will be a random list of "Dir"s and code your test.
prop_greedy_pack_is_fixpoint ds =
let pack = greedy_pack ds
in pack_size pack == pack_size (greedy_pack (dirs pack))
</haskell>

let's run the test, after which I'll explain how it all works:

Prelude> :r
Compiling Main ( ./cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> quickCheck prop_greedy_pack_is_fixpoint
[numbers spinning]
OK, passed 100 tests.
*Main>

We've just seen our "greedy_pack" run on a 100 completely (well, almost
completely) random lists of "Dir"s, and it seems that property indeed holds.

let's dissect the code. The most intriguing part is "instance Arbitrary Dir
where", which declares that "Dir" is an '''[[instance]]''' of '''type[[class]]''' "Arbitrary". Whoa, that's a whole lot of unknown words! :) Let's slow down a
bit.

What is a '''type[[class]]'''? A typeclass is a Haskell way of dealing with the
following situation: suppose that you are writing a library of useful
functions and you don't know in advance how exactly they will be used, so you
want to make them generic. Now, on one hand you don't want to restrict your
users to certain type (e.g. String). On the other hand, you want to enforce
the convention that arguments for your function must satisfy a certain set of
constraints. That is where '''typeclass''' comes in handy.

Think of typeclass as a '''contract''' (or "interface", in Java terms) that
your type must fulfill in order to be admitted as an argument to certain
functions.

Let's examine the typeclass "Arbitrary":

*Main> :i Arbitrary
class Arbitrary a where
arbitrary :: Gen a
coarbitrary :: a -> Gen b -> Gen b
-- Imported from Test.QuickCheck
instance Arbitrary Dir
-- Defined at ./cd-fit.hs:61:0
instance Arbitrary Bool -- Imported from Test.QuickCheck
instance Arbitrary Double -- Imported from Test.QuickCheck
instance Arbitrary Float -- Imported from Test.QuickCheck
instance Arbitrary Int -- Imported from Test.QuickCheck
instance Arbitrary Integer -- Imported from Test.QuickCheck
-- rest skipped --

It could be read this way: "Any [[type]] (let's name it 'a') could be a member of the [[class]] Arbitrary as soon as we define two functions for it: "arbitrary" and "coarbitrary", with signatures shown. For types Dir, Bool, Double, Float, Int and Integer such definitions were provided, so all those types are instance of class Arbitrary".

Now, if you write a function which operates on its arguments solely by means
of "arbitrary" and "coarbitrary", you can be sure that this function will work
on any type which is an instance of "Arbitrary"!

let's say it again. Someone (maybe even you) writes the code (API or library),
which requires that input values implement certain ''interfaces'', which is
described in terms of functions. Once you show how your type implements this
''interface'' you are free to use API or library.

Consider the function "sort" from standard library:

*Main> :t Data.List.sort
Data.List.sort :: (Ord a) => [a] -> [a]

We see that it sorts lists of any values which are instance of typeclass
"Ord". Let's examine that class:

*Main> :i Ord
class Eq a => Ord a where
compare :: a -> a -> Ordering
(<) :: a -> a -> Bool
(>=) :: a -> a -> Bool
(>) :: a -> a -> Bool
(<=) :: a -> a -> Bool
max :: a -> a -> a
min :: a -> a -> a
-- skip
instance Ord Double -- Imported from GHC.Float
instance Ord Float -- Imported from GHC.Float
instance Ord Bool -- Imported from GHC.Base
instance Ord Char -- Imported from GHC.Base
instance Ord Integer -- Imported from GHC.Num
instance Ord Int -- Imported from GHC.Base
-- skip
*Main>

We see a couple of interesting things: first, there is an additional
requirement listed: in order to be an instance of "Ord", type must first be an
instance of typeclass "Eq". Then, we see that there is an awful lot of
functions to define in order to be an instance of "Ord". Wait a second, isn't
it silly to define both (<) and (>) when one could be expressed via another?

Right you are! Usually, typeclass contains several "default" implementation
for its functions, when it is possible to express them through each other (as
it is with "Ord"). In this case it is possible to supply only a minimal
definition (which in case of "Ord" consists of any single function) and others
will be automatically derived. If you supplied fewer functions than are required
for minimal implementation, the compiler/interpreter will say so and
explain which functions you still have to define.

Once again, we see that a lot of [[type]]s are already instances of typeclass Ord, and thus we are able to sort them.

Now, let's take a look back to the definition of "Dir":

<haskell>
-- Taken from 'cd-fit-3-2.hs'
data Dir = Dir {dir_size::Int, dir_name::String} deriving Show
</haskell>

See that "[[deriving]]" clause? It instructs the compiler to automatically derive code to make "Dir" an instance of typeclass Show. The compiler knows about a bunch of standard typeclasses (Eq, Ord, Show, Enum, Bound, Typeable to name a few) and knows how to make a type into a "suitably good" instance of any of them. If you want to derive instances of more than one typeclass, say it this way: "deriving (Eq,Ord,Show)". Voila! Now we can compare, sort and print data of
that type!

Side note for Java programmers: just imagine java compiler which derives code
for "implements Storable" for you...

Side note for C++ programmers: just imagine that deep copy constructors are
being written for you by compiler....

----
'''Exercises:'''
* Examine typeclasses Eq and Show
* Examine types of (==) and "print"
* Try to make "Dir" instance of "Eq"
----

OK, back to our tests. So, what we have had to do in order to make "Dir" an
instance of "Arbitrary"? Minimal definition consists of "arbitrary". Let's
examine it up close:

*Main> :t arbitrary
arbitrary :: (Arbitrary a) => Gen a

See that "Gen a"? Reminds you of something? Right! Think of "IO a" and "Parser
a" which we've seen already. This is yet another example of action-returning
function, which could be used inside "do"-notation. (You might ask yourself,
wouldn't it be useful to generalize that convenient concept of actions and
"do"? Of course! It is already done, the concept is called "[[Monad]]" and we will talk about it in Chapter 400 :) )

Since 'a' here is a [[type variable]] which is an instance of "Arbitrary", we could substitute "Dir" here. So, how we can make and return an action of type "Gen Dir"?

Let's look at the code:

<haskell>
-- Taken from 'cd-fit-3-2.hs'
arbitrary = liftM2 Dir gen_size gen_name
-- Generate random size between 10 and 1400 Mb
where gen_size = do s <- choose (10,1400)
return (s*1024*1024)
-- Generate random name 1 to 300 chars long, consisting of symbols "fubar/"
gen_name = do n <- choose (1,300)
replicateM n (elements "fubar/")
</haskell>

We have used the library-provided functions "choose" and "elements" to build up
"gen_size :: Gen Int" and "gen_name :: Gen String" (exercise: don't take my
word on that. Find a way to check types of "gen_name" and "gen_size"). Since
"Int" and "String" are components of "Dir", we sure must be able to use "Gen
Int" and "Gen String" to build "Gen Dir". But where is the "do" block for
that? There is none, and there is only single call to "liftM2".

Let's examine it:

*Main> :t liftM2
liftM2 :: (Monad m) => (a1 -> a2 -> r) -> m a1 -> m a2 -> m r

Kind of scary, right? Let's provide typechecker with more context:

*Main> :t liftM2 Dir
liftM2 Dir :: (Monad m) => m Int -> m String -> m Dir

Since you already heard that "Gen" is a "Monad", you could substitute "Gen" for "m" here, obtaining "liftM2 Dir :: (Monad Gen) => Gen Int -> Gen String ->
Gen Dir". Exactly what we wanted!

Consider "liftM2" to be "advanced topic" of this chapter (which we will cover
later) and just note for now that:
* "2" is a number of arguments for data constructor "Dir" and we have used "liftM2" to construct "Gen Dir" out of "Dir"
* There are also "liftM", "liftM3", "liftM4", "liftM5"
* "liftM2" is defined as "liftM2 f a1 a2 = do x<-a1; y<-a2; return (f x y)"

Hopefully, this will all make sense after you read it for the third
time ;)

Oh, by the way - don't forget to "darcs record" your changes!

== Chapter 4: REALLY packing the knapsack this time ==

In this chapter we are going to write another not-so-trivial packing
method, compare packing methods efficiency, and learn something new
about debugging and profiling of the Haskell programs along the way.

It might not be immediately obvious whether our packing algorithm is
effective, and if yes - in which particular way? Whether it's runtime,
memory consumption or result are of sufficient quality, are there any
alternative algorithms, and how do they compare to each other?

Let's code another solution to the knapsack packing problem, called the "dynamic programming method" and put both variants to the test.

This time, I'll not dissect the listing and explain it bit by bit. Instead, comments are provided in the code:

<haskell>
-- Taken from 'cd-fit-4-1.hs'
----------------------------------------------------------------------------------
-- Dynamic programming solution to the knapsack (or, rather, disk) packing problem
--
-- Let the `bestDisk x' be the "most tightly packed" disk of total
-- size no more than `x'.
precomputeDisksFor :: [Dir] -> [DirPack]
precomputeDisksFor dirs =
-- By calculating `bestDisk' for all possible disk sizes, we could
-- obtain a solution for particular case by simple lookup in our list of
-- solutions :)
let precomp = map bestDisk [0..]

-- How to calculate `bestDisk'? Lets opt for a recursive definition:
-- Recursion base: best packed disk of size 0 is empty
bestDisk 0 = DirPack 0 []
-- Recursion step: for size `limit`, bigger than 0, best packed disk is
-- computed as follows:
bestDisk limit =
-- 1. Take all non-empty dirs that could possibly fit to that disk by itself.
-- Consider them one by one. Let the size of particular dir be `dir_size d'.
-- Let's add it to the best-packed disk of size <= (limit - dir_size d), thus
-- producing the disk of size <= limit. Lets do that for all "candidate"
-- dirs that are not yet on our disk:
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(limit - dir_size d)
, d `notElem` ds
] of
-- We either fail to add any dirs (probably, because all of them too big).
-- Well, just report that disk must be left empty:
[] -> DirPack 0 []
-- Or we produce some alternative packings. Let's choose the best of them all:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

in precomp

-- When we precomputed disk of all possible sizes for the given set of dirs, solution to
-- particular problem is simple: just take the solution for the required 'media_size' and
-- that's it!
dynamic_pack dirs = (precomputeDisksFor dirs)!!media_size
</haskell>

Notice that it took almost the same amount of text to describe algorithm and to write implementation for it. Nice, eh?

----

'''Exercises:'''
* Make all necessary amendments to the previously written code to make this example compile. Hints: browse modules Data.List and Data.Ix for functions that are "missing" - maybe you will find them there (use ":browse Module.Name" at ghci prompt). Have you had to define some new instances of some classes? How did you do that?
* <tt>[ other_function local_binding | x <- some_list, x > 0, let local_binding = some_function x ]</tt> is called a "list comprehension". This is another example of "syntactic sugar", which could lead to nicely readable code, but, when abused, could lead to syntactic caries :) Do you understand what does this sample do: <tt>let solve x = [ y | x <- [0..], y<-[0..], y == x * x ]</tt>? Could write (with help of decent tutorial) write de-sugared version of this? (Yes, I know that finding a square root does not require list traversals, but for the sake of self-education try and do it)
* Notice that in order to code quite complex implementation of <tt>precomputeDisksFor</tt> we split it up in several smaller pieces and put them as a '''local bindings''' inside '''let''' clause.
* Notice that we use '''pattern matching''' to both define <tt>bestKnap</tt> on case-by-case basis and to "peer into" ('''de-construct''') <tt>DirPack</tt> in the <tt>let (DirPack s ds)=precomp!!(limit - dir_size d)</tt> line
* Notice how we use function composition to compose complex condition to filter the list of dirs

----

Before we move any further, let's do a small cosmetic change to our
code. Right now our solution uses 'Int' to store directory size. In
Haskell, 'Int' is a platform-dependent integer, which imposes certain
limitations on the values of this type. Attempt to compute the value
of type 'Int' that exceeds the bounds will result in overflow error.
Standard Haskell libraries have special typeclass
<hask>Bounded</hask>, which allows to define and examine such bounds:

Prelude> :i Bounded
class Bounded a where
minBound :: a
maxBound :: a
-- skip --
instance Bounded Int -- Imported from GHC.Enum

We see that 'Int' is indeed bounded. Let's examine the bounds:

Prelude> minBound :: Int
-2147483648
Prelude> maxBound :: Int
2147483647
Prelude>

Those of you who are C-literate, will spot at once that in this case
the 'Int' is so-called "signed 32-bit integer", which means that we
would run into errors trying to operate on directories/directory packs
which are bigger than 2 GB.

Luckily for us, Haskell has integers of arbitrary precision (limited
only by the amount of available memory). The appropriate type is
called 'Integer':

Prelude> (2^50) :: Int
0 -- overflow
Prelude> (2^50) :: Integer
1125899906842624 -- no overflow
Prelude>

Lets change definitions of 'Dir' and 'DirPack' to allow for bigger
directory sizes:
<haskell>
-- Taken from 'cd-fit-4-2.hs'
data Dir = Dir {dir_size::Integer, dir_name::String} deriving (Eq,Show)
data DirPack = DirPack {pack_size::Integer, dirs::[Dir]} deriving Show
</haskell>

Try to compile the code or load it into ghci. You will get the
following errors:

cd-fit-4-2.hs:73:79:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the expression: limit - (dir_size d)
In the second argument of `(!!)', namely `(limit - (dir_size d))'

cd-fit-4-2.hs:89:47:
Couldn't match `Int' against `Integer'
Expected type: Int
Inferred type: Integer
In the second argument of `(!!)', namely `media_size'
In the definition of `dynamic_pack':
dynamic_pack dirs = (precomputeDisksFor dirs) !! media_size

It seems like Haskell have some troubles using 'Integer' with '(!!)'.
Let's see why:

Prelude> :t (!!)
(!!) :: [a] -> Int -> a

Seems like definition of '(!!)' demands that index will be 'Int', not
'Integer'. Haskell never converts any type to some other type
automatically - programmer have to explicitly ask for that.

I will not repeat the section "Standard Haskell Classes" from
[http://haskell.org/onlinereport/basic.html the Haskell Report] and
explain, why typeclasses for various numbers organized the way they
are organized. I will just say that standard typeclass
<hask>Num</hask> demands that numeric types implement method
<hask>fromInteger</hask>:

Prelude> :i Num
class (Eq a, Show a) => Num a where
(+) :: a -> a -> a
(*) :: a -> a -> a
(-) :: a -> a -> a
negate :: a -> a
abs :: a -> a
signum :: a -> a
fromInteger :: Integer -> a
-- Imported from GHC.Num
instance Num Float -- Imported from GHC.Float
instance Num Double -- Imported from GHC.Float
instance Num Integer -- Imported from GHC.Num
instance Num Int -- Imported from GHC.Num

We see that <hask>Integer</hask> is a member of typeclass
<hask>Num</hask>, thus we could use <hask>fromInteger</hask> to make
the type errors go away:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
-- snip
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
] of
-- snip
dynamic_pack dirs = (precomputeDisksFor dirs)!!(fromInteger media_size)
-- snip
</haskell>

Type errors went away, but careful reader will spot at once that when
expression <hask>(limit - dir_size d)</hask> will exceed the bounds
for <hask>Int</hask>, overflow will occur, and we will not access the
correct list element. Don't worry, we will deal with this in a short while.

Now, lets code the QuickCheck test for this function along the lines of the test for <tt>greedy_pack</tt>:

<haskell>
-- Taken from 'cd-fit-4-2.hs'
prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack ds
in pack_size pack == pack_size (dynamic_pack (dirs pack))
</haskell>

Now, lets try to run (DON'T PANIC and save all you work in other applications first!):

*Main> quickCheck prop_dynamic_pack_is_fixpoint

Now, you took my advice seriously, don't you? And you did have your '''Ctrl-C''' handy, didn't you? Most probably, the attempt to run the test resulted in all your memory being taken by <tt>ghci</tt> process, which you hopefully interrupted soon enough by pressing '''Ctrl-C'''.

What happened? Who ate all the memory? How to debug this problem? GHC comes with profiling abilities, but we could not use them - they produce report after program terminates, and our doesn't seem to do so without consuming several terabytes of memory first. Still, there is a lot of room for maneuver.

Let's see. Since we called <tt>dynamic_pack</tt> and it ate all the memory, let's not do this again. Instead, let's see what this function does and tweak it a bit to explore it's behavior.

Since we already know that random lists of "Dir"s generated for our QuickCheck tests are of modest size (after all, <tt>greedy_pack</tt> munches them without significant memory consumption), the size of the input most probably is not the issue. However, <tt>dynamic_pack_is_fixpoint</tt> is building quite a huge list internally (via <tt>precomputeDisksFor</tt>). Could this be a problem?

Let's turn the timing/memory stats on (":set +s" on ghci prompt) and try to peek into various elements of list returned by <tt>precomputeDisksFor</tt>:

Prelude> :l cd-fit.hs
Compiling Main ( cd-fit.hs, interpreted )
Ok, modules loaded: Main.
*Main> :set +s
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 0
DirPack {pack_size = 0, dirs = []}
(0.06 secs, 1277972 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10
DirPack {pack_size = 0, dirs = []}
(0.00 secs, 0 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100
DirPack {pack_size = 0, dirs = []}
(0.01 secs, 1519064 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 1000
DirPack {pack_size = 0, dirs = []}
(0.03 secs, 1081808 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 10000
DirPack {pack_size = 0, dirs = []}
(1.39 secs, 12714088 bytes)
*Main> (precomputeDisksFor [Dir 1 "aaa"]) !! 100000
Interrupted.

Aha! This seems to be a problem, since computation of 100000 fails to terminate in "reasonable" time, and to think that we have tried to compute <tt>700*1024*1024</tt>th element...

Lets modify our code a bit, to allow disk size to be tweaked:

<haskell>
-- Taken from 'cd-fit-4-3.hs'
dynamic_pack limit dirs = (precomputeDisksFor dirs)!!(fromInteger limit)

prop_dynamic_pack_is_fixpoint ds =
let pack = dynamic_pack media_size ds
in pack_size pack == pack_size (dynamic_pack media_size (dirs pack))

prop_dynamic_pack_small_disk ds =
let pack = dynamic_pack 50000 ds
in pack_size pack == pack_size (dynamic_pack 50000 (dirs pack))

-- rename "old" main to "moin"
main = quickCheck prop_dynamic_pack_small_disk
</haskell>

Compute a profiling version of you code with <tt>ghc -O --make -prof -auto-all -o cd-fit cd-fit.hs</tt> and run it like this:

$ ./cd-fit +RTS -p
OK, passed 100 tests.

First thing, note that our code satisfies at least one simple property. Good. Now let's examine profile. Look into file "cd-fit.prof", which was produced in your current directory.

Most probably, you'll see something like this:

cd-fit +RTS -p -RTS

total time = 2.18 secs (109 ticks @ 20 ms)
total alloc = 721,433,008 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

precomputeDisksFor Main 88.1 99.8
dynamic_pack Main 11.0 0.0

individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc

MAIN MAIN 1 0 0.0 0.0 100.0 100.0
CAF Main 174 11 0.9 0.2 100.0 100.0
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 99.1 99.8
dynamic_pack Main 182 200 11.0 0.0 99.1 99.8
precomputeDisksFor Main 183 200 88.1 99.8 88.1 99.8
main Main 180 1 0.0 0.0 0.0 0.0

Examine column of "individual %alloc". As we thought, all memory was
allocated within <tt>precomputeDisksFor</tt>. However, amount of
memory allocated (more than 700 MB, according to the line "total
alloc") seems to be a little too much for our simple task. We will dig
deeper and find where we a wasting it.

Let's examine memory consumption a little closer via so-called "heap
profiles". Run <tt>./cd-fit +RTS -hb</tt>. This produces "biographical
heap profile", which tells us how various parts of the memory were
used during the program run time. Heap profile was saved to
"cd-fit.hp". It is next to impossible to read and comprehend it as is,
so use "hp2ps cd-fit.hp" to produce a nice PostScript picture which
is worth a thousand words. View it with "gv" or "ghostview" or "full
Adobe Acrobat (not Reader)". (This and subsequent pictures are
'''not''' attached here).

Notice that most of the graph is taken up by region marked as "VOID".
This means that memory allocated was never used. Notice that there is
'''no''' areas marked as "USE", "LAG" or "DRAG". Seems like our
program hardly uses '''any''' of the allocated memory at all. Wait a
minute! How could that be? Surely it must use something when it packs
to the imaginary disks of 50000 bytes those random-generated
directories which are 10 to 1400 Mb in size.... Oops. Severe size
mismatch. We should have spotted it earlier, when we were timing
<tt>precomputeDisksFor</tt>. Scroll back and observe how each run
returned the very same result - empty directory set.

Our random directories are too big, but nevertheless code spends time
and memory trying to "pack" them. Obviously,
<tt>precomputeDisksFor</tt> (which is responsible for 90% of total
memory consumption and run time) is flawed in some way.

Let's take a closer look at what takes up so much memory. Run
<tt>./cd-fit +RTS -h -hbvoid</tt> and produce PostScript picture for
this memory profile. This will give us detailed breakdown of all
memory whose "biography" shows that it's been "VOID" (unused). My
picture (and I presume that yours as well) shows that VOID memory
comprises of "thunks" labeled "precomputeDisksFor/pre...". We could
safely assume that second word would be "precomp" (You wonder why?
Look again at the code and try to find function named "pre.*" which is
called from inside <tt>precomputeDisksFor</tt>)

This means that memory has been taken by the list generated inside
"precomp". Rumor has it that memory leaks with Haskell are caused by
either too little laziness or too much laziness. It seems like we have
too little laziness here: we evaluate more elements of the list that
we actually need and keep them from being garbage-collected.

Note how we look up element from "precomp" in this piece of code:

<haskell>
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)=precomp!!(fromInteger (limit - dir_size d))
, d `notElem` ds
</haskell>

Obviously, the whole list generated by "precomp" must be kept in
memory for such lookups, since we can't be sure that some element
could be garbage collected and will not be needed again.

Let's rewrite the code to eliminate the list (incidentally, this will also deal with the possible Int overflow while accessing the "precomp" via (!!) operator):

<haskell>
-- Taken from 'cd-fit-4-4.hs'
-- Let the `bestDisk x' be the "most tightly packed" disk of total
-- size no more than `x'.
-- How to calculate `bestDisk'? Lets opt for a recursive definition:
-- Recursion base: best packed disk of size 0 is empty and best-packed
-- disk for empty list of directories on it is also empty.
bestDisk 0 _ = DirPack 0 []
bestDisk _ [] = DirPack 0 []
-- Recursion step: for size `limit`, bigger than 0, best packed disk is
-- computed as follows:
bestDisk limit dirs =
-- Take all non-empty dirs that could possibly fit to that disk by itself.
-- Consider them one by one. Let the size of particular dir be `dir_size d'.
-- Let's add it to the best-packed disk of size <= (limit - dir_size d), thus
-- producing the disk of size <= limit. Lets do that for all "candidate"
-- dirs that are not yet on our disk:
case [ DirPack (dir_size d + s) (d:ds)
| d <- filter ( (inRange (1,limit)).dir_size ) dirs
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) dirs
, d `notElem` ds
] of
-- We either fail to add any dirs (probably, because all of them too big).
-- Well, just report that disk must be left empty:
[] -> DirPack 0 []
-- Or we produce some alternative packings. Let's choose the best of them all:
packs -> maximumBy cmpSize packs

cmpSize a b = compare (pack_size a) (pack_size b)

dynamic_pack limit dirs = bestDisk limit dirs
</haskell>

Compile the profiling version of this code and obtain the overall
execution profile (with "+RTS -p"). You'll get something like this:

cd-fit +RTS -p -RTS

total time = 0.00 secs (0 ticks @ 20 ms)
total alloc = 1,129,520 bytes (excludes profiling overheads)

COST CENTRE MODULE %time %alloc

CAF GHC.Float 0.0 4.4
main Main 0.0 93.9

individual inherited
COST CENTRE MODULE no. entries %time %alloc %time %alloc
MAIN MAIN 1 0 0.0 0.0 0.0 100.0
main Main 180 1 0.0 93.9 0.0 94.2
prop_dynamic_pack_small_disk Main 181 100 0.0 0.0 0.0 0.3
dynamic_pack Main 182 200 0.0 0.2 0.0 0.3
bestDisk Main 183 200 0.0 0.1 0.0 0.1

We achieved the major improvement: memory consumption is reduced by factor
of 700! Now we could test the code on the "real task" - change the
code to run the test for packing the full-sized disk:

<haskell>
main = quickCheck prop_dynamic_pack_is_fixpoint
</haskell>

Compile with profiling and run (with "+RTS -p"). If you are not lucky
and a considerably big test set would be randomly generated for your
runs, you'll have to wait. And wait even more. And more.

Go make some tea. Drink it. Read some Tolstoi (Do you have "War and
peace" handy?). Chances are that by the time you are done with
Tolstoi, program will still be running (just take my word on it, don't
check).

If you are lucky, your program will finish fast enough and leave you
with profile. According to a profile, program spends 99% of its time
inside <tt>bestDisk</tt>. Could we speed up <tt>bestDisk</tt> somehow?

Note that <tt>bestDisk</tt> performs several simple calculation for
which it must call itself. However, it is done rather inefficiently -
each time we pass to <tt>bestDisk</tt> the exact same set of
directories as it was called with, even if we have already "packed"
some of them. Let's amend this:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
case [ DirPack (dir_size d + s) (d:ds)
| let small_enough = filter ( (inRange (0,limit)).dir_size ) dirs
, d <- small_enough
, dir_size d > 0
, let (DirPack s ds)= bestDisk (limit - dir_size d) (delete d small_enough)
] of
</haskell>

Recompile and run again. Runtimes could be lengthy, but bearable, and
number of times <tt>bestDisk</tt> is called (according to the profile)
should decrease significantly.

Finally, let's compare both packing algorithms. Intuitively, we feel
that greedy algorithm should produce worse results, don't we? Lets put
this feeling to the test:

<haskell>
-- Taken from 'cd-fit-4-5.hs'
prop_greedy_pack_is_no_better_than_dynamic_pack ds =
pack_size (greedy_pack ds) <= pack_size (dynamic_pack media_size ds)
</haskell>

Verify that it is indeed so by running <tt>quickCheck</tt> for this
test several time. I feel that this concludes our knapsacking
exercises.

Adventurous readers could continue further by implementing so-called
"scaling" for <tt>dynamic_pack</tt> where we divide all directory
sizes and medium size by the size of the smallest directory to proceed
with smaller numbers (which promises faster runtimes).

== Chapter 5: (Ab)using monads and destructing constructors for fun and profit ==

We already mentioned monads quite a few times. They are described in
numerous articles and tutorial (See Chapter 400). It's hard to read a
daily dose of any Haskell mailing list and not to come across a word
"monad" a dozen times.

Since we already made quite a progress with Haskell, it's time we
revisit the monads once again. I will let the other sources teach you
theory behind the monads, overall usefulness of the concept, etc.
Instead, I will focus on providing you with examples.

Let's take a part of the real world program which involves XML
processing. We will work with XML tag attributes, which are
essentially named values:
<haskell>
-- Taken from 'chapter5-1.hs'
type Attribute = (Name, AttValue)
</haskell>

'Name' is a plain string, and value could be '''either''' string or
references (also strings) to another attributes which holds the actual
value (now, this is not a valid XML thing, but for the sake of
providing a nice example, let's accept this). Word "either" suggests
that we use 'Either' datatype:
<haskell>
type AttValue = Either Value [Reference]
type Name = String
type Value = String
type Reference = String

-- Sample list of simple attributes:
simple_attrs = [ ( "xml:lang", Left "en" )
, ( "xmlns", Left "jabber:client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]

-- Sample list of attributes with references:
complex_attrs = [ ( "xml:lang", Right ["lang"] )
, ( "lang", Left "en" )
, ( "xmlns", Right ["ns","subns"] )
, ( "ns", Left "jabber" )
, ( "subns", Left "client" )
, ( "xmlns:stream", Left "http://etherx.jabber.org/streams" ) ]
</haskell>

'''Our task is:''' to write a function that will look up a value of
attribute by it's name from the given list of attributes. When
attribute contains reference(s), we resolve them (looking for the
referenced attribute in the same list) and concatenate their values,
separated by semicolon. Thus, lookup of attribute "xmlns" form both
sample sets of attributes should return the same value.

Following the example set by the <hask>Data.List.lookup</hask> from
the standard libraries, we will call our function
<hask>lookupAttr</hask> and it will return <hask>Maybe Value</hask>,
allowing for lookup errors:

<haskell>
-- Taken from 'chapter5-1.hs'
lookupAttr :: Name -> [Attribute] -> Maybe Value
-- Since we dont have code for 'lookupAttr', but want
-- to compile code already, we use the function 'undefined' to
-- provide default, "always-fail-with-runtime-error" function body.
lookupAttr = undefined
</haskell>

Let's try to code <hask>lookupAttr</hask> using <hask>lookup</hask> in
a very straightforward way:

<haskell>
-- Taken from 'chapter5-1.hs'
import Data.List

lookupAttr :: Name -> [Attribute] -> Maybe Value
lookupAttr nm attrs =
-- First, we lookup 'Maybe AttValue' by name and
-- check whether we are successful:
case (lookup nm attrs) of
-- Pass the lookup error through.
Nothing -> Nothing
-- If given name exist, see if it is value of reference:
Just attv -> case attv of
-- It's a value. Return it!
Left val -> Just val
-- It's a list of references :(
-- We have to look them up, accounting for
-- possible failures.
-- First, we will perform lookup of all references ...
Right refs ->
let vals = [ lookupAttr ref attrs | ref <- refs ]
-- .. then, we will exclude lookup failures
wo_failures = filter (/=Nothing) vals
-- ... find a way to remove annoying 'Just' wrapper
stripJust (Just v) = v
-- ... use it to extract all lookup results as strings
strings = map stripJust wo_failures
in
-- ... finally, combine them into single String.
-- If all lookups failed, we should pass failure to caller.
case null strings of
True -> Nothing
False -> Just (concat (intersperse ":" strings))
</haskell>

Testing:

*Main> lookupAttr "xmlns" complex_attrs
Just "jabber:client"
*Main> lookupAttr "xmlns" simple_attrs
Just "jabber:client"
*Main>

It works, but ... It seems strange that such a boatload of code
required for quite simple task. If you examine the code closely,
you'll see that the code bloat is caused by:

* the fact that after each step we check whether the error occurred

* unwrapping Strings from <hask>Maybe</hask> and <hask>Either</hask> data constructors and wrapping them back.

At this point C++/Java programmers would say that since we just pass
errors upstream, all those cases could be replaced by the single "try
... catch ..." block, and they would be right. Does this mean that
Haskell programmers are reduced to using "case"s, which were already
obsolete 10 years ago?

Monads to the rescue! As you can read elsewhere (see section 400),
monads are used in advanced ways to construct computations from other
computations. Just what we need - we want to combine several simple
steps (lookup value, lookup reference, ...) into function
<hask>lookupAttr</hask> in a way that would take into account possible
failures.

Lets start from the code and dissect in afterwards:
<haskell>
-- Taken from 'chapter5-2.hs'
import Control.Monad

lookupAttr' nm attrs = do
-- First, we lookup 'AttValue' by name
attv <- lookup nm attrs
-- See if it is value of reference:
case attv of
-- It's a value. Return it!
Left val -> Just val
-- It's a list of references :(
-- We have to look them up, accounting for
-- possible failures.
-- First, we will perform lookup of all references ...
Right refs -> do vals <- sequence $ map (flip lookupAttr' attrs) refs
-- ... since all failures are already excluded by "monad magic",
-- ... all all 'Just's have been removed likewise,
-- ... we just combine values into single String,
-- ... and return failure if it is empty.
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

'''Exercise''': compile the code, test that <hask>lookupAttr</hask>
and <hask>lookupAttr'</hask> really behave in the same way. Try to
write a QuickCheck test for that, defining the
<hask>instance Arbitrary Name</hask> such that arbitrary names will be taken from
names available in <hask>simple_attrs</hask>.

Well, back to the story. Noticed the drastic reduction in code size?
If you drop comments, the code will occupy mere 7 lines instead of 13
- almost two-fold reduction. How we achieved this?

First, notice that we never ever check whether some computation
returns <hask>Nothing</hask> anymore. Yet, try to lookup some
non-existing attribute name, and <hask>lookupAttr'</hask> will return
<hask>Nothing</hask>. How does this happen? Secret lies in the fact
that type constructor <hask>Maybe</hask> is a "monad".

We use keyword <hask>do</hask> to indicate that following block of
code is a sequence of '''monadic actions''', where '''monadic magic'''
have to happen when we use '<-', 'return' or move from one action to
another.

Different monads have different '''magic'''. Library code says that
type constructor <hask>Maybe</hask> is such a monad that we could use
<hask><-</hask> to "extract" values from wrapper <hask>Just</hask> and
use <hask>return</hask> to put them back in form of
<hask>Just some_value</hask>. When we move from one action in the "do" block to
another a check happens. If the action returned <hask>Nothing</hask>,
all subsequent computations will be skipped and the whole "do" block
will return <hask>Nothing</hask>.

Try this to understand it all better:
<haskell>
*Main> let foo x = do v <- x; return (v+1) in foo (Just 5)
Just 6
*Main> let foo x = do v <- x; return (v+1) in foo Nothing
Nothing
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo (Just 'a')
Just 97
*Main> let foo x = do v <- x; return (Data.Char.ord v) in foo Nothing
Nothing
*Main>
</haskell>

Do not mind <hask>sequence</hask> and <hask>guard</hask> just for now
- we will get to them in the little while.

Since we already removed one reason for code bloat, it is time to deal
with the other one. Notice that we have to use <hask>case</hask> to
'''deconstruct''' the value of type <hask>Either Value
[Reference]</hask>. Surely we are not the first to do this, and such
use case have to be quite a common one.

Indeed, there is a simple remedy for our case, and it is called
<hask>either</hask>:

*Main> :t either
either :: (a -> c) -> (b -> c) -> Either a b -> c

Scary type signature, but here are examples to help you grok it:

*Main> :t either (+1) (length)
either (+1) (length) :: Either Int [a] -> Int
*Main> either (+1) (length) (Left 5)
6
*Main> either (+1) (length) (Right "foo")
3
*Main>

Seems like this is exactly our case. Let's replace the
<hask>case</hask> with invocation of <hask>either</hask>:

<haskell>
-- Taken from 'chapter5-3.hs'
lookupAttr'' nm attrs = do
attv <- lookup nm attrs
either Just (dereference attrs) attv
where
dereference attrs refs = do
vals <- sequence $ map (flip lookupAttr'' attrs) refs
guard (not (null vals))
return (concat (intersperse ":" vals))
</haskell>

It keeps getting better and better :)

Now, as semi-exercise, try to understand the meaning of "sequence",
"guard" and "flip" looking at the following ghci sessions:

*Main> :t sequence
sequence :: (Monad m) => [m a] -> m [a]
*Main> :t [Just 'a', Just 'b', Nothing, Just 'c']
[Just 'a', Just 'b', Nothing, Just 'c'] :: [Maybe Char]
*Main> :t sequence [Just 'a', Just 'b', Nothing, Just 'c']
sequence [Just 'a', Just 'b', Nothing, Just 'c'] :: Maybe [Char]

*Main> sequence [Just 'a', Just 'b', Nothing, Just 'c']
Nothing
*Main> sequence [Just 'a', Just 'b', Nothing]
Nothing
*Main> sequence [Just 'a', Just 'b']
Just "ab"

*Main> :t [putStrLn "a", putStrLn "b"]
[putStrLn "a", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", putStrLn "b"]
sequence [putStrLn "a", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", putStrLn "b"]
a
b

*Main> :t [putStrLn "a", fail "stop here", putStrLn "b"]
[putStrLn "a", fail "stop here", putStrLn "b"] :: [IO ()]
*Main> :t sequence [putStrLn "a", fail "stop here", putStrLn "b"]
sequence [putStrLn "a", fail "stop here", putStrLn "b"] :: IO [()]
*Main> sequence [putStrLn "a", fail "stop here", putStrLn "b"]
a
*** Exception: user error (stop here)

Notice that for monad <hask>Maybe</hask> sequence continues execution
until the first <hask>Nothing</hask>. The same behavior could be
observed for IO monad. Take into account that different behaviors are
not hardcoded into the definition of <hask>sequence</hask>!

Now, let's examine <hask>guard</hask>:

*Main> let foo x = do v <- x; guard (v/=5); return (v+1) in map foo [Just 4, Just 5, Just 6]
[Just 5,Nothing,Just 7]

As you can see, it's just a simple way to "stop" execution at some
condition.

If you have been hooked on monads, I urge you to read "All About
Monads" right now (link in Chapter 400).

== Chapter 6: Where do you want to go tomorrow? ==

As the name implies, the author is open for proposals - where should
we go next? I had networking + xml/xmpp in mind, but it might be too
heavy and too narrow for most of the readers.

What do you think? Drop me a line.

== Chapter 400: Monads up close ==

Read [http://en.wikibooks.org/wiki/Haskell/Understanding_monads this wikibook chapter].
Then, read [http://horna.org.ua/books/All_About_Monads.pdf "All about monads"] (PDF).
'Nuff said :)

== Chapter 500: IO up close ==

Shows that:

<haskell>
c = do a <- someAction
b <- someOtherAction
print (bar b)
print (foo a)
print "done"
</haskell>

really is just a syntax sugar for:

<haskell>
c = someAction >>= \a ->
someOtherAction >>= \b ->
print (bar b) >>
print (foo a) >>
print "done"
</haskell>

and explains about ">>=" and ">>". Oh wait. This was already explained
in Chapter 400 :)

== Chapter 9999: Installing Haskell Compiler/Interpreter and all necessary software ==

Plenty of material on this on the web and this wiki. Just go get
yourself installation of [[GHC]] (6.4 or above) or [[Hugs]] (v200311 or
above) and "[[darcs]]", which we will use for version control.

== Chapter 10000: Thanks! ==

Thanks for comments, proofreading, good advice and kind words go to:
Helge, alt, dottedmag, Paul Moore, Ben Rudiak-Gould, Jim Wilkinson,
Andrew Zhdanov (avalez), Martin Percossi, SpellingNazi, Davor
Cubranic, Brett Giles, Stdrange, Brian Chrisman, Nathan Collins,
Anastasia Gornostaeva (ermine), Remi, Ptolomy, Zimbatm,
HenkJanVanTuyl, Miguel, Mforbes, Kartik Agaram, Jake Luck, Ketil
Malde, Mike Mimic, Jens Kubieziel.

If I should have mentioned YOU and forgot - tell me so.

Without you I would have stopped after Chapter 1 :)

Languages: [[Haskellへのヒッチハイカーガイド|jp]]

HaskellWiki talk:Community

2011-03-22T13:05:01Z

Imz: /* captcha not visible in (emacs-)w3m :( */ a solution example from emacswiki.org

= Page renaming =

I thought that [[HaskellWiki:Community]] is more concise than [[HaskellWiki:Community portal]]. In addition, the '''Haskell Performance Resource''' is also just called [[Performance]]. -- [[User:Wolfgang Jeltsch|Wolfgang Jeltsch]] 23:27, 25 February 2006 (UTC)

= Page position =

I argue this page should be removed from its prominent position at the
top of the front page. This page is getting 10,000 hits, making it one
of the most popular pages, but with very little content of wide appeal.

I think the name is misleading viewers thinking 'Community' will point
them to something like thing like the content of the 'The Haskell
community' column on the frontpage, when its just a guide to editing the
wiki (which only 1% of visitors want to do).

So, my proposal. [[HaskellWiki:Community_Portal]] should be removed from
the top of the front page altogether. Its second place, at the bottom of
the page near the editing facilities seems more appropriate.

While we're here, I think 'All pages' should also disappear from the top
of the front page. Site maps aren't generally the first thing you see on
a web site, right? And that the remaining elements of the bar:

All Pages - Categories - Community Portal
Language | Packages | Standard libraries | Idioms | Tools | Proposals

should appear on the left hand column of the front page. Currently we
have to scroll over this stuff to get to the most popular content.

Opinions? -- [[User:DonStewart]]

:Makes sense. —[[User:Ashley Y|Ashley Y]] 00:57, 11 April 2006 (UTC)

= Wiki dump =

Is it easily possible to get a dump of the current state of all wiki pages.
I mean the Wiki markup not the generated HTML.
I also do not need images.
For HTML I could use wget starting at Special:AllPages.

= captcha not visible in (emacs-)w3m :( =

I wanted to [http://www.haskell.org/haskellwiki/index.php?title=Special:Userlogin&type=signup create an account] here, and I was using [http://www.emacswiki.org/emacs/emacs-w3m emacs-w3m]-1.4.259-[http://prometheus.altlinux.org/en/5.1/srpms/emacs-w3m alt0.2.20080303] with [http://en.wikipedia.org/wiki/W3m w3m]-0.5.2-[http://prometheus.altlinux.org/en/5.1/srpms/w3m alt2.1].

The captcha wasn't shown in emacs-w3m, so couldn't register without switching to another, bloated browser for a while.

Perhaps, there are ways to mitigate this problem--either on the side of the wiki (by modifying the markup for the captcha so that it makes some sense in emacs-w3m) or finding what to improve in emacs-w3m for it to show such captchas, aren't there?--[[User:Imz|Imz]] 00:30, 14 March 2011 (UTC)

Oh, I wasn't able to post this comment for the same reason: edits with links suppose that I solve a captcha, but me using emacs-w3m wasn't able to see it! (So, now I have pasted my comment into another, bloated browser.) :( --[[User:Imz|Imz]] 00:30, 14 March 2011 (UTC)

:They came up with a solution for such a problem at the Emacs Wiki: they have a special URL to set up cookies for w3m, see the link under [http://www.emacswiki.org/emacs/emacs-w3m#toc2 ''Editing Emacs Wiki''] section. Can a solution be devised for the Haskell Wiki?--[[User:Imz|Imz]] 13:05, 22 March 2011 (UTC)

Non-empty list

2011-03-14T00:44:35Z

Imz: Links for the mentioned languages.

Errors such as taking <hask>head</hask> or <hask>tail</hask> of the
empty list in Haskell are equivalent to the dereferencing of the zero
pointer in C/C++ or <code>NullPointerException</code> in Java. These
errors occur because the true domain of the function is smaller than
the function's type suggests. For example, the type of
<hask>head</hask> says that the function applies to any list. In
reality, it can be meaningfully applied only to non-empty
lists. One can eliminate such errors by giving functions
<hask>head</hask> and <hask>tail</hask> more precise type, such as
<hask>FullList a</hask>. Languages like [http://en.wikipedia.org/wiki/Cyclone_programming_language Cyclone] and [http://en.wikipedia.org/wiki/C%CF%89 Cw] do exactly
that.

It must be emphasized that we can eliminate head-of-empty-list errors
'''now''', without any modification to the Haskell type system, without
developing any new tool. In fact, it is possible in Haskell98! The
same technique applies to OCaml and even Java and C++. The ''only''
required advancement is in our thinking and programming style.

Maybe, you are also interested in
[http://www.haskell.org/pipermail/haskell-cafe/2006-November/019644.html advocacy] of this style.

== Safe list functions ==

Here's the 0th approximation of the advocated approach:

<haskell>
{-# Haskell98! #-}
-- Safe list functions

module NList (FullList,
fromFL,
indeedFL,
decon,
head,
tail,
Listable (..)
) where

import Prelude hiding (head, tail)

newtype FullList a = FullList [a] -- data constructor is not exported!

fromFL (FullList x) = x -- Injection into general lists

-- The following is an analogue of `maybe'
indeedFL :: [a] -> w -> (FullList a -> w) -> w
indeedFL x on_empty on_full
| null x = on_empty
| otherwise = on_full $ FullList x

-- A possible alternative, with an extra Maybe tagging
-- indeedFL :: [a] -> Maybe (FullList a)

-- A more direct analogue of `maybe', for lists
decon :: [a] -> w -> (a -> [a] -> w) -> w
decon [] on_empty on_full = on_empty
decon (h:t) on_empty on_full = on_full h t

-- The following are _total_ functions
-- They are guaranteed to be safe, and so we could have used
-- unsafeHead# and unsafeTail# if GHC provides though...

head :: FullList a -> a
head (FullList (x:_)) = x

tail :: FullList a -> [a]
tail (FullList (_:x)) = x

-- Mapping over a non-empty list gives a non-empty list
instance Functor FullList where
fmap f (FullList x) = FullList $ map f x

-- Adding something to a general list surely gives a non-empty list
infixr 5 !:

class Listable l where
(!:) :: a -> l a -> FullList a

instance Listable [] where
(!:) h t = FullList (h:t)

instance Listable FullList where
(!:) h (FullList t) = FullList (h:t)
</haskell>

Now we can write
<haskell>
import NList
import Prelude hiding (head, tail)
safe_reverse l = loop l []
where
loop l accum = indeedFL l accum $
(\l -> loop (tail l) (head l : accum))

test1 = safe_reverse [1,2,3]
</haskell>

As we can see, the null test is algorithmic. After we've done it, head
and tail no longer need to check for null list. Those head and tail
functions are total. Thus we achieve both safety and performance.

We can also write
<haskell>
-- Again, we are statically assured of no head [] error!
test2 = head $ 1 !: 2 !: 3 !: []
</haskell>

I should point to
[http://pobox.com/~oleg/ftp/Computation/lightweight-dependent-typing.html Lightweight dependent typing] for justification and formalization, as
well as for for further, more complex examples.
We can also use the approach to
ensure various control properties, e.g., the yield property: a thread may
not invoke `yield' while holding a lock. We can assure this property
both for recursive and non-recursive locks.

If there is a surprise in this, it is in the triviality of
approach. One can't help but wonder why don't we program in this
style.

== Integrating with the existing list-processing functions ==

Jan-Willem Maessen wrote:
<blockquote>
In addition, we have this rather nice assembly of functions which
work on ordinary lists. Sadly, rewriting them all to also work on
NonEmptyList or MySpecialInvariantList is a nontrivial task.
</blockquote>

That's an excellent question. Indeed, let us assume we have a function
<haskell>
foo:: [a] -> [a]
</haskell>
(whose code, if available, we'd rather not change) and we want to
write something like
<haskell>
\l -> [head l, head (foo l)]
</haskell>
To use the safe <hask>head</hask> from NList.hs , we should write
<haskell>
\l -> indeedFL l onempty (\l -> [head l, head (foo l)])
</haskell>
But that doesn't type: first of all, <hask>foo</hask> applies to
<hask>[a]</hask> rather than
<hask>FullList a</hask>, and second, the result of
<hask>foo</hask> is not <hask>FullList a</hask>, required
by our <hask>head</hask>. The first problem is easy to solve: we can always
inject <hask>FullList a</hask> into the general list:
<hask>fromFL</hask>. We insist on writing
the latter function explicitly, which keeps the typesystem simple,
free of subtyping and implicit coercions. One may regard
<hask>fromFL</hask> as an
analogue of <hask>fromIntegral</hask> -- which, too, we have to
write explicitly, in any code with more than one sort of integral
numbers (e.g., Int and Integer, or Int and CInt).

If we are not sure if our function foo maps non-empty lists
to non-empty lists, we really should handle the empty list case:
<haskell>
\l -> indeedFL l onempty $
\l -> [head l, indeedFL (foo $ fromFL l) onempty' head]
</haskell>
If we have a hunch that foo maps non-empty lists to non-empty lists,
but we are too busy to verify it, we can write
<haskell>
\l -> indeedFL l onempty $
\l -> [head l, indeedFL (foo $ fromFL l)
(error msg)
head]
where msg = "I'm quite sure foo maps non-empty lists to " ++
"non-empty lists. I'll be darned if it doesn't."
</haskell>
That would get the code running. Possibly at some future date (during
the code review?) I'll be called to justify my hunch, to whatever
degree of formality (informal argument, formal proof) required by the
policies in effect. If I fail at this justification, I'd better think
what to do if the result of foo is really the empty list. If I
succeed, I'd be given permission to update the module NList with the
following definition
<haskell>
nfoo (FullList x) = FullList $ foo x
</haskell>
after which I could write
<haskell>
\l -> indeedFL l onempty (\l -> [head l, head (nfoo l)])
</haskell>
with no extra empty list checks.

Excerpted from the discussion on Haskell-Cafe, November 2006.

[[Category:Idioms]]

HaskellWiki talk:Community

2011-03-14T00:32:23Z

Imz: /* captcha not visible in (emacs-)w3m :( */ fixing the wrong automatic section level

HaskellWiki talk:Community

2011-03-14T00:30:36Z

Imz: /* captcha not visible in (emacs-)w3m :( */ new section

= Page renaming =

I thought that [[HaskellWiki:Community]] is more concise than [[HaskellWiki:Community portal]]. In addition, the '''Haskell Performance Resource''' is also just called [[Performance]]. -- [[User:Wolfgang Jeltsch|Wolfgang Jeltsch]] 23:27, 25 February 2006 (UTC)

= Page position =

I argue this page should be removed from its prominent position at the
top of the front page. This page is getting 10,000 hits, making it one
of the most popular pages, but with very little content of wide appeal.

I think the name is misleading viewers thinking 'Community' will point
them to something like thing like the content of the 'The Haskell
community' column on the frontpage, when its just a guide to editing the
wiki (which only 1% of visitors want to do).

So, my proposal. [[HaskellWiki:Community_Portal]] should be removed from
the top of the front page altogether. Its second place, at the bottom of
the page near the editing facilities seems more appropriate.

While we're here, I think 'All pages' should also disappear from the top
of the front page. Site maps aren't generally the first thing you see on
a web site, right? And that the remaining elements of the bar:

All Pages - Categories - Community Portal
Language | Packages | Standard libraries | Idioms | Tools | Proposals

should appear on the left hand column of the front page. Currently we
have to scroll over this stuff to get to the most popular content.

Opinions? -- [[User:DonStewart]]

:Makes sense. —[[User:Ashley Y|Ashley Y]] 00:57, 11 April 2006 (UTC)

= Wiki dump =

Is it easily possible to get a dump of the current state of all wiki pages.
I mean the Wiki markup not the generated HTML.
I also do not need images.
For HTML I could use wget starting at Special:AllPages.

== captcha not visible in (emacs-)w3m :( ==

I wanted to [http://www.haskell.org/haskellwiki/index.php?title=Special:Userlogin&type=signup create an account] here, and I was using [http://www.emacswiki.org/emacs/emacs-w3m emacs-w3m]-1.4.259-[http://prometheus.altlinux.org/en/5.1/srpms/emacs-w3m alt0.2.20080303] with [http://en.wikipedia.org/wiki/W3m w3m]-0.5.2-[http://prometheus.altlinux.org/en/5.1/srpms/w3m alt2.1].

The captcha wasn't shown in emacs-w3m, so couldn't register without switching to another, bloated browser for a while.

Perhaps, there are ways to mitigate this problem--either on the side of the wiki (by modifying the markup for the captcha so that it makes some sense in emacs-w3m) or finding what to improve in emacs-w3m for it to show such captchas, aren't there?--[[User:Imz|Imz]] 00:30, 14 March 2011 (UTC)

Oh, I wasn't able to post this comment for the same reason: edits with links suppose that I solve a captcha, but me using emacs-w3m wasn't able to see it! (So, now I have pasted my comment into another, bloated browser.) :( --[[User:Imz|Imz]] 00:30, 14 March 2011 (UTC)