HaskellWiki - User contributions [en]

Haskell Quiz/The Solitaire Cipher/Solution Thiago Arrais

2007-10-30T18:02:38Z

ThiagoArrais:

[[Category:Haskell Quiz solutions|Solitaire Cipher]]

<haskell>
module Main where

import Char(chr, isAlpha, ord, toUpper)
import List(intersperse)
import System.Environment(getArgs, getProgName)

toUpperCase = map toUpper
toLetter n = chr $ ord 'A' + (n - 1) `mod` 26
toNumber l = ord l - ord 'A' + 1

split5 cs
| length cs > 5 = (take 5 cs) : split5 (drop 5 cs)
| otherwise = [cs]

fill cs = cs ++ replicate (5 - length cs) 'X'

-- Filters alpha characters and splits them into groups of five
sanitize:: String -> [String]
sanitize cs = reverse $ (fill.head) rchunks : tail rchunks
where rchunks = reverse.split5.filter isAlpha.toUpperCase $ cs

unkeyeddeck :: [Int]
unkeyeddeck = [1..54]

jokerA = 53
jokerB = 54
isJoker = (flip elem) [jokerA, jokerB]

-- Pushes a card (j) down once
push' j xs = if length right > 0
then left ++ head right : j : tail right
else head left : j : tail left
where (left,_:right) = break (== j) xs

-- Pushes a card (j) down a given number (n) of times
push j n = (!! n) . iterate (push' j)

pushJokerA = push jokerA 1
pushJokerB = push jokerB 2

-- Performs a triplecut around the first two cards that satisfy a predicate (p)
tripleCut p xs = bottom ++ j1 : (middle ++ j2 : top)
where (top,j1:b1) = break p xs
(middle,j2:bottom) = break p b1

countCut n xs = (reverse.tail $ rbottom) ++ top ++ [head rbottom]
where top = take n xs
rbottom = reverse.drop n $ xs

-- Performs a count cut by the number written on the bottom card
deckCut xs = countCut (last xs) xs

valueFor 54 = 53 -- B joker's value is 53
valueFor n = n

-- Shuffles the deck once
nextDeck = deckCut.tripleCut isJoker.pushJokerB.pushJokerA

-- Shuffles the deck once and extracts the resulting letter
stepStream :: (String, [Int]) -> (String, [Int])
stepStream (_, oldDeck) = (letter $ number newDeck, newDeck)
where newDeck = nextDeck oldDeck
number deck@(n:_) = deck !! valueFor n
letter n = if isJoker n then "" else toLetter n : []

-- The keystream generated by an unkeyed deck
keystream = concat [c | (c,_) <- tail $ iterate stepStream ([], unkeyeddeck)]

join = concat.intersperse " "

-- Combines an input string (xs) and the default keystream by applying the
-- given operation (f). This is the function that does the encoding/decoding
codeWith f xs = join.sanitize.letterize $
zipWith f (numberize letters) (numberize keyletters)
where keyletters = take (length letters) keystream
numberize = map toNumber
letterize = map toLetter
letters = concat $ sanitize xs

encode, decode :: String -> String
encode = codeWith (+)
decode = codeWith (-)

-- An action that applies the coding function (f) to a set of words
-- and prints the resulting code
printCode f = putStrLn . f . join

main = do args <- getArgs
case args of
("d":ws@(_:_)) -> printCode decode ws
("e":ws@(_:_)) -> printCode encode ws
_ -> getProgName >>=
\n -> putStrLn $ "Usage: " ++ n ++ " <d/e> <phrase>"
</haskell>

Haskell Quiz/The Solitaire Cipher

2007-10-30T18:01:23Z

ThiagoArrais:

The first puzzle of the rubyquiz series was to implement the Solitaire cipher [http://en.wikipedia.org/wiki/Solitaire_(cipher)] Bruce Schneier made for Neil Stephenson's Cryptonomicon [http://en.wikipedia.org/wiki/Cryptonomicon]. The twist is that it's designed to be done by a Spy in a containment camp with no other tools than a deck of bridge cards.

When creating a page, be sure to categorise it as code, with a <nowiki>[[Category:Haskell Quiz]]</nowiki> tag.

==The problem==

* http://www.rubyquiz.com/quiz1.html
* http://www.schneier.com/solitaire.html

==Solutions==

* [[Haskell Quiz/The Solitaire Cipher/Solution Dolio|Dan Doel]]
* [[Haskell Quiz/The Solitaire Cipher/Solution Matthias|Matthias]] (incomplete, but stream generation works)
* [[Haskell Quiz/The Solitaire Cipher/Solution Paul|Paul]] ([http://mult.ifario.us/articles/2006/10/25/solitaire-cipher-in-haskell accompanying narrative])
* [[Haskell Quiz/The Solitaire Cipher/Solution Burton|Jim]]
* [[Haskell Quiz/The Solitaire Cipher/Solution Igloo|Igloo]]
* [[Haskell Quiz/The Solitaire Cipher/Solution JFoutz|JFoutz]]
* [[Haskell Quiz/The Solitaire Cipher/Solution Thiago Arrais|Thiago Arrais]]

[[Category:Haskell Quiz|Solitaire Cipher]]

The Monadic Way

2006-08-26T13:02:52Z

ThiagoArrais:

==An evaluation of Philip Wadler's "Monads for functional programming"==

This tutorial is a "translation" of Philip Wadler's "Monads for
functional programming".
(avail. from [http://homepages.inf.ed.ac.uk/wadler/topics/monads.html here])

I'm a Haskell newbie trying to grasp such a difficult concept as the
ones of Monad and monadic computation.
While [http://www.cs.utah.edu/~hal/htut/ "Yet Another Haskell Tutorial"]
gave me a good understanding of the type system when it
comes to monads I find it almost unreadable.

But I had also Wadler's paper, and started reading it. Well, just
wonderful! It explains how to ''create'' a monad!

So I decided to "translate it", in order to clarify to myself the
topic. And I'm now sharing this traslation (not completed yet) with
the hope it will be useful to someone else.

Moreover, that's a wiki, so please improve it. And, specifically,
correct my poor English. I'm Italian, afterall.

==A Simple Evaluator==

Let's start with something simple: suppose we want to implement a new
programming language. We just finished with
[http://swiss.csail.mit.edu/classes/6.001/abelson-sussman-lectures/ Abelson and Sussman's Structure and Interpretation of ComputerPrograms]
and we want to test what we have learned.

Our programming language will be very simple: it will just compute the
sum operation.

So we have just one primitive operation (Add) that takes two constants
and calculates their sum

For instance, something like:

(Add (Con 5) (Con 6))

should yeld:

11

We will implement our language with the help of a data type
constructor such as:

<haskell>

module MyMonads where
data Term = Con Int
| Add Term Term
deriving (Show)

</haskell>

After that we build our interpreter:

<haskell>

eval :: Term -> Int
eval (Con a) = a
eval (Add a b) = eval a + eval b

</haskell>

That's it. Just an example:

*MyMonads> eval (Add (Con 5) (Con 6))
11
*MyMonads>

Very very simple. The evaluator checks if its argument is a Cons: if
it is it just returns it.

If it's not a Con, but it is a Term, it evaluates the right one and
sums the result with the result of the evaluation of the second term.

== Some Output, Please!==

Now, that's fine, but we'd like to add some features, like providing
some output, to show how the computation was carried out.
Well, but Haskell is a pure functional language, with no side effects,
we were told.

Now we seem to be wanting to create a side effect of the computation,
its output, and be able to stare at it...
If we had some global variable to store the out that would be
simple...

But we can create the output and carry it along the computation,
concatenating it with the old one, and present it at the end of the
evaluation together with the evaluation of the expression!

Simple and neat!

<haskell>

type MOut a = (a, Output)
type Output = String

formatLine :: Term -> Int -> Output
formatLine t a = "eval (" ++ show t ++ ") <= " ++ show a ++ " - "

evalO :: Term -> MOut Int
evalO (Con a) = (a, formatLine (Con a) a)
evalO (Add t u) = ((a + b),(x ++ y ++ formatLine (Add t u) (a + b)))
where (a, x) = evalO t
(b, y) = evalO u

</haskell>

Now we have what we want. But we had to change our evaluator quite a
bit. First we added a function, that takes a Term (of the expression
to be evaluated), an Int (the result of the evaluation) and gives back
an output of type Output (that is a synonymous of String).

The evaluator changed quite a lot! Now it has a different type: it
takes a Term data type and produces a new type, we called MOut, that
is actually a pair of a variable type a (an Int in our evaluator) and
a type Output, a string.

So our evaluator, now, will take a Term (the type of the expressions
in our new programming language) and will produce a pair, composed of
the result of the evaluation (an Int) and the Output, a string.

So far so good. But what's happening inside the evaluator?

The first part will just return a pair with the number evaluated and
the output formatted by formatLine.

The second part does something more complicated: it returns a pair
composed by
1. the result of the evaluation of the right Term summed to the result
of the evaluation of the second Term
2. the output: the concatenation of the output produced by the
evaluation of the right Term, the output produced by the evaluation of
the left Term (each this evaluation returns a pair with the number and
the output), and the formatted output of the evaluation.

Let's try it:
*MyMonads> evalO (Add (Con 5) (Con 6))
(11,"eval (Con 5) <= 5 - eval (Con 6) <= 6 - eval (Add (Con 5) (Con 6)) <= 11 - ")
*MyMonads>

It works! Let's put the output this way:
eval (Con 5) <= 5 -
eval (Con 6) <= 6 -
eval (Add (Con 5) (Con 6)) <= 11 -

Great! We are able to produce a side effect of our evaluation and
present it at the end of the computation, after all.

Let's have a closer look at this expression:
<haskell>

evalO (Add t u) = ((a + b),(x ++ y ++ formatLine (Add t u) (a + b)))
where (a, x) = evalO t
(b, y) = evalO u

</haskell>

Why all that? The problem is that we need "a" and "b" to calculate their
sum, together with the output coming from their calculation (to be
concatenated by the expression x ++ y ++ formatLine ...).

So we need to separate the pairs produced by "evalO t" and "eval u"
(remember: eval now produces a value of type M Int, i.e. a pair of an
Int and a String!).

== Let's Go Monadic==

Is there a more general way of doing so?

Let's analyze the evaluator from another perspective. From the type
perspective.

We solved our problem by creating a new type, a pair of an Int (the
result of the evaluation) and a String (the output of the process of
evaluation).

The first part of the evaluator does nothing else but creating, from
a value of type Int, an object of type M Int (Int,Output). It does so
by creating a pair with that Int and some text.

The second part evaluates the two Term(s) and "stores" the values thus
produced in some variables to be use later to compute the output.

Let's focus on the "stores" action. The correct term should be
"binds".

Take a function:
<haskell>
f x = x + x
</haskell>
"x" appears on both sides of the expression. We say that on the right
side "x" is bound to the value of x given on the left side.

So
<haskell>
f 3
</haskell>
binds x to 3 for the evaluation of the expression "x + x".

Our evaluator binds "a" and "x" / "b" and "y" with the evaluation of
"eval t" and "eval u" respectively. "a","b","x" and "y" will be then
used in the evaluation of ((a+)(x ++ ...).

We know that there is an ad hoc operator for binding variables to a
value: lambda, or \.

Indeed f x = x + x is syntactic sugar for:
<haskell>
f = \x -> x + x
</haskell>
When we write f 3 we are actually binding "x" to 3 within what's next
"->", that will be used (substituted) for evaluating f 3.

So we can try to abstract this phenomenon.

What we need is a function that takes our composed type MOut Int and a
function in order to produce a new MOut Int, concatenating the
output of the computation of the first with the output of the
computation of the second.

This is what bindM does:

<haskell>

bindM :: MOut a -> (a -> MOut b) -> MOut b
bindM m f = (b, x ++ y)
where (a, x) = m
(b, y) = f a

</haskell>

It takes:
* "m": the compound type MOut Int carrying the result of an "eval Term",
* a function "f". This function will take the Int ("a") extracted by the evaluation of "m" ((a,x)=m). This function will produce anew pair: a new Int produced by a new evaluation; some new output.

bindM will return the new Int in pair with the concatenated outputs
resulting from the evaluation of "m" and "f a".

So let's write the new version of the evaluator:

<haskell>

evalM_1 :: Term -> MOut Int
evalM_1 (Con a) = (a, formatLine (Con a) a)
evalM_1 (Add t u) = bindM (evalM_1 t) (\a ->
bindM (evalM_1 u) (\b ->
((a + b), formatLine (Add t u) (a + b))
)
)

</haskell>

Ugly, isn't it?

Let's start from the outside:

<haskell>
bindM (evalM_1 u) (\b -> ((a + b), formatLine (Add t u) (a + b)))
</haskell>

bindM takes the result of the evaluation "evalM_1 u", a type Mout Int,
and a function. It will extract the Int from that type and use it to
bind "b".

So in bindM (evalM_1 u)... "b" will be bound to a value.

Then the outer part (bindM (evalM_1 t) (\a...) will bind "a" to the
value needed to evaluate "((a+b), formatLine...) and produce our final
MOut Int.

We can write the evaluator in a more convinient way, now that we know
what it does:

<haskell>

evalM_2 :: Term -> MOut Int
evalM_2 (Con a) = (a, formatLine (Con a) a)
evalM_2 (Add t u) = evalM_2 t `bindM` \a ->
evalM_2 u `bindM` \b ->
((a + b), (formatLine (Add t u) (a + b)))

</haskell>

Now, look at the first part:

<haskell>
evalM_2 (Con a) = (a, formatLine (Con a) a)
</haskell>

We could use a more general way of creating some output.

First we need a method for creating someting of type M a, starting from
something of type a. This is what <hask>evalM_2 (Con a)</hask> is doing, after all.
Very simply:

<haskell>

mkM :: a -> MOut a
mkM a = (a, "")

</haskell>

We then need to "insert" some text (Output) in our type M:

<haskell>

outPut :: Output -> MOut ()
outPut x = ((), x)

</haskell>

Very simple: we have a string "x" (Output) and create a pair with a ()
instead of an Int, and the output.

This way we will be able to define also this firts part in terms of
bindM, that will take care of concatenating outputs.

So we have now a new evaluator:

<haskell>

evalM_3 :: Term -> MOut Int
evalM_3 (Con a) = outPut (formatLine (Con a) a) `bindM` \_ -> mkM a
evalM_3 (Add t u) = evalM_3 t `bindM` \a ->
evalM_3 u `bindM` \b ->
outPut (formatLine (Add t u) (a + b)) `bindM` \_ -> mkM (a + b)

</haskell>

Well, this is fine, definetly better then before, anyway.

Still we use `bindM` \_ -> that binds something we do not use (_). We
could write something for this case, when we concatenate computations
without the need of binding variables. Let's call it `combineM`:

<haskell>

combineM :: MOut a -> MOut b -> MOut b
combineM m f = m `bindM` \_ -> f

</haskell>

So the new evaluator:

<haskell>

evalM :: Term -> MOut Int
evalM (Con a) = outPut (formatLine (Con a) a) `combineM`
mkM a
evalM (Add t u) = evalM t `bindM` \a ->
evalM u `bindM` \b ->
outPut (formatLine (Add t u) (a + b)) `combineM`
mkM (a + b)

</haskell>

Let's put everything together (and change some names):

<haskell>

type MO a = (a, Out)
type Out = String

mkMO :: a -> MO a
mkMO a = (a, "")

bindMO :: MO a -> (a -> MO b) -> MO b
bindMO m f = (b, x ++ y)
where (a, x) = m
(b, y) = f a

combineMO :: MO a -> MO b -> MO b
combineMO m f = m `bindM` \_ -> f

outMO :: Out -> MO ()
outMO x = ((), x)

evalMO :: Term -> MO Int
evalMO (Con a) = outMO (formatLine (Con a) a) `combineMO`
mkMO a
evalMO (Add t u) = evalMO t `bindMO` \a ->
evalMO u `bindMO` \b ->
outMO (formatLine (Add t u) (a + b)) `combineMO`
mkMO (a + b)

</haskell>

== Some Sugar, Please!==
Now our evaluator has been completely transformed into a monadic
evaluator. That's what it is: a monad.

We have a function that constructs an object of type MO Int, formed by
a pair: the result of the evaluation and the accumulated
(concatenated) output.

The process of accumulation and the act of parting the MO Int into its
component is buried into bindM, that can also preserve some value for
later uses.

So we have:
* MO a type constructor for a type carrying a pair composed by an Int and a String;
* bindMO, that gives a direction to the process of evaluation: it concatenates computations and captures some side effects we created.
* mkOM lets us create an object of type MO Int starting from an Int.

As you see this is all we need to create a monad. In other words
monads arise from the type system. Everything else is just syntactic
sugar.

So, let's have a look to that sugar: the famous do-notation!

We will now rewrite our basic evaluator to use it with the
do-notation.

Now we have to crate a new type: this is necessary in order to use
specific monadic notation and have at our disposal the more practical
do-notation.

<haskell>

newtype Eval a = Eval a
deriving (Show)

</haskell>

So, our type will be an instance of the monad class. We will have to
define the methods of this class (>>= and return), but that will be
easy since we already done that while defining bindMO and mkMO.

<haskell>

instance Monad Eval where
return a = Eval a
Eval m >>= f = f m

</haskell>

And then we will take the old version of our evaluator and substitute
`bindMO` with >>= and `mkMO` with return:

<haskell>

evalM_4 :: Term -> Eval Int
evalM_4 (Con a) = return a
evalM_4 (Add t u) = evalM_4 t >>= \a ->
evalM_4 u >>= \b ->
return (a + b)

</haskell>

which is, in the do-notation:

<haskell>

evalM_5 :: Term -> Eval Int
evalM_5 (Con a) = return a
evalM_5 (Add t u) = do a <- evalM_5 t
b <- evalM_5 u
return (a + b)

</haskell>

Simple: do binds the result of "eval_M5 t" to a, binds the result of
"eval_M5 u" to b and then returns the sum. In a very imperative style.

We can now have an image of our monad: it is out type (Eval) that is
made up of a pair: during evaluation the first member of the pair (the
Int) will get the results of our computation (i.e.: the procedures to
calculate the final result). The second part, the String called
Output, will get filled up with the concatenated output of the
computation.

The sequencing done by bindMO (now >>=) will take care of passing to
the next evaluation the needed Int and will do some more side
calculation to produce the output (concatenating outputs resulting
from computation of the new Int, for instance).

So we can grasp the basic concept of a monad: it is like a label which
we attach to each step of the evaluation (the String attached to the
Int).
This label is persistent within the process of computation and at each
step bindMO can do some manipulation of it.
We are creating side-effects and propagating them within our monads.

Ok. Let's translate our output-producing evaluator in monadic
notation:

<haskell>

newtype Eval_IO a = Eval_IO (a, O)
deriving (Show)
type O = String

instance Monad Eval_IO where
return a = Eval_IO (a, "")
(>>=) m f = Eval_IO (b, x ++ y)
where Eval_IO (a, x) = m
Eval_IO (b, y) = f a
print_IO :: O -> Eval_IO ()
print_IO x = Eval_IO ((), x)

eval_IO :: Term -> Eval_IO Int
eval_IO (Con a) = do print_IO (formatLine (Con a) a)
return a
eval_IO (Add t u) = do a <- eval_IO t
b <- eval_IO u
print_IO (formatLine (Add t u) (a + b))
return (a + b)

</haskell>
Let's see the evaluator with output in action:
*MyMonads> eval_IO (Add (Con 6) (Add (Con 16) (Add (Con 20) (Con 12))))
Eval_IO (54,"eval (Con 6) <= 6 - eval (Con 16) <= 16 - eval (Con 20) <= 20 - eval (Con 12) <= 12 - \
eval (Add (Con 20) (Con 12)) <= 32 - eval (Add (Con 16) (Add (Con 20) (Con 12))) <= 48 - \
eval (Add (Con 6) (Add (Con 16) (Add (Con 20) (Con 12)))) <= 54 - ")
*MyMonads>

Let's format the output part:
eval (Con 6) <= 6
eval (Con 16) <= 16
eval (Con 20) <= 20
eval (Con 12) <= 12
eval (Add (Con 20) (Con 12)) <= 32
eval (Add (Con 16) (Add (Con 20) (Con 12))) <= 48
eval (Add (Con 6) (Add (Con 16) (Add (Con 20) (Con 12)))) <= 54

That's it. For today...

(TO BE CONTINUED)

Andrea Rossato
arossato AT istitutocolli.org

GHC/GHCi debugger

2006-08-17T17:27:44Z

ThiagoArrais:

This page is a dump of the designs, ideas, and results of the GHCi debugger SoC project. Please contribute with your suggestions and comments.

The project builds on top of Lemmih's work: a breakpoint function that when hit while evaluating something in GHCi, launchs a new interactive environment with the variables in the local scope of the breakpoint, allowing you to interact with them.

'''NEW:''' There is also on-going work in the documentation for these features in the ghc user guide. Here is a [http://ender4.dsic.upv.es:81/ghcdocs/ghci.html snapshot] of the ghci documentation page extended with this project.

== Intermediate Closure Viewer ==
The closure viewer is intended to permit working with polymorphic values in breakpoints, as well as to explore intermediate computations without altering the evaluation order.

This feature is now (more or less) complete. Currently it provides two new commands under ghci, ''':print''' and ''':sprint''', both used in the same way as <tt>:type</tt> or <tt>:info</tt>. The latter prints a semievaluated closure using underscores to represent suspended computations (pretty much as [[Hood]] does). The former one in addition binds these thunks to variable names, so that you can do things with them.

Example:
<pre>
Prelude> let li = map Just [1..5]
Prelude> length li
5
Prelude> :sp li
li - [_,_,_,_,_,]

Prelude> head li
Just 1

Prelude> :sp li
li - [Just 1,_,_,_,_]

Prelude> last li
Just 5

Prelude> :sp li
li - [Just 1,_,_,_Just 5]

Prelude> :p li
li - [Just 1, (_t1::Maybe Integer),(_t2::Maybe Integer),(_t3::Maybe Integer),Just 5]

Prelude> _t1 `seq` ()

Prelude> :p li
li - [Just 1, Just 2,(_t3::Maybe Integer),(_t4::Maybe Integer),Just 5]

Prelude> _t2
Just 3
</pre>

Its best feature is that it can work without type information, so you can display polymorphic objects the type of which you don't know. However if there is type information available, it is used. Thanks to this it can work with opaque or coerced types. For instance:

<haskell>
data Opaque = forall a. O a
</haskell>

<pre>
*Test2> let li = map Just [1..5]
*Test2> let o = O li
*Test2> head li `seq` ()
*Test2> length li `seq` ()
*Test2> :p o
o - [O Just 1,(_t1::Integer),(_t2::Integer),(_t3::Integer),(_t4::Integer)]
</pre>

In the example above the <tt>li</tt> inside <tt>o</tt> has an opaque existential type. However, the closure viewer makes it possible to recover its type when it gets evaluated.

Other currently proposed extensions are a <tt>safeCoerce</tt> function (not so useful, it depends on ghc-api) and an <tt>unsafeDeepSeq</tt> (this one is decoupled from ghc-api). There is also a generally useful (for compiler/tool developers) <tt>isFullyEvaluated</tt> query function. The signatures being:

<pre>
isFullyEvaluated :: a -> IO Bool
unsafeDeepSeq :: a -> b -> b
safeCoerce :: GHC.Session -> a -> Maybe b
</pre>

Finally, note that there are some inconveniences with the current implementation, such as <tt>:p</tt> binding the same closure to different names when used twice on the same closure, but they are minor and temporary (hopefully).

== Usability ==
There is plenty of work to be done in this area before the debugger can be shipped with ghc.

If you have tried the patches maybe you want to add your comments here. Please add feature requests here too.

== Dynamic Breakpoints ==

See the user details of the current implementation at the GHC User Guide. Here is a [http://ender4.dsic.upv.es:81/ghcdocs/ghci.html snapshot] of the ghci documentation page extended with this project.

=== Event sites and events ===
We define 'event sites' as points in the code where you can want to set a breakpoint. Current candidates for sites are:
* On the entrance to a function / lambda abstraction
* <strike>Prior to function applications</strike> ''(this one does not make sense unless it forces the application using <tt>$!</tt>)
* Local bindings in lets and wheres
* Entrance to statements in monadic-do code

Overlapping or unnecesary events should be coalesced into a single one.
The rationale for what is an event and what is not is trying to find a middle point between the user interests and the overhead introduced:
* We want to keep the overhead manageable, thus we want to keep the number of breakpoints low.
* The user wants to introduce breakpoints at will.

Credit goes to both A. Tolmach's ML debugger and the OCaml time-travel debugger for providing the inspiration.

=== Proposals ===
There are currently the following proposals:

* Instrument the code with a conditional breakpoint at every event site. Sites are numbered, and the condition uses a site-indexed array to check if there is a breakpoint enabled. The array is maintained inside ghci. Hopefully not much magic is required for this one.

* In the style of the previous one, but no array is maintained. All the breakpoint conditions are set to False, so almost no overhead is incurred. When the user demands a breakpoint, its BCO in the heap is rewritten to enable the breakpoint. Feasibility of this?

* Don't use instrumentation. Have a new header for BCOs with breakpoints, say BCO_BREAK, and change headers in execution time on user demand (as in the previous proposal). The problem I see with this one is how to extract the local bindings. I don't fully grok the scheme Lemmih uses to do that yet.

During this project we have explored the first one, under the lemma of ``do the simplest thing that could possibly work``.
I'm sure there are many other designs. Please add your proposal or just throw an idea in.

== Call Traces ==
We want to have ''strict'' call traces, not the lazy ones.

=== Proposals ===

* It has been suggested that stealing ideas from Cost-Centre Stacks may be useful. I need more pointers on this.
<strike>
* Based on Tolmach's debugger, we can instrument the source code to build a timeline of events (either lazily or not). The events contain a pointer to its lexical parent event. With that it should be possible to extract a call trace:
# CASE 1: We are in a Function definition (FN):
## Go back one step in the timeline: it necessarily is an application (APP)
## Go back to its 'binding', i.e. its lexical parent. Keep doing this until it is a FN, then start again from case 1.
## Once you reach the top, i.e. the 0 event, you are done. Display all theAPPs you encountered in the way
# CASE 2: We are in a site other than a FN:
## Go back [[lexically]] until you hit a FN and continue with case 1.

This is just a wild, untested idea. It's possible that it would not work. Also even if it worked, it's possible that the overhead was unadmissible.
</strike> WON'T WORK WITH LAZINESS

== Integration ==
Allowing other tools to integrate with the debugger is an important goal. It should not be taken lightly though.

* It has been suggested to create a client/server protocol so that the debugger can be used by other tools.

* On the other hand, arguably it would be much easier to provide integration to clients of the ghc-api via some form of debugger api.

* Finally, it should be possible to derive the client/server architecture as an afterthought provided there is a debugger api in the ghc-api.

== Further pointers ==

# [http://www.tekno.chalmers.se/~murk/rectus Rectus], '' Oleg Mürk and Lennart Kolmodin''
# [http://caml.inria.fr/pub/docs/manual-ocaml/manual030.html The Ocaml Debugger], ''The OCaml Team''
# [http://web.cecs.pdx.edu/~apt/jfp95.ps A debugger for Standard ML], ''A.Tolmach, A. Appel''
# [http://www.haskell.org//pipermail/cvs-ghc/2006-April/029040.html The original discussion in the ghc-cvs mailing list]

== How to get the patches ==

The patches are available at the SoC ghc.debugger [http://darcs.haskell.org/SoC/ghc.debugger/ darcs repo]:
<pre>
darcs get --partial http://darcs.haskell.org/SoC/ghc.debugger
</pre>
and build it following the instructions at the [http://hackage.haskell.org/trac/ghc GHC developers wiki].

Or simply pull them into your local ghc-6.5 repo and rebuild stage2 and base libraries.

Have fun! (and feel free to spam [mailto:mnislaih@gmail.com me] with bugs, suggestions or requests!)

Talk:GHC/GHCi debugger

2006-07-20T14:46:25Z

ThiagoArrais: cannot use debugger for functions with inferred type

How do I use the debugger on implicitly typed and/or class typed functions? Here is what I am trying:

<code>
qsort [] = []
qsort (a:as) = breakpoint $ (qsort left) ++ [a] ++ (qsort right)
where left = filter (<=a) as
right = filter (>a) as
</code>

when I execute this into the patched ghci, here is what I get:

<pre>
*Main> qsort [8, 4, 0, 3, 1, 23, 11, 18]
Local bindings in scope:
src/Main.hs:7> left

<interactive>:1:0: Not in scope: `left'
src/Main.hs:7> right

<interactive>:1:0: Not in scope: `right'
src/Main.hs:7> a

<interactive>:1:0: Not in scope: `a'
</pre>

If I try specifying the inferred type (<code>qsort :: Ord a => [a] -> [a]</code>), I still get the same result. But if I specify a concrete type for the qsort function (<code>qsort :: [Int] -> [Int]</code>), the bindings in scope are shown and I can inspect the values.

[[User:ThiagoArrais|ThiagoArrais]] 14:46, 20 July 2006 (UTC)

--------------------

Is there a mailing list or any kind of user group for this?

[[User:ThiagoArrais|ThiagoArrais]] 14:46, 20 July 2006 (UTC)

Applications and libraries/Program development

2006-07-04T02:27:21Z

ThiagoArrais: Updated the EclipseFP link

{{unknown copyright}}
{{LibrariesPage}}

== Tools for program development ==

A list of tools that are helpful when developing Haskell code.
See also the [[Libraries and tools/Compiler tools|compiler tools]] and [[Libraries and tools/Theorem provers|theorem provers]].

=== Preprocesors ===

;[http://www.cs.york.ac.uk/fp/cpphs/ cpphs]
:Cpphs is a re-implementation (in Haskell) of the C pre-processor.

;[http://repetae.net/john/computer/haskell/DrIFT DrIFT]
:DrIFT is a tool which allows derivation of instances for classes that aren't supported by the standard compilers. In addition, instances can be produced in seperate modules to that containing the type declaration. This allows instances to be derived for a type after the original module has been compiled. As a bonus, simple utility functions can also be produced from a type.

;[http://www.cs.vu.nl/Strafunski/ Strafunski]
:Strafunski is a Haskell bundle that provides support for generic programming in Haskell, based on the concept of a functional strategy. It consists of a combinator library (StrategyLib) and a precompiler (DrIFT-Strafunski).

;[http://darcs.haskell.org/~lemmih/zerothHead/ Zeroth]
:A program using Template Haskell must link with the TH library even if it contains no references to TH after it has been compiled. Zeroth is a preprocessor which allows modules to use TH without linking with the TH library. To do this, Zeroth evaluates the top level splices from a module and saves the resulting code.

=== Build systems ===

;[http://www.haskell.org/cabal Cabal]
:The Haskell Cabal is a Common Architecture for Building Applications and Libraries. It is an API distributed with GHC, NHC98, and Hugs which allows a developer to easily group together a set of modules into a package. It is the standard build system for new Haskell libraries and applications

;[http://www.cs.york.ac.uk/fp/hmake/ hmake], a Haskell-aware replacement for make
:Automatically keeps track of module dependencies (i.e. no need to write any Makefiles!). Can be used with any of the usual Haskell compilers (ghc, hbc, nhc98).

=== Tags ===

;[http://www.cl.cam.ac.uk/users/rje33/software.html HaskTags]
:Hasktags is a simple program that generates TAGS files for Haskell code. Together with a supporting editor (e.g. NEdit, XEmacs, or Vim) TAGS files can be used to quickly find the places where functions, data constructors etc. are defined.

;[http://www.dtek.chalmers.se/~d99josve/tagsh.tar.gz tagsh]
:A version of the tags program for Haskell. It uses the standardised hssource and posix library, works with GHC 5.02.1. tags file has been checked to work with vim and nedit.

=== Program Transformation ===

;[http://www.cs.kent.ac.uk/projects/refactor-fp/hare.html HaRe -- The Haskell Refactorer]
:Mechanical refactoring of Haskell code (across module boundaries). HaRe now supports many refactorings such as renaming identifiers, moving/introducing/inlining definitions, and so on. Those refactorings are not limited to a single module. HaRe can be accessed from either Vim or Emacs

;[http://www.isi.edu/~hdaume/HAllInOne/ Haskell All-In-One]
:This Haskell utility takes a program implemented in multiple modules and converts it to a single module.

;[http://wiki.di.uminho.pt/wiki/bin/view/Alcino/DrHylo DrHylo]
:Tool for deriving hylomorphisms from a restricted Haskell syntax. It is based on the algorithm first presented in the paper "Deriving Structural Hylomorphisms From Recursive Definitions" at ICFP'96 by Hu, Iwasaki, and Takeichi.

=== Integrated Development Environments ===

;[http://eclipsefp.sourceforge.net/ Haskell support for Eclipse]
:Eclipse is an open, extensible IDE platform for "everything and nothing in particular". It is implemented in Java and runs on several platforms. The Java IDE built on top of it has already become very popular among Java developers. The Haskell tools extend it to support editing (syntax coloring, code assist), compiling, and running Haskell programs from within the IDE. More features like source code navigation, module browsing etc. will be added in the future.

;[http://www.dtek.chalmers.se/~d99josve/hide/ hIDE]
:hIDE is a GUI-based Haskell IDE written using gtk+hs. It does not include an editor but instead interfaces with NEdit, vim or GNU emacs.

;[http://haskell.org/hide hIDE-2]
:Through the dark ages many a programmer has longed for the ultimate tool. In response to this most unnerving craving, of which we ourselves have had maybe more than our fair share, the dynamic trio of #Haskellaniacs (dons, dcoutts and Lemmih) hereby announce, to the relief of the community, that a fetus has been conceived: ''hIDE - the Haskell Integrated Development Environment''. So far the unborn integrates source code recognition and a chameleon editor, resenting these in a snappy gtk2 environment. Although no seer has yet predicted the date of birth of our hIDEous creature, we hope that the mere knowledge of its existence will spread peace of mind throughout the community as oil on troubled waters. See also: [[Screenshots of HIDE]] and [[HIDE]]

;[http://www.students.cs.uu.nl/people/rjchaaft/JCreator JCreator with Haskell support]
:JCreator is a highly customizable Java IDE for Windows. Features include extensive project support, fully customizable toolbars (including the images of user tools) and menus, increase/decrease indent for a selected block of text (tab/shift+tab respectively). The Haskell support module adds syntax highlighting for haskell files and winhugs, hugs, a static checker (if you double click on the error message, JCreator will jump to the right file and line and highlight it yellow) and the Haskell 98 Report as tools. Platforms: Win95, Win98, WinNT and Win2000 (only Win95 not tested yet). Size: 6MB. JCreator is a trademark of Xinox Software; Copyright © 2000 Xinox Software The Haskell support module is made by [http://www.students.cs.uu.nl/people/rjchaaft/ Rijk-Jan van Haaften].

;[http://www.haskell.org/visualhaskell Visual Haskell]
:Visual Haskell is a complete development environment for Haskell software, based on Microsoft's [http://msdn.microsoft.com/vstudio/productinfo/ Microsoft Visual Studio] platform. Visual Haskell integrates with the Visual Studio editor to provide interactive features to aid Haskell development, and it enables the construction of projects consisting of multiple Haskell modules, using the Cabal building/packaging infrastructure.

;[http://www.kdevelop.org/ KDevelop]
:This IDE supports many languages. For Haskell it [http://www.kdevelop.org/HEAD/doc/api/html/LangSupportStatus.html currently supports] project management, syntax highlighting, building (with GHC) & executing within the IDE.

;[http://haste.dyndns.org:8080/ haste - Haskell TurboEdit]
:haste - Haskell TurboEdit - is an IDE for the functional programming language Haskell, written in Haskell.

;[http://www.cs.kent.ac.uk/projects/vital/ Vital]
:Vital is a visual programming environment. It is particularly intended for supporting the open-ended, incremental style of development often preferred by end users (engineers, scientists, analysts, etc.).

;[http://www.cs.kent.ac.uk/projects/pivotal/ Pivotal]
:Pivotal 0.025 is an early prototype of a Vital-like environment for Haskell. Unlike Vital, however, Pivotal is implemented entirely in Haskell. The implementation is based on the use of the hs-plugins library to allow dynamic compilation and evaluation of Haskell expressions together with the gtk2hs library for implementing the GUI.

=== Editor modes for syntax highlighting ===

====Kate====

; Syntax highlighting files for KDE's Kate
:
* [http://www.informatik.uni-bonn.de/~ralf/software.html#syntax Files] by Ralf Hinze.
* [hs.xml hs.xml] and [lhs.xml lhs.xml] by Brian Huffman.

====NEdit====

;[http://www.nedit.org/ftp/contrib/highlighting/haskell.pats NEdit] syntax highlighting and block comment support.

====Vim====

;[http://www.vim.org vim] syntax highlighting
:
* [ftp://ftp.cse.unsw.edu.au/pub/users/dons/vim/ by Don Stewart]: for TeX and cpp style Haskell files.
* [http://urchin.earth.li/~ian/vim/ by Ian Lynagh]: distinguishes different literal Haskell styles.
* by John Williams: Both regular Haskell [haskell.vim .hs] and [lhaskell.vim .lhs] files that uncomment lines using '>' are supported.

====Textpad====

;[http://www.haskell.org/libraries/Haskell98.syn Syntax highlighting file] for [http://www.textpad.com textpad]
:by Jeroen van Wolffelaar and Arjan van IJzerdoorn, which inludes all prelude functions, datatype, constructors, etc, all in seperate groups.

====Emacs====

;[http://www.haskell.org/haskell-mode/ Haskell Mode for Emacs]
:Supports font locking, declaration scanning, documentation of types, indentation and interaction with Hugs.

;Alternative [http://www.haskell.org/libraries/hugs-mode.el Hugs Mode for Emacs] by Chris Van Humbeeck
:Provides fontification and cooperation with Hugs. Updated for emacs 20.* by Adam P. Jenkins.

====Jed====

;[http://www.astercity.net/~khaliff/haskell/haskellmode.tgz Haskell mode] {{dead link}}
:for [http://www.jedsoft.org/jed/ jed] by Marcin 'Qrczak' Kowalczyk.

====Subethaedit====

;[http://www.codingmonkeys.de/subethaedit/modes.html Haskell mode For SubEthaEdit]
: SubEthaEdit is a Mac OS X editor.

====Other====

Some other, mostly obsolete, modes are available in [http://cvs.haskell.org/cgi-bin/cvsweb.cgi/fptools/CONTRIB/haskell-modes/ CVS].

=== Typesetting Haskell ===

;[http://www.jantar.org/lambdaTeX/ lambdaTeX]
:A TeX package for typesetting literate scripts in TeX. The output looks much like the code from Chris Okasaki's book "Purely Functional Data Structures", doing syntax highlighting and converting ASCII art such as <code>-></code> or <code>alpha</code> to proper mathematical symbols. It should work with both LaTeX and plain TeX, and it does its magic without any annotations, directly on the source code (lambdaTeX uses an almost-complete Haskell lexical analyzer written entirely in plain TeX). You only have to add <code>\input lambdaTeX</code> at the top of your source file, and manually typeset your literate comments so they look as good as the source code.

;[http://www.cse.unsw.edu.au/~chak/haskell/haskell-style.html Haskell Style for LaTeX2e]
:by Manuel Chakravarty provides environments and macros that simplify setting Haskell programs in LaTeX.

;[http://www.iai.uni-bonn.de/~loeh/lhs2tex/ lhs2tex]
:A preprocessor for typesetting Haskell programs that combines some of the good features of pphs and smugweb. It generates LaTeX code from literate Haskell sources.

;[http://www.cs.uu.nl/wiki/Ehc/Shuffle Shuffle]
:another tool helping literate programming in Haskell. It helps to maintain ''views'' in a literate programming project. For example, it is among the tools used for developing a compiler in an iterative way with manuals didactically reflecting these evolving series of versions deriving from the literal code (see [http://www.cs.uu.nl/wiki/Ehc/WebHome Essential Haskell Compiler] project). Thus, Shuffle gives us the possibility for making didactically the evolution of versions visible in the documentation, when this is needed. More generally, Shuffle gives us tangling and weaving possibilities of literate programming. I think it gives a way to think of literal program development in a more abstract way by supporting the concept of views (maybe a too far analogy: version control management -- e.g. [http://abridgegame.org/darcs/ darcs] -- helps thinking of program development in a more abstract way, too). Shuffle works well together with lhs2tex.

;[http://www.acooke.org/jara/pancito/haskell.sty haskell.sty]
:A Latex style file by Andrew Cooke that makes literal programming in Haskell simple.

;[http://web.comlab.ox.ac.uk/oucl/work/ian.lynagh/Haskell2LaTeX/ Haskell2Latex]
:Ian Lynagh's Haskell2LaTeX takes a literate Haskell program, or any LaTeX document with embedded Haskell, and pretty-prints the Haskell sections within it. The most significant difference between Haskell2LaTeX and other programs with similar goals is is that Haskell2LaTeX parses the input rather than merely lexing it.

;[ftp://ftp.cs.york.ac.uk/pub/haskell/contrib/hscolour-1.1.tar.gz HsColour]
:Colourise Haskell source code in HTML or ANSI terminal screen codes.

=== Source documentation and browsing ===

;[[Haddock]] A Haskell Documentation Tool
:A tool for automatically generating documentation from annotated Haskell source code. It is primarily intended for documenting libraries, but it should be useful for any kind of Haskell code. Haddock lets you write documentation annotations next to the definitions of functions and types in the source code, in a syntax that is easy on the eye when writing the source code (no heavyweight mark-up). The documentation generated by Haddock is fully hyperlinked - click on a type name in a type signature to go straight to the definition, and documentation, for that type.

;[http://www.cse.unsw.edu.au/~chak/haskell/idoc/ IDoc] A No Frills Haskell Interface Documentation System
:IDoc extracts interface documentation and declarations from Haskell modules based on standard Haskell layout rules and a small number of clues that the programmer embeds in interface comments. These clues have been designed to be visually non-imposing when displaying the source in a text editor. Interface documentation is rendered in standard markup languages (currently, only HTML is supported). IDoc has been designed to be simple to use and install.

;[http://www.fmi.uni-passau.de/~groessli/hdoc/ HDoc]
:HDoc generates documentation in HTML format for Haskell modules. The generated documents are cross linked and include summaries and detailed descriptions for the documented functions, data types, type classes and instance declarations.

;[http://www.ida.liu.se/~jakax/haskell.html HaskellDoc]
:This program generates an HTML document showing the module interfaces of a Haskell project. Convenient links are placed for easy browsing of the different modules of the project, and for quick access to the source code.

;[http://home.conceptsfa.nl/~jwit/HaSpell.html HaSpell]
:HaSpell is a spelling and style checker for Haskell programs. It can detect spelling errors in comments in the program text, and optionally in the code itself. There is an option to detect metasyntactic variables (such as 'foo') and 'bad function prefixes' such as 'compute' and 'doThe' - these make the program less readable and generally indicate bad programming style.

=== Testing ===

;[http://hunit.sourceforge.net HUnit]
:A unit testing framework for Haskell, similar to JUnit for Java. With HUnit, the programmer can easily create tests, name them, group them into suites, and execute them, with the framework checking the results automatically. Test specification is concise, flexible, and convenient.

;[http://www.cs.chalmers.se/~rjmh/QuickCheck/ QuickCheck]
:A tool for testing Haskell programs automatically. The programmer provides a specification of the program, in the form of properties which functions should satisfy, and QuickCheck then tests that the properties hold in a large number of randomly generated cases. Specifications are expressed in Haskell, using combinators defined in the QuickCheck library. QuickCheck provides combinators to define properties, observe the distribution of test data, and define test data generators.

;[http://www.informatik.uni-freiburg.de/~wehr/haskell/ HTF - The Haskell Test Framework]
:The HTF lets you write HUnit tests and QuickCheck properties in an easy and convenient way. Additionally, the HTF provides a facility for testing programs by running them and comparing the actual output with the expected output (so called "file-based tests"). The HTF uses Template Haskell to collect all tests and properties, so you do not need to write boilerplate code for that purpose. Preprocessor macros provide you with file name and line number information for tests and properties that failed.

=== Tracing & debugging ===

Tracing gives access to otherwise invisible information about a computation. Conventional debuggers allow the user to step through the program computation, stop at given points and examine variable contents. This tracing method is quite unsuitable for Haskell, because its evaluation order is complex, function arguments are usually unwieldy large unevaluated expressions and generally
computation details do not match the user's high-level view of functions mapping values to values.

;[http://www.cs.mu.oz.au/~bjpop/buddha/ Buddha]
:Buddha is a declarative debugger for Haskell 98 programs. It presents the evaluation of a Haskell program as a series of function applications. A typical debugging session involves a series of questions and answers. The questions are posed by the debugger, and the answers are provided by the user. The implementation of Buddha is based on program transformation.

;[http://www.ida.liu.se/~henni Freja]
:A compiler for a subset of Haskell. Running a compiled program creates an evaluation dependency tree as trace, a structure based on the idea of declarative debugging from the logic programming community. A debugging session consists of the user answering a sequence of yes/no questions.

;[http://www.cs.york.ac.uk/fp/hat Hat]
:A Haskell program is first transformed by hat-trans and then compiled with nhc98 or ghc. At runtime the program writes a trace file. There are tools for viewing the trace in various ways: Hat-stack shows a virtual stack of redexes. Hat-observe shows top-level functions in the style of Hood. Hat-trail enables exploring a computation backwards, starting from (part of) a faulty output or an error message. Hat-detect provides algorithmic debugging in the style of Freja. Hat-explore allows free navigation through a computation similar to traditional debuggers and algorithmic debugging and slicing.

;[http://www.haskell.org/hood Hood]
:A library that permits to observe data structures at given program points. It can basically be used like print statements in imperative languages, but the lazy evaluation order is not affected and functions can be observed as well.

;[http://www.cs.ukc.ac.uk/people/staff/cr3/toolbox/haskell/ GHood]
:"Graphical Hood" - a Java-based graphical observation event viewer, building on Hood.

;[http://www.cs.mu.oz.au/~bjpop/code.html highWaterMark]
:A library for determining the amount of memory allocated at any point by a GHC program.

;[http://www.cs.mu.oz.au/~bjpop/code.html GHC Internals library]
:A GHC library for polymorphically deconstructing heap objects from within Haskell code.

;[http://www.cs.mu.oz.au/~bjpop/code.html GHC Heap and Stable Table Printing library]
:Two libraries for GHC. The first is for printing heap objects from within Haskell or C code. The second is for dumping the contents of the Stable Table which is used for Stable Pointers and Stable Names.

=== Bug tracking ===

;[http://urchin.earth.li/darcs/ian/bts/ Bark]
:Bark is a bug tracking system written in Haskell

=== Miscellaneous ===

;[http://www.haskell.org/hoogle/ Hoogle]
:Hoogle is a Haskell API search engine. It allows you to search for a function in the standard libraries by either name, or by approximate type signature.

;[[Lambdabot]]
:Once an IRC bot, this has grown to be a large, ad-hoc collection of Haskell development tools available for offline use. In particular, automatic point-free refactoring is available via a vim interface, as well as access to [[Hoogle]], djinn, ghci, and much much more.

;[http://www.cse.unsw.edu.au/~dons/darcs-graph.html darcs-graph]
:a tool for generating graphs of commit activity for darcs repositories.

;[http://www.cse.unsw.edu.au/~chak/haskell/VersionTool/ VersionTool]
:a small utility that:
* extracts version information from Cabal files,
* maintains version tags in darcs,
* computes patch levels by querying darcs,
* extracts the current context from darcs, and
* adds all this information to a source file

;[http://www.haskell.org/pipermail/haskell/2006-June/018043.html Kamiariduki]
:a system to judge your derivative work's purpose and license is valid with Creative Commons License Works.

=== Formal methods ===

See
* [[Analysis and design|analysis and design methods]]
* [[Libraries and tools/Theorem provers|theorem provers]].

=== Dead ===

;[http://web.archive.org/web/*/http://www.numeric-quest.com/haskell/explorer/browser.html The Haskell Module Browser]<em>(since 10/06/2003 via Internet Archive)</em>
:A browser similar to Smaltalk and Eiffel class browsers.

User:ThiagoArrais

2006-06-30T17:14:57Z

ThiagoArrais:

I am a programmer from Recife, Brazil. My main interests are on developing tools for software development. [http://thiagoarrais.blogspot.com Mergulhando no Caos] is my blog (in Portuguese) about sotware development and technology in general.

I hereby license all my contributions to this wiki under the simple permissive license on [[HaskellWiki:Copyrights]].

== Haskell related projects ==

I am currently developing [[EclipseFP]], a development environment for Haskell based on the [http://www.eclipse.org Eclipse] platform.

EclipseFP

2006-06-30T17:12:29Z

ThiagoArrais:

''EclipseFP'' is an integrated environment for development in Haskell. Our goal is to provide a full-featured IDE that mimics the Eclipse JDT (the Java IDE which is flagship of the Eclipse project). Some planned and implemented features are support for code assistance, auto-building, refactoring and structural search.

The homepage for details and download is [http://eclipsefp.sourceforge.net/ http://eclipsefp.sourceforge.net/].

We welcome any suggestions from the community. You can add your feature request to the following list, if you'd like.

* ...

EclipseFP

2006-06-30T17:11:32Z

ThiagoArrais:

EclipseFP is an integrated environment for development in Haskell. Our goal is to provide a full-featured IDE that mimics the Eclipse JDT (the Java IDE which is flagship of the Eclipse project). Some planned and implemented features are support for code assistance, auto-building, refactoring and structural search.

The homepage for details and download is [http://eclipsefp.sourceforge.net/ http://eclipsefp.sourceforge.net/].

We welcome any suggestions from the community. You can add your feature request to the following list, if you'd like.

* ...

User:ThiagoArrais

2006-06-30T16:48:39Z

ThiagoArrais:

I am a programmer from Recife, Brazil. My main interests are on developing tools for software development.

I hereby license all my contributions to this wiki under the simple permissive license on [[HaskellWiki:Copyrights]].

== Haskell related projects ==

I am currently developing [[EclipseFP]], a development environment for Haskell based on the [http://www.eclipse.org Eclipse] platform.

IO inside

2006-06-30T16:36:00Z

ThiagoArrais:

Haskell I/O always was a source of confusion and surprises for new
Haskellers. While simple I/O code in Haskell looks very similar to
it's equivalents in imperative languages, our attempts to write
somewhat more complex often ended with total head mess. That is
because Haskell I/O is really very different internally. Haskell is a
pure language and even I/O system don't break this law.

The following text is an attempt to explain details of Haskell I/O
implementation that should help you eventually master all the smart
I/O tricks. Moreover, i added detailed explanation of various traps
you can encounter on this way. After reading this text, you will get a
degree "A Master of Haskell I/O" that is equal to Bachelor in CS and
Mathematics, simultaneously :)

If you are new to Haskell I/O you may prefer to start with reading [[Introduction to IO]] page

== Haskell is pure language ==

Haskell is pure language, which means that result of any function call
is fully determined by its arguments. Pseudo-functions like rand() or
getchar() in C which returns different results on each call, are just
impossible and prohibited by language rules. Moreover, Haskell
functions can't have side effects, i.e. they cannot make any changes in the "real
world" - change files, write to the screen, print, send data over the network,
and so on. These two restrictions together mean that any function
call can be omitted, repeated, or replaced by the result of a previous call with the
same parameters, and the language _guarantees_ that all
these rearrangements will not change program result!

Let's compare this to C - compilers for this language just try to guess
that function don't have side effects and it's result don't depends on
some global variables. If this guess is wrong - the whole optimization
becomes incorrect! As a consequence, C optimizers are enough
conservative in their guesses and/or require from programmer to give
them hints about usage (not meaning!) of functions and variables

Comparing to them, Haskell compiler is a set of pure mathematical
transformations that can't be wrong by definition - they just
translate one abstract data processing algorithm (i.e. some complex
function) to another equivalent algorithm, just with better
performance. This results in much better high-level optimization
facilities comparing to C compilers

But this purity creates it's own problems. How we can do I/O, work
with stateful algorithms and side effects in pure language? This
question had many different solutions probed in 18 years of Haskell
existence and finally one based on using monads was widely accepted

== What is the monad? ==

What is the monad? It's something from mathematical category theory, i
don't know anymore :) In order to understand how monads are used to
solve problem of I/O and side effects, you don't need to know it. It's
enough to just know elementary mathematics, like I do :)

Let's imagine that we want to implement in Haskell well-known
'getchar' function. What the type it should have? Let's try:

<haskell>
getchar :: Char

get2chars = [getchar,getchar]
</haskell>

What we will got with 'getchar' having just 'Char' type? You can see
all the possible problems in 'get2chars' definition:

1) because Haskell compiler treats all functions as pure and not
having side effects, it can avoid "excessive" call to 'getchar' and
use one returned value two times

2) even if it will make two calls, there is no any clue to determine
which call should be performed first. Do you want to return chars in
the order they read, or in opposite order? Nothing in 'get2chars'
definition answers this question.

How these problems can be solved, from plain programmer's viewpoint?
Let's introduce fake parameter of 'getchar' to make each call
"different" from the compiler's point of view:

<haskell>
getchar :: Int -> Char

get2chars = [getchar 1, getchar 2]
</haskell>

This right away solved the first problem mentioned above - now
compiler will make two calls because it sees them as having different
parameters. The whole 'get2chars' function should also had such
fake parameter, otherwise we will have the same problem calling it:

<haskell>
getchar :: Int -> Char
get2chars :: Int -> String

get2chars _ = [getchar 1, getchar 2]
</haskell>

Now, we need to give the compiler some clue to determine which function it
should call first. The Haskell language doesn't provide any way to express
order of evaluation... except for data dependencies! How about adding
artificial data dependency which prevents evaluation of second
'getchar' before the first one? In order to achieve this, we will
return from 'getchar' additional fake result that will be used as
parameter for next 'getchar' call:

<haskell>
getchar :: Int -> (Char, Int)

get2chars _ = [a,b] where (a,i) = getchar 1
(b,_) = getchar i
</haskell>

So bad so good - now we can guarantee that 'a' is read before 'b'
because 'b' reading need value (i) that is returned by 'a' reading!

We've added fake parameter to 'get2chars' but the problem is that the
Haskell compiler is too smart! It can believe that the external 'getchar'
function is really dependent on it's parameter but for 'get2chars' it
will see that we just cheating and throw it away! Problem? How about
passing this fake parameter to 'getchar' function?! In this case
compiler can't guess that it is really unused :)

<haskell>
get2chars i0 = [a,b] where (a,i1) = getchar i0
(b,i2) = getchar i1
</haskell>

And more - 'get2chars' has all the same purity problems as 'getchar'
function. If one need to call it two times, he need a way to describe
order of these calls. Look at:

<haskell>
get4chars = [get2chars 1, get2chars 2] -- order of `get2chars` calls isn't defined
</haskell>

We already know how to fight with such problem - 'get2chars' should
also return some fake value that can be used to order calls:

<haskell>
get2chars :: Int -> (String, Int)

get4chars i0 = (a++b) where (a,i1) = get2chars i0
(b,i2) = get2chars i1
</haskell>

But what the fake value it would return? If we will use some integer
constant, too smart Haskell compiler will guess we are cheating, again :)
What about returning the value returned by 'getchar'? See:

<haskell>
get2chars :: Int -> (String, Int)
get2chars i0 = ([a,b], i2) where (a,i1) = getchar i0
(b,i2) = getchar i1
</haskell>

Believe you or not, but we just constructed the whole "monadic"
Haskell I/O system.

== Welcome to RealWorld, baby :) ==

The 'main' Haskell function has the type:

<haskell>
main :: RealWorld -> ((), RealWorld)
</haskell>

where 'RealWorld' is faking type used instead of our Int. It is something
like baton passed in relay-race. When 'main' calls some IO function,
it pass the "RealWorld" it received as parameter. All IO functions have
similar types involving RealWorld as parameter and result. To be
exact, "IO" is a type synonym defined in the following way:

<haskell>
type IO a = RealWorld -> (a, RealWorld)
</haskell>

so. 'main' just has type "IO ()", 'getChar' has type "IO Char" and so
on. Let's look at 'main' calling 'getChar' two times:

<haskell>
getChar :: RealWorld -> (Char, RealWorld)

main :: RealWorld -> ((), RealWorld)
main world0 = let (a, world1) = getChar world0
(b, world2) = getChar world1
in ((), world2)
</haskell>

Look at this closely: 'main' passes to first 'getChar' the "world" it
received. This 'getChar' returns some new value of type RealWorld,
that is used in next call. Finally, 'main' returns the "world" it got
from the second 'getChar':

1) Is it possible here to omit any call of 'getChar' if the char
it read is not used? No, because we should return the "world" that is
result of second 'getChar' and in turn requires "world" from first 'getChar'.

2) Is it possible to reorder 'getChar' calls? No, second 'getChar'
can't be called before first one because it uses "world" it returns.

3) Is it possible to duplicate calls? In Haskell semantics - yes, but
real compilers never duplicate work in such simple cases (otherwise,
the programs generated will not have any speed guarantees)

As we already said, RealWorld values are used like baton, passing them
between all routines called by 'main' in strict order. Inside each
routine called, RealWorld values used in the same way. In whole, in
order to "compute" world to be returned from 'main', we should perform
each IO procedure that is called from 'main', directly or indirectly.
This means that each procedure inserted in the chain will be performed
just at the moment (relative to other IO actions) when we planned it
to be called. Let consider the following program:

<haskell>
main = do a <- ask "What is your name?"
b <- ask "How old are you?"
return ()

ask s = do putStr s
readLn
</haskell>

Now you have enough knowledge to rewrite it in low-level way and
check that each operation what should be performed, will be really
performed with arguments it should have and in order we expecting

But what about conditional execution? No problem. Let's define the
well-known 'when' operation:

<haskell>
when :: Bool -> IO () -> IO ()
when condition action world =
if condition
then action world
else ((), world)
</haskell>

As you can see, we can easily include or exclude from execution chain
IO procedures (actions) depending on the data values. If 'condition'
will be False on call of 'when', 'action' will never be called because
real Haskell compilers, again, never calls functions whose results
don't required to calculate final result (i.e., here, final "world" value
of 'main')

Loops and any more complex control structures can be implemented in
the same way. Try it as an exercise!

Finally you may want to know how much costs this passing of RealWorld
values all around. It's free! These fake values exist for compiler
only while it analyze and optimize code, but when it goes to assembler
code generation, it "suddenly" realize that this type is like "()", so
all these parameters and result values can be omitted from generated code.
Is it not really beautiful? :)

== '>>=' and 'do' notation ==

All beginners (including me :) start by thinking that 'do' is some
magic statement that executes IO actions. It's wrong - 'do' is just a
syntax sugar that simplifies writing of IO procedures. 'do' notation
is finally translated to the statements passing "world" values like
we manually written above and need only to simplify gluing of several
IO actions together. You don't require to use 'do' for just one statement:

<haskell>
main = do putStr "Hello!"
</haskell>

is desugared just to:

<haskell>
main = putStr "Hello!"
</haskell>

But nevertheless it's a Good Style to use 'do' even for one statement
because it simplifies adding new statements in the future.

Let's examine how desugared 'do' with multiple statements on the
following example:

<haskell>
main = do putStr "What is your name?"
putStr "How old are you?"
putStr "Nice day!"
</haskell>

'do' statement here just joins several IO actions that should be
performed sequentially. It's translated to sequential applications
of so named "binding operator", namely '>>':

<haskell>
main = (putStr "What is your name?")
>> ( (putStr "How old are you?")
>> (putStr "Nice day!")
)
</haskell>

This binding operator just combines two IO actions, executing them
sequentially by passing the "world" between them:

<haskell>
(>>) :: IO a -> IO b -> IO b
(action1 >> action2) world0 =
let (a, world1) = action1 world0
(b, world2) = action2 world1
in (b, world2)
</haskell>

If such way to define operator looks strange for you, read this
definition as the following:

<haskell>
action1 >> action2 = action
where
action world0 = let (a, world1) = action1 world0
(b, world2) = action2 world1
in (b, world2)
</haskell>

Now you can substitute definition of '>>' at the places of it's usage
and check that program constructed by 'do' desugaring is actually the
same as we can write by manually manipulating "world" values.

More complex example involves binding of variable using "<-":

<haskell>
main = do a <- readLn
print a
</haskell>

This code is desugared into:

<haskell>
main = readLn
>>= (\a -> print a)
</haskell>

As you should remember, '>>' binding operator silently ignores
value of it's first action and returns as an overall result just
result of second action. On the other side, '>>=' allows to use value
of it's first action - it's passed as additional parameter to the second one!
Look at the definition:

<haskell>
(>>=) :: IO a -> (a->IO b) -> IO b
(action1 >>= action2) world0 =
let (a, world1) = action1 world0
(b, world2) = action2 a world1
in (b, world2)
</haskell>

First, what means type of second action, namely "a->IO b"? By
substituting the "IO" definition, we get "a -> RealWorld -> (b, RealWorld)".
This means that second action actually has two parameters
- of type 'a' actually used inside it, and of type RealWorld used for
sequencing of IO actions. That's a destiny - any IO procedure has one
more parameter comparing to that you see in it's type signature. This
parameter is hidden inside the definition of type alias "IO".

Second, you can use these '>>' and '>>=' operations to simplify your
program. For example, in the code above we don't need to introduce the
variable, because 'readLn' result can be send directly to 'print':

<haskell>
main = readLn >>= print
</haskell>

And third - as you see, the notation:

<haskell>
do x <- action1
action2
</haskell>

where 'action1' has type "IO a" and 'action2' has type "IO b",
translated into:

<haskell>
action1 >>= (\x -> action2)
</haskell>

where second argument of '>>=' has the type "a->IO b". It's the way
how the "<-" binding processed - it just becomes parameter of
subsequent operations represented as one large IO action. Look at the
next example:

<haskell>
main = do putStr "What is your name?"
a <- readLn
putStr "How old are you?"
b <- readLn
print (a,b)
</haskell>

This code is desugared into:

<haskell>
main = putStr "What is your name?"
>> readLn
>>= \a -> putStr "How old are you?"
>> readLn
>>= \b -> print (a,b)
</haskell>

I omitted parentheses here, both '>>' and '>>=' operations are
left-associative that leads to that 'a' and 'b' bindings introduced
here is valid for all remaining actions. As an exercise, add the
parentheses yourself and translate this procedure into the low-level
code passing "world" values. I think it should be enough to finally
realize how 'do' translation and binding operators work.

Oh, no. I forgot third monadic operator - 'return'. It just
combines it's two parameters - value passed and "world":

<haskell>
return :: a -> IO a
return a world0 = (a, world0)
</haskell>

How about translating some simple example of 'return' usage? Say,

<haskell>
main = do a <- readLn
return (a*2)
</haskell>

Programmers with imperative languages background often thinks that
'return' in Haskell, like in other languages, immediately returns from
the IO procedure. As you can see in its definition (and even just
type!), such assumption is totally wrong. The only purpose of using
'return' is to "lift" some value (of type 'a') into the result of
whole action (of type "IO a") and therefore it should be used only as
last executed statements of some IO sequence. For example try to
translate the following procedure into the low-level code:

<haskell>
main = do a <- readLn
when (a>=0) $ do
return ()
print "a is negative"
</haskell>

and you will realize that 'print' statement is executed anyway. If you
need to escape from middle of IO procedure, you can use the 'if'
statement:

<haskell>
main = do a <- readLn
if (a>=0)
then return ()
else print "a is negative"
</haskell>

Moreover, Haskell layout rules allow us to use the following layout:

<haskell>
main = do a <- readLn
if (a>=0) then return ()
else do
print "a is negative"
...
</haskell>

that may be very useful for escaping from middle of longish 'do' statement.

Last exercise: implement function 'liftM' that lifts operations on
plain values to the operations on monadic ones. It's type signature:

<haskell>
liftM :: (a->b) -> (IO a -> IO b)
</haskell>

If it's too hard for you, start with the following high-level
definition and rewrite it in low-level fashion:

<haskell>
liftM f action = do x <- action
return (f x)
</haskell>

== Mutable data (references, arrays, hash tables...) ==

As you should know, all names in Haskell are bind to one fixed value.
This greatly simplify understanding of algorithms and optimization of
code, but inappropriate for some cases. Yes, there a plenty of
algorithms that is simpler to implement in terms of updatable
variables, arrays and so on. This means that the value associated with
variable, for example, can be different at different execution points,
so reading it's value can't be considered as pure function. Imagine,
for example the following code:

<haskell>
main = do let a0 = readVariable varA
_ = writeVariable varA 1
a1 = readVariable varA
print (a0,a1)
</haskell>

Looks strange? First, two calls to 'readVariable' looks the same, so
compiler can just reuse the value returned by first call. Second,
result of 'writeVariable' call isn't used so compiler can (and will!)
omit this call completely. To finish the picture, these 3 calls may be
rearranged to any order because they looking independent on each
other. What is the solution? You know - using of IO actions! IO
actions guarantees us that:

# execution order will be retained
# each action will be mandatory executed
# result of the "same" action (such as "readVariable varA") will not be reused

So, the code above really should be written as:

<haskell>
main = do varA <- newIORef 0 -- Create and initialize new variable
a0 <- readIORef varA
writeIORef varA 1
a1 <- readIORef varA
print (a0,a1)
</haskell>

Here, 'varA' got type "IORef Int" which means "variable (reference) in
IO monad holding value of type Int". newIORef creates new variable
(reference) and returns it, and then read/write actions use this
reference. Value returned by "readIORef varA" action may depend not
only on variable involved but also on the moment of performing this
operation so it can return different values on each call.

Arrays, hash tables and any other _mutable_ data structures are
defined in the same way - there is operation that creates new "mutable
value" and returns reference to it. Then special read and write
operations in IO monad are used. The following example shows example
of using mutable array:

<haskell>
import Data.Array.IO
main = do arr <- newArray (1,10) 37 :: IO (IOArray Int Int)
a <- readArray arr 1
writeArray arr 1 64
b <- readArray arr 1
print (a,b)
</haskell>

Here, array of 10 elements with 37 as initial values is created. After
reading value of first element to 'a' this element's value is changed
to 64 and then read again, to 'b'. As you can see by executing this
code, 'a' will be set to 37 and 'b' to 64.

Other state-dependent operations are also often implemented as IO
actions. For example, random numbers generator should return different
values on each call. It looks natural to give it IO-involving type:

<haskell>
rand :: IO Int
</haskell>

Moreover, when you import C routines you should be careful - if this
routine is impure, i.e. it's result depends on something in "real
world" (file system, memory contents...), internal state and so on,
you should give it IO-involving type. Otherwise, compiler can
"optimize" repetitive calls of this procedure with the same parameters! :)

For example:

<haskell>
foreign import ccall
sin :: Double -> Double
</haskell>

because 'sin' result depends only on it's argument, but

<haskell>
foreign import ccall
tell :: Int -> IO Int
</haskell>

If you will declare 'tell' as pure function (without IO) then you may
got the same position on each call! :)

== IO actions as values ==

Now you should precisely understand why it's impossible to use IO
actions inside non-IO (pure) procedures. Such procedures just don't
get a "baton", don't know any "world" value to pass to IO action.
RealWorld is abstract datatype, so they also can't construct it's
values by himself, and it's a strict type, so 'undefined' also can't
be used. So, prohibition of using IO actions inside pure procedures is
just a type trick as it is usual in Haskell :)

But while pure code can't _execute_ IO actions, it can work with them
as with any other functional values - they can be stored in data
structures, passed as parameters and returned as results, collected in
lists, and partially applied. But anyway IO action will remain
functional value because we can't apply it to the last argument - of
type RealWorld.

In order to _execute_ the IO action we need to apply it to some
RealWorld value that can be done only inside some IO procedure,
in it's "actions chain". And real execution of this action will take
place only when this procedure is called as part of process of
"calculating final value of world" for 'main'. Look at this example:

<haskell>
main = let get2chars = getChar >> getChar
((), world1) = putStr "Press two keys" world0
(answer, world2) = get2chars world1
in ((), world2)
</haskell>

Here we first bind value to 'get2chars' and then write binding
involving 'putStr'. But what is an execution order? It is not defined
by order of writing bindings, it is defined by order of processing
"world" values! You can arbitrarily reorder binding statements - in
any case execution order will be defined by dependence on passing
"world" values. Let's see how this 'main' looks in the 'do' notation:

<haskell>
main = do let get2chars = getChar >> getChar
putStr "Press two keys"
get2chars
return ()
</haskell>

As you can see, the 'let' binding that is not included in IO chain, is
translated just to 'let' statement inside the 'do' sequence. And as
you now should understand, placement of this 'let' don't has any
impact on the evaluation order, which is defined by order of passing
"world" values that is, in turn, defined by order of ordinal (non-let)
statements inside 'do'!

Moreover, IO actions like this 'get2chars' can't be executed just
because they are functions with RealWorld parameter. To execute them,
we should supply the RealWorld parameter, i.e. insert them in 'main'
chain, placing them in some 'do' sequence executed from 'main'. Until
that is done, they will be keep as any function, in partially
evaluated form. And we can work with IO actions as with any other
functions - bind them to names (like above), save them to data
structures, pass as function parameters and return as results - and
they will not be performed until you give them this magic RealWorld
parameter!

Let's try. How about defining list of IO actions?

<haskell>
ioActions :: [IO ()]
ioActions = [(print "Hello!"),
(putStr "just kidding"),
(getChar >> return ())
]
</haskell>

I used additional parentheses around each action, although they are
not really required. If you still can't belive that these actions will
not be executed until your command, just uncover this list type:

<haskell>
ioActions :: [RealWorld -> ((), RealWorld)]
</haskell>

Well, now we want to execute some of these actions. No problem, just
insert them into the 'main' chain:

<haskell>
main = do head ioActions
ioActions !! 1
last ioActions
</haskell>

Looks strange, yeah? :) Really, any IO action you write in the 'do'
statement (or use as parameter for '>>'/'>>=') is an expression
returning result of type "IO a". Typically, you use some function that
has type "x -> y -> ... -> IO a" and provide all these x, y and
so on parameters. But you are not limited to this standard scenario -
don't forget that Haskell is functional language and you are free to
compute the functional value required (recall - "IO a" is a function
type) in any possible way. Here we just extracted several functions
from the list - no problem. This functional value can also be
constructed on-the-fly, as we've done in previous example - it's also
ok. Want to see this functional value passed as the parameter - heh,
just look at the 'when' definition. Hey, we can sell, buy and rent
these IO actions as any other functional values! For example, let's
define function that executes all IO actions in the list:

<haskell>
sequence_ :: [IO a] -> IO ()
sequence_ [] = return ()
sequence_ (x:xs) = do x
sequence_ xs
</haskell>

No black magic - we just extracts IO actions from the list and inserts
them into chain of IO operations that should be performed to "compute
final world value" of entire 'sequence_' call.

With help of 'sequence_', we can rewrite our last 'main' as:

<haskell>
main = sequence ioActions
</haskell>

Haskell's ability to work with IO actions as with any other
(functional or non-functional) value allows us to define control
structures of any complexity. Try, for example, to define control
structure that repeats the action until it returns the 'False' result:

<haskell>
while :: IO Bool -> IO ()
while action = ???
</haskell>

How about returning IO action as the function result? Well, we done
this each time we defined IO procedure - they all return IO action
that need RealWorld value to be performed. While we most times just
executed them in chain of higher-level IO procedure, it's also
possible to just collect them without actual execution:

<haskell>
main = do let a = sequence ioActions
b = when True getChar
c = getChar >> getChar
putStr "'let' statements are not executed!"
</haskell>

These assigned IO procedures can be used as parameters to other
procedures, or written to global variables, or processed in some other
way, or just executed later, as we done in example with 'get2chars'.

But how about returning from IO procedure a parameterized IO action?
Let's define a procedure that returns i'th byte from file represented
as Handle:

<haskell>
readi h i = do hSeek h i AbsoluteSeek
hGetChar h
</haskell>

So bad so good. But how about procedure that returns i'th byte of file
with given name without reopening it each time?

<haskell>
readfilei :: String -> IO (Integer -> IO Char)
readfilei name = do h <- openFile name ReadMode
return (readi h)
</haskell>

As you can see, it's an IO procedure that opens file and returns...
another IO procedure that will read byte specified. But we can go
further and include 'readi' body into 'readfilei':

<haskell>
readfilei name = do h <- openFile name ReadMode
let readi h i = do hSeek h i AbsoluteSeek
hGetChar h
return (readi h)
</haskell>

Good? May be better. Why we add 'h' as 'readi' parameter if it can be
got from the environment where 'readi' now defined? Shorter will be:

<haskell>
readfilei name = do h <- openFile name ReadMode
let readi i = do hSeek h i AbsoluteSeek
hGetChar h
return readi
</haskell>

What we've done here? We've build parameterized IO action involving local
names inside 'readfilei' and returned it as the result. Now it can be
used in following way:

<haskell>
main = do myfile <- readfilei "test"
a <- myfile 0
b <- myfile 1
print (a,b)
</haskell>

Such usage of IO actions is very typical for Haskell programs - you
just construct one or more (using tuple) IO actions that your need,
with and/or without parameters, involving the parameters that your
"constructor" received, and return them to caller. Then these IO actions
can be used in rest of program without any knowledge about your
internal implementation strategies. Actually, this is used to
partially emulate OOP (to be exact, ADT) programming ideology.

For example, one of my program's modules is the memory suballocator. It
receives address and size of large memory block and returns two
procedures - one to allocate subblock of given size and second to
return allocated block back:

<haskell>
memoryAllocator :: Ptr a -> Int -> IO (Int -> IO (Ptr b),
Ptr c -> IO ())

memoryAllocator buf size = do ......
let alloc size = do ...
...
free ptr = do ...
...
return (alloc, free)
</haskell>

How this is implemented? 'alloc' and 'free' works with references
created inside this procedure. Because creation of these references is
a part of 'memoryAllocator' IO actions chain, new independent set of
references will be created for each memory block for which
'memoryAllocator' is called:

<haskell>
memoryAllocator buf size = do start <- newIORef buf
end <- newIORef (buf `plusPtr` size)
...
</haskell>

These two references (we will implement very simple memory allocator) are
read and written in 'alloc' and 'free' definitions:

<haskell>
let alloc size = do addr <- readIORef start
writeIORef start (addr `plusPtr` size)
return addr

let free ptr = do writeIORef start ptr
</haskell>

What we've defined here is just a pair of closures that is using state
available on the moment of their definition. As you can see, it's as
easy as in any other functional language, despite the Haskell's lack
of direct support for non-pure functions.

== unsafePerformIO and unsafeInterleaveIO ==

Programmers with imperative background often still looks for a ways to
execute IO actions inside the pure procedures. But that this means?
Imagine that you try to write procedure that reads contents of file
with given name:

<haskell>
readContents :: Filename -> String
</haskell>

Defining it as pure function will simplify the code that use it, i
agree. But this creates troubles for the compiler:

- first, this call is not inserted in sequence of "world
transformations", so compiler don't get a hint - at what exact moment
you want to execute this action. For example, if file contents is one
at the program start and another at the end - what contents you want
to see? Moment of "consumption" of this value don't make strong
guarantees for execution order, because Haskell see all the functions
as pure and fell free to reorder their execution as needed.

- second, attempts to read contents of file with the same name can be
factorized despite the fact that file (or current directory) can be
changed between calls. Again, Haskell looks at all the functions as
pure ones and feel free to omit excessive calls with the same
parameters.

So, implementing functions that interacts with Real World as pure ones
considered as a Bad Behavior. Good boys never do it ;)

Nevertheless, there are (semi-official) ways to use IO actions inside
of pure functions. As you should remember this is prohibited by
requiring "baton" to call IO action. Pure function don't have the baton,
but there is special procedure, that procures this baton from nowhere,
uses it to call IO action and then throws resulting "world" away!
A little low-level magic :) This very special procedure is:

<haskell>
unsafePerformIO :: IO a -> a
</haskell>

Let's look at it's (possible) definition:

<haskell>
unsafePerformIO :: (RealWorld -> (a,RealWorld)) -> a
unsafePerformIO action = let (a,world1) = action createNewWorld
in a
</haskell>

where 'createNewWorld' is internal function producing new value of
RealWorld type.

Using unsafePerformIO, you can easily write pure functions that does
I/O inside. But don't do this without real need, and remember to
follow this rule: compiler don't know that you are cheating, it still
consider each non-IO function as pure one. Therefore, all the usual
optimization rules can (and will!) be applied to it's execution. So
you must ensure that:

1) Result of each call depends only on it's arguments

2) You don't rely on side-effects of this function, which may be not
executed if it's results are not used

Let's investigate this problem deeper. Function evaluation in Haskell
are ruled by value's necessity - computed only the values that really
required to calculate final result. But that this means according to
'main' function? To "calculate final world's" value, it's required to
perform all the intermediate IO actions that included in 'main' chain.
By using 'unsafePerformIO' we call IO actions outside of this chain.
What can guarantee that they will be run? Nothing. The only case when
they will be run is if that is required to compute overall function
result (that in turn should be required to perform some action in
'main' chain). Here we return to the Haskell-natural
evaluation-on-value-need. Now you should clearly see the difference:

- IO action inside IO procedure guaranteed to execute as long as
it is inside 'main' chain - even when it's result is not used.
You directly specify order of action's execution inside IO procedure.
Data dependencies are simulated via "world" values.

- IO action inside 'unsafePerformIO' will be performed only if
result of this operation is really used. Evaluation order is not
guaranteed and you should not rely on it (except when you sure about
data dependency).

I should also say that inside 'unsafePerformIO' call you can organize
small internal chain of IO actions with help of the same binding
operators and/or 'do' sugar:

<haskell>
one = unsafePerformIO $ do var <- newIORef 0
writeIORef var 1
readIORef var
</haskell>

and in this case ALL the operations in this chain will be performed as
long as 'unsafePerformIO' result will be demanded. To ensure this,
the actual 'unsafePerformIO' implementation evaluates "world" returned
by the 'action':

<haskell>
unsafePerformIO action = let (a,world1) = action createNewWorld
in (world1 `seq` a)
</haskell>

('seq' operation strictly evaluates it's first argument before
returning the value of second one)

But there is even more strange operation - 'unsafeInterleaveIO' that
gets "official baton", makes it's piratical copy, and then run's
"illegal" relay-race in parallel with main one! I can't further say
about it's behavior without grief and indignation, it's not surprise
that this operation is widely used in such software-piratical
countries as Russia and China! ;) Don't even ask me - i will say
nothing about this dirty trick i using permanently ;)

== fixIO and 'mdo' ==

== ST monad ==

== Q monad ==

== Welcome to machine: actual [[GHC]] implementation ==

A little disclaimer: after all, i should say that i don't described
here what is a monad (i even don't know it myself) and what my
explanations shows only the one _possible_ way to implement them in
Haskell. For example, hbc Haskell compiler implements monads via
continuations. I also don't said anything about exception handling
that is natural part of "monad" concept. You can read "All about
monads" guide to learn more on these topics.

But there are a good news: first, monad understanding you've build
will work with any implementation. You just can't work with RealWorld
values directly.

Second, IO monad implementation described here is really used in GHC,
Hugs (nhc/jhc, too?) compilers. It is the really real IO definition
from GHC sources:

<haskell>
newtype IO a = IO (State# RealWorld -> (# State# RealWorld, a #))
</haskell>

It uses "State# RealWorld" type instead of our RealWorld, it uses "(# #)"
strict tuple for optimization, and it adds IO data constructor
around the type. Nevertheless, there are no principal changes. Knowing
the principle of "chaining" IO actions via fake "state of world"
values, now you can easily understand and write low-level
implementations of GHC I/O operations.

=== The [[Yhc]]/nhc98 implementation ===

<haskell>
data World = World
newtype IO a = IO (World -> Either IOError a)
</haskell>

This implementation makes the "World" disappear somewhat, and returns Either a
result "a", or if an error occurs then "IOError". The lack of the World on the
right hand side of the function can only be done because the compiler knows
special things about the IO type, and will not over optimise it.

== Further reading ==

Look at the [[Books and tutorials#Using_Monads]] page

Are you have more questions? Ask in the haskell-cafe.

IO inside

2006-06-30T16:24:28Z

ThiagoArrais:

Haskell I/O always was a source of confusion and surprises for new
Haskellers. While simple I/O code in Haskell looks very similar to
it's equivalents in imperative languages, our attempts to write
somewhat more complex often ended with total head mess. That is
because Haskell I/O is really very different internally. Haskell is a
pure language and even I/O system don't break this law.

The following text is an attempt to explain details of Haskell I/O
implementation that should help you eventually master all the smart
I/O tricks. Moreover, i added detailed explanation of various traps
you can encounter on this way. After reading this text, you will get a
degree "A Master of Haskell I/O" that is equal to Bachelor in CS and
Mathematics, simultaneously :)

If you are new to Haskell I/O you may prefer to start with reading [[Introduction to IO]] page

== Haskell is pure language ==

Haskell is pure language, which means that result of any function call
is fully determined by its arguments. Pseudo-functions like rand() or
getchar() in C which returns different results on each call, are just
impossible and prohibited by language rules. Moreover, Haskell
functions can't have side effects, i.e. they cannot make any changes in the "real
world" - change files, write to the screen, print, send data over the network,
and so on. These two restrictions together mean that any function
call can be omitted, repeated, or replaced by the result of a previous call with the
same parameters, and the language _guarantees_ that all
these rearrangements will not change program result!

Let's compare this to C - compilers for this language just try to guess
that function don't have side effects and it's result don't depends on
some global variables. If this guess is wrong - the whole optimization
becomes incorrect! As a consequence, C optimizers are enough
conservative in their guesses and/or require from programmer to give
them hints about usage (not meaning!) of functions and variables

Comparing to them, Haskell compiler is a set of pure mathematical
transformations that can't be wrong by definition - they just
translate one abstract data processing algorithm (i.e. some complex
function) to another equivalent algorithm, just with better
performance. This results in much better high-level optimization
facilities comparing to C compilers

But this purity creates it's own problems. How we can do I/O, work
with stateful algorithms and side effects in pure language? This
question had many different solutions probed in 18 years of Haskell
existence and finally one based on using monads was widely accepted

== What is the monad? ==

What is the monad? It's something from mathematical category theory, i
don't know anymore :) In order to understand how monads are used to
solve problem of I/O and side effects, you don't need to know it. It's
enough to just know elementary mathematics, like I do :)

Let's imagine that we want to implement in Haskell well-known
'getchar' function. What the type it should have? Let's try:

<haskell>
getchar :: Char

get2chars = [getchar,getchar]
</haskell>

What we will got with 'getchar' having just 'Char' type? You can see
all the possible problems in 'get2chars' definition:

1) because Haskell compiler treats all functions as pure and not
having side effects, it can avoid "excessive" call to 'getchar' and
use one returned value two times

2) even if it will make two calls, there is no any clue to determine
which call should be performed first. Are you want to return chars in
the order they read, or in opposite order? Nothing in 'get2chars'
definition answers this question.

How these problems can be solved, from plain programmer's viewpoint?
Let's introduce fake parameter of 'getchar' to make each call
"different" from compiler's POV:

<haskell>
getchar :: Int -> Char

get2chars = [getchar 1, getchar 2]
</haskell>

This right away solved the first problem mentioned above - now
compiler will make two calls because it sees them as having different
parameters. The whole 'get2chars' function should also had such
fake parameter, otherwise we will have the same problem calling it:

<haskell>
getchar :: Int -> Char
get2chars :: Int -> String

get2chars _ = [getchar 1, getchar 2]
</haskell>

Now, we need to give compiler some clue to determine which function it
should call first. Haskell language don't provide any ways to express
order of evaluation... except for data dependencies! How about adding
artificial data dependency which prevents evaluation of second
'getchar' before the first one? In order to achieve this, we will
return from 'getchar' additional fake result that will be used as
parameter for next 'getchar' call:

<haskell>
getchar :: Int -> (Char, Int)

get2chars _ = [a,b] where (a,i) = getchar 1
(b,_) = getchar i
</haskell>

So bad so good - now we can guarantee that 'a' is read before 'b'
because 'b' reading need value (i) that is returned by 'a' reading!

We've added fake parameter to 'get2chars' but the problem is what
Haskell compiler is too smart! It can believe that external 'getchar'
function is really dependent on it's parameter but for 'get2chars' it
will see that we just cheating and throw it away! Problem? How about
passing this fake parameter to 'getchar' function?! In this case
compiler can't guess that it really unused :)

<haskell>
get2chars i0 = [a,b] where (a,i1) = getchar i0
(b,i2) = getchar i1
</haskell>

And more - 'get2chars' has all the same purity problems as 'getchar'
function. If one need to call it two times, he need a way to describe
order of these calls. Look at:

<haskell>
get4chars = [get2chars 1, get2chars 2] -- order of `get2chars` calls isn't defined
</haskell>

We already know how to fight with such problem - 'get2chars' should
also return some fake value that can be used to order calls:

<haskell>
get2chars :: Int -> (String, Int)

get4chars i0 = (a++b) where (a,i1) = get2chars i0
(b,i2) = get2chars i1
</haskell>

But what the fake value it would return? If we will use some integer
constant, too smart Haskell compiler will guess we are cheating, again :)
What about returning the value returned by 'getchar'? See:

<haskell>
get2chars :: Int -> (String, Int)
get2chars i0 = ([a,b], i2) where (a,i1) = getchar i0
(b,i2) = getchar i1
</haskell>

Believe you or not, but we just constructed the whole "monadic"
Haskell I/O system.

== Welcome to RealWorld, baby :) ==

The 'main' Haskell function has the type:

<haskell>
main :: RealWorld -> ((), RealWorld)
</haskell>

where 'RealWorld' is faking type used instead of our Int. It is something
like baton passed in relay-race. When 'main' calls some IO function,
it pass the "RealWorld" it received as parameter. All IO functions has
similar types involving RealWorld as parameter and result. To be
exact, "IO" is a type synonym defined in the following way:

<haskell>
type IO a = RealWorld -> (a, RealWorld)
</haskell>

so. 'main' just has type "IO ()", 'getChar' has type "IO Char" and so
on. Let's look at 'main' calling 'getChar' two times:

<haskell>
getChar :: RealWorld -> (Char, RealWorld)

main :: RealWorld -> ((), RealWorld)
main world0 = let (a, world1) = getChar world0
(b, world2) = getChar world1
in ((), world2)
</haskell>

Look at this closely: 'main' passes to first 'getChar' the "world" it
received. This 'getChar' returns some new value of type RealWorld,
that is used in next call. Finally, 'main' returns the "world" it got
from the second 'getChar':

1) Is it possible here to omit any call of 'getChar' if the char
it read is not used? No, because we should return the "world" that is
result of second 'getChar' and in turn requires "world" from first 'getChar'.

2) Is it possible to reorder 'getChar' calls? No, second 'getChar'
can't be called before first one because it uses "world" it returns.

3) Is it possible to duplicate calls? In Haskell semantics - yes, but
real compilers never duplicate work in such simple cases (otherwise,
the programs generated will not have any speed guarantees)

As we already said, RealWorld values used like baton, passing them
between all routines called by 'main' in strict order. Inside each
routine called, RealWorld values used in the same way. In whole, in
order to "compute" world to be returned from 'main', we should perform
each IO procedure that is called from 'main', directly or indirectly.
This means that each procedure inserted in the chain will be performed
just at the moment (relative to other IO actions) when we planned it
to be called. Let consider the following program:

<haskell>
main = do a <- ask "What is your name?"
b <- ask "How old are you?"
return ()

ask s = do putStr s
readLn
</haskell>

Now you have enough knowledge to rewrite it in low-level way and
check that each operation what should be performed, will be really
performed with arguments it should have and in order we expecting

But what about conditional execution? No problem. Let's define the
well-known 'when' operation:

<haskell>
when :: Bool -> IO () -> IO ()
when condition action world =
if condition
then action world
else ((), world)
</haskell>

As you can see, we can easily include or exclude from execution chain
IO procedures (actions) depending on the data values. If 'condition'
will be False on call of 'when', 'action' will never be called because
real Haskell compilers, again, never calls functions whose results
don't required to calculate final result (i.e., here, final "world" value
of 'main')

Loops and any more complex control structures can be implemented in
the same way. Try it as an exercise!

Finally you may want to know how much costs this passing of RealWorld
values all around. It's free! These fake values exist for compiler
only while it analyze and optimize code, but when it goes to assembler
code generation, it "suddenly" realize that this type is like "()", so
all these parameters and result values can be omitted from generated code.
Is it not really beautiful? :)

== '>>=' and 'do' notation ==

All beginners (including me :) start by thinking that 'do' is some
magic statement that executes IO actions. It's wrong - 'do' is just a
syntax sugar that simplifies writing of IO procedures. 'do' notation
is finally translated to the statements passing "world" values like
we manually written above and need only to simplify gluing of several
IO actions together. You don't require to use 'do' for just one statement:

<haskell>
main = do putStr "Hello!"
</haskell>

is desugared just to:

<haskell>
main = putStr "Hello!"
</haskell>

But nevertheless it's a Good Style to use 'do' even for one statement
because it simplifies adding new statements in the future.

Let's examine how desugared 'do' with multiple statements on the
following example:

<haskell>
main = do putStr "What is your name?"
putStr "How old are you?"
putStr "Nice day!"
</haskell>

'do' statement here just joins several IO actions that should be
performed sequentially. It's translated to sequential applications
of so named "binding operator", namely '>>':

<haskell>
main = (putStr "What is your name?")
>> ( (putStr "How old are you?")
>> (putStr "Nice day!")
)
</haskell>

This binding operator just combines two IO actions, executing them
sequentially by passing the "world" between them:

<haskell>
(>>) :: IO a -> IO b -> IO b
(action1 >> action2) world0 =
let (a, world1) = action1 world0
(b, world2) = action2 world1
in (b, world2)
</haskell>

If such way to define operator looks strange for you, read this
definition as the following:

<haskell>
action1 >> action2 = action
where
action world0 = let (a, world1) = action1 world0
(b, world2) = action2 world1
in (b, world2)
</haskell>

Now you can substitute definition of '>>' at the places of it's usage
and check that program constructed by 'do' desugaring is actually the
same as we can write by manually manipulating "world" values.

More complex example involves binding of variable using "<-":

<haskell>
main = do a <- readLn
print a
</haskell>

This code is desugared into:

<haskell>
main = readLn
>>= (\a -> print a)
</haskell>

As you should remember, '>>' binding operator silently ignores
value of it's first action and returns as an overall result just
result of second action. On the other side, '>>=' allows to use value
of it's first action - it's passed as additional parameter to the second one!
Look at the definition:

<haskell>
(>>=) :: IO a -> (a->IO b) -> IO b
(action1 >>= action2) world0 =
let (a, world1) = action1 world0
(b, world2) = action2 a world1
in (b, world2)
</haskell>

First, what means type of second action, namely "a->IO b"? By
substituting the "IO" definition, we get "a -> RealWorld -> (b, RealWorld)".
This means that second action actually has two parameters
- of type 'a' actually used inside it, and of type RealWorld used for
sequencing of IO actions. That's a destiny - any IO procedure has one
more parameter comparing to that you see in it's type signature. This
parameter is hidden inside the definition of type alias "IO".

Second, you can use these '>>' and '>>=' operations to simplify your
program. For example, in the code above we don't need to introduce the
variable, because 'readLn' result can be send directly to 'print':

<haskell>
main = readLn >>= print
</haskell>

And third - as you see, the notation:

<haskell>
do x <- action1
action2
</haskell>

where 'action1' has type "IO a" and 'action2' has type "IO b",
translated into:

<haskell>
action1 >>= (\x -> action2)
</haskell>

where second argument of '>>=' has the type "a->IO b". It's the way
how the "<-" binding processed - it just becomes parameter of
subsequent operations represented as one large IO action. Look at the
next example:

<haskell>
main = do putStr "What is your name?"
a <- readLn
putStr "How old are you?"
b <- readLn
print (a,b)
</haskell>

This code is desugared into:

<haskell>
main = putStr "What is your name?"
>> readLn
>>= \a -> putStr "How old are you?"
>> readLn
>>= \b -> print (a,b)
</haskell>

I omitted parentheses here, both '>>' and '>>=' operations are
left-associative that leads to that 'a' and 'b' bindings introduced
here is valid for all remaining actions. As an exercise, add the
parentheses yourself and translate this procedure into the low-level
code passing "world" values. I think it should be enough to finally
realize how 'do' translation and binding operators work.

Oh, no. I forgot third monadic operator - 'return'. It just
combines it's two parameters - value passed and "world":

<haskell>
return :: a -> IO a
return a world0 = (a, world0)
</haskell>

How about translating some simple example of 'return' usage? Say,

<haskell>
main = do a <- readLn
return (a*2)
</haskell>

Programmers with imperative languages background often thinks that
'return' in Haskell, like in other languages, immediately returns from
the IO procedure. As you can see in its definition (and even just
type!), such assumption is totally wrong. The only purpose of using
'return' is to "lift" some value (of type 'a') into the result of
whole action (of type "IO a") and therefore it should be used only as
last executed statements of some IO sequence. For example try to
translate the following procedure into the low-level code:

<haskell>
main = do a <- readLn
when (a>=0) $ do
return ()
print "a is negative"
</haskell>

and you will realize that 'print' statement is executed anyway. If you
need to escape from middle of IO procedure, you can use the 'if'
statement:

<haskell>
main = do a <- readLn
if (a>=0)
then return ()
else print "a is negative"
</haskell>

Moreover, Haskell layout rules allow us to use the following layout:

<haskell>
main = do a <- readLn
if (a>=0) then return ()
else do
print "a is negative"
...
</haskell>

that may be very useful for escaping from middle of longish 'do' statement.

Last exercise: implement function 'liftM' that lifts operations on
plain values to the operations on monadic ones. It's type signature:

<haskell>
liftM :: (a->b) -> (IO a -> IO b)
</haskell>

If it's too hard for you, start with the following high-level
definition and rewrite it in low-level fashion:

<haskell>
liftM f action = do x <- action
return (f x)
</haskell>

== Mutable data (references, arrays, hash tables...) ==

As you should know, all names in Haskell are bind to one fixed value.
This greatly simplify understanding of algorithms and optimization of
code, but inappropriate for some cases. Yes, there a plenty of
algorithms that is simpler to implement in terms of updatable
variables, arrays and so on. This means that the value associated with
variable, for example, can be different at different execution points,
so reading it's value can't be considered as pure function. Imagine,
for example the following code:

<haskell>
main = do let a0 = readVariable varA
_ = writeVariable varA 1
a1 = readVariable varA
print (a0,a1)
</haskell>

Looks strange? First, two calls to 'readVariable' looks the same, so
compiler can just reuse the value returned by first call. Second,
result of 'writeVariable' call isn't used so compiler can (and will!)
omit this call completely. To finish the picture, these 3 calls may be
rearranged to any order because they looking independent on each
other. What is the solution? You know - using of IO actions! IO
actions guarantees us that:

# execution order will be retained
# each action will be mandatory executed
# result of the "same" action (such as "readVariable varA") will not be reused

So, the code above really should be written as:

<haskell>
main = do varA <- newIORef 0 -- Create and initialize new variable
a0 <- readIORef varA
writeIORef varA 1
a1 <- readIORef varA
print (a0,a1)
</haskell>

Here, 'varA' got type "IORef Int" which means "variable (reference) in
IO monad holding value of type Int". newIORef creates new variable
(reference) and returns it, and then read/write actions use this
reference. Value returned by "readIORef varA" action may depend not
only on variable involved but also on the moment of performing this
operation so it can return different values on each call.

Arrays, hash tables and any other _mutable_ data structures are
defined in the same way - there is operation that creates new "mutable
value" and returns reference to it. Then special read and write
operations in IO monad are used. The following example shows example
of using mutable array:

<haskell>
import Data.Array.IO
main = do arr <- newArray (1,10) 37 :: IO (IOArray Int Int)
a <- readArray arr 1
writeArray arr 1 64
b <- readArray arr 1
print (a,b)
</haskell>

Here, array of 10 elements with 37 as initial values is created. After
reading value of first element to 'a' this element's value is changed
to 64 and then read again, to 'b'. As you can see by executing this
code, 'a' will be set to 37 and 'b' to 64.

Other state-dependent operations are also often implemented as IO
actions. For example, random numbers generator should return different
values on each call. It looks natural to give it IO-involving type:

<haskell>
rand :: IO Int
</haskell>

Moreover, when you import C routines you should be careful - if this
routine is impure, i.e. it's result depends on something in "real
world" (file system, memory contents...), internal state and so on,
you should give it IO-involving type. Otherwise, compiler can
"optimize" repetitive calls of this procedure with the same parameters! :)

For example:

<haskell>
foreign import ccall
sin :: Double -> Double
</haskell>

because 'sin' result depends only on it's argument, but

<haskell>
foreign import ccall
tell :: Int -> IO Int
</haskell>

If you will declare 'tell' as pure function (without IO) then you may
got the same position on each call! :)

== IO actions as values ==

Now you should precisely understand why it's impossible to use IO
actions inside non-IO (pure) procedures. Such procedures just don't
get a "baton", don't know any "world" value to pass to IO action.
RealWorld is abstract datatype, so they also can't construct it's
values by himself, and it's a strict type, so 'undefined' also can't
be used. So, prohibition of using IO actions inside pure procedures is
just a type trick as it is usual in Haskell :)

But while pure code can't _execute_ IO actions, it can work with them
as with any other functional values - they can be stored in data
structures, passed as parameters and returned as results, collected in
lists, and partially applied. But anyway IO action will remain
functional value because we can't apply it to the last argument - of
type RealWorld.

In order to _execute_ the IO action we need to apply it to some
RealWorld value that can be done only inside some IO procedure,
in it's "actions chain". And real execution of this action will take
place only when this procedure is called as part of process of
"calculating final value of world" for 'main'. Look at this example:

<haskell>
main = let get2chars = getChar >> getChar
((), world1) = putStr "Press two keys" world0
(answer, world2) = get2chars world1
in ((), world2)
</haskell>

Here we first bind value to 'get2chars' and then write binding
involving 'putStr'. But what is an execution order? It is not defined
by order of writing bindings, it is defined by order of processing
"world" values! You can arbitrarily reorder binding statements - in
any case execution order will be defined by dependence on passing
"world" values. Let's see how this 'main' looks in the 'do' notation:

<haskell>
main = do let get2chars = getChar >> getChar
putStr "Press two keys"
get2chars
return ()
</haskell>

As you can see, the 'let' binding that is not included in IO chain, is
translated just to 'let' statement inside the 'do' sequence. And as
you now should understand, placement of this 'let' don't has any
impact on the evaluation order, which is defined by order of passing
"world" values that is, in turn, defined by order of ordinal (non-let)
statements inside 'do'!

Moreover, IO actions like this 'get2chars' can't be executed just
because they are functions with RealWorld parameter. To execute them,
we should supply the RealWorld parameter, i.e. insert them in 'main'
chain, placing them in some 'do' sequence executed from 'main'. Until
that is done, they will be keep as any function, in partially
evaluated form. And we can work with IO actions as with any other
functions - bind them to names (like above), save them to data
structures, pass as function parameters and return as results - and
they will not be performed until you give them this magic RealWorld
parameter!

Let's try. How about defining list of IO actions?

<haskell>
ioActions :: [IO ()]
ioActions = [(print "Hello!"),
(putStr "just kidding"),
(getChar >> return ())
]
</haskell>

I used additional parentheses around each action, although they are
not really required. If you still can't belive that these actions will
not be executed until your command, just uncover this list type:

<haskell>
ioActions :: [RealWorld -> ((), RealWorld)]
</haskell>

Well, now we want to execute some of these actions. No problem, just
insert them into the 'main' chain:

<haskell>
main = do head ioActions
ioActions !! 1
last ioActions
</haskell>

Looks strange, yeah? :) Really, any IO action you write in the 'do'
statement (or use as parameter for '>>'/'>>=') is an expression
returning result of type "IO a". Typically, you use some function that
has type "x -> y -> ... -> IO a" and provide all these x, y and
so on parameters. But you are not limited to this standard scenario -
don't forget that Haskell is functional language and you are free to
compute the functional value required (recall - "IO a" is a function
type) in any possible way. Here we just extracted several functions
from the list - no problem. This functional value can also be
constructed on-the-fly, as we've done in previous example - it's also
ok. Want to see this functional value passed as the parameter - heh,
just look at the 'when' definition. Hey, we can sell, buy and rent
these IO actions as any other functional values! For example, let's
define function that executes all IO actions in the list:

<haskell>
sequence_ :: [IO a] -> IO ()
sequence_ [] = return ()
sequence_ (x:xs) = do x
sequence_ xs
</haskell>

No black magic - we just extracts IO actions from the list and inserts
them into chain of IO operations that should be performed to "compute
final world value" of entire 'sequence_' call.

With help of 'sequence_', we can rewrite our last 'main' as:

<haskell>
main = sequence ioActions
</haskell>

Haskell's ability to work with IO actions as with any other
(functional or non-functional) value allows us to define control
structures of any complexity. Try, for example, to define control
structure that repeats the action until it returns the 'False' result:

<haskell>
while :: IO Bool -> IO ()
while action = ???
</haskell>

How about returning IO action as the function result? Well, we done
this each time we defined IO procedure - they all return IO action
that need RealWorld value to be performed. While we most times just
executed them in chain of higher-level IO procedure, it's also
possible to just collect them without actual execution:

<haskell>
main = do let a = sequence ioActions
b = when True getChar
c = getChar >> getChar
putStr "'let' statements are not executed!"
</haskell>

These assigned IO procedures can be used as parameters to other
procedures, or written to global variables, or processed in some other
way, or just executed later, as we done in example with 'get2chars'.

But how about returning from IO procedure a parameterized IO action?
Let's define a procedure that returns i'th byte from file represented
as Handle:

<haskell>
readi h i = do hSeek h i AbsoluteSeek
hGetChar h
</haskell>

So bad so good. But how about procedure that returns i'th byte of file
with given name without reopening it each time?

<haskell>
readfilei :: String -> IO (Integer -> IO Char)
readfilei name = do h <- openFile name ReadMode
return (readi h)
</haskell>

As you can see, it's an IO procedure that opens file and returns...
another IO procedure that will read byte specified. But we can go
further and include 'readi' body into 'readfilei':

<haskell>
readfilei name = do h <- openFile name ReadMode
let readi h i = do hSeek h i AbsoluteSeek
hGetChar h
return (readi h)
</haskell>

Good? May be better. Why we add 'h' as 'readi' parameter if it can be
got from the environment where 'readi' now defined? Shorter will be:

<haskell>
readfilei name = do h <- openFile name ReadMode
let readi i = do hSeek h i AbsoluteSeek
hGetChar h
return readi
</haskell>

What we've done here? We've build parameterized IO action involving local
names inside 'readfilei' and returned it as the result. Now it can be
used in following way:

<haskell>
main = do myfile <- readfilei "test"
a <- myfile 0
b <- myfile 1
print (a,b)
</haskell>

Such usage of IO actions is very typical for Haskell programs - you
just construct one or more (using tuple) IO actions that your need,
with and/or without parameters, involving the parameters that your
"constructor" received, and return them to caller. Then these IO actions
can be used in rest of program without any knowledge about your
internal implementation strategies. Actually, this is used to
partially emulate OOP (to be exact, ADT) programming ideology.

For example, one of my program's modules is the memory suballocator. It
receives address and size of large memory block and returns two
procedures - one to allocate subblock of given size and second to
return allocated block back:

<haskell>
memoryAllocator :: Ptr a -> Int -> IO (Int -> IO (Ptr b),
Ptr c -> IO ())

memoryAllocator buf size = do ......
let alloc size = do ...
...
free ptr = do ...
...
return (alloc, free)
</haskell>

How this is implemented? 'alloc' and 'free' works with references
created inside this procedure. Because creation of these references is
a part of 'memoryAllocator' IO actions chain, new independent set of
references will be created for each memory block for which
'memoryAllocator' is called:

<haskell>
memoryAllocator buf size = do start <- newIORef buf
end <- newIORef (buf `plusPtr` size)
...
</haskell>

These two references (we will implement very simple memory allocator) are
read and written in 'alloc' and 'free' definitions:

<haskell>
let alloc size = do addr <- readIORef start
writeIORef start (addr `plusPtr` size)
return addr

let free ptr = do writeIORef start ptr
</haskell>

What we've defined here is just a pair of closures that is using state
available on the moment of their definition. As you can see, it's as
easy as in any other functional language, despite the Haskell's lack
of direct support for non-pure functions.

== unsafePerformIO and unsafeInterleaveIO ==

Programmers with imperative background often still looks for a ways to
execute IO actions inside the pure procedures. But that this means?
Imagine that you try to write procedure that reads contents of file
with given name:

<haskell>
readContents :: Filename -> String
</haskell>

Defining it as pure function will simplify the code that use it, i
agree. But this creates troubles for the compiler:

- first, this call is not inserted in sequence of "world
transformations", so compiler don't get a hint - at what exact moment
you want to execute this action. For example, if file contents is one
at the program start and another at the end - what contents you want
to see? Moment of "consumption" of this value don't make strong
guarantees for execution order, because Haskell see all the functions
as pure and fell free to reorder their execution as needed.

- second, attempts to read contents of file with the same name can be
factorized despite the fact that file (or current directory) can be
changed between calls. Again, Haskell looks at all the functions as
pure ones and feel free to omit excessive calls with the same
parameters.

So, implementing functions that interacts with Real World as pure ones
considered as a Bad Behavior. Good boys never do it ;)

Nevertheless, there are (semi-official) ways to use IO actions inside
of pure functions. As you should remember this is prohibited by
requiring "baton" to call IO action. Pure function don't have the baton,
but there is special procedure, that procures this baton from nowhere,
uses it to call IO action and then throws resulting "world" away!
A little low-level magic :) This very special procedure is:

<haskell>
unsafePerformIO :: IO a -> a
</haskell>

Let's look at it's (possible) definition:

<haskell>
unsafePerformIO :: (RealWorld -> (a,RealWorld)) -> a
unsafePerformIO action = let (a,world1) = action createNewWorld
in a
</haskell>

where 'createNewWorld' is internal function producing new value of
RealWorld type.

Using unsafePerformIO, you can easily write pure functions that does
I/O inside. But don't do this without real need, and remember to
follow this rule: compiler don't know that you are cheating, it still
consider each non-IO function as pure one. Therefore, all the usual
optimization rules can (and will!) be applied to it's execution. So
you must ensure that:

1) Result of each call depends only on it's arguments

2) You don't rely on side-effects of this function, which may be not
executed if it's results are not used

Let's investigate this problem deeper. Function evaluation in Haskell
are ruled by value's necessity - computed only the values that really
required to calculate final result. But that this means according to
'main' function? To "calculate final world's" value, it's required to
perform all the intermediate IO actions that included in 'main' chain.
By using 'unsafePerformIO' we call IO actions outside of this chain.
What can guarantee that they will be run? Nothing. The only case when
they will be run is if that is required to compute overall function
result (that in turn should be required to perform some action in
'main' chain). Here we return to the Haskell-natural
evaluation-on-value-need. Now you should clearly see the difference:

- IO action inside IO procedure guaranteed to execute as long as
it is inside 'main' chain - even when it's result is not used.
You directly specify order of action's execution inside IO procedure.
Data dependencies are simulated via "world" values.

- IO action inside 'unsafePerformIO' will be performed only if
result of this operation is really used. Evaluation order is not
guaranteed and you should not rely on it (except when you sure about
data dependency).

I should also say that inside 'unsafePerformIO' call you can organize
small internal chain of IO actions with help of the same binding
operators and/or 'do' sugar:

<haskell>
one = unsafePerformIO $ do var <- newIORef 0
writeIORef var 1
readIORef var
</haskell>

and in this case ALL the operations in this chain will be performed as
long as 'unsafePerformIO' result will be demanded. To ensure this,
the actual 'unsafePerformIO' implementation evaluates "world" returned
by the 'action':

<haskell>
unsafePerformIO action = let (a,world1) = action createNewWorld
in (world1 `seq` a)
</haskell>

('seq' operation strictly evaluates it's first argument before
returning the value of second one)

But there is even more strange operation - 'unsafeInterleaveIO' that
gets "official baton", makes it's piratical copy, and then run's
"illegal" relay-race in parallel with main one! I can't further say
about it's behavior without grief and indignation, it's not surprise
that this operation is widely used in such software-piratical
countries as Russia and China! ;) Don't even ask me - i will say
nothing about this dirty trick i using permanently ;)

== fixIO and 'mdo' ==

== ST monad ==

== Q monad ==

== Welcome to machine: actual [[GHC]] implementation ==

A little disclaimer: after all, i should say that i don't described
here what is a monad (i even don't know it myself) and what my
explanations shows only the one _possible_ way to implement them in
Haskell. For example, hbc Haskell compiler implements monads via
continuations. I also don't said anything about exception handling
that is natural part of "monad" concept. You can read "All about
monads" guide to learn more on these topics.

But there are a good news: first, monad understanding you've build
will work with any implementation. You just can't work with RealWorld
values directly.

Second, IO monad implementation described here is really used in GHC,
Hugs (nhc/jhc, too?) compilers. It is the really real IO definition
from GHC sources:

<haskell>
newtype IO a = IO (State# RealWorld -> (# State# RealWorld, a #))
</haskell>

It uses "State# RealWorld" type instead of our RealWorld, it uses "(# #)"
strict tuple for optimization, and it adds IO data constructor
around the type. Nevertheless, there are no principal changes. Knowing
the principle of "chaining" IO actions via fake "state of world"
values, now you can easily understand and write low-level
implementations of GHC I/O operations.

=== The [[Yhc]]/nhc98 implementation ===

<haskell>
data World = World
newtype IO a = IO (World -> Either IOError a)
</haskell>

This implementation makes the "World" disappear somewhat, and returns Either a
result "a", or if an error occurs then "IOError". The lack of the World on the
right hand side of the function can only be done because the compiler knows
special things about the IO type, and will not over optimise it.

== Further reading ==

Look at the [[Books and tutorials#Using_Monads]] page

Are you have more questions? Ask in the haskell-cafe.