Difference between revisions of "IO Semantics"

From HaskellWiki
Jump to navigation Jump to search
m
m
 
(6 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
[[Category:Theoretical_foundations]]
 
[[Category:Theoretical_foundations]]
== Semantics of <code>IO</code>: A Free Approach ==
 
   
  +
== An adapted example ==
The following is inspired by [https://lukepalmer.wordpress.com/2008/03/29/io-monad-the-continuation-presentation Luke Palmer's post] on the topic. This only describes one possible semantics of <code>IO a</code>; your actual implementation may vary.
 
  +
  +
Based on page 42 of 76 in [http://www.people.cs.uchicago.edu/~soare/Turing/shagrir.pdf Turing-Post Relativized Computability and Interactive Computing]:
  +
  +
<blockquote>
  +
<b>The Limit Computable or Approximation Model</b>
  +
  +
There exists a sequence of Turing programs {<i>P<sub>t</sub></i> : <i>t</i> ∈ <i>T</i> } so that
  +
<i>P<sub>t</sub></i> computes function <i>g<sub>t</sub></i> at time <i>t</i>. There is not
  +
necessarily any connection between different programs and computing may have to
  +
start all over again as the time changes from <i>t
  +
</i> to <i>t</i> + 1.
  +
  +
Suppose a meteorologist receives data every second <i>t</i> ∈ <i>T</i> from weather
  +
stations scattered across the country. The configuration at the meteorologist's
  +
desk may be described using the Shoenfield Limit Lemma by a computable function
  +
where <i>g<sub>t</sub></i> is the computable characteristic function of
  +
<i>B<sub>t</sub></i>, the configuration of the meteorological computation at the end
  +
of time <i>t</i>. The computable function <i>g<sub>t</sub></i> gives an algorithm to
  +
compute the condition <i>B<sub>t</sub></i> at time <i>t</i> but it gives no relationship
  +
between <i>B<sub>t</sub></i> and <i>B</i><sub><i>t</i>+1</sub>. It will not be possible for the
  +
meteorologist to run, let alone write a new program every second. How will the
  +
meteorologist write a program to uniformly compute the index <i>g<sub>t</sub></i>
  +
for <i>t</i> ∈ <i>T</i> ?
  +
  +
<b>The Online Model With an Oracle Machine</b>
  +
  +
By the Shoenfield Limit Lemma there is a computably enumerable set <i>A</i> (or even
  +
a ∆<sup>0</sup><sub>2</sub> set) and oracle machine Φ<sub><i>e</i></sub> such that
  +
<i>B</i> = Φ<sub><i>e</i></sub><sup><i>A</i></sup>. Now the meteorologist can program the
  +
algorithm Φ<sub><i>e</i></sub> into a computer once and for all at the start of the
  +
day. Every second <i>t</i> ∈ <i>T</i> the meteorologist receives from the weather stations
  +
the latest readings <i>A<sub>t</sub></i> which enter directly into that computer by
  +
an network connection. The meteorologist does not (and cannot) change the
  +
program Φ<sub><i>e</i></sub> every second. The algorithm simply receives the
  +
“oracle” information <i>A</i> from the weather-station network as it is continually
  +
updated, and computes the approximation <i>B<sub>t</sub></i>(<i>x</i>) =
  +
Φ<sub><i>e</i></sub><sup><i>A<sub>t</sub></i></sup>(<i>x</i>) . The meteorologist's program
  +
then produces the next scheduled weather forecast for the day from the
  +
algorithm's result. It is difficult to see how this meteorologist could have
  +
carried out that activity using a batch processing, automatic machine
  +
model, instead of an online model.
  +
</blockquote>
  +
  +
== More is needed ==
  +
  +
But the meteorologist's program has to do more than download weather-station
  +
data: it also has to upload the resulting weather forecasts, for example,
  +
to the computers of media outlets to be broadcast to listeners or viewers.
  +
Another machine model is needed, not just for interactions with other
  +
computers but also:
  +
  +
* a screen, to view a document,
  +
* a printer, to have a hard copy of a document,
  +
* a speaker, to produce some sound,
  +
* a keyboard, mouse, microphone or controller, to obtain some input,
  +
* [https://www.cs.virginia.edu/~robins/Turing_Paper_1936.pdf an operator], if an "arbitrary choice” is needed,
  +
* [https://pure.mpg.de/rest/items/item_2403325_2/component/file_2403324/content an oracle], which simply “cannot be a machine”.
  +
  +
<blockquote>
  +
An “automatic” machine becomes a “choice” machine as soon as we allow the machine’s tape
  +
to be modified by external entities: the tape itself becomes a means of communication. This is
  +
essentially what happens in “real” computers (memory-mapped I/O); for example, we can write
  +
to the computer’s screen by modifying one particular area of memory, or find out which key was
  +
pressed on the computer’s keyboard by reading another.
  +
  +
<small>[https://web.archive.org/web/20160527210838/https://www.edsko.net/pubs/thesis.pdf Making uniqueness types less unique] (page 23 of 264).</small>
  +
</blockquote>
  +
  +
So a choice machine could merely be an automatic machine given an extra ability
  +
to make requests to external entities while it is running:
  +
  +
* requests for <i>output</i> to a screen, printer or speaker.
  +
* or for <i>input</i> from a keyboard, mouse, microphone, controller, an operator or an oracle.
  +
  +
with an oracle machine then being a specialised choice machine. Therefore if
  +
an automatic machine computes the result of an ordinary expression (like
  +
<code>56 + 24</code>), a choice machine computes the result of an I/O action (like
  +
<code>getChar</code>) when provided with an external-entity request. So how can
  +
this concept be adapted for use in a <i>functional</i> language like Haskell?
  +
  +
== Two different approaches ==
  +
<i>
  +
Note:
  +
* For simplicity, the examples here only gives semantics for teletype I/O.
  +
* These are only some of the various ways to describe the semantics of </i><code>IO a</code><i>; your actual implementation may vary.
  +
</i>
  +
  +
=== A free type ===
  +
  +
(Inspired by [https://lukepalmer.wordpress.com/2008/03/29/io-monad-the-continuation-presentation Luke Palmer's post].)
   
 
The idea is to define <code>IO</code> as
 
The idea is to define <code>IO</code> as
Line 11: Line 100:
 
</haskell>
 
</haskell>
   
For simplicity, this is an example of <code>IO</code> that only gives semantics for teletype I/O.
+
Think of <code>IO a</code> as a tree:
 
Think of <code>IO a</code> as a tree whose leaves are <code>Done a</code> that holds the result of the program:
 
 
* <code>PutChar</code> is a node that has one child tree and the node holds one character of data.
 
* <code>PutChar</code> is a node that has one child tree and the node holds one character of data.
 
* <code>GetChar</code> is a node that has many children; it has one child for every character, but <code>GetChar</code> holds no data itself.
 
* <code>GetChar</code> is a node that has many children; it has one child for every character, but <code>GetChar</code> holds no data itself.
  +
* <code>Done a</code> (a leaf) is a node that holds the result of the program.
   
This tree contains all the information needed to execute teletype interactions. One interprets (or executes) an <code>IO a</code> by tracing a route from root of the tree to a leaf:
+
This tree contains all the information needed to execute basic interactions. One interprets (or executes) an <code>IO a</code> by tracing a route from root of the tree to a leaf:
 
* If a <code>PutChar</code> node is encountered, the character data contained at that node is output to the terminal and then its subtree is executed. It is at this point that Haskell code is evaluated in order to determine what character should be displayed before continuing.
 
* If a <code>PutChar</code> node is encountered, the character data contained at that node is output to the terminal and then its subtree is executed. It is at this point that Haskell code is evaluated in order to determine what character should be displayed before continuing.
 
* If a <code>GetChar</code> node is encountered, a character is read from the terminal (blocking if necessary) and the subtree corresponding to the character received is executed.
 
* If a <code>GetChar</code> node is encountered, a character is read from the terminal (blocking if necessary) and the subtree corresponding to the character received is executed.
* If <code>Done</code> is encountered, the program ends.
+
* If a <code>Done</code> node is encountered, the program ends.
   
<code>Done</code> holds the result of the computation, but in the case of <code>Main.main :: IO ()</code> the data is of type <code>()</code> and thus contains no information and is ignored.
+
<code>Done</code> holds the result of the computation, but in the case of <code>Main.main :: IO ()</code> the data is of type <code>()</code> and thus ignored as it contains no information.
   
 
This execution is not done anywhere in a Haskell program, rather it is done by the run-time system.
 
This execution is not done anywhere in a Haskell program, rather it is done by the run-time system.
Line 49: Line 137:
   
 
* The function <code>putChar</code> builds a small <code>IO ()</code> tree that contains one <code>PutChar</code> node holding the character data followed by <code>Done</code>.
 
* The function <code>putChar</code> builds a small <code>IO ()</code> tree that contains one <code>PutChar</code> node holding the character data followed by <code>Done</code>.
 
 
* The function <code>getChar</code> builds a short <code>IO Char</code> tree that begins with a <code>GetChar</code> node that holds one <code>Done</code> node for every character.
 
* The function <code>getChar</code> builds a short <code>IO Char</code> tree that begins with a <code>GetChar</code> node that holds one <code>Done</code> node for every character.
   
Other teletype commands can be defined in terms of these primitives:
+
Other commands can be defined in terms of these primitives:
 
<haskell>
 
<haskell>
 
putStr :: String -> IO ()
 
putStr :: String -> IO ()
Line 62: Line 149:
 
| SysCallName p1 p2 ... pn (r -> IO a)
 
| SysCallName p1 p2 ... pn (r -> IO a)
 
</haskell>
 
</haskell>
  +
where:
where <code>p1</code> ... <code>pn</code> are the parameters for the system call, and <code>r</code> is the result of the system call. (Thus <code>PutChar</code> and <code>GetChar</code> will not occur as constructors for I/O trees if they don't correspond to system calls).
 
  +
* <code>p1</code> ... <code>pn</code> are the parameters for the system call,
  +
* and <code>r</code> is the result of the system call.
   
  +
(Thus <code>PutChar</code> and <code>GetChar</code> will not occur as constructors for I/O trees if they don't correspond to system calls).
== Further reading ==
 
  +
  +
=== A more direct style ===
  +
  +
<blockquote>
  +
Here is the key idea:
  +
  +
:A value of type <code>IO a</code> is an “action” that, when performed, may do some input/output, before delivering a value of type <code>a</code>.
  +
  +
:<small>[[#readmore|Tackling the Awkward Squad]] (page 5 of 60).</small>
  +
</blockquote>
  +
  +
It also has a [https://www.interaction-design.org/literature/topics/keep-it-simple-stupid simple] translation:
  +
<pre>
  +
type IO a = (->) OI a
  +
. . . .
  +
an action that __| | | |
  +
| | |
  +
when performed __________| | |
  +
| |
  +
may do some input/output ___| |
  +
|
  +
before delivering a value _____|
  +
</pre>
  +
  +
Think of an <code>IO a</code> action as an entity which:
  +
* can be used like other Haskell functions,
  +
* but may also have effects like procedures.
  +
  +
This combination of [[Denotative|denotative]] and imperative features is enough to provide basic interactions. An <code>IO a</code> action works by calling it when applied to an <code>OI</code> argument, subject to certain conditions:
  +
* Each <code>OI</code> value should be distinct from all others, to help preserve [[Referential transparency|referential transparency]].
  +
* Each <code>OI</code> value should be used at most once, to ensure actions can be used like other functions.
  +
  +
As for the <code>OI</code> type itself:
  +
<haskell>
  +
data OI -- no constructors are visible to the user
  +
</haskell>
  +
  +
Most <code>OI</code> values would then be provided by one or more specific actions:
  +
<haskell>
  +
foreign import partOI :: OI -> (OI, OI)
  +
  +
partsOI :: OI -> [OI]
  +
partsOI u = let !(u1, u2) = partOI u in u1 : partsOI u2
  +
</haskell>
  +
  +
but in the case of <code>Main.main :: IO ()</code> the initial argument is provided directly by the implementation. It is from this initial argument that all other <code>OI</code> values in the program are obtained.
  +
  +
Not being able to define <code>OI</code> values directly in Haskell means actions in a Haskell program cannot work until the program, applied to its initial <code>OI</code> argument, is called by the run-time system.
  +
  +
The monadic operations are defined as follows:
  +
<haskell>
  +
instance Monad ((->) OI)
  +
where
  +
return x = \ u -> let !_ = partOI u in x
  +
  +
m >>= k = \ u -> let !(u1, u2) = partOI u
  +
!x = m u1
  +
!y = k x u2 in
  +
in y
  +
</haskell>
  +
  +
Note that both methods use their <code>OI</code> parameters, with <code>return</code> simply returning its other parameter <code>x</code> to the caller. The bind operation <code>(>>=)</code> takes an action <code>m</code> and a function <code>k</code> and uses new <code>OI</code> values to call <code>m</code> to obtain <code>x</code>, then call <code>k x</code> to obtain the result <code>y</code>.
  +
  +
Other actions can be declared:
  +
  +
<haskell>
  +
foreign import getChar :: OI -> Char
  +
foreign import putChar :: Char -> OI -> ()
  +
</haskell>
  +
  +
or defined:
  +
  +
<haskell>
  +
getLine :: OI -> [Char]
  +
getLine = do c <- getChar
  +
if c == ’\n’ then return ""
  +
else do cs <- getLine
  +
return (c:cs)
  +
  +
putStr :: [Char] -> OI -> ()
  +
putStr cs = foldr (\ !(_) -> id) () . zipWith putChar cs . partsOI
  +
</haskell>
  +
  +
In more fully-featured implementations, each system call would have its own declaration:
  +
  +
<haskell>
  +
primitive primSysCallName :: T1 -> T2 -> ... -> OI -> Tr
  +
</haskell>
  +
  +
<haskell>
  +
foreign import ... extnSysCallName :: T1 -> T2 -> ... -> OI -> Tr
  +
</haskell>
  +
  +
where:
  +
* <code>T1</code>, <code>T2</code> ... are the types of the parameters for the system call,
  +
* and <code>Tr</code> is the type of the system call's result.
  +
  +
== <span id="readmore"></span> Further reading ==
  +
  +
* [https://www.haskell.org/definition/haskell2010.pdf The Haskell 2010 Report]
  +
::ed. Simon Marlow Marlow, 2010.
   
 
* [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.722.8440&rep=rep1&type=pdf A Functional Specification of Effects]
 
* [http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.722.8440&rep=rep1&type=pdf A Functional Specification of Effects]
Line 71: Line 261:
 
* [https://www.cs.nott.ac.uk/~psztxa/publ/beast.pdf Beauty in the Beast: A Functional Semantics for the Awkward Squad]
 
* [https://www.cs.nott.ac.uk/~psztxa/publ/beast.pdf Beauty in the Beast: A Functional Semantics for the Awkward Squad]
 
::Wouter Swierstra, Thorsten Altenkirch. In: Proceedings of the ACM SIGPLAN Workshop on Haskell, Haskell ’07, ACM, New York, NY, USA, pages 25–36 (2007).
 
::Wouter Swierstra, Thorsten Altenkirch. In: Proceedings of the ACM SIGPLAN Workshop on Haskell, Haskell ’07, ACM, New York, NY, USA, pages 25–36 (2007).
  +
  +
* [https://www.scss.tcd.ie/publications/tech-reports/reports.06/TCD-CS-2006-19.pdf A Semantic Framework for Deterministic Functional Input/Output]
  +
::Malcolm Dowse. PhD dissertation, University of Dublin, Trinity College (2006).
   
 
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.15.736&rep=rep1&type=pdf Semantics of ''fixIO'']
 
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.15.736&rep=rep1&type=pdf Semantics of ''fixIO'']
 
::Levent Erkök, John Launchbury, Andrew Moran. In Fixed Points in Computer Science Workshop, FICS'01 (2001).
 
::Levent Erkök, John Launchbury, Andrew Moran. In Fixed Points in Computer Science Workshop, FICS'01 (2001).
   
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.13.9123&rep=rep1&type=pdf Tackling the awkward squad: monadic input/output, concurrency, exceptions, and foreign-language calls in Haskell]
+
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.13.9123&rep=rep1&type=pdf Tackling the Awkward Squad: monadic input/output, concurrency, exceptions, and foreign-language calls in Haskell]
 
::Simon Peyton Jones. In "Engineering theories of software construction", ed. Tony Hoare, Manfred Broy, Ralf Steinbruggen, IOS Press, ISBN 1 58603 1724, 2001, pages 47-96.
 
::Simon Peyton Jones. In "Engineering theories of software construction", ed. Tony Hoare, Manfred Broy, Ralf Steinbruggen, IOS Press, ISBN 1 58603 1724, 2001, pages 47-96.
   
 
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.52.6409&rep=rep1&type=pdf Relating operational and denotational semantics for input/output effects]
 
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.52.6409&rep=rep1&type=pdf Relating operational and denotational semantics for input/output effects]
 
::Roy L. Crole, Andrew D. Gordon. Mathematical Structures in Computer Science 9(2): 125-158 (1999).
 
::Roy L. Crole, Andrew D. Gordon. Mathematical Structures in Computer Science 9(2): 125-158 (1999).
  +
  +
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.91.3579&rep=rep1&type=pdf How to Declare an Imperative]
  +
::Philip Wadler. ACM Computing Surveys, 29(3): 240-263, September 1997.
  +
  +
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.52.6922&rep=rep1&type=pdf Monadic I/O in Haskell 1.3]
  +
::Andrew D. Gordon and Kevin Hammond. In: Proceedings of the Haskell Workshop, La Jolla, California, June 1995.
   
 
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.16.5894&rep=rep1&type=pdf A Sound Metalogical Semantics for Input/Output Effects]
 
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.16.5894&rep=rep1&type=pdf A Sound Metalogical Semantics for Input/Output Effects]
Line 92: Line 291:
 
* [https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-160.pdf PFL+: A Kernel Scheme for Functional I/O]
 
* [https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-160.pdf PFL+: A Kernel Scheme for Functional I/O]
 
::Andrew Gordon. Computer Laboratory Technical Report Number 160, University of Cambridge (1989).
 
::Andrew Gordon. Computer Laboratory Technical Report Number 160, University of Cambridge (1989).
  +
  +
* [https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.45.8497&rep=rep1&type=pdf Interactive Functional Programs: A Method and a Formal Semantics]
  +
::Simon Thompson. Technical Report 48, Computing Laboratory, University of Kent, Canterbury, UK, November 1987.

Latest revision as of 04:04, 10 August 2024


An adapted example

Based on page 42 of 76 in Turing-Post Relativized Computability and Interactive Computing:

The Limit Computable or Approximation Model

There exists a sequence of Turing programs {Pt : tT } so that Pt computes function gt at time t. There is not necessarily any connection between different programs and computing may have to start all over again as the time changes from t to t + 1.

Suppose a meteorologist receives data every second tT from weather stations scattered across the country. The configuration at the meteorologist's desk may be described using the Shoenfield Limit Lemma by a computable function where gt is the computable characteristic function of Bt, the configuration of the meteorological computation at the end of time t. The computable function gt gives an algorithm to compute the condition Bt at time t but it gives no relationship between Bt and Bt+1. It will not be possible for the meteorologist to run, let alone write a new program every second. How will the meteorologist write a program to uniformly compute the index gt for tT ?

The Online Model With an Oracle Machine

By the Shoenfield Limit Lemma there is a computably enumerable set A (or even a ∆02 set) and oracle machine Φe such that B = ΦeA. Now the meteorologist can program the algorithm Φe into a computer once and for all at the start of the day. Every second tT the meteorologist receives from the weather stations the latest readings At which enter directly into that computer by an network connection. The meteorologist does not (and cannot) change the program Φe every second. The algorithm simply receives the “oracle” information A from the weather-station network as it is continually updated, and computes the approximation Bt(x) = ΦeAt(x) . The meteorologist's program then produces the next scheduled weather forecast for the day from the algorithm's result. It is difficult to see how this meteorologist could have carried out that activity using a batch processing, automatic machine model, instead of an online model.

More is needed

But the meteorologist's program has to do more than download weather-station data: it also has to upload the resulting weather forecasts, for example, to the computers of media outlets to be broadcast to listeners or viewers. Another machine model is needed, not just for interactions with other computers but also:

  • a screen, to view a document,
  • a printer, to have a hard copy of a document,
  • a speaker, to produce some sound,
  • a keyboard, mouse, microphone or controller, to obtain some input,
  • an operator, if an "arbitrary choice” is needed,
  • an oracle, which simply “cannot be a machine”.

An “automatic” machine becomes a “choice” machine as soon as we allow the machine’s tape to be modified by external entities: the tape itself becomes a means of communication. This is essentially what happens in “real” computers (memory-mapped I/O); for example, we can write to the computer’s screen by modifying one particular area of memory, or find out which key was pressed on the computer’s keyboard by reading another.

Making uniqueness types less unique (page 23 of 264).

So a choice machine could merely be an automatic machine given an extra ability to make requests to external entities while it is running:

  • requests for output to a screen, printer or speaker.
  • or for input from a keyboard, mouse, microphone, controller, an operator or an oracle.

with an oracle machine then being a specialised choice machine. Therefore if an automatic machine computes the result of an ordinary expression (like 56 + 24), a choice machine computes the result of an I/O action (like getChar) when provided with an external-entity request. So how can this concept be adapted for use in a functional language like Haskell?

Two different approaches

Note:

  • For simplicity, the examples here only gives semantics for teletype I/O.
  • These are only some of the various ways to describe the semantics of IO a; your actual implementation may vary.

A free type

(Inspired by Luke Palmer's post.)

The idea is to define IO as

data IO a = Done a
          | PutChar Char (IO a)
          | GetChar (Char -> IO a)

Think of IO a as a tree:

  • PutChar is a node that has one child tree and the node holds one character of data.
  • GetChar is a node that has many children; it has one child for every character, but GetChar holds no data itself.
  • Done a (a leaf) is a node that holds the result of the program.

This tree contains all the information needed to execute basic interactions. One interprets (or executes) an IO a by tracing a route from root of the tree to a leaf:

  • If a PutChar node is encountered, the character data contained at that node is output to the terminal and then its subtree is executed. It is at this point that Haskell code is evaluated in order to determine what character should be displayed before continuing.
  • If a GetChar node is encountered, a character is read from the terminal (blocking if necessary) and the subtree corresponding to the character received is executed.
  • If a Done node is encountered, the program ends.

Done holds the result of the computation, but in the case of Main.main :: IO () the data is of type () and thus ignored as it contains no information.

This execution is not done anywhere in a Haskell program, rather it is done by the run-time system.

The monadic operations are defined as follows:

return :: a -> IO a
return x = Done x

(>>=)  :: IO a -> (a -> IO b) -> IO b
Done x      >>= f = f x
PutChar c x >>= f = PutChar c (x >>= f)
GetChar g   >>= f = GetChar (\c -> g c >>= f)

As you can see return is just another name for Done. The bind operation (>>=) takes a tree x and a function f and replaces the Done nodes (the leaves) of x by a new tree produced by applying f to the data held in the Done nodes.

The primitive I/O commands are defined using these constructors.

putChar :: Char -> IO ()
putChar x = PutChar x (Done ())

getChar :: IO Char
getChar = GetChar (\c -> Done c)
  • The function putChar builds a small IO () tree that contains one PutChar node holding the character data followed by Done.
  • The function getChar builds a short IO Char tree that begins with a GetChar node that holds one Done node for every character.

Other commands can be defined in terms of these primitives:

putStr :: String -> IO ()
putStr = mapM_ putChar

More generally speaking, IO a will represent the desired interaction with the operating system. For every system call there will be a corresponding I/O-tree constructor of the form:

	| SysCallName p1 p2 ... pn (r -> IO a)

where:

  • p1 ... pn are the parameters for the system call,
  • and r is the result of the system call.

(Thus PutChar and GetChar will not occur as constructors for I/O trees if they don't correspond to system calls).

A more direct style

Here is the key idea:

A value of type IO a is an “action” that, when performed, may do some input/output, before delivering a value of type a.
Tackling the Awkward Squad (page 5 of 60).

It also has a simple translation:

            type IO a = (->) OI a
                  .       .  .  .
 an action that __|       |  |  |
                          |  |  |
 when performed __________|  |  |
                             |  |
 may do some input/output ___|  |
                                |
 before delivering a value _____|

Think of an IO a action as an entity which:

  • can be used like other Haskell functions,
  • but may also have effects like procedures.

This combination of denotative and imperative features is enough to provide basic interactions. An IO a action works by calling it when applied to an OI argument, subject to certain conditions:

  • Each OI value should be distinct from all others, to help preserve referential transparency.
  • Each OI value should be used at most once, to ensure actions can be used like other functions.

As for the OI type itself:

data OI  -- no constructors are visible to the user

Most OI values would then be provided by one or more specific actions:

foreign import partOI  :: OI -> (OI, OI)

partsOI                :: OI -> [OI]
partsOI u              = let !(u1, u2) = partOI u in u1 : partsOI u2

but in the case of Main.main :: IO () the initial argument is provided directly by the implementation. It is from this initial argument that all other OI values in the program are obtained.

Not being able to define OI values directly in Haskell means actions in a Haskell program cannot work until the program, applied to its initial OI argument, is called by the run-time system.

The monadic operations are defined as follows:

instance Monad ((->) OI)
  where
        return x =  \ u -> let !_ = partOI u in x 

        m >>= k  =  \ u -> let !(u1, u2) = partOI u
                               !x = m u1
                               !y = k x u2 in
                           in y

Note that both methods use their OI parameters, with return simply returning its other parameter x to the caller. The bind operation (>>=) takes an action m and a function k and uses new OI values to call m to obtain x, then call k x to obtain the result y.

Other actions can be declared:

foreign import getChar :: OI -> Char
foreign import putChar :: Char -> OI -> ()

or defined:

getLine                :: OI -> [Char]
getLine                = do c <- getChar
                            if c == \n then return ""
                                         else do cs <- getLine
                                              return (c:cs)

putStr                 :: [Char] -> OI -> ()
putStr cs              = foldr (\ !(_) -> id) () . zipWith putChar cs . partsOI

In more fully-featured implementations, each system call would have its own declaration:

primitive primSysCallName :: T1 -> T2 -> ... -> OI -> Tr
foreign import ... extnSysCallName :: T1 -> T2 -> ... -> OI -> Tr

where:

  • T1, T2 ... are the types of the parameters for the system call,
  • and Tr is the type of the system call's result.

Further reading

ed. Simon Marlow Marlow, 2010.
Wouter Swierstra. Ph.D. thesis, University of Nottingham. (2009).
Wouter Swierstra, Thorsten Altenkirch. In: Proceedings of the ACM SIGPLAN Workshop on Haskell, Haskell ’07, ACM, New York, NY, USA, pages 25–36 (2007).
Malcolm Dowse. PhD dissertation, University of Dublin, Trinity College (2006).
Levent Erkök, John Launchbury, Andrew Moran. In Fixed Points in Computer Science Workshop, FICS'01 (2001).
Simon Peyton Jones. In "Engineering theories of software construction", ed. Tony Hoare, Manfred Broy, Ralf Steinbruggen, IOS Press, ISBN 1 58603 1724, 2001, pages 47-96.
Roy L. Crole, Andrew D. Gordon. Mathematical Structures in Computer Science 9(2): 125-158 (1999).
Philip Wadler. ACM Computing Surveys, 29(3): 240-263, September 1997.
Andrew D. Gordon and Kevin Hammond. In: Proceedings of the Haskell Workshop, La Jolla, California, June 1995.
Andrew Gordon. In International Workshop on Computer Science Logic, January 1995. Springer Berlin Heidelberg.
Andrew Gordon. In FPCA '93: Conference on Functional Programming Languages and Computer Architecture, Copenhagen, June 1993. ACM Press.
Andrew Gordon. Cambridge University Press. Revision of 1992 PhD dissertation.
Andrew Gordon. Computer Laboratory Technical Report Number 160, University of Cambridge (1989).
Simon Thompson. Technical Report 48, Computing Laboratory, University of Kent, Canterbury, UK, November 1987.