Revision as of 17:05, 8 December 2006

Concurrent programming in GHC

This page contains notes and information about how to write concurrent programs in GHC.

Please feel free to add stuff here (Edit page link at the bottom).

Starting points

Basic concurrency: forkIO and MVars. Read Tackling the awkward squad: monadic input/output, concurrency, exceptions, and foreign-language calls in Haskell.
The original paper about Concurrent Haskell contains quite a few examples about how to write concurrent programs. A larger example is Writing High-Performance Server Applications in Haskell, Case Study: A Haskell Web Server

Software Transactional Memory (STM) is a new way to coordinate concurrent threads. There's a separate Wiki page devoted to STM.

STM was added to GHC 6.4, and is described in the paper Composable memory transactions. The paper Lock-free data structures using Software Transactional Memory in Haskell gives further examples of concurrent programming using STM.

Foreign function interface. If you are calling foreign functions in a concurrent program, you need to know about bound threads. They are described in a Haskell workshop paper, Extending the Haskell Foreign Function Interface with Concurrency. The GHC Commentary Supporting multi-threaded interoperation contains more detailed explanation of cooperation between FFI calls and multi-threaded runtime.

Nested Data Parallelism. For an approach to exploiting the implicit parallelism in array programs for multiprocessors, see Data Parallel Haskell (work in progress).

Using concurrency in GHC

You get access to concurrency operations by importing the library Control.Concurrent.

The GHC manual gives a few useful flags that control scheduling (not usually necessary) RTS options.

Multiprocessor GHC

As of version 6.5, GHC supports running programs in parallel on an SMP or multi-core machine. How to do it:

You'll need to get a version of GHC that supports SMP. Either download a nightly snapshot distribution, or get the sources from darcs and build it yourself.

You need to link your program using the -threaded switch. (NOTE: previously it was necessary to compile all code, including libraries, with the -smp switch, this is no longer the case. The -smp flag is now a synonym for -threaded).

Run the program with +RTS -N2 to use 2 threads, for example. You should use a -N value equal to the number of CPU cores on your machine (not including Hyper-threading cores).

Concurrent threads (forkIO and forkOS) will run in parallel, and you can also use the par combinator and Strategies from the Control.Parallel.Strategies module to create parallelism.

Use +RTS -sstderr for timing stats.

Links to related work on parallel and distributed Haskell (many based on GHC)

Problems with GHC implementation

There are ciritical differences between the description in the paper "Asynchronous exceptions in Haskell by Simon Marlow, Simon Peyton Jones, Andy Moran and John Reppy, PLDI'01." the implementation in GHC 6.4 and GHC 6.6 today.

Some of the bad effects are described here under throwTo & block statements considered harmful.

Difference between revisions of "Concurrency"