# Difference between revisions of "Common Misunderstandings"

(→If / Then / Else) |
(→If / Then / Else) |
||

Line 26: | Line 26: | ||

</code> |
</code> |
||

− | The above is valid C code, a [ |
+ | The above is valid C code, a [http://en.wikipedia.org/wiki/Ternary_operation ternary operation], that's used quite commonly as a shortcut to typing out a full if-then-else block. It states that if <code>b</code> is true then <code>x = y</code> otherwise <code>x = z</code>. Notice how this makes no sense without <code>z</code>. Similarly, in Haskell an <hask>if</hask>/<hask>then</hask> makes no sense without an <hask>else</hask>. |

<haskell> |
<haskell> |

## Revision as of 21:37, 24 August 2011

## Contents

# Common Mistakes and Incorrect Beliefs By Haskell Beginners

People going from zero to Haskell are likely to gain a misunderstanding or miss a point that isn't stressed enough. Here are some mistakes that have been observed from multiple sources.

## Indentation

Perhaps the first trip-up - you might understand that indentation defines where a code block starts and the lack of an equal amount of indentation indicates the previous code block ended. What some miss is that `then`

and `else`

must be indented deeper than the `if`

statement:

```
if boolean
then expr1
else expr2
```

Or they can be on the same line as the if:

```
if boolean then expr1 else expr2
```

## If / Then / Else

if-then statements must always include an 'else' portion. It might be best not to think of if-then-else as flow control, as in most imperative languages, but think of it as construction of a value using a well formed expression.

```
x = b ? y : z;
```

The above is valid C code, a ternary operation, that's used quite commonly as a shortcut to typing out a full if-then-else block. It states that if `b`

is true then `x = y`

otherwise `x = z`

. Notice how this makes no sense without `z`

. Similarly, in Haskell an `if`

/`then`

makes no sense without an `else`

.

```
let x = if b then y -- compare to x = b ? y
```

What is `x`

when `b`

is false? One should also recognize that the types returned by the `then`

and `else`

branches must match due to Haskells strong and static type system.

When `if`

is used for sequencing IO it is not uncommon to see an `else`

that returns a null value:

```
main = do
startNetwork <- askUser "Network? "
if startNetwork
then do iface <- initNetworkInterface
handlePackets iface
else return ()
```

Such uses can be more succinct if they use the `when`

function (from the Control.Monad module):

```
main = do
startNetwork <- askUser "Network? "
when startNetwork $ do
iface <- initNetworkInterface
handlePackets iface
```

## Binding of Type Signatures

Due to the typeclass overloading of numbers, lists of numbers are often typed as:

```
ghci> [1,2,3,4] :: [Float]
```

This works fine when the list stands alone, but when applying a function take note the function application binds stronger than the type signature. This means the below signature is invalid:

```
ghci> map floor [1,2,3,4] :: [Float]
```

and should instead be:

```
ghci> map floor ([1,2,3,4] :: [Float])
```

**do** Notation

If the do notation page ever exists I'll advise you to check it out. Until then, understand that a missing `do`

from the top of a function or code block can result in your compiler giving an error message citing a much later line number. Also, any new blocks (ex: from an `if`

or `case`

) must have their own `do`

, even if the higher level code block already had one.

Sorry this isn't the full picture - for an inverse point of view see do notation considered harmful.

## Iterating Over a List

Some beginners make the mistake of mistaking a single-element list pattern (such as `[x]`

) for a pattern that iterates over every element in the list.

One example that recently (in April, 2008) appeared on the Haskell-Cafe mailing list (see the reply post Re: Embedding newlines into a string?) was the following. Here, one coder attempted to write a function `hanoi`

to solve the Towers of Hanoi problem, but to code it so that each tower could be named polymorphically, using, for example, either Chars or Ints. The problematic code segment was the following:

```
hanoi_shower :: Show a => [(a, a)] -> String
hanoi_shower [(a, b)] = "Move " ++ show a ++ " to " ++ show b ++ "."
```

in the following program:

```
hanoi :: a -> a -> a -> Int -> [(a, a)]
hanoi source using dest n
| n == 1 = [(source, dest)]
| otherwise = hanoi source dest using (n-1)
++ hanoi source using dest 1
++ hanoi using source dest (n-1)
hanoi_shower :: Show a => [(a, a)] -> String
hanoi_shower [(a, b)] = "Move " ++ show a ++ " to " ++ show b ++ "."
```

The coder tried to run the code in WinHugs as follows:

`Main> putStr (hanoi_shower (hanoi 'a' 'b' 'c' 2))`

However, this was the result:

`Program error: pattern match failure: hanoi_shower [('a','b'),('a','c')] ++ ([] ++ hanoi 'b' 'a' 'c' (2 - 1))`

The problem was that the parameter `[(a, b)]`

to `hanoi_shower`

only matched the first element of the list, but didn't iterate over the list as intended.

Here is a corrected version of the code above:

```
hanoi_shower :: Show a => [(a, a)] -> String
hanoi_shower moves = unlines ["Move " ++ show a ++ " to "++ show b ++ "." | (a, b) <- moves]
```

Here, `moves`

is pattern-matched to type `[(a, a)]`

(a list of pairs). The problem is how to iterate over the elements (pairs) of the list while separating the first `a`

of each pair from the second `a`

.

The solution above uses list comprehension: The generator `(a, b) <- moves`

feeds each pair in turn to the left-hand expression `(a, b)`

, and this pair is mapped to the left expression, `"Move " ++ show a ++ " to "++ show b ++ "."`

, building a new list of sentences representing moves. Then, the function `unlines`

breaks this list into a sequence of lines.

Here is the result of executing the above code in WinHugs:

`Main> putStr (hanoi_shower (hanoi 'a' 'b' 'c' 2)) Move 'a' to 'b'. Move 'a' to 'c'. Move 'b' to 'c'. Main> putStr (hanoi_shower (hanoi 1 2 3 2)) Move 1 to 2. Move 1 to 3. Move 2 to 3.`

Notice that since `a`

and `b`

in `(a, b)`

are polymorphic types, they can range over both `Chars`

and `Ints`

.

Another way of writing `hanoi_shower`

, using `map`

, is as follows:

```
hanoi_shower :: Show a => [(a, a)] -> String
hanoi_shower moves = unlines (map move moves)
where move (a, b) = "Move " ++ show a ++ " to "++ show b ++ "."
```

Here, `move`

is mapped over `moves`

, and each pair `(a, b)`

of `moves`

is pattern-matched against `"Move " ++ show a ++ " to "++ show b ++ "."`

Another way to map over a list is to use recursion, although this method is considered non-idiomatic Haskell (Haskellers generally prefer using higher-order functions over recursion when possible):

```
hanoi :: a -> a -> a -> Int -> [(a, a)]
hanoi source using dest n
| n == 0 = []
| n == 1 = [(source, dest)]
| otherwise = hanoi source dest using (n-1)
++ hanoi source using dest 1
++ hanoi using source dest (n-1)
hanoi_shower :: Show a => [(a, a)] -> String
hanoi_shower [] = ""
hanoi_shower ((a, b):moves) = unlines ["Move " ++ show a ++ " to "++ show b ++ "."] ++ hanoi_shower moves
```

Here, in `hanoi_shower`

, the base case is simply an empty list `[]`

. At each recursive step, a list of type `[(a, a)]`

(a list of pairs) is mapped against the parameter `(a, b):moves`

of `hanoi_shower`

. This separates the head of the list `(a, b)`

from the tail of the list `moves`

, which then is further matched against `((a, b):moves)`

on the next recursive call.

There are other ways of iterating over lists as well. One advantage of Haskell is that there are often many ways of performing the same action, including iterating over lists.

## Guards

(The following two tips on guards were contributed by the user 7stud in the thread "Top beginner mistakes" (see http://article.gmane.org/gmane.comp.lang.haskell.beginners/1121) on the Haskell-Beginners mailing list on Wed, 4 Mar 2009 21:54:05 +0000 (UTC).)

Some beginners make the mistake of putting an equals sign after a function name when using guards; *viz.*:

```
myfunction x y =
| x < 2 = "a"
| y > 20 = "b"
| otherwise = "c"
```

This causes a cryptic error message similar to the following to be displayed:

`dhask.hs:2:4: parse error on input `|' Failed, modules loaded: none.`

Another common mistake that some beginners make is writing "if" in front of the guard conditions; *viz.*:

```
myfunction x y
| if x < 2 = "a"
| if y > 20 = "b"
| otherwise = "c"
```

This causes a mysterious error message similar to the following to be shown:

`dhask.hs:2:25: parse error on input `=' Failed, modules loaded: none.`

In both cases, the error messages don't help to identify the problem.

(Entry added by Benjamin L. Russell.)

## Parentheses

(The following tip on parentheses was contributed by the user 7stud in the thread "Top beginner mistakes" (see http://article.gmane.org/gmane.comp.lang.haskell.beginners/1121) on the Haskell-Beginners mailing list on Wed, 4 Mar 2009 21:54:05 +0000 (UTC).)

Some beginners make the mistake of not putting parentheses around arguments of the form x:xs; *viz.*:

```
dosomething x:xs = head xs
```

This causes an ambiguous error message similar to the following to be shown:

`dhask.hs:1:0: Parse error in pattern Failed, modules loaded: none.`

Here, the error message doesn't help to recognize the problem.

(Entry added by Benjamin L. Russell.)