lj_chaource: Free functors, free applicatives, free monads IV

Free functors, free applicatives, free monads IV
Part I
Part II
Part III

After extensive preparation and practice with free functors and free applicatives, we will now find it easy to tackle the free monad.

There are two equivalent definitions of a monadic functor: one with "pure" and "join", another with "pure" and "bind".

pure :: t -> M t
join :: M (M t) -> M t
bind :: M a -> (a -> M t) -> M t

All of these properties have the form of a representable functor, that is,

"...something..." -> M t

For this reason, defining the free F-monad is straightforward. We have a choice: either use "pure" and "join", or use "pure" and "bind" as primitive constructors.

The corresponding definitions of a "free F-monad" look like this in the short notation:

M₁ t = t + F t + M₁ ( M₁ t )

M₂ t = t + F t + ∃a. M₂ a * (a -> M₂ t)

Both definitions are recursive and use the recursive instance of "M" at two places.

In Haskell's "long form", the definitions are

data M₁ t where
  Pure :: t -> M₁ t
  Wrap :: F t -> M₁ t
  Join :: M₁ (M₁ t) -> M₁ t

data M₂ t where
  Pure :: t -> M₂ t
  Wrap :: F t -> M₂ t
  Bind :: forall a. M₂ a -> (a -> M₂ t) -> M₂ t

These definitions will certainly yield us all the monadic properties "for free".

At this point, we would naturally come up with the following questions about these definitions:

- Are these two definitions equivalent?

- What are the possible "optimizations" for these definitions? How do they affect the performance of the corresponding "universal runners"?

- Is there a definition of M(F) that does not assume that F is a functor?

Comparing the two definitions of the free F-monad, we notice that we can get M₂ out of M₁ if we substitute M₁t by ∃a. M₂ a * (a -> t) in the recursive case.

Now, this looks suspiciously like the free functor construction. Note that if F is already a functor then the free F-functor,

∃a. F a * (a -> t),

is actually equivalent to F t. (This is one form of Yoneda's lemma, but that's the only time we used it so far, so I don't feel that it helps to emphasize the name "Yoneda".)

So, it appears that M₂ could be a candidate for a free monad that doesn't assume F to be a functor. In its present form, however, M₂ does require F to be a functor because F t is one of the variants of M₂t, and so F t needs to be a functor by itself.

To remedy this, we can try implementing fmap for M₂ in a different way, so that F t after an "fmap" with a function of type "t->s" is converted into ∃a. M₂ a * (a -> M₂ s) with a = t. To produce a value of (t -> M₂ s), we use the given function t -> s and regard s as a "pure" variant of M₂ s, so that the given function is viewed as a morphism t -> M₂ s. However, this implementation does not satisfy one of the monadic laws: fmap of identity is not equal to identity (because it transforms F t into another variant of M₂ t).

Let us therefore consider possible optimizations of the definitions of the free monad.

Our approach to "optimization" consists of trying to replace "M t by "F a" at certain places in the definitions of the free monad, and trying to remove the "wrap" constructor. Let us see how well this works.

It turns out that we can replace "M" by "F" only in the first recursive use of "M". The second recursive use of "M" cannot be replaced by "F", because attempted definitions such as

M t = t + F t + M₁ ( F₁ t )

M t = t + F t + ∃a. M₂ a * (a -> F₂ t)

do not work (we can't define the monadic functions "bind" and "join" any more).

Replacing the first recursive use of "M" works and brings the expected results. It is straightforward to check that

M₃ t = t + F ( M₃ t )

M₄ t = t + ∃a. F a * (a -> M₄ t)

are valid definitions of the free monad that allow us to define all the required morphisms and satisfy the required laws.

The "wrap" and "bind" operations have to be implemented since they are no longer simply equal to type constructors. Let us show briefly how the "wrap" and "bind" operations are defined for M₄. (The corresponding definitions for M₃ can be seen in an earlier post.)

To define "wrap", we need to inject a value of F t into M₄ t. We can set a = t and use F t * (t -> M t) as the value, where t -> M t is the identity morphism t->t composed with "pure" :: t -> M t to obtain a morphism t -> M t.

To define "bind", we need to transform M t * (t -> M s) into M s. We proceed by case analysis on the first M t. If we have a pure value t, we can evaluate the function t -> M s and obtain a value of M s. If we have a value of ∃a. F a * (a -> M₄ t), we can build a function a -> M s by evaluating "bind" recursively on M₄ t * (t -> M s). Note that the result is F a * (a -> M s), where the function of type a -> M s contains a call to "bind" but is not yet evaluated. Thus, applying "bind" does not actually run the recursive call to "bind", and so takes O(1) operations. The recursive layers are accumulated in the recursive instance of "M s", while "F a" stays unchanged.

The difference between M₃ and M₄ can be seen clearly now: M₄ uses the free F-functor instead of F. Thus, M₄ does not require F to be a functor, while M₃ does. When F is a functor, the definitions 3 and 4 are equivalent, due to the fact that the free F-functor is equivalent to F when F is itself a functor (Yoneda's lemma).

The performance differences between M₁ and M₃ is similar to the difference we saw when studying the free F-functor. The original definition M₁ performs no computations at all until the "universal runner" is applied; the "optimized" definitions will perform some computations (composing some functions) while applying "fmap" or "bind" to the free F-monad, so that the "universal runner" will have fewer computations to do.

Note that the runner will have to descend into several layers of "bind", while applying "bind" to the monad itself takes O(1) operations.

We can restore the "F t" constructor in these definitions. However, some of the identity laws will not hold any more, as we already noted. If the violation of these laws is not important, but the performance gain is significant, we can use the versions of the free F-monad with the "F t" constructor. But, strictly speaking, only M₃ is the free F-monad in the categorical sense (where F is assumed to be a functor). Replacing F with a free F-functor, we obtain M₄ out of M₃.

We can also see why it is impossible to replace the second recursive instance of M by F in the definitions of the free F-monad. The free F-monad must, in one way or another, accumulate several layers of "bind" without actually performing those computations. Therefore, it is required that we have some recursive instance of M in the definition. However, if we wanted to have the recursive instance on the first use of "M", we would have to transfer the "bind" or "join" from the second instance of M to the first, passing through a layer of the functor F. This, however, is impossible, since the types F ( M t) and M ( F t) are not equivalent, while we need to define the free F-monad such that it works for every functor F. For this reason, the definitions M₃ and M₄ are the simplest ones possible.
/lj-cut>