google · djmitche · May 3, 2024 · May 3, 2024 · May 3, 2024 · May 3, 2024
diff --git a/src/SUMMARY.md b/src/SUMMARY.md
@@ -382,6 +382,8 @@
 - [Async Basics](concurrency/async.md)
   - [`async`/`await`](concurrency/async/async-await.md)
   - [Futures](concurrency/async/futures.md)
+  - [State Machine](concurrency/async/state-machine.md)
+    - [Recursion](concurrency/async/state-machine/recursion.md)
   - [Runtimes](concurrency/async/runtimes.md)
     - [Tokio](concurrency/async/runtimes/tokio.md)
   - [Tasks](concurrency/async/tasks.md)

diff --git a/src/concurrency/async-pitfalls/pin.md b/src/concurrency/async-pitfalls/pin.md
@@ -4,13 +4,10 @@ minutes: 20
 
 # `Pin`
 
-Async blocks and functions return types implementing the `Future` trait. The
-type returned is the result of a compiler transformation which turns local
-variables into data stored inside the future.
-
-Some of those variables can hold pointers to other local variables. Because of
-that, the future should never be moved to a different memory location, as it
-would invalidate those pointers.
+Recall an async function or block creates a type implementing `Future` and
+containing all of the local variables. Some of those variables can hold
+references (pointers) to other local variables. To ensure those remain valid,
+the future can never be moved to a different memory location.
 
 To prevent moving the future type in memory, it can only be polled through a
 pinned pointer. `Pin` is a wrapper around a reference that disallows all

diff --git a/src/concurrency/async/state-machine.md b/src/concurrency/async/state-machine.md
@@ -0,0 +1,97 @@
+---
+minutes: 7
+---
+
+# State Machine
+
+Rust transforms an async function or block to a hidden type that implements
+`Future`, using a state machine to track the function's progress. The details of
+this transform are complex, but it helps to have a schematic understanding of
+what is happening.
+
+```rust,editable,compile_fail
+use futures::executor::block_on;
+use std::future::Future;
+use std::pin::Pin;
+use std::task::{Context, Poll};
+
+async fn send(s: &str) {
+    println!("{s}");
+}
+
+/*
+async fn count_to(count: i32) {
+    for i in 1..=count {
+        send("tick").await;
+    }
+}
+*/
+
+fn count_to(count: i32) -> CountToFuture {
+    CountToFuture { state: CountToState::Init, count, i: 0 }
+}
+
+struct CountToFuture {
+    state: CountToState,
+    count: i32,
+    i: i32,
+}
+
+enum CountToState {
+    Init,
+    Sending(Pin<Box<dyn Future<Output = ()>>>),
+}
+
+impl std::future::Future for CountToFuture {
+    type Output = ();
+    fn poll(mut self: Pin<&mut Self>, cx: &mut Context<'_>) -> Poll<Self::Output> {
+        loop {
+            match &mut self.state {
+                CountToState::Init => {
+                    self.i = 1;
+                    self.state = CountToState::Sending(Box::pin(send("tick")));
+                }
+                CountToState::Sending(send_future) => {
+                    match send_future.as_mut().poll(cx) {
+                        Poll::Pending => return Poll::Pending,
+                        Poll::Ready(_) => {
+                            self.i += 1;
+                            if self.i > self.count {
+                                return Poll::Ready(());
+                            } else {
+                                self.state =
+                                    CountToState::Sending(Box::pin(send("tick")));
+                            }
+                        }
+                    }
+                }
+            }
+        }
+    }
+}
+
+fn main() {
+    block_on(count_to(5));
+}
+```
+
+<details>
+
+While this code will run, it is simplified from what the real state machine
+would do. The important things to notice here are:
+
+- Calling an async function does nothing but construct a value, ready to start
+  on the first call to `poll`.
+- All local variables are stored in the function's future struct, including an
+  enum to identify where execution is currently suspended. The real generated
+  state machine would not initialize `i` to 0.
+- An `.await` in the async function is translated into a call to that async
+  function, then polling the future it returns until it is `Poll::Ready`. The
+  real generated state machine would contain the future type defined by `send`,
+  but that cannot be expressed in Rust syntax.
+- Execution continues eagerly until there's some reason to block. Try returning
+  `Poll::Pending` in the `CountToState::Init` branch of the match, in hopes that
+  `poll` will be called again with state `CountToState::Sending`. `block_on`
+  will not do so!
+
+</details>
diff --git a/src/concurrency/async/state-machine/recursion.md b/src/concurrency/async/state-machine/recursion.md
@@ -0,0 +1,34 @@
+---
+minutes: 3
+---
+
+# Recursion
+
+An async function's future type _contains_ the futures for all functions it
+calls. This means a recursive async functions are not allowed.
+
+```rust,editable,compile_fail
+use futures::executor::block_on;
+
+async fn count_to(n: u32) {
+    if n > 0 {
+        count_to(n - 1).await;
+        println!("{n}");
+    }
+}
+
+fn main() {
+    block_on(count_to(5));
+}
+```
+
+<details>
+
+This is a quick illustration of how understanding the state machine helps to
+understand errors. Recursion would require `CountToFuture` to contain a field of
+type `CountToFuture`, which is impossible.
+
+Fix this with `Box::pin(count_to(n-1)).await;`, boxing the future returned from
+`count_to`.
+
+</details>