nipype
diff --git a/‎notebooks/3_intro_functiontask_state.md
Lines changed: 50 additions & 49 deletions b/‎notebooks/3_intro_functiontask_state.md
Lines changed: 50 additions & 49 deletions
@@ -4,9 +4,9 @@ jupytext:
     extension: .md
     format_name: myst
     format_version: 0.13
-    jupytext_version: 1.14.0
+    jupytext_version: 1.15.0
 kernelspec:
-  display_name: Python 3
+  display_name: Python 3 (ipykernel)
   language: python
   name: python3
 ---
@@ -17,7 +17,7 @@ Task might be run for a single set of input values or we can generate multiple s
 
 Let's start from a simple `FunctionTask` that takes a list as an input:
 
-```{code-cell}
+```{code-cell} ipython3
 ---
 jupyter:
   outputs_hidden: false
@@ -31,7 +31,7 @@ import nest_asyncio
 nest_asyncio.apply()
 ```
 
-```{code-cell}
+```{code-cell} ipython3
 import pydra
 
 
@@ -45,52 +45,52 @@ task1 = add_two(x=[1, 2, 3])
 
 Before we set any splitter, the task's `state` should be `None`
 
-```{code-cell}
+```{code-cell} ipython3
 task1.state is None
 ```
 
-Now, we can set the `splitter` by using the `split` method. Since our task has only one input, there is only one option to create a set of inputs, i.e. `splitter="x"`:
+Now, we can set the `splitter` using the `split` method. Since our task has only one input, there is only one option to create a set of inputs, i.e. `split(splitter='x', x=[1, 2, 3])`; make sure you define the value of `x` in the splitter as you did in `task`:
 
-```{code-cell}
-task1.split('x')
+```{code-cell} ipython3
+task1.split('x', x=[1, 2, 3])
 ```
 
 Now, we can check that our task has a `state`:
 
-```{code-cell}
+```{code-cell} ipython3
 task1.state
 ```
 
 And we can print information about the state:
 
-```{code-cell}
+```{code-cell} ipython3
 print(task1.state)
 ```
 
 within the `state` information about the splitter has been stored:
 
-```{code-cell}
+```{code-cell} ipython3
 task1.state.splitter
 ```
 
 Note, that *pydra* adds name of the function to the name of the input.
 
 Now, we can run the task and check results:
 
-```{code-cell}
+```{code-cell} ipython3
 task1()
 task1.result()
 ```
 
 We can also return results together with values of the input, we just have to set an additional argument `return_inputs` to `True` (or `val`)
 
-```{code-cell}
+```{code-cell} ipython3
 task1.result(return_inputs=True)
 ```
 
 If we want to return indices instead of values, we can set `return_inputs` to `ind`
 
-```{code-cell}
+```{code-cell} ipython3
 task1.result(return_inputs='ind')
 ```
 
@@ -106,16 +106,16 @@ For tasks with a state *pydra* prepare all sets of inputs and run the task for e
 
 We can also use `State` for functions with multiple inputs:
 
-```{code-cell}
+```{code-cell} ipython3
 @pydra.mark.task
 def add_var(a, b):
     return a + b
 ```
 
 Now we have more options to define `splitter`, it depends on the type of inputs and on our application. For example, we could have `a` that is a list, `b` that is a single value, and split over `a` values:
 
-```{code-cell}
-task2 = add_var(a=[1, 2, 3], b=10).split('a')
+```{code-cell} ipython3
+task2 = add_var(a=[1, 2, 3], b=10).split('a', a=[1, 2, 3])
 task2()
 task2.result()
 ```
@@ -130,7 +130,7 @@ Now we have three results for each element from the `a` list and the value of `b
 
 But we can have lists for both inputs, and use both inputs in the splitter. Let's assume that `a` and `b` are two elements lists.
 
-```{code-cell}
+```{code-cell} ipython3
 task3 = add_var(a=[1, 2], b=[10, 100])
 ```
 
@@ -144,8 +144,8 @@ Now, we have two options to map the input values, we might want to run the task
 
 Let's start from the scalar splitter, that uses parentheses in the syntax:
 
-```{code-cell}
-task3.split(('a', 'b'))
+```{code-cell} ipython3
+task3.split(('a', 'b'), a=[1, 2], b=[10, 100])
 task3()
 task3.result()
 ```
@@ -164,9 +164,9 @@ We can represent the execution by the graph:
 
 For the outer splitter we will use brackets:
 
-```{code-cell}
+```{code-cell} ipython3
 task4 = add_var(a=[1, 2], b=[10, 100])
-task4.split(['a', 'b'])
+task4.split(['a', 'b'], a=[1, 2], b=[10, 100])
 task4()
 task4.result()
 ```
@@ -181,17 +181,17 @@ Now, we have results for all of the combinations of values from `a` and `b`.
 
 Note, that once you set the splitter, you will get error when you try to set the splitter again. However, you can always set `overwrite` to `True` if you really intend to change the splitter.
 
-```{code-cell}
+```{code-cell} ipython3
 :tags: [raises-exception]
 
-task4.split(('a', 'b'))
+task4.split(('a', 'b'), a=[1, 2], b=[10, 100])
 ```
 
 For more inputs we can create more complex splitter, and use scalar and outer splitters together. **Note, that the scalar splitter can only work for lists that have the same length, but the outer splitter doesn't have this limitation.**
 
 Let's run one more example that takes four inputs, `x` and `y` components of two vectors, and calculates all possible sums of vectors. `x` components should be kept together with corresponding `y` components (i.e. scalar splitters: `("x1", "y1")` and `("x2", "y2")`), but we should use outer splitter for two vectors to get all combinations.
 
-```{code-cell}
+```{code-cell} ipython3
 @pydra.mark.task
 def add_vector(x1, y1, x2, y2):
     return (x1 + x2, y1 + y2)
@@ -205,7 +205,8 @@ task5 = add_vector(
     x2=[10, 20, 30],
     y2=[10, 20, 30],
 )
-task5.split(splitter=[('x1', 'y1'), ('x2', 'y2')])
+task5.split(splitter=[('x1', 'y1'), ('x2', 'y2')],
+            x1=[10, 20], y1=[1, 2], x2=[10, 20, 30], y2=[10, 20, 30])
 task5()
 task5.result()
 ```
@@ -220,9 +221,9 @@ When we use `splitter`, we can also define `combiner`, if we want to combine tog
 
 If we take the `task4` as an example and combine all results for each element of the input `b`, we can modify the task as follows:
 
-```{code-cell}
+```{code-cell} ipython3
 task5 = add_var(a=[1, 2], b=[10, 100])
-task5.split(['a', 'b'])
+task5.split(['a', 'b'], a=[1, 2], b=[10, 100])
 # adding combiner
 task5.combine('b')
 task5()
@@ -231,7 +232,7 @@ task5.result()
 
 Now our result contains two elements, each one is a list. The first one contains results for `a=1` and both values of `b`, and the second contains results for `a=2` and both values of `b`. Let's print the result again using `return_inputs`:
 
-```{code-cell}
+```{code-cell} ipython3
 all_results = task5.result(return_inputs=True)
 print(f'first list, a=1: {all_results[0]}')
 print(f'\n second list, a=2: {all_results[1]}')
@@ -243,9 +244,9 @@ print(f'\n second list, a=2: {all_results[1]}')
 
 But we could also group all elements from the input `a` and have a different combined output:
 
-```{code-cell}
+```{code-cell} ipython3
 task6 = add_var(a=[1, 2], b=[10, 100])
-task6.split(['a', 'b'])
+task6.split(['a', 'b'], a=[1, 2], b=[10, 100])
 # changing the combiner
 task6.combine('a')
 task6()
@@ -254,7 +255,7 @@ task6.result()
 
 We still have two elements in our results, but this time the first element contains results for `b=10` and both values of `a`, and the second contains results for `b=100` and both values of `a`.
 
-```{code-cell}
+```{code-cell} ipython3
 all_results = task6.result(return_inputs=True)
 print(f'first list, b=10: {all_results[0]}')
 print(f'\n second list, b=100: {all_results[1]}')
@@ -266,9 +267,9 @@ print(f'\n second list, b=100: {all_results[1]}')
 
 We can also combine all elements by providing a list of all inputs to the `combiner`:
 
-```{code-cell}
+```{code-cell} ipython3
 task7 = add_var(a=[1, 2], b=[10, 100])
-task7.split(['a', 'b'])
+task7.split(['a', 'b'], a=[1, 2], b=[10, 100])
 # combining all inputs
 task7.combine(['a', 'b'])
 task7()
@@ -287,7 +288,7 @@ This time the output contains one element that is a list of all outputs:
 
 Note that list can be used as an input even without using any splitter, there are functions that take a list as a single input value:
 
-```{code-cell}
+```{code-cell} ipython3
 @pydra.mark.task
 def moment(lst, n):
     return sum([i**n for i in lst]) / len(lst)
@@ -307,7 +308,7 @@ Let's say we want to calculate squares and cubes of integers from 2 to 5, and co
 
 First we will define a function that returns powers:
 
-```{code-cell}
+```{code-cell} ipython3
 :tags: [hide-cell]
 
 @pydra.mark.task
@@ -317,17 +318,17 @@ def power(x, n):
 
 Now we can create a task that takes two lists as its input, outer splitter for `x` and `n`, and combine all `x`:
 
-```{code-cell}
+```{code-cell} ipython3
 :tags: [hide-cell]
 
-task_ex1 = power(x=[2, 3, 4, 5], n=[2, 3]).split(['x', 'n']).combine('x')
+task_ex1 = power(x=[2, 3, 4, 5], n=[2, 3]).split(['x', 'n'], x=[2, 3, 4, 5], n=[2, 3]).combine('x')
 task_ex1()
 task_ex1.result()
 ```
 
 The result should contain two list, the first one is for squares, the second for cubes.
 
-```{code-cell}
+```{code-cell} ipython3
 :tags: [hide-cell]
 
 squares_list = [el.output.out for el in task_ex1.result()[0]]
@@ -340,7 +341,7 @@ print(f'cubes: {cubes_list}')
 
 We run task multiple times for multiple sets of input, but we didn't talk about the execution time. Let's create a function that sleeps for a second and run for four values:
 
-```{code-cell}
+```{code-cell} ipython3
 import time
 
 
@@ -350,7 +351,7 @@ def add_two_sleep(x):
     return x + 2
 
 
-task9 = add_two_sleep(x=[1, 2, 3, 4]).split('x')
+task9 = add_two_sleep(x=[1, 2, 3, 4]).split('x', x=[1, 2, 3, 4])
 t0 = time.time()
 task9()
 print(f'total time: {time.time() - t0}')
@@ -363,8 +364,8 @@ If we run `Task` that has a `State`, pydra will automatically create a `Submitte
 
 We could also create a `Submitter` first, and than use it to run the task:
 
-```{code-cell}
-task10 = add_two_sleep(x=[1, 2, 3, 4]).split('x')
+```{code-cell} ipython3
+task10 = add_two_sleep(x=[1, 2, 3, 4]).split('x', x=[1, 2, 3, 4])
 
 t0 = time.time()
 with pydra.Submitter(plugin='cf') as sub:
@@ -375,8 +376,8 @@ print(f'results: {task10.result()}')
 
 or we can provide the name of the plugin:
 
-```{code-cell}
-task11 = add_two_sleep(x=[1, 2, 3, 4]).split('x')
+```{code-cell} ipython3
+task11 = add_two_sleep(x=[1, 2, 3, 4]).split('x', x=[1, 2, 3, 4])
 
 t0 = time.time()
 task11(plugin='cf')
@@ -386,8 +387,8 @@ print(f'results: {task11.result()}')
 
 The last option for running the task is to create a `Submitter` first and run the submitter (`Submitter` is also a callable object) with the task as  a `runnable`:
 
-```{code-cell}
-task12 = add_two_sleep(x=[1, 2, 3, 4]).split('x')
+```{code-cell} ipython3
+task12 = add_two_sleep(x=[1, 2, 3, 4]).split('x', x=[1, 2, 3, 4])
 
 t0 = time.time()
 with pydra.Submitter(plugin='cf') as sub:
@@ -398,8 +399,8 @@ print(f'results: {task12.result()}')
 
 All of the execution time should be similar, since all tasks are run by *pydra* in the same way, i.e. *pydra* creates a submitter with `ConcurrentFutures` worker, if a number of processors is not provided, `ConcurrentFutures` takes all available processors as `max_workers`. However, if we want to set a specific number of processors, we can set it using `n_procs` when creating a `Submitter`. Let's see how the execution time changes when we use `n_procs=2`.
 
-```{code-cell}
-task13 = add_two_sleep(x=[1, 2, 3, 4]).split('x')
+```{code-cell} ipython3
+task13 = add_two_sleep(x=[1, 2, 3, 4]).split('x', x=[1, 2, 3, 4])
 
 t0 = time.time()
 with pydra.Submitter(plugin='cf', n_procs=2) as sub: