Generic SM introduction
In this document is an introduction of the basic components of a generic state machine.
Unlike the mFibonacci state machine, which is an implementation of one specific computation, we now describe a generic state machine that can be instantiated with various computations of the user’s choice.
The idea here is to create a state machine that behaves like a processor of sorts. In as much as a processor has registries and a clock, so is our generic state machine. It receives instructions in the form of programs written in Assembly, and makes state transitions at each clock in accordance with these instructions.
See Figure below, for such a state machine with registries
The aim with this document is to explain how the machinery used in the mFibonacci SM; to execute computations, produce proofs of correctness of execution, and verify these proofs; can extend to a generic state machine.
Think of our state machine as being composed of two parts; the part that has to do with generating the execution trace, while the other part is focused on verifying that the executions were correctly executed.
-
The former part is more like the “software” of the state machine, as it is concerned with interpreting program instructions and correctly generating the execution trace. A novel language dubbed the Zero-knowledge Assembly Language (zkASM) is used in this part.
-
But the latter part is more like the “hardware” as it consists of a set of arithmetic constraints (or their equivalent, polynomial identities) that every correctly generated execution trace must satisfy. Since these arithmetic constraints are transformed into polynomial identities (via an interpolation process), they are described in a novel language called the Polynomial Identity Language (PIL).
Generic SM executor¶
As seen with the mFibonacci SM, the SM executor takes certain inputs together with the description of the SM, in order to produce the execution trace specifically corresponding to these inputs.
The main difference, in the Generic State Machine case, is the inclusion of a program which stipulates computations to be carried out by the SM executor. These computations could range from a simple addition of two registry values, or moving the value in registry
So then, instead of programming the SM executor ourselves with a specific set of instructions as we did with the mFibonacci SM, the executor of a Generic SM is programmed to read arbitrary instructions encapsulated in some program (depending on the capacity of the SM or the SM’s context of application). As mentioned above, each of these programs is initially written, not in a language like Javascript, but in the zkASM language.
State machine instructions¶
We continue with the state machine model shown in Figure 1 above; a state machine with two registries
Here is an example of a program containing four instructions, expressed in the zkASM language,
Suppose the state machine starts with the initial state
- Firstly, “
” is a request to execute the function . That is, the executor must get a free input value and move it into register . Note that “free input” simply means the input can be any numeric value. - Secondly, “
” means the executor must move the constant value into register . Since the value is part of an instruction, it is referred to as the constant of the execution. Also, it is called a constant because, for a given program, it cannot be freely chosen. - Thirdly, “
” instructs the executor to compute the sum of registry value in with the one in , and save the output into register . - Lastly, “
” tells the executor to reset the registries and to their initial values in the next state, and thus achieving the cyclic behaviour.
Execution trace¶
In addition to carrying out computations as per instructions in programs, the executor must also generate the trace of all state transitions, called the Execution Trace.
Consider, as an example, the execution trace the executor produces for the above program of four instructions. Suppose the free input value used is
This execution trace utilises a total of six columns. Perhaps the use of the columns corresponding to the two registries
The reason there are two extra columns, instead of only four, is the need to capture each state transition in full, and per instruction. The column labelled
The execution trace is therefore read row-by-row as follows,
- The first row, which is per instruction “
”, reads thus: The function was executed, by getting the free input value of , which is to be moved into the registry as the next registry value. - The second row, which is per instruction “
”, reads as follows: A constant value is to be moved into the registry as the next registry value, and the registry now reflects the value as expected per previous instruction. - The third row, which is per instruction “
”, indicates that: The executor executed the sum of registry values in and and the output is to be moved into register , as the next value of the registry. - The last row, which is per instruction “
”, tells us that: The executor has updated the registry value in according to the previous instruction, and will reset the registries and to zeros (their initial values) as their next registry values.
Note that, for this specific program, a change in the free input from