Instruction optimizations are partially (but not fully) implemented. (2025-07-04)

The Basic Idea

Suppose that polonius-editor is told to execute the following two instructions:

REMOVE 0 2
INSERT 0 abc

These instructions are redundant -- we're deleting 3 characters, and then inserting 3 characters to the same position. You can reduce those two instructions to a single REPLACE 0 abc

At first, that might not sound like much. But if we're editing a 2TB file, for example, removing/inserting at the beginning could take a pretty long time, whereas replaces happen nearly-instantaneously. Needless to say, there are many more cases like this one where the instructions could be optimized.

So, what we would like the program to be able to do is optimize (or simplify) the instructions given to it.

The basic idea of the rest of this document is that we want to try to represent a sequence of instructions as a math expression, which we can then simplify by applying some basic theorems.

So here, we'll define some "objects" to represent files and parts of files, and some operations for them (addition, subtraction, and multiplication) to represent the instruction types in Polonius.

The Math: Blocks and their Operations

These objects are called "blocks," and can represent parts of the file we're editing. For example:

$$\displaystyle \begin{bmatrix} 0 & 1 & 2 \\a & b & c \end{bmatrix}$$

This block can represent a file with only 3 characters: a, b, and c (in that order)

The top-row should be thought of as a position, and its corresponding element in the bottom-row is the data which is stored at that position.

Picture a file that looks like this:

$$\displaystyle \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & b & c & d & e & f \end{bmatrix}$$

If you opened up this file on your computer and looked at it, it would say: abcdef

If we want to make it just say def, we can subtract the abc block from it:

$$\displaystyle \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & b & c & d & e & f \end{bmatrix} - \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} \ = \ \begin{bmatrix} 0 & 1 & 2 \\ d & e & f \end{bmatrix}$$

If instead, you wanted to make it say abcdefxyz, you could add an xyz block to it:

$$\displaystyle \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & b & c & d & e & f \end{bmatrix} + \begin{bmatrix} 6 & 7 & 8 \\ x & y & z \end{bmatrix} \ = \ \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 \\ a & b & c & d & e & f & x & y & z \end{bmatrix}$$

Let's say instead we wanted to change the abc into hij. We would multiply it by an hij block:

$$\displaystyle \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & b & c & d & e & f \end{bmatrix} \bullet \begin{bmatrix} 0 & 1 & 2 \\ h & i & j \end{bmatrix} \ = \ \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ h & i & j & d & e & f \end{bmatrix}$$

It's necessary to define a few arbitrary rules before we continue:

All of these objects have exactly 2 rows -- the top row (the keys) and the bottom row (the values).
For each key, there must be a corresponding value.
The keys must be unique and in order (lowest-to-highest)
The keys do not necessarily have to be sequential (you can have "1 - 3 - 5")
The keys do not necessarily have to start from zero (you can start at anywhere $>= 0$)
The lowest possible key is $0$

This block makes up a single mathematical object. Call it $x$

Let's also define a function $mag(x)$ (meaning the "magnitude" of $x$), which gives the number of keys (or the width) of the object. In Polonius, this is called the Block Size.

Addition (INSERT Operations)

Addition is defined as follows:

Let

$$\displaystyle x = \begin{bmatrix} 0 & 1 & 2 & 3 \\ a & b & c & d \end{bmatrix}$$

Let

$$\displaystyle y = \begin{bmatrix} 1 \\ X \end{bmatrix}$$

Then

$$\displaystyle x + y = \begin{bmatrix} 0 & 1 & 2 & 3 & 4 \\ a & X & b & c & d \end{bmatrix}$$

The values are inserted from $y$ into $x$, in left-to-right order. Here, we insert the key-value pair

$$\displaystyle \begin{bmatrix} 1 \\ X \end{bmatrix}$$

Into position #1, and then shift everything after it over to the right (the old #1 becomes now #2, etc).

Here's another example:

Let

$$\displaystyle x = \begin{bmatrix} 0 & 1 & 2 & 3 \\ a & b & c & d \end{bmatrix}$$

Let

$$\displaystyle y = \begin{bmatrix} 1 & 3 \\ X & Y \end{bmatrix}$$

Then

$$\displaystyle x + y = \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & X & b & Y & c & d \end{bmatrix}$$

Here you can see that we're actually inserting the values left-to-right -- first, we insert the "X" into position #1, and only afterwards we insert the "Y" into position #3.¹

Note that:

Addition is NOT commutative ($x + y \ne y + x$)
Addition is NOT associative ($x + (y + z) \ne (x + y) + z$)
$mag(x + y) = mag(x) + mag(y)$ -- the length of the file, when all is said and done, is the old length of the file plus the length of what we just added.

Subtraction (REMOVE Operations)

Subtraction is defined as follows:

Let

$$\displaystyle x = \begin{bmatrix} 0 & 1 & 2 & 3 \\ a & b & c & d \end{bmatrix}$$

Let

$$\displaystyle y = \begin{bmatrix} 1 \\ ? \end{bmatrix}$$

(Where $?$ signals that we don't have to care what the value is for this)

Then

$$\displaystyle x - y = \begin{bmatrix} 0 & 1 & 2 \\ a & c & d \end{bmatrix}$$

First, we check which keys are shared in common between the two blocks, then we remove those key/value pairs from the left-hand object, and shift the remaining values leftward. If a key appears in both blocks, it gets removed.

In this case, both of them have key #1. So, key #1 is removed from $x$, and all values paired with keys $> 1$ are left-shifted (#2 becomes the new #1, etc)

Note that:

$x - x = 0$ (subtracting a block from itself yields an empty block)
Subtraction is NOT commutative ($x - y \ne y - x$)
Subtraction is NOT associative ($x - (y - z) \ne (x - y) - z$)
$0 \le mag(x - y) \le mag(x)$

Multiplication (REPLACE Operations)

Multiplication is defined as follows:

Let

$$\displaystyle x = \begin{bmatrix} 0 & 1 & 2 & 3 \\ a & b & c & d \end{bmatrix}$$

Let

$$\displaystyle y = \begin{bmatrix} 1 \\ X \end{bmatrix}$$

Then

$$\displaystyle x \bullet y = \begin{bmatrix} 0 & 1 & 2 & 3 \\ a & X & c & d \end{bmatrix}$$

First, we look for keys that the two blocks share in common. In this case, they both have a key #1

Then, we replace the value in the left-hand block with the value in the right-hand block. In this case, the value under $1$ (which originally was $b$) was replaced with the corresponding one in $y$, and became $X$.

Note that:

Multiplication is NOT commutative ($x \bullet y \ne y \bullet x$)
Multiplication is NOT associative ($x \bullet (y \bullet z) \ne (x \bullet y) \bullet z$)
Multiplication is NOT distributive ($x \bullet (y + z) \ne xy + xz$)
Multiplication is idempotent ($x \bullet x = x$)
$mag(x \bullet y) = mag(x)$

An Analogy

Imagine a neighborhood of houses all on a conveyer belt. Whenever a new family moves in, or an old family moves out, the conveyer belt moves left or right and shifts the entire neighborhood. The only operation that does not move the houses is if one family moves out & another family moves in to the same house at the same time.

That's what this process is like. The file is the neighborhood, the "positions" or "places" where the characters go are the houses, and the characters themselves are the occupants of the houses. A single block, written like this:

$$\begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix}$$

Is only a snapshot of the file at one moment. The instant that we apply an operation (addition or subtraction, an insert or a remove), the conveyer belt shifts, and many of the old houses have new addresses.


A visual representation of an INSERT operation

The Logic: Theorems and Uses

It's important to remember that, although we're calling these operations addition, subtraction and multiplication, they really just represent the INSERT, REMOVE, and REPLACE operations from Polonius. Defining them as a kind of arithmetic just helps us to work some things out logically.

Theorem #0

$$\displaystyle x + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} + \begin{bmatrix} 3 & 4 & 5 \\ d & e & f \end{bmatrix} \ = \ x + \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & b & c & d & e & f \end{bmatrix}$$

$$\displaystyle x + \begin{bmatrix} 0 & 1 & 2 \\ d & e & f \end{bmatrix} + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} = x + \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & b & c & d & e & f \end{bmatrix}$$

$$\displaystyle x + \begin{bmatrix} 3 & 4 & 5 \\ d & e & f \end{bmatrix} + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} = x + \begin{bmatrix} 0 & 1 & 2 & 6 & 7 & 8 \\ a & b & c & d & e & f \end{bmatrix}$$

'Insert' instructions can be combined.

Theorem #1

$$\displaystyle x - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} - \begin{bmatrix} 3 & 4 & 5 \\ ? & ? & ? \end{bmatrix} \ = \ x - \begin{bmatrix} 0 & 1 & 2 & 6 & 7 & 8 \\ ? & ? & ? & ? & ? & ? \end{bmatrix}$$

$$\displaystyle x - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} \ = \ x - \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ ? & ? & ? & ? & ? & ? \end{bmatrix}$$

$$\displaystyle x - \begin{bmatrix} 3 & 4 & 5 \\ ? & ? & ? \end{bmatrix} - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} \ = \ x - \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ ? & ? & ? & ? & ? & ? \end{bmatrix}$$

'Remove' instructions can be combined.

Theorem #2

$$\displaystyle x \bullet \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} \bullet \begin{bmatrix} 3 & 4 & 5 \\ d & e & f \end{bmatrix} \ = \ x \bullet \begin{bmatrix} 0 & 1 & 2 & 3 & 4 & 5 \\ a & b & c & d & e & f \end{bmatrix}$$

$$\displaystyle x \bullet \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} \bullet \begin{bmatrix} 0 & 1 & 2 \\ d & e & f \end{bmatrix} \ = \ x \bullet \begin{bmatrix} 0 & 1 & 2 \\ d & e & f \end{bmatrix}$$

'Replace' instructions can be combined.

Of a sequence of 'replace' instructions to the same position, only the last one is significant.

Theorem #3

$$\displaystyle x - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} \ = \ x \bullet \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix}$$

$$\displaystyle x - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} + \begin{bmatrix} 0 & 1 \\ a & b \end{bmatrix} \ = \ x \bullet \begin{bmatrix} 0 & 1 \\ a & b \end{bmatrix} - \begin{bmatrix} 2 \\ ? \end{bmatrix}$$

$$\displaystyle x - \begin{bmatrix} 0 & 1 \\ ? & ? \end{bmatrix} + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} \ = \ x \bullet \begin{bmatrix} 0 & 1 \\ a & b \end{bmatrix} + \begin{bmatrix} 2 \\ c \end{bmatrix}$$

Removing some characters, followed by inserting new characters into the same position, is exactly equivalent to a single replace operation at that position.

Theorem #4

$$\displaystyle x + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} \ = \ x$$

Inserting some characters, followed by removing those same characters, does not change the input.

Theorem #5

$$\displaystyle x + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} \bullet \begin{bmatrix} 0 & 1 & 2 \\ d & e & f \end{bmatrix} \ = \ x + \begin{bmatrix} 0 & 1 & 2 \\ d & e & f \end{bmatrix}$$

Inserting some characters, and then replacing some or all of what we just inserted, can be simplified into a single insert operation.

Theorem #6

$$\displaystyle x \bullet \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix} \ = \ x - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix}$$

Replacing some characters, and then removing those same characters, is equivalent to simply removing those characters.

Uses

We don't need to have any idea about the contents of the file we're editing in order to apply any of these theorems. They're all written in terms of $x$ -- any file will do, as long as the instruction sequence is valid for that file. But if the original sequence is valid for our file, our simplified version will also be valid for it

In application of any of these theorems, we need to watch for left- and right- shifts in the data. For example, consider the following sequence of instructions:

A Simple Optimization Example

$$\displaystyle + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} + \begin{bmatrix} 17 & 18 & 19 \\ d & e & f \end{bmatrix} - \begin{bmatrix} 5 & 6 & 7 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 14 & 15 \\ g & h \end{bmatrix} - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix}$$

We might notice the complementary pair of instructions at the beginning and end:

$$\displaystyle + \begin{bmatrix} 0 & 1 & 2 \\ a & b & c \end{bmatrix} \ \cdot \cdot \cdot \ - \begin{bmatrix} 0 & 1 & 2 \\ ? & ? & ? \end{bmatrix}$$

And we might try to simplify the expression like this, using theorem #4:

$$\displaystyle + \begin{bmatrix} 17 & 18 & 19 \\ d & e & f \end{bmatrix} - \begin{bmatrix} 5 & 6 & 7 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 14 & 15 \\ g & h \end{bmatrix}$$

But this would be wrong! After we inserted the 3 characters "abc" to position 0, the rest of the file was right-shifted by 3 places (the length of the insert). So, if we're deleting that instruction, we have to subtract 3 from every position given between the redundant insert/remove pair.²

$$\displaystyle + \begin{bmatrix} 14 & 15 & 16 \\ d & e & f \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 \\ g & h \end{bmatrix}$$

It would also be easy to miss the fact that we can apply theorem #5 here as well -- the addition and the multiplication are also a redundant pair, but it's less obvious because there is a subtraction operation in-between them. This subtraction removes 3 characters and left-shifts all of the data to the right of it. Therefore, when we're replacing at position 11, that's really the same position that was earlier called 14.

$$\displaystyle + \begin{bmatrix} 14 & 15 & 16 \\ g & h & f \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix}$$

This is now the furthest-possible simplification of the instruction sequence, and is exactly equivalent to the original.

In Polonius's terms, this means that the following two sets of instructions will always do exactly the same thing to a given file:

Original	Optimized
INSERT 0 abc INSERT 17 def REMOVE 5 7 REPLACE 14 gh REMOVE 0 2	INSERT 14 ghf REMOVE 2 4

A More Involved Optimization Example

This example is only here to intimidate you -- to show that we can go, even from a really long and complicated sequence of instructions to a short and simple one.

Consider the following expression:

$$\displaystyle x - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} + \begin{bmatrix} 2 & 3 & 4 \\ d & e & f \end{bmatrix} + \begin{bmatrix} 2 & 3 & 4 \\ a & b & c \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ g & h & f \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ h & i & j \end{bmatrix} \bullet \begin{bmatrix} 5 & 6 & 7 \\ k & l & m \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 & 13 \\ x & y & z \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

First we can apply Theorem #1 to combine the first two subtractions:

$$\displaystyle x - \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ ? & ? & ? & ? & ? & ? \end{bmatrix} + \begin{bmatrix} 2 & 3 & 4 \\ d & e & f \end{bmatrix} + \begin{bmatrix} 2 & 3 & 4 \\ a & b & c \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ g & h & f \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ h & i & j \end{bmatrix} \bullet \begin{bmatrix} 5 & 6 & 7 \\ k & l & m \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 & 13 \\ x & y & z \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

Next, we can apply Theorem #0 to combine those two additions:

$$\displaystyle x - \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ ? & ? & ? & ? & ? & ? \end{bmatrix} + \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ a & b & c & d & e & f \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ g & h & f \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ h & i & j \end{bmatrix} \bullet \begin{bmatrix} 5 & 6 & 7 \\ k & l & m \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 & 13 \\ x & y & z \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

We also have a redundant insert/replace pair:

$$\displaystyle \cdot \cdot \cdot + \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ a & b & c & d & e & f \end{bmatrix} \cdot \cdot \cdot \cdot \bullet \begin{bmatrix} 5 & 6 & 7 \\ k & l & m \end{bmatrix}$$

We can remove using Theorem #5, giving us:

$$\displaystyle x - \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ ? & ? & ? & ? & ? & ? \end{bmatrix} + \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ a & b & c & k & l & m \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ g & h & f \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ h & i & j \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 & 13 \\ x & y & z \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

While we're at it, we can apply Theorem #2 to simplify those two multiplications:

$$\displaystyle x - \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ ? & ? & ? & ? & ? & ? \end{bmatrix} + \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ a & b & c & k & l & m \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ h & i & j \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 & 13 \\ x & y & z \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

Let's not forget to apply the (arguably) most useful theorem, Theorem #3, to simplify the subtraction/addition pair at the beginning:

$$\displaystyle x \bullet \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 \\ a & b & c & k & l & m \end{bmatrix} \bullet \begin{bmatrix} 8 & 9 & 10 \\ h & i & j \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 & 13 \\ x & y & z \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

We can apply Theorem #2 here as well to combine those two multiplications in the beginning:

$$\displaystyle x \bullet \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & 10 \\ a & b & c & k & l & m & h & i & j \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} \bullet \begin{bmatrix} 11 & 12 & 13 \\ x & y & z \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

Now, we can apply Theorem #6:

$$\displaystyle x \bullet \begin{bmatrix} 2 & 3 & 4 & 5 & 6 & 7 & 8 & 9 & 10 \\ a & b & c & k & l & m & h & i & j \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 \\ ? & ? & ? \end{bmatrix} - \begin{bmatrix} 11 & 12 & 13 \\ ? & ? & ? \end{bmatrix}$$

Apply it again, and combine the subtractions:

$$\displaystyle x \bullet \begin{bmatrix} 5 & 6 & 7 & 8 & 9 & 10 \\ k & l & m & h & i & j \end{bmatrix} - \begin{bmatrix} 2 & 3 & 4 & 11 & 12 & 13 \\ ? & ? & ? & ? & ? & ? \end{bmatrix}$$

Here finally is the simplest possible form of this instruction sequence. In Polonius's terms, this means that the following two sets of instructions will always do exactly the same thing to a given file:

Original	Optimized
REMOVE 2 4 REMOVE 2 4 INSERT 2 def INSERT 2 abc REPLACE 8 ghf REPLACE 8 hij REPLACE 5 klm REMOVE 2 4 REPLACE 11 xyz REMOVE 11 13	REPLACE 5 klmhij REMOVE 2 4; 11 13

The Upshot: Benefits and Drawbacks

It might seem crazy (and I certainly hope it does!) that we can algorithmically go from 10 instructions all the way down to only 2 instructions and have them be exactly equivalent for all files, but this is a natural consequence of the theorems listed above.

In fact, as a direct consequence of Theorems #0, #1, and #2, we know that it's possible to reduce any sequence of instructions, no matter how long (100 instructions, 1 million instructions, etc) down to a maximum of 3 instructions. At absolute most: one INSERT, one REMOVE, and one REPLACE.

In terms of time complexity, that means we're going from $O(S \cdot I)$ complexity to only $O(S)$ complexity (with $I$ referring to the number of inputted instructions, and $S$ referring to the size of the file). To explain:

Each time Polonius executes an INSERT or REMOVE instruction, it must traverse the file piece-by-piece. At its core, Polonius can only really do inserts/removes directly at the end of a file. To insert to the beginning of a file, it has to add some blank space to the end, and chunk-by-chunk move pieces over to the new end until it hits the location of the insert. A similar process is necessary for removes. Replace instructions, for the purpose of this argument, we'll say take 0 time (really only 1 or 2 microseconds generally).

Let's call the time that it takes to traverse a given file $T$. Let's say the file is pretty big, and so traversing it takes a full second. $T = 1$ second.

If we have 100 different INSERT/REMOVE instructions to execute, this will take $100T$ -- or 100 seconds, almost two full minutes before our changes are saved.

But after we simplify that expression to be only 2 instructions (possibly plus one REPLACE instruction, which takes 0 time), it will take only $2T$ -- just 2 seconds.

Further, while 1,000 different such instructions would take $1000T$ and a billion would take $1000000000T$, their respective simplified versions will always take only $2T$. That's a pretty big deal.

Of course, in reality, it takes some time to apply the theorems as well; it takes a little bit of time to simplify the instruction sequence. But for very large files (the kind that Polonius is designed to work with), it's almost always worth it. Besides this, the idea is to have the interactive UI optimize instruction sequences while it's building them, that is, while you're still typing into the file, before you ever hit "save." This way, by the time the UI sends its instructions to polonius-editor, they're already optimized.

Even in the extremely unlikely case that the delay we add in trying to optimize the instructions is somehow equal to the time we save from having those optimized instructions, it's still a benefit in that it reduces the amount of time which is spent actually making changes to the file. Because Polonius edits files in-place, if for some reason it's interrupted before it can finish its job, the files will be corrupted (kind of half-edited). Reducing that window is obviously a good idea.

Implications Beyond Polonius

You might notice that the theorems and the logic behind them are not specific to Polonius or even to file editing. They can be applied to any kind of data structure that has a linear order (like a list or an array) and where the operations are defined in terms of inserting, removing, and replacing elements.

So long as we're able to know what operations will be performed in advance, we can apply these theorems to optimize those operations. This brings the time complexity of the operations down to $O(S)$, where $S$ is the size of the data structure, rather than $O(S \cdot I)$, where $I$ is the number of operations.

It's also however necessary to make sure that it's worth it -- by which I mean: if the time it takes to optimize the instructions is greater than the time it would take to execute them, then we shouldn't bother. Polonius is designed to work with very large files, so the time it takes to optimize the instructions is generally negligible compared to the time it would take to execute them without optimization. If, similarly, you're writing a program that processes large datasets or handles massive contiguous arrays, then the theorems here can be applied to optimize the operations on those datasets.

The gains are substantial, and the logic is sound. The time it takes to perform those operations is no longer a function of the number of operations, rather only of the size of the data structure itself. This is a significant improvement in efficiency and performance if applied in an appropriate context.

In Practice

What Does It Mean to Combine Instructions of Like Type?

Theorems 0, 1 and 2 demonstrate that we can combine any arbitrarily large number of instructions of the same type to a single instruction -- at least conceptually. In practice, however, how can we execute them as a single instruction?

Let's take the case of INSERT instructions. First, we should think about how Polonius actually executes an INSERT. For example, INSERT 3 X:

insert-execution

First, we make room at the end of the file, then we shift all the characters to the right until we reach the position where we want to insert. Finally we can insert the new character(s) at that position.

If we have a sequence of INSERT instructions, we can execute them all in a single pass through the file, so long as the following conditions are met:

The instructions are sorted by their position in the file
Instructions that overlap have been merged (e.g, INSERT 5 abc and INSERT 6 X must be merged into INSERT 5 aXbc)
The instructions are not interrupted by any other operations (i.e, no REMOVE or REPLACE instructions in between them)

These conditions can be met simply by careful application of theorems #0, #1 and #2. If these conditions are met, we can execute the INSERT instructions in a single pass through the file.

For example:

INSERT 3 X
INSERT 8 Y

insert-execution-multiple

We can follow a similar procedure for REMOVE instructions given the same conditions.

Therefore, it's not strictly necessary for the optimizer to entirely combine all the instructions of the same type into a single instruction. All that it needs to do (the bare minimum) is make sure that instructions of the same type are paired together, and that the conditions above are met.

Having met that bare minimum, of course, we might still like to apply further optimizations (such as theorems #3, #4, #5, and #6) to reduce the number of instructions even further. But the above-stated minimum is sufficient to reduce the time complexity of the edit from $O(S \cdot I)$ to $O(S)$.

Footnotes

1: You can see that we would get a different result if we went the other way (inserting right-to-left). The "left-to-right" business matters because after we've inserted the first key, the rest have shifted over. So, the position which we now call #3 isn't the same as what "#3" was before.

2: In fact it's slightly worse than that: we only subtract from the positions between the redundant insert/remove pair that are greater than the position we inserted to. If, for example, we had:

INSERT 10 a
...
REMOVE 10 10

Then we would only subtract from some of the positions in-between them -- that is, the ones $\gt 10$. The original example was specially chosen to be simple, since inserting to position #0 affects the entire file, so that we wouldn't have to worry about this little detail.

Instruction Optimization - rail5/polonius GitHub Wiki

The Basic Idea

The Math: Blocks and their Operations

Addition (INSERT Operations)

Subtraction (REMOVE Operations)

Multiplication (REPLACE Operations)

An Analogy

The Logic: Theorems and Uses

Theorem #0

Theorem #1

Theorem #2

Theorem #3

Theorem #4

Theorem #5

Theorem #6

Uses

A Simple Optimization Example

A More Involved Optimization Example

The Upshot: Benefits and Drawbacks

Implications Beyond Polonius

In Practice

What Does It Mean to Combine Instructions of Like Type?

Footnotes

⚠️ GitHub.com Fallback ⚠️

Instruction Optimization - rail5/polonius GitHub Wiki

The Basic Idea

The Math: Blocks and their Operations

Addition (INSERT Operations)

Subtraction (REMOVE Operations)

Multiplication (REPLACE Operations)

An Analogy

The Logic: Theorems and Uses

Theorem #0

Theorem #1

Theorem #2

Theorem #3

Theorem #4

Theorem #5

Theorem #6

Uses

A Simple Optimization Example

A More Involved Optimization Example

The Upshot: Benefits and Drawbacks

Implications Beyond Polonius

In Practice

What Does It Mean to Combine Instructions of Like Type?

Footnotes

⚠️ **GitHub.com Fallback** ⚠️

⚠️ GitHub.com Fallback ⚠️