Data structures - Splay tree

A splay tree is a self-adjusting binary search tree with one defining behavior: every time you access a node — whether you are searching, inserting, or deleting — that node is moved to the root. This movement is called splaying, and it is what makes the data structure both elegant and practically useful.

Splay trees were introduced by Daniel Sleator and Robert Tarjan in 1985. Unlike AVL trees or red-black trees, they require no extra bookkeeping per node — no height, no color, no balance factor. The structure stays efficient through the splay operation alone.

Definition

A splay tree is a binary search tree (BST) that satisfies the standard BST property — for every node, all keys in the left subtree are smaller and all keys in the right subtree are larger — with the additional invariant that every access moves the accessed node to the root via a sequence of rotations.

There is no explicit balance guarantee. At any point, a splay tree might look completely unbalanced. But the splay operation ensures that frequently accessed nodes migrate toward the root and rarely accessed nodes drift toward the leaves. Over a sequence of operations, this self-adjusting behavior produces amortized O(log n) time per operation.

Key properties

No stored balance information — nodes contain only a key, a value, and left/right child pointers
Self-adjusting — the structure reorganizes itself on every access
Amortized efficient — any sequence of m operations on a tree with n nodes costs O((m + n) log n)
Working-set property — recently accessed items are fast to access again

The Splay Operation

Splaying moves a target node x to the root through repeated rotations. The operation applies one of three cases at each step, chosen based on the relationship between x, its parent p, and its grandparent g.

Case 1: Zig (x’s parent is the root)

When p is the root, perform a single rotation: rotate x over p. This is the terminal step — it happens at most once per splay.

    p              x
   / \            / \
  x   C    →    A   p
 / \                / \
A   B              B   C

Case 2: Zig-zig (x and p are both left children, or both right children)

When x and p are on the same side, rotate p over g first, then rotate x over p. This is the key distinction from a naive “rotate up” strategy — it keeps the tree from becoming a degenerate chain on repeated access to the same path.

      g              x
     / \            / \
    p   D          A   p
   / \        →       / \
  x   C              B   g
 / \                     / \
A   B                   C   D

Case 3: Zig-zag (x is a left child and p is a right child, or vice versa)

When x and p are on opposite sides, rotate x over p, then rotate x over g. This is equivalent to a double rotation (same as in AVL trees).

    g              x
   / \            / \
  p   D          p   g
 / \        →   / \ / \
A   x          A  B C  D
   / \
  B   C

The splay terminates when x reaches the root.

Common Operations

Search

To search for key k:

Walk the tree from the root following BST rules — go left if k is smaller, right if larger.
If k is found, splay that node to the root.
If k is not found, splay the last node visited (the node where the search terminated) to the root.

The splay on a failed search is not wasted work — it still moves a nearby node to the root, which improves locality for future accesses.

def search(tree, k):
    node = bst_find(tree.root, k)   # standard BST walk
    tree.root = splay(node)          # move found (or last) node to root
    return tree.root if tree.root.key == k else None