Hash Table

Definition

Hash Table

A hash table $T$ is an array of length $m$ where each element $T [i]$ , $0 \leq i < m$ , is a singly linked list.

Example:

Probing

There are multiple ways to deal with collisions.

Linear Probing

todo

Problem: Primary clustering todo

Quadratic Probing

Let $h^{'} : K \to {0, 1, \dots, m - 1}$ be a normal hash function. The probing function is then:

h (k, i) = (h^{'} (k) + c_{1} i + c_{2} i^{2}) mod m

where $c_{1}, c_{2}$ are properly chosen constants.

Problem: Primary clustering is mitigated, but another phenomenon, secondary clustering, could occur.

Double Hashing

Let $h_{1}, h_{2} : K \to {0, 1, \dots, m - 1}$ be two hash functions. The probing function is then:

h (k, i) = (h_{1} (k) + i h_{2} (k)) mod m

with $0 \leq i < m$ .

Choice of $h_{2} (k)$ : For all keys $k$ , the probing sequence must reach all slots $0, \dots, m - 1$ . This means that $h_{2} (k) \neq = 0$ must hold and it must not divide $m$ . $m$ should be a prime number, $h_{2}$ , should be chosen independently of $h_{1}$ .

Brent’s Improvement

Idea: Whenever a key $k$ is inserted at a probed slot $j$ with $k^{'} = T [j] . k ey$ (already occupied), set:

j_{1} j_{2} = (j + h_{2} (k)) mod m = (j + h_{2} (k^{'})) mod m

If $j$ is occupied but $j_{2}$ is free, then move $k^{'}$ to $j_{2}$ to free the position $j$ for $k$ .

Example:

Benefit: The average-case runtime of a successful search is in worst case $α = 1 \in Θ (1)$ .

Algorithm:

def insert_brent(T: HashTable, k: Key) -> None:
	j = h1(k)
	while T[j].status == "used":
		k_prime = T[j].key
		j1 = (j + h2(k)) % m
		j2 = (j + h2(k_prime)) % m
		if T[j1].status != "used" or T[j2].status == "used":
			j = j1
		else:
			T[j] = k
			k = k_prime
			j = j2
	T[j] = k
	T[j].status = "used"

Lukas' Notes

Hash Table

Definition

Probing

Linear Probing

Quadratic Probing

Double Hashing

Brent’s Improvement

Graph View

Table of Contents

Backlinks