Basic graph theory: trees (CSCI 2824, Spring 2015)

In this lecture, we will study trees:

  • Basic Definition

  • Properties of trees with some proofs.

Trees

You have most probably studied rooted trees and binary search trees as a data-structure for organizing lists of numbers and efficiently performing insertions and deletions. However a tree is a useful structure in many other parts of CS:

  • Game trees in AI (how can we make computers play games like chess, go,…).

  • Decision trees in ML.

  • Spanning trees in network routing.

  • Parse trees for compilers.

  • The list goes on and on

Let us start with the simplest kind of trees: unrooted and undirected trees.

Definition

A (unrooted) tree G is an undirected graph G:(V,E) such that

  • G is fully connected (the entire graph is a maximally connected component),

  • G is acyclic (there are no cycles in G).

A rooted tree G is a fully connected, acyclic graph with a special node v in V that is called the root of the tree. You may have studied rooted trees in your data structures class. With a root, it is possible to define a parent and children for each node. But without a root, we will regard the tree simply as a connected, acyclic graph.

Examples

Here are some examples of unrooted trees:

100 
300 
100 

Non-Examples

The following graph is not a tree. It has a cycle:

100 

Here is another example that has more than one maximal connected components and is not a tree:

100 

Properties of Trees

Leaves of a Tree

A leaf of an unrooted tree is a node that has a degree 1. Let us write down the leaves of the following tree examples:

100 

Leaves are {1,6,7,8,9}.

300 

Leaves are {1,3,4,5,6,7,8,9}.

100 

Leaves are {3,4,5}.

Claim Every tree has a leaf.

Proof

If the tree just has one node, then it is trivially a leaf. If the tree has two nodes connected by an edge, both nodes are leaves. Let us focus on trees with three or more nodes.

Let us assume that a tree T that does not have a leaf exists. Therefore for every node v in this tree mathsf{degree}(v) geq 2. We are going to show that T has a cycle, yielding a contradiction with the assumption that it is a tree.

Start from any node v_0 in the tree and do a walk as follows:

  • Take any edge out for v_0 to reach v_1.

  • For any node v_i, take an edge other than the v_{i-1} --- v_i that we took to enter v_i. Since mathsf{degree}(v_i) geq 2, such a vertex is always available.

Note that the walk above can be continued for arbitrarily many steps. However, since number of vertices is finite, the walk repeats a vertex. Let for some i geq 0, v_i = v_{i+k}. We can now conclude the existence of a cycle with v_i in it. Therefore T is not a tree, yielding a contradiction.

Thus, we have concluded that every tree has a leaf.

Number of Edges

Claim A tree with n nodes has n-1 edges.

Proof

Proof is by weak induction on the number of nodes n.

Base Case: Take any tree with 1 node. There is just one such tree and it has 1-1=0 edges.

Inductive Hypothesis: Let us assume that all trees with n nodes have n-1 edges. We will show that all trees with n+1 nodes have n edges.

Take some tree T_{n+1} with n+1 nodes. It must have a leaf v. Removing the leaf gives us a tree with n nodes that must have n-1 edges in it. The leaf itself was connected to the rest of the tree by one edge. Therefore T_{n+1} has n edges.

Number of Leaves

Claim Any tree has at least two leaves.

We will prove this in class by the following argument. We already know that any tree has at least one leaf.

Proof

Let us assume that there is a tree with n > 1 nodes and exactly one leaf v.

  • Therefore all nodes other than v have degree geq 2.

  • Sum of degrees of all nodes geq  1 + 2 (n-1) = 2 n -1.

  • However, we know that sum of degrees of all nodes = 2 * number of edges.

  • number of edges geq lceil{frac{2n -1}{2}} rceil geq n.

  • However a tree has precisely n-1 edges.

  • This leads to a contradiction.

Paths in Trees

Claim Let u and v be two nodes in a tree. There is precisely one path from u to v in the tree.

Argument in class Since the tree is connected component itself, there has to be at least one path from u to v.

We will argue that having two distinct paths will necessarily imply that the tree has a cycle which will lead to a contradiction.