Capturing the Origin with Random Points: Generalizations of a Putnam Problem

College Mathematics Journal, 27 (1996) no. 3, 186-192.

Ralph Howard
Department of Mathematics
University of South Carolina
Columbia SC 29208

and

Paul Sisson
Department of Mathematics
LSU - Shreveport
Shreveport LA 71115

1 Introduction

Problem A-6 of the 53^rd Putnam Competition read as follows:

Four points are chosen at random on the surface of a sphere. What is the probability that the center of the sphere lies inside the tetrahedron whose vertices are at the four points? (It is understood that each point is independently chosen relative to a uniform distribution on the sphere.)

The problem has a geometric immediacy that makes it tantalizing: the tetrahedron so formed is readily visualized and no great mathematical background is necessary to understand the question being asked. Further, it is almost impossible to resist the urge to generalize the problem. Some of the variants that spring to mind quickly are:

(1): Suppose n+1 points are chosen at random from the surface of a ball in Rⁿ. What is the probability that the center of the ball lies inside the simplex in Rⁿ whose vertices are the n+1 points (i.e. the convex hull of the n+1 points)?
(2): Four points are chosen at random from within a ball in R³ (or n+1 points from an n-ball in Rⁿ). What is the probability that the center of the ball lies within the convex hull of the points?
(3): Four points are chosen at random from the surface of some other object in R³ (or n+1 points from the surface of some object in Rⁿ). What is the probability that a fixed interior point of the object lies inside the convex hull of the four (respectively, n+1) points?
(4): More vaguely, assume the action is centered about the origin in Rⁿ, and that n+1 points are chosen ``at random'' in Rⁿ. What is the probability that the convex hull of the n+1 points contains the origin?

The list can easily be extended, but as question (4) demonstrates we have already reached the point where the questions need to be more carefully posed.

Despite the fact that the original Putnam question is so easily understood, the solution is (not surprisingly) not arrived at with equal ease. This sentiment is supported by the fact that 123 of the top 203 scorers on the Putnam exam submitted no solution at all to problem A-6, and a relatively low number of 9 of the top scorers received a full 10 points for the problem. This difficulty in answering such an easily grasped problem just makes it more intriguing, of course, and suggests that the problem and its generalizations are worth investigating. In this paper we will develop a surprisingly simple answer to questions (1) and (2). In addition, our result answers rather general forms of questions (3) and (4).

In [3], Klosinski, Alexanderson and Larson offer the following solution to A-6. Assume the sphere is centered at the origin, and that the first point P₀ is located at the north pole of the sphere, with the three remaining points then located at random locations on the sphere. We can assume that these remaining points are chosen in a two-step process: first a diameter P_i1P_i2 (i Î {1,2,3}) is fixed and then one of the two end-points {P_i1,P_i2} is selected as a vertex of the tetrahedron. Figure 1 below illustrates a typical orientation of the choices. The eight possible tetrahedra P₀P_1j₁P_2j₂P_3j₃ (with each j_i being 1 or 2) are equally likely. Further, we can assume that the result is an honest tetrahedron and that the origin does not lie on any face. (Recall that the plane through three noncollinear points P₁, P₂ and P₃ consists of all affine combinations

a₁

Ž
P

+ a₂

Ž
P

+ a₃

Ž
P

where a₁+a₂+a₃ = 1. With probability one, neither the fourth vertex nor the origin lies in the plane through any three vertices.)

Figure 1: Typical choice of vertices.

In particular, the four vertex vectors

Ž
P

and

Ž
P

must be linearly dependent, so there exists a 4-tuple (w,x,y,z) for which

Ž
0

= w

Ž
P

+ x

Ž
P

+ y

Ž
P

+ z

Ž
P

and for which w,x,y and z are all non-zero. Then since

Ž
P

= -

Ž
P

the eight equations

Ž
0

= w

Ž
P

+ x

Ž
P

1j₁

+ y

Ž
P

2j₂

+ z

Ž
P

3j₃

have the solutions

(w,x,y,z),(w,x,y,-z),(w,x,-y,z),(w,-x,y,z),

(w,x,-y,-z),(w,-x,-y,z),(w,-x,y,-z),(w,-x,-y,-z).

Each point in the tetrahedron with vertices P₀, P_1j₁, P_2j₂,P_3j₃ can be uniquely represented as a convex combination

b₀

Ž
P

+ b₁

Ž
P

1j₁

+ b₂

Ž
P

2j₂

+ b₃

Ž
P

3j₃

(where each b_i ł 0 and b₀ + b₁ + b₂ +b₃ = 1), so the origin is contained in the tetrahedron P₀P_1j₁P_2j₂P_3j₃ if and only if the 4-tuple solving the associated vector equation

Ž
0

= w

Ž
P

+ x

Ž
P

1j₁

+ y

Ž
P

2j₂

+ z

Ž
P

3j₃

consists of four coordinates of the same sign. Since only one of the above eight solutions has this property, only one of the eight equally likely tetrahedra contains the origin, and hence the probability that the origin is contained in the randomly chosen tetrahedron is 1/8.

2 First Generalization

So far, so good. This solution generalizes in the obvious way and gives us the answer of 1/2ⁿ to question (1) in the above list. But what of question (2)? The above approach seems inadequate in this case, since points can now be chosen anywhere along the randomly chosen diameters.

Let us employ one of the standard procedures when faced with a difficult problem: that of changing the problem to something easier. We will attempt first to answer question (2) in R². Specifically, if three points are chosen at random from the unit disk B², what is the probability that the triangle thus formed contains the origin? Let us further simplify the problem by assuming that we are choosing three points at random with respect to a probability measure P on B² which is rotationally invariant ; that is, measures of subsets of B² are unchanged under rotational translations. We will also continue to assume the appropriate degree of non-degeneracy of the measure (more on this in the next section).

Since we are assuming rotational invariance, we can assume that the first point P₁ is fixed between 0 and 1 on the positive x-axis. With probability one, the second point P₂ of the triangle is not located at the origin, and we can form the ray starting at the origin and passing through P₂. Let q be the angle between the positive x-axis and this ray. The question can now be posed as a conditional probability problem: given q, what is the probability that the third point P₃ defines a triangle which contains the origin? Integrating this probability over all possible q's will then give us the answer we seek.

In order to simplify our work, let us agree upon some notation. Given a point P in B² -{(0,0)}, let Q(P) denote the angle from the positive x-axis to the ray beginning at the origin and passing through P (see Figure 2). Thus, q₁ Ł Q(P) Ł q₂ will indicate that P lies in the sector of B² defined by the angles q₁ and q₂. Let P(capture) denote the probability that the origin is captured within the triangle formed by the three points P₁, P₂ and P₃. Thus, the first task is to calculate P(capture | Q(P₂) = q), for each q Î [0,2p].

Figure 2: Illustration of Q(P₂) for a typical P₂.

Suppose first that 0 Ł q Ł p. It is not difficult to see that a necessary and sufficient condition for capture is that p Ł Q(P₃) Ł p+ q. That is, the ray from the origin to P₃ must pass through S¹ (the boundary of B²) at a point between p units and p+ q units, as measured from the positive x-axis. Since the length of this arc is q, this conditional probability is q/2p, i.e. P(capture | Q(P₂) = q) = q/2p. Similarly, if p Ł q Ł 2p, P(capture | Q(P₂) = q) = 1 - q/2p.

We can now approximate our solution with an appropriate Riemann sum. Let { q₀, ź, q_n } be a partition of [0,2p]. Then

P(capture)

n-1
ĺ
i = 0

P(capture | q_i Ł Q(P₂) Ł q_i+1) P(q_i Ł Q(P₂) Ł q_i+1)

n-1
ĺ
i = 0

P(capture | q_i Ł Q(P₂) Ł q_i+1)

Dq_i

In the limit of finer and finer partitions, we obtain

P(capture)

ó
ő

2p

0

P(capture | Q(P₂) = q)

ó
ő

p

0

ó
ő

2p

p

ć
ç
č

1 -

ö
÷
ř

= 1/4.

Examination of this argument shows that we have answered more than we set out to, since the fact that P is a probability measure on B² is really irrelevant. As long as P is a probability measure on R² which is rotationally invariant and suitably non-degenerate, the result is the same. We are already aware of one consequence of this: if P is a uniformly distributed probability measure on S¹, the method of Klosinski, Alexanderson and Larson tells us that with probability 1/4 the origin will be contained in a randomly chosen triangle. We can also begin to make sense of question (4) by noting that if P is the usual Gaussian probability measure on all of R², the probability that three randomly chosen points captures the origin is again 1/4.

A related problem in geometric probability, whose many variants are dealt with in [1], [2], [4], [5] and [6], is to find the probability that three points chosen at random from a region in the plane will form an acute triangle. One version can be easily answered here. Since the origin is captured by three points chosen at random from the unit circle if and only if the three points form an acute triangle, the probability that an acute triangle is formed by three points chosen at random from S¹ is also 1/4.

The results above suggest that under rather general circumstances n+1 points chosen randomly from a region in Rⁿ which is symmetric with respect to the origin will capture the origin with probability 1/2ⁿ. Our main result gives conditions which guarantee the validity of this conclusion, and thus provides answers to questions (2), (3) and (4).

3 Second Generalization

We begin with a theorem, in which we finally describe the amount of non-degeneracy of the measures that we require. We also discard rotational-invariance for a weaker condition.

Theorem 1 Let Rⁿ be endowed with a probability measure m which is symmetric with respect to the origin and such that when n+1 points are chosen independently with respect to m, with probability one their convex hull is a simplex. Then the probability that the origin is contained in the simplex generated by n+1 such random points is 1/2ⁿ.

Recall that a measure m is symmetric if m(-S) = m(S) for any measurable set S. Also, n+1 points x₁,x₂,ź,x_n+1 from Rⁿ are vertices of a simplex if and only if none of these points lies on a hyperplane containing the other n points. Thus simplexes in R² are triangles and simplexes in R³ are tetrahedra.

As examples of how the theorem can be applied, we could let m be a uniformly distributed probability measure on a rectangle centered at the origin in R², or on the boundary of such a rectangle. In either case, if three points are chosen at random with respect to the measure, the probability that the origin is contained in the triangle thus formed is 1/4. In R³, the original Putnam answer of 1/8 applies to four points chosen at random from a cube, or from the surface of a cube, as well as from the sphere. More generally, let D be any domain in Rⁿ with finite volume and which is symmetric with respect to the origin. Then the probability that the origin is in the convex hull of n+1 points chosen uniformly and independently from D is 1/2ⁿ. Note that, as in Figure 3, it is not necessary for D to contain the origin, or even for D to be connected.

Figure 3: A disconnected domain D to which the theorem applies.

We will now proceed with the proof of the theorem.

Proof: Begin by defining a new probability space W = Rⁿ ×ź×Rⁿ (n+1 factors) with the product measure m×ź×m. Let

A = {(x₁, ź, x_n+1) Î W | the origin ofRⁿ is in the convex hull of x₁, ź, x_n+1}

For each set of indices I Ě {1, ź, n+1} let

A_I = {w Î W | w_I Î A},

where for w = (x₁,x₂,ź,x_n+1) we define w_I = (y₁,y₂,ź,y_n+1) with

y_i =

ě
í
î

-x_i

if i Î I

x_i

if i Î I^c.

Thus A = A_Ć, where Ć is the empty set of indices.

Our proof now rests on four observations.

(i): A_I = A_I^c where I^c is the complement of I in {1,ź, n+1}
(ii): A_I and A_J are essentially disjoint if J is neither I nor I^c
(iii): W = Č{A_I | I Ě {1, ź, n+1}}
(iv): The sets A_I all have equal measure.

To prove (i), suppose w Î A_I, with w = (x₁, x₂, ź, x_n+1). Then w_I Î A, i.e.

b_iy_i =

Ž
0

with 0 Ł b_i Ł 1 for each i and ĺb_i = 1. But then

b_i(-y_i) =

Ž
0

and since

-y_i =

ě
í
î

-x_i

if i Î I^c

x_i

if i Î I

it follows that w_I^c Î A, so w Î A_I^c. Of course, the same argument shows that A_I^c Ě A_I as well, so A_I = A_I^c.

Next, let x₁, ź, x_n+1 be chosen independently with respect to m from Rⁿ, and let w = (x₁, ź, x_n+1). Because of the hypothesis that with probability one none of the random points x_i is in the hyperplane spanned by the other n points, the origin has a representation as an affine combination of x₁,x₂,ź,x_n+1 that is almost surely unique, say

Ž
0

a_ix_i,

where ĺa_i = 1. Thus, except possibly for w in a set of measure zero in W, the set I = {i | a_i Ł 0} for the above representation is uniquely determined by w, and picks out the set A_I = A_I^c of which w is a member. Thus observation (ii) is proved.

Observation (iii) follows from noting that for any n+1 randomly chosen points x₁,x₂,ź,x_n+1 from Rⁿ, the zero vector can be expressed as an affine combination of these points, and the set A_I = A_I^c which contains w = (x₁,x₂,ź,x_n+1) is essentially uniquely determined by this expression. Finally, (iv) follows from the hypothesis that m is invariant under reflection through the origin, so that for any index set I the mapping wŽ w_I on W that changes the signs of the coordinates in I preserves the product measure. This mapping sends A_I onto A, so m(A_I) = m(A) for all I.

The index set Á = {I | I Ě {1,ź,n+1}} has 2ⁿ⁺¹ elements, but A_I = A_I^c so our claims show that W can be written as an essentially disjoint union of 2ⁿ subsets each with the same measure. Thus each of these subsets has measure 1/2ⁿ. In particular A has measure 1/2ⁿ, completing the proof of the theorem. ¨

After discovering the above proof, we learned that our result for points chosen from the surface of the unit sphere in Rⁿ could be deduced from a result due to J. G. Wendel and found in [8]. Wendel showed that if N points are chosen at random from the surface of the unit sphere in Rⁿ, the probability that all N points lie in the same hemisphere is given by

p_n,N = 2^-N+1

n-1
ĺ
k = 0

ć
ç
č

N-1

ö
÷
ř

If we let N = n+1, then the probability that the origin is contained in the convex hull of n+1 points chosen at random from the unit sphere in Rⁿ is 1 - p_n,n+1, which as the reader can verify is 1/2ⁿ. A referee pointed out another interestingly odd result related to our investigation that can be found on page 124 in [7]: If five points are chosen at random from a ball in R³, the probability that one of them is contained in the tetrahedron generated by the other four is 9/143.

The authors are grateful to the editor and referees for many useful suggestions.

References

[1]: G. R. Hall, Acute Triangles in the n-Ball, J. Appl. Prob., 19 (1982) pp 712-715.
[2]: D. G. Kendall and W. S. Kendall, Alignments in Two-Dimensional Random Sets of Points, Adv. Appl. Prob., 12 (1980) pp 380-424.
[3]: L.F. Klosinski, G.L. Alexanderson and L.C. Larson, The Fifty-Third William Lowell Putnam Mathematical Competition, Am. Math. Monthly, 100 (1993) pp 755-767.
[4]: E. Langford, The Probability that a Random Triangle is Obtuse, Biometrika, 56 (1969) pp 689-690.
[5]: E. Langford, A problem in Geometrical Probability, Math Mag., 43 (1970) pp 237-244.
[6]: L. A. Santaló, Integral Geometry and Geometric Probability, Addison-Wesley, Reading, Massachusetts (1976).
[7]: H. Solomon Geometric Probability, SIAM, Philadelphia, Pennsylvania (1978).
[8]: J. G. Wendel, A Problem in Geometric Probability, Math. Scand., 11 (1962) pp 109-111.

File translated from T_EX by T_TH, version 2.32.
On 10 Oct 1999, 13:00.