Hunter Liu's Website

7. Week 6: Integrals and Meshes

≪ 6. Week 5: Integrals and Limits | Table of Contents | 8. Week 7: Metrising the Space of Continuous Functions ≫

I suppose I should allot some time for midterm review things…

One problem from the practise midterm I’d like to highlight is the following:

Problem 1.

Show that if f:[0,1][0,1]2f : [0, 1]\to [0, 1]^2 is continuous and onto, then it cannot be one-to-one.

In other words, there is no continuous bijection f:[0,1][0,1]2f: [0, 1]\to \left[ 0,1 \right]^2. A highlight of this problem is the following fact: if ff is continuous and bijective on a compact domain, its inverse is also continuous. This is not true of non-compact domains — make sure you know some counterexamples.

Since continuous functions preserve topological features in one direction or another, continuous bijections with continuous inverses (known as homeomorphsims) identify topological features of the domain and range: open sets and closed sets are “paired up”, connectedness for one implies connectedness of the other, etc. With this in mind, one may search for topological features of [0,1][0, 1] that are not shared by [0,1]2[0, 1]^2 to get a contradiction.

Solution 1
If f:[0,1][0,1]2f : [0, 1]\to \left[ 0,1 \right]^2 is a continuous bijection, its inverse f1:[0,1]2[0,1]f ^{-1} : \left[ 0,1 \right]^2 \to \left[ 0,1 \right] is continuous too. Restricting this to the smaller domain f1:[0,1]2{f1(12)}[0,1]{12}f ^{-1} : \left[ 0,1 \right]^2\setminus \left\lbrace f ^{-1}\left( \frac{1}{2} \right) \right\rbrace \to \left[ 0,1 \right]\setminus \left\lbrace \frac{1}{2} \right\rbrace gives a continuous surjection from a connected domain to a disconnected image, a contradiction. \square

This solution argues that the unit square and unit interval are topologically distinguishable, as the latter becomes disconnected upon removing a point while the former remains connected.

Here’s another cute solution that shows that the unit square and unit cube are topologically distinguishable by size:

Solution 2
Suppose f:[0,1][0,1]2f : [0, 1]\to \left[ 0,1 \right]^2 is a continuous bijection; its inverse is also continuous. Then, if ιx:(0,1)[0,1]2\iota _x : (0, 1)\to [ 0, 1]^2 is given by y(x,y)y\mapsto (x, y), we have the composite (0,1)ιx[0,1]2f1[0,1](0, 1) \xrightarrow{\iota _x} \left[ 0,1 \right]^2 \xrightarrow{f ^{-1} } [0, 1] is continuous and injective. Hence, by the intermediate value theorem, f1({x}×(0,1))f ^{-1} \left( \left\lbrace x \right\rbrace\times (0, 1) \right) is a nonempty open interval, and these are pairwise disjoint. But [0,1][0, 1] does not contain uncountably many nonempty disjoint open intervals. \square

Question 2.

Does there exist a continuous bijection f:[0,1]2[0,1]3f : [0, 1]^2\to [0, 1]^3?

The Mesh Size Characterisation of the Integral

Let α:[a,b]R\alpha : [a, b]\to \mathbb{R} be a nondecreasing function. Recall the following criterion for Riemann-Stieltjes integrability with respect to α\alpha :

Theorem 3.

Let f:[a,b]Rf : [a, b]\to \mathbb{R} be a bounded function. The following are equivalent:

  1. fR(α)f\in \mathscr{R} (\alpha ), i.e. infU(f,P,α)=supL(f,P,α),\inf U\left( f, \mathcal{P}, \alpha \right) = \sup L \left( f, \mathcal{P}, \alpha \right), where the sup\sup and inf\inf are taken over all partitions P\mathcal{P} of [a,b][a, b].
  2. For every ϵ>0\epsilon > 0, there exists a partition P\mathcal{P} such that U(f,P,α)L(f,P,α)<ϵ.U\left( f, \mathcal{P}, \alpha \right) - L \left( f, \mathcal{P}, \alpha \right) < \epsilon .

Let f:[a,b]Rf : [a, b]\to \mathbb{R} be continuous. Recall the following proof that fR(α)f \in \mathscr{R}(\alpha ):

  1. Since ff is continuous on a compact domain, it’s uniformly continuous. Thus, given ϵ>0\epsilon > 0, let δ>0\delta > 0 such that xy<δ\left\lvert x-y \right\rvert < \delta implies f(x)f(y)<ϵα(b)α(a) \left\lvert f(x) - f(y) \right\rvert < \frac{\epsilon }{ \alpha (b) - \alpha (a)}.
  2. Let P={a=x0<<xn=b}\mathcal{P} = \left\lbrace a = x_0 < \cdots < x_n =b \right\rbrace be any partition of [a,b][a, b] such that xixi1<δx_i - x _{i-1} < \delta for all ii. On each subinterval [xi1,xi]\left[ x _{i-1}, x_i \right], we have Mimi<ϵα(b)α(a)M_i - m_i < \frac{\epsilon }{\alpha (b) - \alpha (a)} by the extreme value theorem and by construction of δ\delta .
  3. Compute that U(f,P,α)L(f,P,α)=i=1n(Mimi)Δαi<ϵα(b)α(a)i=1nΔαi=ϵ.U(f, \mathcal{P}, \alpha ) - L (f, \mathcal{P}, \alpha ) = \sum _{i=1}^{n} \left( M_i - m_i \right) \Delta \alpha _i < \frac{\epsilon }{ \alpha (b) - \alpha (a)} \sum _{i=1}^{n} \Delta \alpha _i = \epsilon .

Note that this proof indicates a bit more than integrability: it doesn’t matter what the partition P\mathcal{P} is, as long as it’s “fine enough”, the upper and lower sums will be good approximations of the integral. This is in line with how we learned integration in calculus. This can be formalised as follows:

Definition 4.

Given a partition P={a=x0<<xn=b}\mathcal{P} = \left\lbrace a = x_0 < \cdots < x_n = b \right\rbrace of [a,b][a, b], we define the mesh size of P\mathcal{P} as μ(P):=max{xixi1:i=1,,n}.\mu (\mathcal{P}) := \max \left\lbrace x_i - x _{i-1} : i = 1,\ldots, n \right\rbrace. (This is sometimes denoted P\left\lVert \mathcal{P} \right\rVert, but I don’t like using the “norm” bars on something that’s not a vector space.)

Theorem 5.

If f:[a,b]Rf : [a, b]\to \mathbb{R} is continuous, then fR(α)f\in \mathscr{R}(\alpha ), and abf dα=limμ(P)0U(f,P,α)=limμ(P)0L(f,P,α).\int _{a}^{b} f\ d \alpha = \lim _{\mu (\mathcal{P}) \to 0} U \left( f, \mathcal{P}, \alpha \right) = \lim _{\mu (\mathcal{P}) \to 0} L \left( f, \mathcal{P}, \alpha \right).

Explicitly, this means that for all ϵ>0\epsilon > 0, there exists a δ>0\delta > 0 such that μ(P)<δ\mu (\mathcal{P}) < \delta implies abf dαU(f,P,α)<ϵ,\left\lvert \int _{a}^{b} f\ d \alpha - U \left( f, \mathcal{P}, \alpha \right)\right\rvert < \epsilon , and likewise for the lower sums.

This leads to the following alternative definition of integrability: if f:[a,b]Rf: [a, b]\to \mathbb{R} is bounded, fR(α)f\in \mathscr{R} (\alpha ) if the limits limμ(P)0U(f,P,α)=limμ(P)0L(f,P,α)\lim _{\mu (\mathcal{P}) \to 0} U \left( f, \mathcal{P}, \alpha \right) = \lim _{\mu \left( \mathcal{P}\right)\to 0 } L \left( f, \mathcal{P}, \alpha \right) and both exist. Unfortunately, this is not equivalent to the definition of integrability we stated at the beginning.

Exercise 6.

Give an example of f,αf, \alpha such that fR(α)f\in \mathscr{R} (\alpha ) (in the original sense) and limμ(P)0U(f,P,α)\lim _{\mu (\mathcal{P}) \to 0} U \left( f, \mathcal{P}, \alpha \right) does not even exist.

As your condition hopefully demonstrates, it’s when the discontinuities of α\alpha and ff overlap and “conspire together” that the mesh size formulation of integrability fails. When ff is continuous, this pathology doesn’t happen. When α\alpha is continuous, we again expect these pathologies not to happen either, but proving the equivalence of these two characterisations is more difficult.

Proposition 7.

Let α:[a,b]R\alpha : [a, b]\to \mathbb{R} be nondecreasing and continuous. Let f:[a,b]Rf : [a, b]\to \mathbb{R} be bounded. Then fR(α)f\in \mathscr{R}(\alpha ) if and only if limμ(P)0U(f,P,α)=limμ(P)0L(f,P,α)\lim _{\mu (\mathcal{P})\to 0} U(f, \mathcal{P}, \alpha ) = \lim _{\mu \left( \mathcal{P} \right) \to 0} L\left( f, \mathcal{P}, \alpha \right) exist and coincide. Additionally, both limits coincide with abf dα\int _{a}^{b}f\ d \alpha .

Note, by the way, that the limits as μ(P)0\mu (\mathcal{P}) \to 0 not existing or not coinciding does not imply that both ff and α\alpha share a point of discontinuity. Any fR(α)f\notin \mathscr{R}(\alpha ) exhibit this problem, so this strategy of proof is doomed.

To prove the proposition, we’ll first prove a lemma that states that upper and lower sums are “continuous” with respect to P\mathcal{P} when α\alpha is continuous. Specifically, when you add a point to a partition, the upper and lower sums do not change by much.

Lemma 8.

Let f:[a,b]Rf:[a,b ]\to \mathbb{R} be a bounded function, α:[a,b]R\alpha : [a, b]\to \mathbb{R} nondecreasing (not necessarily continuous), and P={a=t0<<tn=b}\mathcal{P} = \left\lbrace a=t_0 < \cdots < t_n = b \right\rbrace.

Let ti1<t<tit _{i-1} < t_* < t_i, and define P=P{t}\mathcal{P}_* = \mathcal{P} \cup \left\lbrace t_* \right\rbrace, i.e. the partition formed by adding a point tt_* to P\mathcal{P}. Then U(f,P,α)U(f,P,α)(sup[xi1,xi]finf[xi1,xi]f)(α(xi)α(xi1)).\left\lvert U\left( f, \mathcal{P}, \alpha \right) - U \left( f, \mathcal{P}_*, \alpha \right) \right\rvert \leq \left( \sup _{\left[ x _{i-1}, x_i \right]} f - \inf _{\left[ x _{i-1}, x_i \right]} f \right) \left( \alpha \left( x_i \right) - \alpha \left( x _{i-1} \right) \right). An identical inequality holds when UU is replaced by LL.

To prove this key lemma, expand the two sums and compare.

Proof of Proposition

The “if” direction is clear: take any partition of sufficiently small mesh size, and the upper and lower sums will be close to their common limits, hence also each other.

We will focus on the “only if” part and prove that if fR(α)f\in \mathscr{R}(\alpha ), then limμ(P)0U(f,P,α)=abf dα.\lim _{\mu (\mathcal{P})\to 0} U\left( f, \mathcal{P}, \alpha \right) = \int _{a}^{b} f\ d \alpha. An identical argument applies to the limit of the lower sums.

When ff is a constant function, both claims are trivially true, so we’ll assume ff is not constant.

Fix any ϵ0>0\epsilon _0 > 0. There exists a partition P\mathcal{P} such that U(f,P,α)<abf dα+ϵ0U \left( f, \mathcal{P}, \alpha \right) < \int _{a}^{b} f\ d \alpha + \epsilon _0. Our goal is to determine a δ>0\delta > 0 such that whenever μ(P)<δ\mu \left( \mathcal{P}’ \right) < \delta , U(f,P,α)U \left( f, \mathcal{P}’, \alpha \right) is within ϵ0\epsilon _0 of the integral (maybe with an extra constant factor).

To do so, we’ll leverage the lemma and argue that U(f,P,α)U \left( f,\mathcal{P}’, \alpha \right) should be close to U(f,PP,α)U(f,P,α)U \left( f, \mathcal{P}’\cup \mathcal{P}, \alpha \right) \leq U \left( f, \mathcal{P}, \alpha \right). We can do this iteratively, adding one point at a time to P\mathcal{P}’, leveraging the continuity of α\alpha to make the difference at each step very small.

Let ϵ1>0\epsilon _1 > 0, to be chosen later. Since α\alpha is continuous on a compact domain, it’s uniformly continuous. Thus, there exists δ>0\delta > 0 such that xy<δ\left\lvert x-y \right\rvert < \delta implies α(x)α(y)<ϵ1\left\lvert \alpha (x) - \alpha (y) \right\rvert < \epsilon _1. Then, whenever P\mathcal{P}’ is any partition with μ(P)<δ\mu \left( \mathcal{P}’ \right) < \delta , one has U(f,P,α)U(f,P{t},α)(supfinff)(α(xi)α(xi1))<(supfinff)ϵ1.\begin{align*} U\left( f, \mathcal{P}’, \alpha \right) - U \left( f, \mathcal{P}’ \cup \left\lbrace t_* \right\rbrace , \alpha \right) &\leq \left( \sup f - \inf f \right) \cdot \left( \alpha \left( x_i \right)- \alpha \left( x _{i-1} \right)\right) \\ &< \left( \sup f - \inf f \right) \epsilon _1 . \end{align*} Proceeding by induction, we have that U(f,P,α)U(f,PP,α)<(supfinff)ϵ1P,U \left( f, \mathcal{P}’, \alpha \right) - U \left( f, \mathcal{P}’ \cup \mathcal{P}, \alpha \right) < \left( \sup f - \inf f \right) \epsilon _1 \left\lvert \mathcal{P} \right\rvert, where P\left\lvert \mathcal{P} \right\rvert is the number of points in the partition P\mathcal{P} .

Setting ϵ1=ϵ0P(supfinff)\epsilon _1 = \frac{\epsilon _0}{ \left\lvert \mathcal{P} \right\rvert \left( \sup f - \inf f \right)}, we find that U(f,P,α)<U(f,PP,α)+ϵ0U(f,P,α)+ϵ0<abf dα+2ϵ0.\begin{align*} U \left( f, \mathcal{P}’, \alpha \right) &< U \left( f, \mathcal{P}’\cup \mathcal{P} , \alpha \right) + \epsilon_0 \\ &\leq U \left( f, \mathcal{P}, \alpha \right) + \epsilon _0 \\ &< \int _{a}^{b} f\ d \alpha + 2\epsilon _0. \end{align*} This concludes the proof. \square

Think carefully about the order in which everything was constructed and chosen:

  1. ϵ0>0\epsilon_0 > 0 was taken to be arbitrary.
  2. P\mathcal{P} was determined based on ϵ0\epsilon _0.
  3. ϵ1>0\epsilon _1 > 0 was determined based on P\mathcal{P}, ff, and ϵ0\epsilon _0.
  4. Finally, δ>0\delta > 0 was determined based on ϵ1\epsilon _1 and α\alpha .

The lemma is suggestive of why we need either ff or α\alpha to be continuous in order to make the above statement: when we relate fine partitions to “coarse” partitions, we need to make sure the two are close. When both ff and α\alpha are simultaneously discontinuous, it’s possible that this doesn’t happen.

Question 9.

Let f:[a,b]Rf: [a, b]\to \mathbb{R} and α:[a,b]R\alpha : [a, b]\to \mathbb{R} nondecreasing. Suppose at every x[a,b]x\in [a, b] either ff or α\alpha is continuous. Is it true that if fR(α)f\in \mathscr{R}(\alpha ), limμ(P)0U(f,P,α)=limμ(P)0L(f,P,α)=abf dα?\lim _{\mu (\mathcal{P})\to 0} U \left( f, \mathcal{P}, \alpha \right) = \lim _{ \mu \left( \mathcal{P} \right) \to 0} L \left( f, \mathcal{P}, \alpha \right) = \int _{a}^{b} f\ d \alpha ?