7. Week 6: Integrals and Meshes
≪ 6. Week 5: Integrals and Limits | Table of Contents | 8. Week 7: Metrising the Space of Continuous Functions ≫I suppose I should allot some time for midterm review things…
One problem from the practise midterm I’d like to highlight is the following:
Problem 1.
Show that if \(f : [0, 1]\to [0, 1]^2\) is continuous and onto, then it cannot be one-to-one.
In other words, there is no continuous bijection \(f: [0, 1]\to \left[ 0,1 \right]^2\). A highlight of this problem is the following fact: if \(f\) is continuous and bijective on a compact domain, its inverse is also continuous. This is not true of non-compact domains — make sure you know some counterexamples.
Since continuous functions preserve topological features in one direction or another, continuous bijections with continuous inverses (known as homeomorphsims) identify topological features of the domain and range: open sets and closed sets are “paired up”, connectedness for one implies connectedness of the other, etc. With this in mind, one may search for topological features of \([0, 1]\) that are not shared by \([0, 1]^2\) to get a contradiction.
Solution 1
This solution argues that the unit square and unit interval are topologically distinguishable, as the latter becomes disconnected upon removing a point while the former remains connected.
Here’s another cute solution that shows that the unit square and unit cube are topologically distinguishable by size:
Solution 2
Question 2.
Does there exist a continuous bijection \(f : [0, 1]^2\to [0, 1]^3\)?
The Mesh Size Characterisation of the Integral
Let \(\alpha : [a, b]\to \mathbb{R}\) be a nondecreasing function. Recall the following criterion for Riemann-Stieltjes integrability with respect to \(\alpha \):
Theorem 3.
Let \(f : [a, b]\to \mathbb{R}\) be a bounded function. The following are equivalent:
- \(f\in \mathscr{R} (\alpha )\), i.e. \[\inf U\left( f, \mathcal{P}, \alpha \right) = \sup L \left( f, \mathcal{P}, \alpha \right),\] where the \(\sup\) and \(\inf\) are taken over all partitions \(\mathcal{P}\) of \([a, b]\).
- For every \(\epsilon > 0\), there exists a partition \(\mathcal{P}\) such that \[U\left( f, \mathcal{P}, \alpha \right) - L \left( f, \mathcal{P}, \alpha \right) < \epsilon .\]
Let \(f : [a, b]\to \mathbb{R}\) be continuous. Recall the following proof that \(f \in \mathscr{R}(\alpha )\):
- Since \(f\) is continuous on a compact domain, it’s uniformly continuous. Thus, given \(\epsilon > 0\), let \(\delta > 0\) such that \(\left\lvert x-y \right\rvert < \delta \) implies \( \left\lvert f(x) - f(y) \right\rvert < \frac{\epsilon }{ \alpha (b) - \alpha (a)}\).
- Let \(\mathcal{P} = \left\lbrace a = x_0 < \cdots < x_n =b \right\rbrace\) be any partition of \([a, b]\) such that \(x_i - x _{i-1} < \delta \) for all \(i\). On each subinterval \(\left[ x _{i-1}, x_i \right]\), we have \(M_i - m_i < \frac{\epsilon }{\alpha (b) - \alpha (a)}\) by the extreme value theorem and by construction of \(\delta \).
- Compute that \[U(f, \mathcal{P}, \alpha ) - L (f, \mathcal{P}, \alpha ) = \sum _{i=1}^{n} \left( M_i - m_i \right) \Delta \alpha _i < \frac{\epsilon }{ \alpha (b) - \alpha (a)} \sum _{i=1}^{n} \Delta \alpha _i = \epsilon . \]
Note that this proof indicates a bit more than integrability: it doesn’t matter what the partition \(\mathcal{P}\) is, as long as it’s “fine enough”, the upper and lower sums will be good approximations of the integral. This is in line with how we learned integration in calculus. This can be formalised as follows:
Definition 4.
Given a partition \(\mathcal{P} = \left\lbrace a = x_0 < \cdots < x_n = b \right\rbrace\) of \([a, b]\), we define the mesh size of \(\mathcal{P}\) as \[\mu (\mathcal{P}) := \max \left\lbrace x_i - x _{i-1} : i = 1,\ldots, n \right\rbrace.\] (This is sometimes denoted \(\left\lVert \mathcal{P} \right\rVert\), but I don’t like using the “norm” bars on something that’s not a vector space.)
Theorem 5.
If \(f : [a, b]\to \mathbb{R}\) is continuous, then \(f\in \mathscr{R}(\alpha )\), and \[\int _{a}^{b} f\ d \alpha = \lim _{\mu (\mathcal{P}) \to 0} U \left( f, \mathcal{P}, \alpha \right) = \lim _{\mu (\mathcal{P}) \to 0} L \left( f, \mathcal{P}, \alpha \right). \]
Explicitly, this means that for all \(\epsilon > 0\), there exists a \(\delta > 0\) such that \(\mu (\mathcal{P}) < \delta \) implies \[\left\lvert \int _{a}^{b} f\ d \alpha - U \left( f, \mathcal{P}, \alpha \right)\right\rvert < \epsilon ,\] and likewise for the lower sums.
This leads to the following alternative definition of integrability: if \(f: [a, b]\to \mathbb{R}\) is bounded, \(f\in \mathscr{R} (\alpha )\) if the limits \[\lim _{\mu (\mathcal{P}) \to 0} U \left( f, \mathcal{P}, \alpha \right) = \lim _{\mu \left( \mathcal{P}\right)\to 0 } L \left( f, \mathcal{P}, \alpha \right)\] and both exist. Unfortunately, this is not equivalent to the definition of integrability we stated at the beginning.
Exercise 6.
Give an example of \(f, \alpha\) such that \(f\in \mathscr{R} (\alpha )\) (in the original sense) and \[\lim _{\mu (\mathcal{P}) \to 0} U \left( f, \mathcal{P}, \alpha \right) \] does not even exist.
As your condition hopefully demonstrates, it’s when the discontinuities of \(\alpha \) and \(f\) overlap and “conspire together” that the mesh size formulation of integrability fails. When \(f\) is continuous, this pathology doesn’t happen. When \(\alpha \) is continuous, we again expect these pathologies not to happen either, but proving the equivalence of these two characterisations is more difficult.
Proposition 7.
Let \(\alpha : [a, b]\to \mathbb{R}\) be nondecreasing and continuous. Let \(f : [a, b]\to \mathbb{R}\) be bounded. Then \(f\in \mathscr{R}(\alpha )\) if and only if \(\lim _{\mu (\mathcal{P})\to 0} U(f, \mathcal{P}, \alpha ) = \lim _{\mu \left( \mathcal{P} \right) \to 0} L\left( f, \mathcal{P}, \alpha \right)\) exist and coincide. Additionally, both limits coincide with \(\int _{a}^{b}f\ d \alpha \).
Note, by the way, that the limits as \(\mu (\mathcal{P}) \to 0\) not existing or not coinciding does not imply that both \(f\) and \(\alpha \) share a point of discontinuity. Any \(f\notin \mathscr{R}(\alpha )\) exhibit this problem, so this strategy of proof is doomed.
To prove the proposition, we’ll first prove a lemma that states that upper and lower sums are “continuous” with respect to \(\mathcal{P}\) when \(\alpha \) is continuous. Specifically, when you add a point to a partition, the upper and lower sums do not change by much.
Lemma 8.
Let \(f:[a,b ]\to \mathbb{R}\) be a bounded function, \(\alpha : [a, b]\to \mathbb{R}\) nondecreasing (not necessarily continuous), and \(\mathcal{P} = \left\lbrace a=t_0 < \cdots < t_n = b \right\rbrace\).
Let \(t _{i-1} < t_* < t_i\), and define \(\mathcal{P}_* = \mathcal{P} \cup \left\lbrace t_* \right\rbrace\), i.e. the partition formed by adding a point \(t_*\) to \(\mathcal{P}\). Then \[\left\lvert U\left( f, \mathcal{P}, \alpha \right) - U \left( f, \mathcal{P}_*, \alpha \right) \right\rvert \leq \left( \sup _{\left[ x _{i-1}, x_i \right]} f - \inf _{\left[ x _{i-1}, x_i \right]} f \right) \left( \alpha \left( x_i \right) - \alpha \left( x _{i-1} \right) \right).\] An identical inequality holds when \(U\) is replaced by \(L\).
To prove this key lemma, expand the two sums and compare.
Proof of Proposition
The “if” direction is clear: take any partition of sufficiently small mesh size, and the upper and lower sums will be close to their common limits, hence also each other.
We will focus on the “only if” part and prove that if \(f\in \mathscr{R}(\alpha )\), then \[\lim _{\mu (\mathcal{P})\to 0} U\left( f, \mathcal{P}, \alpha \right) = \int _{a}^{b} f\ d \alpha.\] An identical argument applies to the limit of the lower sums.
When \(f\) is a constant function, both claims are trivially true, so we’ll assume \(f\) is not constant.
Fix any \(\epsilon _0 > 0\). There exists a partition \(\mathcal{P}\) such that \(U \left( f, \mathcal{P}, \alpha \right) < \int _{a}^{b} f\ d \alpha + \epsilon _0\). Our goal is to determine a \(\delta > 0\) such that whenever \(\mu \left( \mathcal{P}’ \right) < \delta \), \(U \left( f, \mathcal{P}’, \alpha \right)\) is within \(\epsilon _0\) of the integral (maybe with an extra constant factor).
To do so, we’ll leverage the lemma and argue that \(U \left( f,\mathcal{P}’, \alpha \right)\) should be close to \(U \left( f, \mathcal{P}’\cup \mathcal{P}, \alpha \right) \leq U \left( f, \mathcal{P}, \alpha \right)\). We can do this iteratively, adding one point at a time to \(\mathcal{P}’\), leveraging the continuity of \(\alpha \) to make the difference at each step very small.
Let \(\epsilon _1 > 0\), to be chosen later. Since \(\alpha \) is continuous on a compact domain, it’s uniformly continuous. Thus, there exists \(\delta > 0\) such that \(\left\lvert x-y \right\rvert < \delta \) implies \(\left\lvert \alpha (x) - \alpha (y) \right\rvert < \epsilon _1\). Then, whenever \(\mathcal{P}’\) is any partition with \(\mu \left( \mathcal{P}’ \right) < \delta \), one has \[\begin{align*} U\left( f, \mathcal{P}’, \alpha \right) - U \left( f, \mathcal{P}’ \cup \left\lbrace t_* \right\rbrace , \alpha \right) &\leq \left( \sup f - \inf f \right) \cdot \left( \alpha \left( x_i \right)- \alpha \left( x _{i-1} \right)\right) \\ &< \left( \sup f - \inf f \right) \epsilon _1 . \end{align*}\] Proceeding by induction, we have that \[U \left( f, \mathcal{P}’, \alpha \right) - U \left( f, \mathcal{P}’ \cup \mathcal{P}, \alpha \right) < \left( \sup f - \inf f \right) \epsilon _1 \left\lvert \mathcal{P} \right\rvert,\] where \(\left\lvert \mathcal{P} \right\rvert\) is the number of points in the partition \(\mathcal{P} \).
Setting \(\epsilon _1 = \frac{\epsilon _0}{ \left\lvert \mathcal{P} \right\rvert \left( \sup f - \inf f \right)}\), we find that \[\begin{align*} U \left( f, \mathcal{P}’, \alpha \right) &< U \left( f, \mathcal{P}’\cup \mathcal{P} , \alpha \right) + \epsilon_0 \\ &\leq U \left( f, \mathcal{P}, \alpha \right) + \epsilon _0 \\ &< \int _{a}^{b} f\ d \alpha + 2\epsilon _0. \end{align*}\] This concludes the proof. \(\square\)
Think carefully about the order in which everything was constructed and chosen:
- \(\epsilon_0 > 0\) was taken to be arbitrary.
- \(\mathcal{P}\) was determined based on \(\epsilon _0\).
- \(\epsilon _1 > 0\) was determined based on \(\mathcal{P}\), \(f\), and \(\epsilon _0\).
- Finally, \(\delta > 0\) was determined based on \(\epsilon _1\) and \(\alpha \).
The lemma is suggestive of why we need either \(f\) or \(\alpha \) to be continuous in order to make the above statement: when we relate fine partitions to “coarse” partitions, we need to make sure the two are close. When both \(f\) and \(\alpha \) are simultaneously discontinuous, it’s possible that this doesn’t happen.
Question 9.
Let \(f: [a, b]\to \mathbb{R}\) and \(\alpha : [a, b]\to \mathbb{R}\) nondecreasing. Suppose at every \(x\in [a, b]\) either \(f\) or \(\alpha \) is continuous. Is it true that if \(f\in \mathscr{R}(\alpha )\), \[\lim _{\mu (\mathcal{P})\to 0} U \left( f, \mathcal{P}, \alpha \right) = \lim _{ \mu \left( \mathcal{P} \right) \to 0} L \left( f, \mathcal{P}, \alpha \right) = \int _{a}^{b} f\ d \alpha ?\]