100 Posts

Meaningless numerical milestones are a good a reason as any for an indulgent post. Today, I will discuss some facts from this blog which you might not otherwise know about. It will be in the form of an (mercifully short) interview with myself.

When did you start this blog?

I originally started it when I went to the IAS for a special year in 2010-2011, but I never ended up making the blog public at that time. The irreverence has been toned down for the current version. Sample post from the IAS: “Who wears short shorts? Deligne wears short shorts!”

What topics would you like to blog about in the future?

No promises, but here are some thoughts:

  • How does an NSF panel work?
  • What are letters of recommendation really like?
  • Who wore it better: piano v. orchestral arrangements.
  • Langlands versus the world.
  • Book reviews: Frenkel, Ellenberg, Harris.
  • India’s greatest mathematician: Harish-Chandra.
  • The top 1%: class and privilege in academia.

Does that mean you are planning to have less math in the future?

No, the math posts are not really planned in advance, they are just what I happen to be thinking about at the time. The math posts are really the main (if not exclusive) focus of this blog. Although, as one of my graduate students once remarked: “I though your post on swans was your best post ever.” Yeah, thanks for that, I’m working hard to bring you occasional insights into the vast edifice of algebraic number theory, and you like the guy who can wobble around to Saint-Saëns.

What can your readers do for you?

More audience participation! A lot of what I write is speculative, so please don’t refrain from giving your partially formed thoughts in the comments. As the readership of this blog went up, the number of comments has gone down. I think I understand this phenomenon, especially when it comes to math posts. The worst thing that can happen, however, is that you say something completely ridiculous in front of a bunch of senior number theorists. But, if you are not occasionally saying stupid things in front of smart people, then you are doing it wrong.

Does the audience have any questions?

I’m going to take audience questions in the form of random search terms which led to this blog. Perhaps those who came here were disappointed with their search results at the time, but perhaps if they search again this post will provide some answers.

  • 아리조나 윈터스쿨: I recommend going here.
  • review my paper: No thanks!
  • how can i tell how many pages my paper is: Form a bijection with one of the sets defined in Part II of this volume.
  • paskunas conference 2013: Sounds good! Alas, I was not invited.
  • maximal unramified abelian extension of a local field: It’s procyclic, and is generated by roots of unity of order prime to the residue characteristic. Assuming, of course, your local field is of mixed characteristic, which all the interesting ones are.
  • bush is the messiah: This seems to me to be an unverifiable claim.
  • galois reprsentations matrix: Unfortunately, no one can be told what the Matrix is. You have to see it for yourself.
  • honorarium + editor + elsevier: $60 for any processed paper. It is taxable income, however.
  • galoisrepresentations+blog+who?: It’s me!
  • bach mit pedal schiff: Bach without pedal, surely?
  • how do i find out how my paper is being reviewed: Oh, I can tell you that. If you are lucky, the reviewer has completely forgotten about it. Otherwise, the reviewer is currently cursing you for generally ruining his or her life.
  • compute the average rate of change from x = -10 to x = 10. enter your answer as a fraction in simplest terms using a slash ( / ). do not include spaces in your answer: (f(10)-f(-10))/20.
  • local even galois representations: I presume you are asking about representations of a local Galois group which are even. For this to make sense, you should probably talk about Galois representations at the infinite prime, that is, representations of \mathrm{Gal}(\mathbf{C}/\mathbf{R}) on which complex conjugation acts trivially. Let me classify those for you: they are all trivial!
  • peter scholze gowers: I don’t think they wrote any joint papers.
  • ila varma grothendieck: Same answer as above.
  • ila varma galois: Same answer as above.
  • danny calegari brilliant; jacob lurie genius: self-googling, I imagine.
  • joel specters math thesis: I shall link to it on this blog after it has been written.
  • representation galois change of characteristic: I assume you are asking about the p-q-switch? It is now ubiquitous, but there are plently of good expositions available online.
  • affirmative motives: I guess these are motives which just need a little support. For only a $5 donation to this blog, I will help turn a poor motive into a bold and effective one, simply by twisting.
  • kevin buzzard chess: I’d be surprised if he had the time. I fancy my chances.
  • xxx agol in school: Personally, I rate way Agol schooled 3-manifolds as [T18+], suitable for topologists of ages 18 and above.
  • the math behind a waffle: Aah, sorry about that. I’m more an expert in the waffle behind the math. On the other hand, you can learn about the chemistry of waffles here.
Posted in Waffle | Tagged , , , , , , , , , , , , | 2 Comments

The decline and fall of Publications Mathématiques de l’lHÉS

I want to discuss the decline of a once great journal.

How did IHES go from this:

and this:

to this:

It is a sad state of affairs. To be clear, I am talking about the typesetting here. The old IHES typeface was a delicious rich dark Baskerville (according to this source). The math displays were always a little wonky, but that was more than compensated by the charm of the text. Now? Still the same wonky formulas, and an inferior electronic typeface that is the equivalent of replacing some find fine hand-made Belgian chocolate with a Cadbury flake bar that’s been left out in the sun too long. This is very sad. How good was the typeface? It was so good, some people submitted their papers to IHES precisely for the beauty of the typesetting, even though it doesn’t look like that anymore. I think I even prefer \usepackage{baskervald} to the SMF version.

Posted in Mathematics, Waffle | Tagged , , , , , | 2 Comments

Is Serre’s conjecture still open?

The conjecture in this paper has indeed been proven. But that isn’t the entire story. Serre was fully aware of Katz modular forms of weight one. However, Serre was too timid was prudently conservative and made his conjecture only for weights k(\rho) \ge 2.

Well, perhaps I am overstating the case; we may as well quote Serre himself here:

Au lieu de définir les formes paraboliques à coefficients dans \mathbf{F}_p par réduction à partir de la caractéristique 0, comme nous l’avons fait, nous aurions pu utiliser la définition de Katz [23], qui conduit à un espace a priori plus grand … Il serait également intéressant d’étudier de ce point de vue le cas k =1, que nous avons exclu jusqu’ici ; peut-être la définition de Katz donne-t-elle alors beaucoup plus de représentations \rho_f?

Instead of defining the cusp forms with coefficients in \mathbf{F}_p by reduction from characteristic 0, as we did, we could have used the definition of Katz [23], which leads, a priori, to a larger space … It would also be interesting to study from this viewpoint the case k = 1 we have ruled out so far; Perhaps Katz’s definition gives more representations \rho_f?

In his Inventiones paper on the weight in Serre’s conjecture, Edixhoven does give the correct formulation where one allows k(\rho) = 1 and correspondingly also Katz modular forms. The bridge between the two conjectures essentially consists of two further conjectures: first, that Galois representations associated to residual weight one forms are unramified, and second, unramified modular representations come from weight one.

The first progress on this problem was actually pre-Edixhoven, namely, Gross’ companion form paper in Duke. (I have four copies of that paper on my laptop — two called GrossDuke.pdf, one called GrossCompanion.pdf, and one simply called Gross.pdf — does anyone else have scatterbrained naming systems for downloaded pdf files?) Gross deals with both directions in the case when \rho(\mathrm{Frob}_p) has distinct eigenvalues (I guess the assumption in the direction weight one \Rightarrow unramified is that the eigenvalues of X^2 - a_p X + \chi(p) are distinct). Of course, there was the famous matter of the “unchecked compatibilities,” (I’m not one for checking compatibilities myself, to be honest) which have certainly been resolved at this point (does Bryden Cais do this in his thesis? I think he does) The next step was the work of Coleman-Voloch, who deal with the remaining case under the additional assumption that p is odd. So this leaves the case p = 2. Somewhat more recently, Gabor Wiese showed that weight one Katz modular forms do give rise to unramified representations without any assumptions. So this leaves:

Serre’s Conjecture [Edixhoven formulation]: Let \rho: G_{\mathbf{Q}} \rightarrow \mathrm{GL}_2(\mathbf{F}_q) be an absolutely irreducible modular representation of characteristic 2. Assume that \rho is unramified at 2 and that the semi-simplification of \rho(\mathrm{Frob}_2) is scalar. Then \rho is modular of weight one.

Now Wiese also explicitly dealt with the case when \rho was (projectively) dihedral, so we can assume that \rho is absolutely irreducible with non-dihedral image. Suppose that the Serre level is N. Let \mathfrak{m} denote the maximal ideal of the weight two Hecke algebra which does not include the Hecke operator T_2. Let’s imagine we are working with Hecke algebras over some sufficiently large extension \mathcal{O}_E of \mathbf{Z}_2 with residue field k so to include enough Frobenius eigenvalues. It suffices to prove that

\dim_{\mathbf{T}/\mathfrak{m}} H^0(X_1(N)/k,\omega^{\otimes 2})[\mathfrak{m}] \ge 2,

because then we will have found two modular forms f and g which are Hecke eigenvalues for all Hecke operators away from p, and by the q-expansion principle, some linear combination of f and g will have to be the square of the desired weight one form.

Let R_{\mathrm{loc}} denote the Kisin deformation ring at two for \rho | D_2 for the decomposition group D_2 at 2, (this is just the ordinary deformation ring, in the sense of Geraghty). Let R^{\dagger}_{\mathrm{loc}} denote the augmented deformation which also includes the crystalline Frobenius eigenvalue T_2 (or, to put it differently, the eigenvalue of Frobenius on the “unramified quotient” U_2, where the former is meant in a sense that can and does make sense integrally. By Hensel’s lemma, both pieces of added data are equivalent.) Now one uses the modularity machine, which is OK by Khare-Wintenberger for p =2 because we are in the non-dihedral setting. Let’s patch the Betti cohomology of modular curves following KW, except now working with the modified global Kisin deformation ring R^{\dagger} which remembers crystalline Frobenius, and the full Hecke algebra \mathbf{T}^{\dagger} which includes T_2. Now R^{\dagger}_{\mathrm{loc}} is a domain with formally smooth generic fibre (this is proved in Snowden’s paper — the ring in question is denoted \widetilde{R}_3 in ibid.). Hence, by Kisin-Khare-Wintenberger method, we obtain an isomorphism R^{\dagger}[1/\varpi] = \mathbf{T}^{\dagger}[1/\varpi]. However, because R^{\dagger}_{\mathrm{loc}} is in addition Cohen-Macaulay, this can be upgraded to an R^{\dagger} = \mathbf{T}^{\dagger} theorem. (It might be cleaner to instead patch coherent cohomology — multiplicity one [which always holds with T_2 included] implies that the patched module is free of rank one, which makes it easy to deduce the integral R^{\dagger} = \mathbf{T}^{\dagger} theorem.) By considering the action of \mathbf{T}^{\dagger} on coherent cohomology, however, our multiplicity one assumption allows us to deduce by Nakayama that \mathbf{T} = \mathbf{T}^{\dagger} (more trivially: the space of modular forms with coefficients in E/\mathcal{O}_E with \mathcal{O}_E/\varpi = k is co-free of rank one over both of these rings) and so R \rightarrow R^{\dagger} = \mathbf{T}^{\dagger} = \mathbf{T} is surjective. However, there cannot be a surjection R \rightarrow R^{\dagger}, because there is a map R^{\dagger} \rightarrow k[\epsilon]/\epsilon^2 which is trivial as a Galois deformation but is non-trivial for (the Galois avatar of) T_2. For example, in the trivial case, this just amounts to saying that the trivial representation of G_{\mathbf{Q}_2} to \mathrm{GL}_2(k[\epsilon]/\epsilon^2) can be thought of as “ordinary with eigenvalue 1 + \epsilon.” It follows that multiplicity one without T_2 cannot hold.

Thus Serre’s conjecture is true!

Posted in Mathematics | Tagged , , , , , , , , , , , , , , | 2 Comments

A flat white, please

Thanks to the New York Times, I can now order a flat white at my local cafe instead of an 8 ounce latte. Culture is coming to America, my friends!

Posted in Food, Waffle | Tagged , , , , | Leave a comment

Horizontal Vanishing Conjectures.

Let F be a number field, and let \mathbf{G} be a reductive group over F, and let \Gamma be a congruence subgroup of \mathbf{G}(\mathcal{O}_F). I can hear BC objecting that this doesn’t make sense without extra choices; if you have such an objection, please make such choices. Matt and I have made various conjectures concerning the vanishing of the completed cohomology groups \widetilde{H}^{n} in the range n > q_0, where q_0 has been defined for all time by Borel and Wallach. (And what is q, you ask? Well, having just consulted [BW] by downloading a pirated djvu copy, I can tell you that 2q = \mathrm{dim}(G/K) [4.3, p.67]. What’s that, you say — q isn’t even always an integer? Nope!) Several cases of this conjecture were proved by Peter (in particular, in the Shimura variety context), but the general conjecture seems quite hard (not that the Shimura variety case was a cakewalk!). For example, when G = \mathrm{GL}(1), then q_0 = 0 and the conjecture is equivalent to Leopoldt’s conjecture. To remind you, one way of stating Leopoldt’s conjecture is that the profinite topology on \mathcal{O}^{\times}_F \times \mathbf{Z}_p coincides with the topology coming from the p-adic topology — that is, units are close if they are close modulo powers of p. In contrast, one can ask for the weaker statement that that the profinite topology on \mathcal{O}^{\times}_F \times \mathbf{Z}_p coincides with the congruence topology, namely, the topology coming from looking at units modulo N for any ideal N. This turns out to be unconditionally true and not too difficult, although it is not quite as obvious as it may seem (the same can be said of LC). It motivates, however, the following conjecture:

Conjecture (Horizontal Vanishing) Let n > q_0. Then the following direct limit vanishes

\displaystyle{\lim_{K} H^n(X(K),\mathbf{F}_p) = 0}

as K ranges over all compact open subgroups of \mathbf{G}(\mathbf{A}^f_F).

There is an equivalent formulation of this conjecture in terms of group cohomology for arithmetic lattices. Because the conjecture is known for \mathrm{GL}(1), one can also pass easily enough between \mathrm{SL} and \mathrm{GL}. For example, for \mathrm{SL}_N(\mathbf{Z}) and n > 2 it has the following formulation: Any cohomology class in H^n(\mathrm{SL}_N(\mathbf{Z}),M) for a finite discrete module M capitulates in some congruence subgroup, providing that

\displaystyle{n > \left\lfloor \frac{N^2}{4} \right\rfloor}.

This vanishing is related to the concept of virtual cohomological dimension. The virtual cohomological dimension of a group G is the smallest integer m such that there exists a finite index subgroup H \subset G such that every cohomology class in degree > m capitulates in H. The notion being considered here is what one gets by reversing the quantifiers — one only insists that the classes capitulate in smaller and smaller groups (in addition, we insist that H is a congruence subgroup, although that is not too restrictive when the rank is \ge 2). There is a trivial bound m \ge q_0, but this bound is not at all sharp. Since this seems an a priori interesting notion, let’s define it:

Definition: pro-virtual cohomological dimension: Let G be a group, and let p be a prime. Say that \mathrm{pvcd}_p(G) < m if, for every discrete G-module M annihilated by p, and every cohomology class [c] \in H^n(G,M) for some n \ge m, there exists a finite subgroup H so that the restriction of [c] to H^n(H,M) vanishes. Say that G has pro-virtual cohomological dimension < m if \mathrm{pvcd}_p(G) < m for all p.

I have nothing profound to say about whether this concept is relevant beyond the example at hand. As you can see for \mathrm{SL}_N(\mathbf{Z}), the virtual cohomological dimension and pro-virtual cohomological dimension are conjecturally quite different, the latter being given conjecturally by the formula above (at least when N > 2), and the former by \displaystyle{\binom{N}{2}}.

I wanted to remark in this post that the Horizontal Vanishing conjecture is, at least after localization at a non-Eisenstein maximal ideal \mathfrak{m}, a consequence of modularity lifting results (in the spirit of [CG]). Namely, the entire point of that method is that the patched complex has cohomology concentrated in a single degree (q_0), which amounts to saying that cohomology classes in H^{q_0 + i}(X(K),\mathbf{F}_p) can be annihilated after passing to some auxiliary level coming from some choice of Taylor-Wiles primes. Now many aspects of this argument are still conditional (note that to annihilate classes of deep p-power level, one would need corresponding local-global compatibility relating Galois representations associated to torsion classes to quotients of Kisin deformation rings, at the very least), but perhaps it is a less hopeless task than trying to prove Leopoldt’s conjecture.

It’s instructive to consider what is possibly the simplest case of this conjecture beyond Shimura varieties, namely, \mathrm{GL}(2) over an imaginary quadratic field (here q_0 = 1, so the claim is that one can kill classes in H^2). Here at least one doesn’t have to worry about vanishing of cohomology outside the indicated range. Local-global compatibility is still a problem, but one possibly way to get around this is to work at all p-power levels at once, namely, to patch the completed cohomology groups. (Matt, Toby, and I chatted over roast duck at Sun Wah what patching completed cohomology for general groups should look like.) Since one certainly has Galois representations, one gets “for free” the fact that the patched modules are modules over the appropriate power series ring of the local deformation ring. On the other hand, as Matt cautioned at the ‘Pig, it is no longer so easy to do naive arguments with codimensions, because the patched objects are not finitely generated over the ring of diamond operators, but only over a non-commutative group algebra, which leads into questions relating to the size of the corresponding p-adic representations, which leads back to questions concerning local-global compatibility in p-adic local Langlands.

I wonder, however, if there are any softer arguments in any special cases.

Posted in Mathematics | Tagged , , , , , , , , , | 2 Comments

The Abelian House is not closed

Today I will talk about \displaystyle{\frac{97 + 26 \sqrt{13}}{27} = 7.064604\ldots}

For an algebraic integer \alpha, the house \overline{|\alpha|} is the absolute value of the largest conjugate of \alpha. Kronecker proved the following:

  1. If \overline{|\alpha|} \le 1, then either \alpha = 0 or a root of unity.
  2. If \overline{|\alpha|} \le 2 and \alpha lives in a CM field, then \overline{|\alpha|} = 2 \cos \pi/N.

The first claim is well known. The second claim follows from the first: the CM condition implies that the conjugates of the squares of the absolute value are the squares of the absolute values of the conjugates. Hence, if \zeta^2 + \zeta^{-2} = \overline{|\alpha|}^2 - 2, then \zeta must be a root of unity by part one. On the other hand, beyond these two results, the respective values of \overline{|\alpha|} are dense in [1,\infty) (general case) and [2,\infty) (CM case). There are a number of ways to modify this problem. One way is to replace the largest conjugate \overline{|\alpha|} by the d-power mean of the absolute values:

\displaystyle{M_n(\alpha) = \left(\frac{1}{[{\mathbf{Q}}(\alpha):{\mathbf{Q}}]} \sum |\sigma \alpha|^n \right)^{1/n}}.

For such a construction, it makes the most sense to assume either that \alpha is totally real or lives in a CM field, so that |\sigma \alpha| = \sigma |\alpha|. For example, if one lets

\mathfrak{M}_n = \{x \in (1,\infty) | x = M_n(\alpha), \  \sigma c \alpha = c \sigma \alpha\},

then Chris Smyth shows (MR0736460) that, for all n > 0, the smallest elements of {\mathfrak{M}}_n are isolated, whereas {\mathfrak{M}}_n is dense for sufficiently large x. In this post, we shall be interested in what happens when one restricts to the class of cyclotomic integers. Namely, let

{\mathfrak{M}}^{{\mathrm{ab}}}_n = \{x \in (1,\infty) | x = M_n(\alpha) \ \alpha \in \mathbf{Q}^{\mathrm{ab}}\}.

In particular, when n = \infty, we obtain the set {\mathfrak{M}}^{{\mathrm{ab}}}_{\infty} consisting of the values \overline{|\alpha|} for cyclotomic integers \alpha. We call {\mathfrak{M}}^{{\mathrm{ab}}}_{\infty} the Abelian House. As already noted, the values of {\mathfrak{M}}^{{\mathrm{ab}}}_{\infty} \cap [1,2] consist of elements of the form 2 \cos(\pi/N), which includes 2 as a limit point. However, the spectrum of {\mathfrak{M}}^{{\mathrm{ab}}}_{\infty} for a short while beyond 2 is once again discrete. For example, the main theorem of MR3119783 (previously discussed here) completely computes {\mathfrak{M}}^{{\mathrm{ab}}}_{\infty} in the interval [0,(5.04)^{1/2}] — it has a second limit point at \sqrt{5} = 2.2360679\ldots and is once again discrete beyond this point. The case n =2 was studied in Cassels (MR0246852) and in MR2786219. In particular, Theorem 9.1.1 of MR2786219 is eqivalent to:

Proposition: The set {\mathfrak{M}}^{{\mathrm{ab}}}_2 = \overline{{\mathfrak{M}}^{{\mathrm{ab}}}_2} \subset {\mathbf{R}} is closed.

Note that M_2(\alpha)^2 =: \mathcal{M}(\alpha) \in {\mathbf{Q}} (the notation \mathcal{M} being used in ibid, so this closed subset is countable and is thus very far from being dense. Moreover M_{2n}(\alpha)^n = M_2(\alpha^n), so the theorem above implies that the closure \overline{{\mathfrak{M}}^{{\mathrm{ab}}}_{2n}} is also countable and lives inside {\mathbf{Q}}^{1/n} \cap {\mathbf{R}} \subset {\overline{\mathbf{Q}}} \cap {\mathbf{R}}. The main goal of the current post is to generalize this result to the abelian house.

Theorem: The closure of the abelian house \overline{{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}} is a subset of {\overline{\mathbf{Q}}} \cap {\mathbf{R}}. If S \subset {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}} is bounded, then \liminf S \in {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}. However, \limsup S is not necessarily in {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}, that is, the abelian house itself is not closed.

One application of this is to the possible index of subfactors (see here and here for an overview of the problem):

Corollary A: Let \alpha \in {\mathbf{R}} \setminus {\mathbf{R}} \cap {\overline{\mathbf{Q}}} be a real transcendental number. Then there does not exist a finite depth subfactor A < B of index in the range (\alpha - \epsilon, \alpha + \epsilon) for some \epsilon > 0.

Corollary B: Let \alpha \in {\mathbf{R}} \cap {\overline{\mathbf{Q}}} be an algebraic number. Then there does not exist a finite depth subfactor A < B of index in the range (\alpha, \alpha + \epsilon) for some \epsilon > 0.

Corollary C: The set of indices of finite depth subfactors is a well-ordered subset of \mathbf{R} of ordinal type \omega^{\omega}. (Random aside: Just like volumes of 3-manifolds according to Thurston and Jørgensen.) I’m assuming here that it is easy enough to construct subfactors of index

\prod_{i=1}^{n} 4 \cos^2(\pi/p_i)

for distinct odd prime numbers p_i.

Since the main context here is that such indices arise as the spectral eigenvalue of graphs, it might be helpful (for contrast) to note that this latter spectrum is dense in [\sqrt{2+\sqrt{5}},\infty) (MR986863).

This theorem came from my bag of thesis problems. I actually expected it to be the case that {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}} was closed, but this turns out to be completely false. On the other hand, the argument I had in mind to prove this theorem was roughly correct. On the third hand, it turns out that the solution to this problem was almost entirely included in a paper of A.Jones from the ’70s (MR0437466) (using the method I roughly had in mind).

Rational Linear subspace of ({\mathbf{R}}/{\mathbf{Z}})^k : Consider the standard torus {\mathbf{T}}:=({\mathbf{R}}/{\mathbf{Z}})^k with coordinates (x_1,\ldots,x_k). We define a rational linear subspace V of {\mathbf{T}} to be the subspace cut out by any number of equations of the form:

\sum a_{i,j} x_i = c_j

for integers a_{i,j} and elements c_j \in {\mathbf{Q}}/{\mathbf{Z}}. Topologically, V is finite disjoint union of tori. Any connected component of V is also a rational linear subspace. If all the c_j = 0, then we call V a rational linear subgroup. Call a point {\underline{x}} \in V rational if {\underline{x}} = (x_1,\ldots,x_k) where x_i \in {\mathbf{Q}}/{\mathbf{Z}}. Given V, the rational numbers c_j have a common denominator; let M denote some integer divisible by this common denominator. The map [m]: {\mathbf{T}} \rightarrow {\mathbf{T}} given by multiplication by m preserves V whenever m \equiv 1 \mod M.

Definition: For any rational point {\underline{x}} on V and an admissible integer M, let L({\underline{x}}) = L_M({\underline{x}}) denote the (finite) set of rational points of the form [m] {\underline{x}} \in V for all m satisfying the following two conditions:

  1. m \equiv 1 \mod M,
  2. m is prime to N, where {\underline{x}} = (x_1,\ldots,x_k) are elements of {\mathbf{Z}}[1/N]/{\mathbf{Z}}.

Of course, this definition comes from looking at the exponent of the conjugates of root of unity which fix an Mth root of unity. We call L({\underline{x}}) the line through {\underline{x}}. The notion of line depends on a choice of integer M, although replacing M by a multiple only (at worst) decreases the size of L({\underline{x}}). Our main technical lemma is the following:

Lemma A: Let S \subset V be any set of rational points, and let M be admissible for V. Then the closure of W = \bigcup L({\underline{x}}) of all lines L({\underline{x}}) = L_M({\underline{x}}) for {\underline{x}}\in S is a union of connected rational subspaces W^0 of V.

We shall apply this theorem to V = {\mathbf{T}} with M =1. However, in order to prove the result (by induction), it is easier to prove this more general statement.

Example: Suppose that n = 2, V = {\mathbf{T}}, and M = 2. If {\underline{x}} = (1/2,1/q), then the line L({\underline{x}}) consists of points of the form (1/2,p/q) with p odd and prime to q. The closure of all such points is the rational subspace x_1 = 1/2.

We proceed by induction on the dimension of V. We may first claim that we can assume V = V^0 is connected. The connected components of a rational subspace are obtained by replacing the linear equations by their saturation. However, this requires introducing numerators into the constants c_j, and so for this step (as well as several others) we must allow the auxiliary integer M to increase. If V is connected, it suffices to show that if the closure is not dense, then the points all lie on a (finite union of) co-dimension \ge 1 rational subspaces W of V, and then apply the inductive hypothesis.

Choose a rational base point {\underline{v}} \in V. After increasing M again if necessary, we may assume that M {\underline{v}} = 0. Under this assumption, translation by {\underline{v}} preserves lines and sends V to a connected rational linear subgroup of {\mathbf{T}}. After an integral change of basis, any connected rational linear subgroup is linearly equivalent to one of the form a_i x_i = 0 for a_i either zero or one, and thus, again without loss of generality, we may assume that V = {\mathbf{T}}. Now suppose that {\underline{v}} \in V is a point which is not in the closure of the set of lines. Because the complement of the closure is open, we may assume that {\underline{v}} is rational. Hence, once more translating by {\underline{v}} and increasing M if necessary, it suffices to show that either 0 is in the closure of the set of lines, or the points are all contained in a subvariety defined by a linear equation. Let {\underline{x}} = (x_1,\ldots,x_k) \in S, where one may think of the x_i as being lifted to {\mathbf{Q}}. The problem is to construct an integer n with n \equiv 1 \mod M and (n,N) = 1 such that if \|x\| denotes the nearest integer to x, then \|n x_i\| < \epsilon for all i, or to show that all the x_i satisfy some linear relation in {\mathbf{Q}}/{\mathbf{Z}}. Without the congruence condition on M, this is exactly a lemma proved by Davenport and Schinzel in MR0205926. Their proof does not obviously extend to this case, however. I had an idea to replace this analytic argument by using an idea of Cassels using the geometry of numbers. Write x_i = a_i/N where N is the smallest common denominator (so the greatest common divisor of the a_i is one). Let \Lambda \subset {\mathbf{Z}}^k denote the lattice

\Lambda := \{\lambda \equiv m (a_1,a_2, \ldots, a_k) \mod N, \quad m \in {\mathbf{Z}}\}.

The basic idea is to break up the problem into two steps: first, find an element of \Lambda of small length. If this element reduces under the natural map to \mathbf{Z}/N\mathbf{Z} to a multiple m of (a_1,\ldots,a_k) which is prime to N and 1 \mod M, then one wins. If not, deform the element both other small vectors, and use the fact that (in an arithmetic progression) one doesn’t have to go very far to find elements prime to N (by Iwaniec, \log(N) or so will suffice). In the end, it turns out that this improved version is essentially proved by Jones in MR0437466. (For me, it is easiest to modify his proof of Theorem 1 than read the notation in some of the latter theorems, but all of the required content is here.) In fact, the application Jones had in mind was almost identical to the topic of this post, namely, to study the higher derived sets of {\mathfrak{M}}^{{\mathrm{ab}}}_n. For some reason, however, he did not seem to notice the implication that \overline{{\mathfrak{M}}^{{\mathrm{ab}}}_n} was a subset of {\overline{\mathbf{Q}}}, possibly because his formalism was less algebraic than what we consider below.

Consider an infinite set S of roots of k-tuples of roots of unity (x_1, \ldots, x_k) which is closed under the action of \mathrm{Gal}({\overline{\mathbf{Q}}}/{\mathbf{Q}}), and view it as a subset of {\mathbf{G}}^k_m. Say that a set of k-tuples of units are constrained by a k-tuple of integers h = (h_1,\ldots,h_k) if, for all such tuples,

(x_1)^{h_1} (x_2)^{h_2} \ldots (x_k)^{h_k} \in \zeta^{{\mathbf{Z}}},

for some fixed root of unity \zeta. Since this property is preserved under taking dth roots for any fixed d \in {\mathbf{Z}}, we also insist that each constraining k-tuple consists of co-prime integers. A constraint cuts out a subvariety Z_h of {\mathbf{G}}^{k}_{m}, which in general is not geometrically connected. The intersection of any finite number of subvarieties Z_{h_i} is determined by the saturation of the subgroup of {\mathbf{Z}}^k generated by the h_i. In particular, there exists a maximal finite set of h_i such that Z:= \cap Z_{h_i}.

Theorem: The supremum of the elements

|y_1 + y_2 + \ldots + y_k|^2

in S is equal to the supremum of the quantity

|z_1 + z_2 + \ldots + z_k|^2,

over (z_1,\ldots,z_k) \in (S^1)^k \cap Z^0, where Z^0 \subset Z is some geometrically connected component of Z^0, and Z is a variety cut out by constraints for finitely many k-tuples. The infimum of the houses

\overline{|y_1+ y_2 + \ldots + y_k|}

is realized by an element of S.

Pulling back under the isomorphism \exp: {\mathbf{T}} \rightarrow (S^1)^k \subset {\mathbf{G}}^k_m, the pre-image of any geometrically connected component Z^0 is a connected rational linear subspace of {\mathbf{T}}, and conversely any connected rational linear subspace gives rise to such a Z^0. Write the pre-image of {\underline{y}} \in S as {\underline{x}} = (x_1, \ldots, x_k). Suppose that {\underline{y}} = (y_1,\ldots,y_k) where each y_i is a roots of unity in {\mathbf{Q}}(\zeta_N) (with N divisible by M), so that the denominators of the x_i divide N. The action of G:=\mathrm{Gal}({\mathbf{Q}}(\zeta_N)/{\mathbf{Q}}(\zeta_M)) on {\underline{x}} via \exp^* sends {\underline{x}} to m x for some m with (n,m) = 1. In particular, the conjugates on V precisely cut out the line L_M({\underline{x}}) of {\mathbf{T}} (with M = 1). It follows from Lemma A that the closure of \exp^*(S) consists of a finite union of connected rational subspaces W = \coprod W^0, and hence the closure of S is the finite union of the sets (S^1)^k \cap Z^0 for a finite number of geometrically connected Z^0. This proves the claim concerning the supremum. For the infimum, we argue as follows. There exists a component Z^0 such that the infimum of the largest conjugate of y_1 + \ldots y_k on Z^0 is equal to the infimum of the houses of elements of S. Let the supremum of |z_1 + z_2 + \ldots + z_k|^2 on this space be \beta. If the desired infimum is equal to \beta, then all elements must the same house, and the result follows immediately. If not, there exists a subset of S whose largest conjugates are bounded by \beta - \epsilon. But such a set can no longer be dense in (S^1)^k \cap Z^0. Hence, replacing S by this smallest set, we may reduce the dimension of Z^0. Continuing this process, we reduce to the case when either Z^0 is a point or all the houses of elements are the same, and in either case the result follows. Note that the supremum of an algebraic function on (S^1)^k \cap Z will automatically be algebraic — essentially by a rigidity argument. Alternatively, one can write down the equations required for a point to be a local minima, and observe that they are algebraic. To finish the proof of the main claim (except for the claim that {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}} is not itself closed) it suffices to note, following a result of Loxton, that any cyclotomic integer of absolute value at most B can be written as a sum of (at most) L(B) roots of unity, so, when dealing with the closure of {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}, it suffices to consider sums of k roots of unity for a fixed k.

Returning to what Jones does, his main result is to consider the sets {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}(k) of cyclotomic integers which are the sum of k roots of unity, and then prove that the k-1st derived set consists precisely of the element \{k\}. In our context, it may seem as though the n-th derived set should consist precisely of the maxima of the natural function on the sets (S^1)^k \cap Z where Z has codimension n. However, there is an extra degeneracy coming from the fact that multiplication by a root of unity doesn’t change the house — so we may insist from the start that 1 is always one of the roots of unity of S, imposing the condition x_1 = 1.

The abelian house is not closed

We now prove that \overline{{{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}} \ne {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}} by constructing an explicit element of \overline{{{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}} not in {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}. Indeed, the corresponding element will neither be cyclotomic nor an algebraic integer (although it will be algebraic). Consider the set of cyclotomic algebraic integers:

\beta = \zeta^2 + \zeta - \zeta^{-1}

\gamma = \zeta^2 + \zeta + \omega \zeta^{-1}

where \omega is a cube root of unity, and \zeta is (say) and pth root of unity for prime p. For large p, the Galois conjugates of \zeta become dense in the unit circle. It follows that the supremum of \overline{|\beta|}^2 is the square of the maximum of the quantity

|X^2 + X - X^{-1}|

over |X|=1, and similarly the supremum of \overline{|\gamma|}^2 is the maximum of the two quantities

|X^2 + X + \omega X^{-1}|, \quad |X^2 + X - \omega^{-1} X^{-1}|,

over the same region. One can compute this maximum, and it turns out, perhaps surprisingly, that it is equal to the value

\displaystyle{\frac{97 + 26 \sqrt{13}}{27} = 7.064604\ldots}

in the first case, which is not an algebraic integer and so not in {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}, and is equal in the second case to

\displaystyle{\frac{1}{27} \cdot \theta = 8.096242\ldots}

where

\theta^5 - 446 \theta^4 + 62377 \theta^3 - 3023244 \theta^2 + 57168180\theta - 351065988 = 0,

and K = {\mathbf{Q}}(\theta) has discriminant 2^2 \cdot 3^5 \cdot 15619 and Galois closure S_5. These are, perhaps, surprisingly ugly numbers for fairly simple looking maximization problems. It is clear, of course, that neither of these numbers lies in {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}, so this proves \overline{{{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}} \ne {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}. Moreover, I think it quite likely (and quite provable, perhaps with a certain amount of computational effort) that \displaystyle{\frac{97 + 26 \sqrt{13}}{27} = 7.064604\ldots} is the smallest number in \overline{{{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}} \setminus {{\mathfrak{M}}^{{\mathrm{ab}}}_{\infty}}.

Posted in Mathematics | Tagged , , , , , , , , , , , | Leave a comment

The distribution of Hecke eigenvalues, part II

Here are some numbers from KB promised in my last post.

“For the first 61595 newforms of squarefree level coprime to 15 here’s
the field extension of Z/3Z generated by the a_5 field extensions:”

[\mathbf{F}_3(a_5):\mathbf{F}_3] Total Number Number of Galois conjugacy classes Density of forms Density of conjugacy classes
Totals: 61595 10740 1 1
1 4623 4623 0.07505 0.4304
2 2492 1246 0.04046 0.1160
3 2397 799 0.03892 0.07439
4 2476 619 0.04020 0.05764
5 2600 520 0.04221 0.04842
6 2142 357 0.03478 0.03324
7 2289 327 0.03716 0.03045
8 2008 251 0.03260 0.02337
9 1962 218 0.03185 0.02030
10 1530 153 0.02484 0.01425
11 1837 167 0.02982 0.01555
12 1656 138 0.02689 0.01285
13 1612 124 0.02617 0.01155
14 1638 117 0.02659 0.01089
15 1455 97 0.02362 0.009032
16 1088 68 0.01766 0.006331
17 1292 76 0.02098 0.007076
18 1008 56 0.01636 0.005214
19 1159 61 0.01882 0.005680
20 1120 56 0.01818 0.005214
21 987 47 0.01602 0.004376
22 990 45 0.01607 0.004190
23 966 42 0.01568 0.003911
24 1056 44 0.01714 0.004097
25 1100 44 0.01786 0.004097
26 650 25 0.01055 0.002328
27 783 29 0.01271 0.002700
28 868 31 0.01409 0.002886
29 551 19 0.008946 0.001769
30 420 14 0.006819 0.001304
31 775 25 0.01258 0.002328
32 800 25 0.01299 0.002328
33 759 23 0.01232 0.002142
34 374 11 0.006072 0.001024
35 490 14 0.007955 0.001304
36 576 16 0.009351 0.001490
37 592 16 0.009611 0.001490
38 380 10 0.006169 0.0009311
39 429 11 0.006965 0.001024
40 680 17 0.01104 0.001583
41 492 12 0.007988 0.001117
42 294 7 0.004773 0.0006518
43 258 6 0.004189 0.0005587
44 308 7 0.005000 0.0006518
45 180 4 0.002922 0.0003724
46 322 7 0.005228 0.0006518
47 282 6 0.004578 0.0005587
48 144 3 0.002338 0.0002793
49 147 3 0.002387 0.0002793
50 350 7 0.005682 0.0006518
51 561 11 0.009108 0.001024
52 260 5 0.004221 0.0004655
53 106 2 0.001721 0.0001862
54 378 7 0.006137 0.0006518
55 0 0 0 0
56 112 2 0.001818 0.0001862
57 171 3 0.002776 0.0002793
58 406 7 0.006591 0.0006518
59 236 4 0.003831 0.0003724
60 120 2 0.001948 0.0001862
61 183 3 0.002971 0.0002793
62 62 1 0.001007 0.00009311
63 378 6 0.006137 0.0005587
64 320 5 0.005195 0.0004655
65 130 2 0.002111 0.0001862
66 132 2 0.002143 0.0001862
67 201 3 0.003263 0.0002793
68 68 1 0.001104 0.00009311
69 276 4 0.004481 0.0003724
70 140 2 0.002273 0.0001862
71 284 4 0.004611 0.0003724
72 144 2 0.002338 0.0001862
73 292 4 0.004741 0.0003724
74 74 1 0.001201 0.00009311
75 0 0 0 0
76 152 2 0.002468 0.0001862
77 0 0 0 0
78 78 1 0.001266 0.00009311
79 79 1 0.001283 0.00009311
80 160 2 0.002598 0.0001862
81 81 1 0.001315 0.00009311
82 0 0 0 0
83 83 1 0.001348 0.00009311
84 168 2 0.002727 0.0001862
85 85 1 0.001380 0.00009311
86 0 0 0 0
87 0 0 0 0
88 0 0 0 0
89 89 1 0.001445 0.00009311
90 0 0 0 0
91 0 0 0 0
92 0 0 0 0
93 0 0 0 0
94 0 0 0 0
95 95 1 0.001542 0.00009311
96 0 0 0 0
97 0 0 0 0
98 0 0 0 0
99 0 0 0 0
100 0 0 0 0
101 0 0 0 0
102 0 0 0 0
103 0 0 0 0
104 104 1 0.001688 0.00009311

I’ve presented the numbers KB send me in various ways. The first column simply counts the field generated by a_5. The second column normalizes by the order of the field. This is a little like counting two representations which differ by an automorphism of the coefficient field as being “the same.” The final two comments are then the proportion of the first two columns overall.

I’m really not quite sure what to make of this data. It does suggest that A is false, which is perhaps not surprising. It’s not terribly overwhelming evidence for B, but then, law of smaller numbers and all.

AV’s suggestion in the comments that the constants C_q should be independent of q must refer to the constants in the second last column, I believe. Of course, it might be the case that \mathbf{F}_3(a_5) is smaller than \mathbf{F}_3(a_2,a_5,a_7,a_{11},\ldots), so these numbers aren’t exactly the same as the fields generated by the mod-p reductions of the eigenforms. If you squint, the numbers in this column do look somewhat constant for n < 10 or so. One can even argue that n = 1 might be artificially inflated exactly because the phenomenon of "slipping into a subfield" mentioned above. So I'm giving the points to AV. (Yes, that's right, there were points available and you missed out because you didn’t play the game.)

Posted in Mathematics | Tagged , , , , , | Leave a comment