Do OPERA’s tachyonic neutrinos make sense?
September 27, 2011
Having been suitably humbled on the statistics of the OPERA experiment in recent days, I’ve been having a bit of play with the numbers to see whether their measurements would make sense, at even an elementary level. (I agree that the systematic errors look precarious — but they have had hundreds of sensible people looking at them for months. Any important discovery will always be on that precarious edge. Most are looking into that already, so I’ll leave it to them for now.)
So let’s assume, for the sake of this blog entry, that the OPERA measurement is sound. What would that mean for physics?
For starters, it would not mean that Einstein would be turning in his grave. I’ve been sorely disappointed in the past few days by physicists, who should know better, claiming that tachyonic neutrinos (i.e. travelling faster than the speed of light) would utterly destroy the special and general theories of relativity; it indicates that there are some in the current generation of physicists who haven’t been given a thorough enough grounding in the basics. For sure, if someone had managed to accelerate a regular (bradyonic) particle through the “light barrier”, then relativity would be in trouble: that’s simply not possible in relativity as we know it, with a finite amount of energy. But Lorentz kinematics has nothing to say about particles that always go faster than the speed of light. Again, such tachyons would allow signals to be sent backwards in time, and so causality would take a battering; but there is nothing in Einstein’s kinematics that prevents it.
After all, we already know that antiparticles are effectively regular particles travelling backwards in time, which is something that was not in the realm of physics at the time that Einstein did his seminal work. (My colleagues and I wrote up a short paper for the pedagogical American Journal of Physics, explaining how this can be understood and made sense of, even at the classical level, some time ago.) Just as relativistic kinematics handles antiparticles with relative ease (once you figure which equations extend seamlessly to that domain), so too can it handle tachyons.
There’s almost a century of work in tachyons available for us to contemplate, but there are some simple calculations that we can do with little more than Einstein’s original equations. Consider this one:
m^2 = E^2 ( 1 – v^2 ),
where (like all theoretical particle physicists) I’m using “natural” units in which the speed of light, c, has the numerical value of 1. This equation isn’t generally used much in particle physics — it’s something you only see in an elementary textbook on special relativity — but it is useful for considering the OPERA experiment. Factorising the parentheses, we have
m^2 = E^2 ( 1 – v ) ( 1 + v ).
For the case of particles travelling at approximately the speed of light (as we have in OPERA), namely v being approximately 1, the last factor 1 + v is well approximated by 2, and so we have
m^2 \approx -2 E^2 ( v – 1 ).
Note that tachyonic particles simply have a negative mass-squared. This might sound strange — and indeed this property leads some to describe a tachyon, misleadingly, as having an “imaginary mass” — but think about what “mass” really is: Go back to elementary special relativity, where it is generally called “rest mass” (until you get familiar with it): the “mass” of a particle is indeed just its energy in a frame in which it is at rest. But just as a bradyon can never be accelerated through the light barrier, nor can a tachyon be decelerated through the light barrier; it can never travel slower than the speed of light, and most certainly can never be at rest. So saying that a tachyon has an “imaginary (rest) mass” is not really useful.
So what does m^2 mean for a tachyon? Well, for any free tachyon there is a frame of reference (up to a spatial rotation, as always) in which its velocity v is infinite; in other words, the tachyon travels from its creation point in space to its destruction point in space (and all points in space on a straight line between those points) instantaneously. Let’s call this the “instantaneous frame” for a given free tachyon, by analogy with the “rest frame” for a free bradyon. By the above, if m^2 is a physical constant for the tachyonic particle, then the energy E must be zero in such a frame. A particle with zero energy? Sounds strange, but possible: the tachyon would still have a finite (three-)momentum. And in fact, the magnitude of its momentum in its instantaneous frame is just the square-root of -m^2. So you can think of \sqrt{-m^2} as its “instantaneous momentum” (for want of a better term); the magnitude of its three-momentum in any other frame is always greater than this value (just as, for a regular bradyonic particle, the magnitude of its energy in any frame other than its rest frame is always greater than its energy in its rest frame, namely, its mass).
OK, so let’s go back to the equation above. Now, although the OPERA neutrinos have a distribution of energies E, this equation can at least give us an order-of-magnitude estimate for -m^2. Overall, the average neutrino energy in their experiment is 17 GeV, and v – 1 = ( 2.48 +/- 0.28 +/- 0.30 ) x 10^{-5}. Pretending, for the sake of this order-of-magnitude calculation, that all of the neutrinos had an energy of exactly 17 GeV, then the above equation would give
-m^2 = ( 14300 +/- 1600 +/- 1700 ) MeV^2.
To make things simple, let’s denote the “instantaneous momentum” by w:
w \id \sqrt{ -m^2 }.
Then for this approximate calculation, we would have
w = ( 120 +/- 7 +/- 7 ) MeV.
These uncertainties aren’t correct, because I have approximated the energy distribution by a delta function at E = 17 GeV. But it at least tells us that, if correct, the OPERA experiment is pointing to some sort of tachyonic particle with a w value around 100 MeV.
We can do one better than this, even without having access to the entire OPERA data set. They divided the charged current subset of their data into two nearly equal halves, by energy, by cutting at 20 GeV. The mean energy of the lower-energy half was 13.9 GeV, and the mean energy of the upper-energy half was 42.9 GeV. Converting their nanosecond results into v – 1 values, and again making the (wrong!) approximation that the energy distribution of each half is a delta function at the mean energy value, I get
w = ( 92 +/- 17 +/- 7 ) MeV around 13.9 GeV,
w = ( 320 +/- 50 +/- 20 ) MeV around 42.9 GeV.
As noted by many people already, the lack of any significant energy dependence in the OPERA time shift results seems to suggest that there is a systematic error at play here. For the above calculation, this is reflected in the fact that the estimated w value increases as the energy increases. (Of course, these uncertainties are also not correct, because we have not convolved in the energy distribution, so it is not proof of a problem.)
The fundamental problem is that, even if we were to believe that one or more of the neutrino mass eigenstates were tachyonic (i.e. that one or more of the m^2 eigenvalues were negative), and even if only those particular mass eigenstates were in play for the OPERA experiment (more on this below), then there should be an energy dependence in the time shift: lower-energy neutrinos should get to Italy faster than the higher-energy neutrinos. (This sounds like it’s the wrong way around, but that’s the way it works for tachyons; remember, zero-energy tachyons have infinite speed!) This should have caused dispersion in the arrival time PDF, and the resultant widening of the PDF (in particular, the two “end regions”) should have been measurable at the precision of the experiment. For example, if we take the central estimate of w = 120 MeV, then a 13.9 GeV particle should get to Italy 91 ns early, a 17 GeV particle should get there 61 ns early, and a 42.9 GeV particle should get there just 10 ns early.
On top of this, one would expect, from the raft of experimental evidence already available, that there is at least one neutrino mass eigenstate whose m^2 value is within eV^2 of zero, not MeV^2. (The Review of Particle Physics would tend to indicate establishment of all of the mass eigenstates to satisfy this bound, but it’s not clear to me that the “physical boundary” mentioned on page 556 — presumably that m^2 >= 0 — might not allow some wiggle room; in any case, I doubt that many of the analyses would have been taking into account the possibility of tachyonic mass eigenstates with w on the order of 100 MeV.) With the mixing angles in the neutrino sector being so large, and with (postulated) m^2 values as large as these, it’s difficult to see how the small m^2 eigenstate(s) wouldn’t be coupled in to the OPERA measurements. In that case, not only would you expect to see the dispersion in arrival times described above, you would also expect to see another “copy” of the source PDF with an anomalous shift of zero (i.e. travelling at exactly the speed of light, to within the precision of this experiment). Again, that was not seen by OPERA.
The catch-all response to all this, of course, is that, if the OPERA result is correct, then there must be “new physics” in the neutrino sector. But what sort of “new physics” would yield a shift in the arrival times that is so energy-independent?
There’s only one simple explanation that comes to my mind: Leave the neutrinos with the m^2 values that we believe them to have (on the order of or less than eV^2), but have them not created directly (as with the Standard Model) but rather via some sort of intermediate tachyonic particle (that gets 60 ns or so ahead of the light-cone in the OPERA experiment) before decaying into neutrinos. That would explain the clean shape of the OPERA distribution, and would be consistent with SN1987A and all the other experimental evidence for small m^2.
This sort of model may also explain why so many neutrino experiments have tended to give an estimate of m^2 that is negative, albeit small (including Fermilab’s MINOS). If there is some sort of intermediate tachyon that comes into play wherever the Standard Model would currently assume to be a neutrino creation vertex, which decays soon after into a neutrino, then every calculation will be skewed that little bit towards a negative m^2; but without explicitly including the intermediate tachyon, each calculation would come up with a different skewed estimate of this negative m^2 (which will be smaller, the longer the “baseline”, i.e. it depends on how much of the measurement is of the neutrino, and how much is of the intermediate tachyon).
While unconventional, this sort of model would also leave us with a Universe that is only a “little bit acausal”, just as it is only a “little bit CP-violating”. It wouldn’t make it impossible to make a tachyonic anti-telephone using this intermediate tachyon, but it would certainly make it a difficult engineering problem.
All this sounds far-fetched, of course, and by far the best guess by all at the moment is that the OPERA result has an unfound systematic error. But it’s important to do the “what-if” analysis, just in case.
You just never know.
September 30, 2011 at 11:33 am
If a neutrino (of some flavor) takes a shortcut through higher dimensional space for a short while before emerging into our space-time, it could appear to be a tachyon for the initial part of its journey. What we need is more and better experiments that are specifically designed to measure neutrino velocities.
October 1, 2011 at 12:19 am
Nice to see someone looking at the issue without beoing blinkered by a phobia about causation violation.
I would point out that the speed estimate from OPERA is for the average neutrino, not for every neutrino. Even the relatively sharp cutoffs at the beginning and end of the pulses take hundreds of nanoseconds to ramp up or down; the OPERA folks are able to get the standard deviation down to 7ns only because they have hundreds of relevant data points – in addition to the thousands of irrelevant ones, as you’ve pointed out. Individual neutrinos could vary from c or below to well above the average, and we would not see it in the data, because the pulse lasts so much longer than variation in neutrino transit times.
Where we would expect to get multiple pulses would be in cases like supernova 1987a.
I do like the idea in your later post better, but I don’t think the current data differentiate between the hypotheses here. What we need is for CERN or Fermilab to shoot neutrinos off to Kamiokande, or at least to each other, so we can get measurements with a different baseline.
October 1, 2011 at 4:39 am
I agree completely that the current data doesn’t tie down anything: it’s one (and a bit) data points — and the “bit” (MINOS) is over exactly the same baseline distance. I’m sure all the neutrino collaborations are prioritising the tests they can run, now that this issue is “hot”. (Testing my later post should be a lot easier — you can do it at Fermilab or CERN without needing to shoot them cross-country at all.)
I don’t agree, though, that the OPERA data is consistent with a large variation in advance times. I don’t have a quantitative calculation done on this (and you’d be right to point out my recent bad track record analysing this data by eye; feel free to crunch the numbers and prove me wrong yet again!), but any such variation would be convolved in with the bunch PDF. OPERA has shown us the leading and trailing edges in their Fig. 12, and it’s difficult to see either the width at half-height (say) of the whole 10.5 us bunch or the slope of the ramp-up and ramp-down being different from what they show in the red proton PDF lines — either or both of which should be altered by such a convolution.
Even if there were some wiggle room there (and I’m sure that OPERA will come out with the ML calculations of those things in due course), it’s even more difficult to reconcile when you look at their low energy / high energy cut (for charged current interactions, anyway, but they were consistent with the whole data set). If it’s not correlated with energy, then with what? Granted, there’s scope for unlimited “new physics” speculation until there are more experimental numbers, but even the Lorentz-violating extensions of spacetime tend to have an energy dependence or one sort or another, and contorting them to remove it looks like a big stretch. (There’s a good paper out by John Ellis and colleagues on this: http://arxiv.org/abs/1109.6296 )
I’m not saying that there isn’t going to be some distribution on these advance times — it’s certainly not going to be a delta function at 60-something nanoseconds (assuming it’s not a systematic error) — as whatever is causing it will have some distribution, but at the moment it’s looking much narrower than the effect itself.
October 1, 2011 at 3:09 pm
[...] new and there is some previous experimental evidence to suggest that it is the case. Moreover, as many including a comment in my previous post on this subject have noted, Einstein’s theories do [...]
November 16, 2011 at 11:29 pm
All those still following this post: Rumour on the wires is that OPERA has confirmed the time of flight with pulses only 2 or 3 nanoseconds long:
http://motls.blogspot.com/2011/11/opera-neutrinos-ftl-even-at-3-ns.html
Press conference expected within two days.
November 20, 2011 at 9:27 pm
Hi John;
Your simple extrapolation of the special (and by default, the general) formulation of relativity to supraluminal states (which everybody seems to do) I think needs re-examination. What is needed is a re-build from the ground up; starting from first principles (Bruce has a copy of my work in this area if you are interested, and Ray did have a copy of the thesis with the full treatment, but I don’t know if he still has it). Several different ingredients are needed but the really crucial one is a more complete treatment of the nature of time; we don’t have a realistic formuation of time in any of our existing theoretical constructions. (If you want to get some real insight into the nature of time, speak to a traditional Australian Aborigine – neither the past nor the future have any physical existence for these people; naieve ? no, profound, yes). For neutrinos, I began by building a co-ordinate system synchronised with assumed massless neutrinos and then go on to show, in a qualitative and quantitative way, what the differences are between such a co-ordinate system and one built on light co-ordinated clocks. Oscillatory phenomenology can be built into such a scheme by mixing non-local tachyonic fields that propagate from the unseen future and subluminal fields that that propagate from the past. (Producing a generational structure however requires a lot of new machinery). A physical state that is observed then corresponds to the temporal crossing of states; but that crossing still contains an element of tachyonic non-locality. Qualitatively this phenomenology can encompass both the Opera and MiniBoonE
results. I’m still struggling with them quantitatively however (the numbers are much larger than expected) but hope to have a revised paper to send to Bruce in a few months time.
Regards,
November 20, 2011 at 10:17 pm
Hi Greg,
No worries — absolutely, I have my own prejudices when it comes to extending relativistic mechanics to tachyons, and I tend to lean on Occam’s Razor as much as possible.
But I agree that if there is something tachyonic going on here, then anything at all is possible in principle, and it will ultimately be experimental results that determine what extension of current physics is correct.
John