How to Measure What We Don't Know
September 10, 2009 By Lisa Zyga(PhysOrg.com) -- How do we discover new things? For scientists, observation and measurement are the main ways to extract information from Nature. Based on observations, scientists build models that, in turn, are used to make predictions about the future or the past. To the extent that the predictions are successful, scientists conclude that their models capture Nature’s organization. However, Nature does not reveal secrets easily - there is no way for observers to learn everything about a process, so some information always remains hidden from view; other kinds of information are present, but difficult to extract. In a recent study, researchers have investigated how to measure the degree of hidden information in a process (its “crypticity”) and, along the way, solved several puzzles involved in extracting, storing, and communicating information.
In their study, James Crutchfield, Physics Professor at the University of California at Davis, and graduate students Christopher Ellison and John Mahoney, have developed the analogy of scientists as cryptologists who are trying to glean hidden information from Nature. As they explain, “Nature speaks for herself only through the data she willingly gives up.” To build good models, scientists must use the correct “codebook” in order to decrypt the information hidden in observations and so decode the structure embedded in Nature’s processes.
In their recent work, the researchers adopt a thorough-going informational view: All of Nature is a communication channel that transmits the past to the future by storing information in the present. The information that the past and future share can be quantified using the “excess entropy” - the mutual information between the past and the future.
Since the present mediates between the past and future, it is natural to think that the excess entropy must somehow be stored in the present, the researchers explain. And while this is true, the researchers showed that, somewhat surprisingly, the present typically contains much more information than just the excess entropy. The information stored in the present is known as the “statistical complexity.” The more information Nature must store to turn her noble gears, the more structured her behavior.
The information that manages to go unaccounted for - the difference between the stored information (statistical complexity) and the observed information (excess entropy) - is the “crypticity”. It captures a new and under-appreciated complexity of a process, something that goes above and beyond what is directly measured in observations. At a more general level, the researchers provide an explicit way to understand the difference between simply making predictions from data versus modeling the process’s underlying structure.
“The results are at the crossroads of several research threads, from causal inference to new forms of computing,” Crutchfield told PhysOrg.com. “But here are a couple of things we highlight: One can look at all of nature as a communication channel: Nature communicates the past to the future, by storing information in the present. In addition, information about how a system is structured can be available in observations, but very hard to extract. Crypticity measures the degree of that difficulty. Even in equilibrium there are temporal asymmetries.”
Although excess entropy, statistical complexity and crypticity are straightforward to define, their direct calculation has been a long-standing puzzle. Crutchfield, Ellison, and Mahoney developed a novel approach to its solution. The process, interpreted as a communication channel, is scanned in both the forward and reverse time directions to create models for prediction and retrodiction. By analyzing the relationship between predicting and retrodicting, they were able to uncover not only the external, time-symmetric information (excess entropy), but also the internal, asymmetric information (statistical complexity and crypticity). By looking inside Nature's communication channel, they discovered a rather non-intuitive asymmetry: Even processes in equilibrium commonly harbor temporally asymmetric structures.
“The basic idea is that a process can appear to not transmit much information from its past to its future, but still require a large amount of hardware to keep the internal machine going,” Crutchfield said. “For example, imagine that you have two coins: Coin A is a fair coin and Coin B is slightly biased. Now the output of this process is a series of heads and tails. That's all the observer gets to see. The observer doesn't know when A is used or B is used. To an observer this process is very close to a fair coin - the heads and tails from B just don't differ much in their statistics from the heads and tails from A. So, the observed process has little mutual information (the heads and tails are pretty much independent of the past). That is, the process has very low excess entropy. Nonetheless, there is one bit of internal stored information: Which coin, A or B, is flipped at each step? You can take this example to an extreme where you have hundreds of internal coins, all slightly biased, all slightly different in their bias, and therefore distinct coins. The large number of coins gives you an arbitrarily large statistical complexity. But the small biases mean the excess entropy is as close to zero as you like.”
These fundamental results should impact research across a wide range of disciplines, from statistical modeling to novel forms of computing. As the researchers explain, when a process contains hidden information, the process cannot be directly represented using only raw measurement data. Rather, a model must be build to account for the degree of hidden information that is encrypted within the process’s observed behavior. Otherwise, analyzing a process only in terms of observed information overlooks the process’s structure, making it appear more random than it actually is.
“In statistical modeling, if you ignore a process's crypticity, you will conclude that nature is more random and less structured than she really is,” Crutchfield said. “We suspect that this general principle will be seen (or is even operating) in many scientific domains, from biosequence analysis to dark energy modeling.”
Intel, which partially funded this research, also has an interest in using the results to improve network performance. For many years, Intel has funded research on complex systems through the Network Dynamics Program, which Crutchfield runs.
“Intel's original interest, 10-plus years ago, was to stimulate research on the structure and dynamics of networks,” Crutchfield said. “This was an extremely successful program, in fact stimulating much progress in the early years that has now blossomed into the field of network science. The present work adds to that growing body of understanding of how complex systems are organized and how we should model and characterize them. In particular, crypticity and causal irreversibility suggest new metrics for network performance.”
More information: James P. Crutchfield, Christopher J. Ellison, and John R. Mahoney. “Time's Barbed Arrow: Irreversibility, Crypticity, and Stored Information.” Physical Review Letters 103, 094101 (2009).
Copyright 2009 PhysOrg.com.
All rights reserved. This material may not be published, broadcast, rewritten or redistributed in whole or part without the express written permission of PhysOrg.com.
-
Physicists investigate how time moves forward
Sep 05, 2008 |
not rated yet |
0
-
Measuring the unseeable: Researchers probe proteins' 'dark energy'
Jul 19, 2007 |
not rated yet |
0
-
"Colony" Computer to Look for a Theory of Theories
Aug 23, 2005 |
not rated yet |
0
-
Physicist Proposes Solution to Arrow-of-Time Paradox
Aug 27, 2009 |
not rated yet |
0
-
Bluffing could be common in prediction markets, study shows
Jul 10, 2008 |
not rated yet |
0
-
Engineers build first sub-10-nm carbon nanotube transistor
Feb 01, 2012 |
4.9 / 5 (30) |
30
-
Something old, something new: Evolution and the structural divergence of duplicate genes
Jan 31, 2012 |
4.6 / 5 (7) |
1
-
The hidden nanoworld of ice crystals: Revealing the dynamic behavior of quasi-liquid layers
Jan 30, 2012 |
5 / 5 (3) |
1
-
Stock market network reveals investor clustering
Jan 27, 2012 |
3.9 / 5 (23) |
8
-
Of microchemistry and molecules: Electronic microfluidic device synthesizes biocompatible probes
Jan 26, 2012 |
5 / 5 (1) |
0
-
perturbance in a model
1 hour ago
-
Combustion: where does the heat come from?
2 hours ago
-
How does dynamic pressure affect static pressure?
2 hours ago
-
Sonication energy
3 hours ago
-
How physicists handle the idea of Free Will?
4 hours ago
-
Orbital Decay Question
5 hours ago
- More from Physics Forums - General Physics
More news stories
Borexino Collaboration succeeds in spotting pep neutrinos emitted from the sun
(PhysOrg.com) -- To learn more about how the sun works, scientists study particles that are emitted from it into space due to thermonuclear reactions that occur inside; by applying known physics principles, ...
Physics research suggests new pathways for cancer progression
Observing that certain cancer cells may exhibit greater flexibility than normal cells, some scientists believe that this capability promotes rapid tumor growth. Now computer simulations developed by Boston University Biomedical ...
5 hours ago |
5 / 5 (1) |
0
Explained: Sigma
It's a question that arises with virtually every major new finding in science or medicine: What makes a result reliable enough to be taken seriously? The answer has to do with statistical significance -- but ...
7 hours ago |
5 / 5 (6) |
7
Quantum physicist explains $100K offer for proof scaled-up quantum computing is impossible
(PhysOrg.com) -- MIT researcher Scott Aaronson has certainly riled the physics community with his offer this past Friday, of $100,000 to anyone who can prove that scaled-up quantum computing is impossible. ...
Physicists build highly efficient 'no-waste' laser
A team of University of California, San Diego researchers has built the smallest room-temperature nanolaser to date, as well as an even more startling device: a highly efficient, "thresholdless" laser that ...
Feb 08, 2012 |
5 / 5 (17) |
5
|
Hydrogen from acidic water: Researchers develop potential low cost alternative to platinum for splitting water
A technique for creating a new molecule that structurally and chemically replicates the active part of the widely used industrial catalyst molybdenite has been developed by researchers with the Lawrence Berkeley ...
To perform with less effort, practice beyond perfection
Whether you are an athlete, a musician or a stroke patient learning to walk again, practice can make perfect, but more practice may make you more efficient, according to a surprising new University of Colorado Boulder study.
Scientists identify most lethal known species of prion protein
Scientists from the Florida campus of The Scripps Research Institute have identified a single prion protein that causes neuronal death similar to that seen in "mad cow" disease, but is at least 10 times more ...
Researchers weigh in on ethics of H5N1 research
(Medical Xpress) -- In a commentary on the biosecurity controversy surrounding publication of bird flu research details, a bioethicist and a vaccine expert at Johns Hopkins reaffirm that "all scientists have an affirmativ ...
US workers are 'giving away the store,' costing firms billions
Nearly 70 percent of the nation's service employees give away free goods and services from hamburgers to cable TV costing companies billions of dollars a year, according to a groundbreaking study.
New prenatal genetic test is much more powerful at detecting fetal abnormalities
A nationwide, federally funded study has found that testing a developing fetus' DNA through chromosomal microarray (CMA) provides more information about potential disorders than does the standard method of prenatal testing, ...
Sep 11, 2009
Rank: 5 / 5 (2)
Sep 11, 2009
Rank: not rated yet
Remember... science does not discover reality, only helps us model our interaction with it. It is easy to fall into the same trap many religions fall into... we begin to believe we comprehend the incomprehensible because we can "make an icon" that represents it for us.
The model/icon is useful in how it helps us navigate this thing called life... it is not life.
Sep 11, 2009
Rank: not rated yet
Though the article oversells the paper. There is nothing new about effective theories, or using noise distributions for unknown effects, or having unobservable nodes in a system analysis of IC. But experimenting as per above is a mean for uncovering this lacking information to desireable degree, and when we have a complete QM description it promises us "the truth, the whole truth and nothing but the truth" (no local hidden variables).
I think it may be true in this classical setting as well. IIRC there is a theorem in category theory which shows that one can map mathematical objects fully. As I understand it, by throwing all possible data at it one can extract its function. Here it would be to do experiments, data and parameters, not merely passively observe what the system does.
Sep 11, 2009
Rank: 4 / 5 (1)
We don't have to be satisfied with not "learn[ing] everything".
Btw, perhaps evolution figured this out early. IIRC there was this description of neuroscience on rats, where they have figured out that rats (and presumably other animals) simulate the situation forwards but also backwards when they do things. I.e. they figure out "how will I get out of this mess" but also "how did I get into this mess" meanwhile it happens, precisely as the paper describes how to extract maximum information of passive observation. (Of course, animals tend to act as well, do choices and all that jazz that the paper doesn't cover.)
Sep 11, 2009
Rank: not rated yet
It is never as complex as the phenomenon, for if it were, it would not be amenable to the sort of mathematical and computational analysis that science lends.
I don't think that follows. Nature seems to be algorithmic, as our theories are.
Reasonably a phenomena is emergent when our theories of the same shows emergence, and so on. That takes care of spatial computing resources, in principle.
Similarly if a phenomena would be NP, say, it wouldn't converge on a result in reasonable time. But we can observe that it does. That takes care of temporal computing resources, in principle.
Then anything nature can do, in principle we can do as well.
[Conversely, if we can't say that an algorithm is undecidable, nature can't either. When you walk into anthropic principle land, where you can claim that _all_ of observed physics is stable because you need it to be.]
Sep 12, 2009
Rank: 5 / 5 (25)
Actually Tjorn, I think smoke&mirrors is quit correct here. Our models of reality are not the Reality itself, because for starters the form is different. We encode reality within pre-existing conceptual forms, and therefore delimit phenomenal reality, which includes subject dependent paradigmns,.. and Reality proper, that is, reality as it is apart from being conceptualized. In other words there can never be a complete one-to-one correlation between models and Reality. It appears that the authors presume that there can be in principal, at the start, but I could be missing the point.
Sep 12, 2009
Rank: 1 / 5 (1)
Sep 12, 2009
Rank: 5 / 5 (1)
Sep 13, 2009
Rank: not rated yet
Computer science seems to be the future of science in general, what with guys like the above and Wolfram's ideas. We are no longer limited by our natural senses, but by how much data we can compute and how fast. Pure reason is becoming our primary mean of interacting with reality.
But pure reason has been thoroughly proven insufficient for some time. It's just a matter of time till a computer scientist comes up with a theorem, if they haven't already, that sets an asymptotic limit to computer modelling. To compute all the data in the universe, wouldn't you need another universe as memory?
Sep 13, 2009
Rank: not rated yet
Sep 15, 2009
Rank: not rated yet
This is a central tenant of the Information model of the Universe. It's an interesting read but very subjective in some cases.