Data Quotes (40 quotes)
Dogbert (advice to Boss): Every credible scientist on earth says your products harm the environment. I recommend paying weasels to write articles casting doubt on the data. Then eat the wrong kind of foods and hope you die before the earth does.
Il faut bien s'arrêter quelque part, et pour que la science soit possible, il faut s'arrêter quand on a trouvé la simplicité.
Analyse data just so far as to obtain simplicity and no further.
Analyse data just so far as to obtain simplicity and no further.
Trimming consists of clipping off little bits here and there from those observations which differ most in excess from the mean, and in sticking them onto those which are too small; a species of 'equitable adjustment,' as a radical would term it, which cannot be admitted in science.
A poet is, after all, a sort of scientist, but engaged in a qualitative science in which nothing is measurable. He lives with data that cannot be numbered, and his experiments can be done only once. The information in a poem is, by definition, not reproducible. ... He becomes an equivalent of scientist, in the act of examining and sorting the things popping in [to his head], finding the marks of remote similarity, points of distant relationship, tiny irregularities that indicate that this one is really the same as that one over there only more important. Gauging the fit, he can meticulously place pieces of the universe together, in geometric configurations that are as beautiful and balanced as crystals.
A procession of the damned. By the damned, I mean the excluded. We shall have a procession of data that Science has excluded.
Although we often hear that data speak for themselves, their voices can be soft and sly.
Beware of the problem of testing too many hypotheses; the more you torture the data, the more likely they are to confess, but confessions obtained under duress may not be admissible in the court of scientific opinion.
Data is not information, Information is not knowledge, Knowledge is not understanding, Understanding is not wisdom.
Errors using inadequate data are much less than those using no data at all.[?] Simplified restatement of another quotation by Babbage.
Facts and theories are different things, not rungs in a hierarchy of increasing certainty. Facts are the world's data. Theories are structures of ideas that explain and interpret facts. Facts do not go away while scientists debate rival theories for explaining them. Einstein's theory of gravitation replaced Newton's, but apples did not suspend themselves in mid-air pending the outcome.
Firm support has been found for the assertion that electricity occurs at thousands of points where we at most conjectured that it was present. Innumerable electrical particles oscillate in every flame and light source. We can in fact assume that every heat source is filled with electrons which will continue to oscillate ceaselessly and indefinitely. All these electrons leave their impression on the emitted rays. We can hope that experimental study of the radiation phenomena, which are exposed to various influences, but in particular to the effect of magnetism, will provide us with useful data concerning a new field, that of atomistic astronomy, as Lodge called it, populated with atoms and electrons instead of planets and worlds.
Given a large mass of data, we can by judicious selection construct perfectly plausible unassailable theories—all of which, some of which, or none of which may be right.
How then did we come to the “standard model”? And how has it supplanted other theories, like the steady state model? It is a tribute to the essential objectivity of modern astrophysics that this consensus has been brought about, not by shifts in philosophical preference or by the influence of astrophysical mandarins, but by the pressure of empirical data.
I am particularly concerned to determine the probability of causes and results, as exhibited in events that occur in large numbers, and to investigate the laws according to which that probability approaches a limit in proportion to the repetition of events. That investigation deserves the attention of mathematicians because of the analysis required. It is primarily there that the approximation of formulas that are functions of large numbers has its most important applications. The investigation will benefit observers in identifying the mean to be chosen among the results of their observations and the probability of the errors still to be apprehended. Lastly, the investigation is one that deserves the attention of philosophers in showing how in the final analysis there is a regularity underlying the very things that seem to us to pertain entirely to chance, and in unveiling the hidden but constant causes on which that regularity depends. It is on the regularity of the main outcomes of events taken in large numbers that various institutions depend, such as annuities, tontines, and insurance policies. Questions about those subjects, as well as about inoculation with vaccine and decisions of electoral assemblies, present no further difficulty in the light of my theory. I limit myself here to resolving the most general of them, but the importance of these concerns in civil life, the moral considerations that complicate them, and the voluminous data that they presuppose require a separate work.
If this “critical openminded attitude” ... is wanted, the question at once arises, Is it science that should be studied in order to achieve it? Why not study law? A judge has to do everything that a scientist is exhorted to do in the way of withholding judgment until all the facts are in, and then judging impartially on the merits of the case as well as he can.... Why not a course in Sherlock Holmes? The detectives, or at least the detective-story writers, join with the scientists in excoriating "dogmatic prejudice, lying, falsification of facts, and data, and willful fallacious reasoning."
It is a capital mistake to theorize before one has data. Insensibly one begins to twist facts to suit theories, instead of theories to suit facts.
It's tempting to go to the throat of the volcano to get the data, because if you do you're a hero ... It's a battle between your mind and your emotions. If your emotions win out, you can get yourself in a lot of trouble.
Mathematics may be compared to a mill of exquisite workmanship, which grinds you stuff of any degree of fineness; but, nevertheless, what you get out depends upon what you put in; and as the grandest mill in the world will not extract wheat-flour from peascods, so pages of formulae will not get a definite result out of loose data.
Measurement has too often been the leitmotif of many investigations rather than the experimental examination of hypotheses. Mounds of data are collected, which are statistically decorous and methodologically unimpeachable, but conclusions are often trivial and rarely useful in decision making. This results from an overly rigorous control of an insignificant variable and a widespread deficiency in the framing of pertinent questions. Investigators seem to have settled for what is measurable instead of measuring what they would really like to know.
Morphological information has provided the greatest single source of data in the formulation and development of the theory of evolution and that even now, when the preponderance of work is experimental, the basis for interpretation in many areas of study remains the form and relationships of structures.
No branches of historical inquiry have suffered more from fanciful speculation than those which relate to the origin and attributes of the races of mankind. The differentiation of these races began in prehistoric darkness, and the more obscure a subject is, so much the more fascinating. Hypotheses are tempting, because though it may be impossible to verify them, it is, in the paucity of data, almost equally impossible to refute them.
No generalizing beyond the data, no theory. No theory, no insight. And if no insight, why do research.
No one who has experienced the intense involvement of computer modeling would deny that the temptation exists to use any data input that will enable one to continue playing what is perhaps the ultimate game of solitaire.
Nothing is more detestable to the physical anthropologist than... [the] wretched habit of cremating the dead. It involves not only a prodigal waste of costly fuel and excellent fertilizer, but also the complete destruction of physical historical data. On the other hand, the custom of embalming and mummification is most praiseworthy and highly to be recommended.
Once the data are in, the theory has to follow along meekly.
Pavlov's data on the two fundamental antagonistic nervous processes—stimulation and inhibition—and his profound generalizations regarding them, in particular, that these processes are parts of a united whole, that they are in a state of constant conflict and constant transition of the one to the other, and his views on the dominant role they play in the formation of the higher nervous activity—all those belong to the most established natural—scientific validation of the Marxist dialectal method. They are in complete accord with the Leninist concepts on the role of the struggle between opposites in the evolution, the motion of matter.
Perhaps... some day the precision of the data will be brought so far that the mathematician will be able to calculate at his desk the outcome of any chemical combination, in the same way, so to speak, as he calculates the motions of celestial bodies.
Politicians, real-estate agents, used-car salesmen, and advertising copy-writers are expected to stretch facts in self-serving directions, but scientists who falsify their results are regarded by their peers as committing an inexcusable crime. Yet the sad fact is that the history of science swarms with cases of outright fakery and instances of scientists who unconsciously distorted their work by seeing it through lenses of passionately held beliefs.
Professor Brown: 'Since this slide was made,' he opined, 'My students have re-examined the errant points and I am happy to report that all fall close to the [straight] line.' Questioner: 'Professor Brown, I am delighted that the points which fell off the line proved, on reinvestigation, to be in compliance. I wonder, however, if you have had your students reinvestigate all these points that previously fell on the line to find out how many no longer do so?'
Random search for data on ... off-chance is hardly scientific. A questionnaire on 'Intellectual Immoralities' was circulated by a well-known institution. 'Intellectual Immorality No. 4' read: 'Generalizing beyond one's data'. [Wilder Dwight] Bancroft asked whether it would not be more correct to word question no. 4 'Not generalizing beyond one's data.'
Students using astrophysical textbooks remain essentially ignorant of even the existence of plasma concepts, despite the fact that some of them have been known for half a century. The conclusion is that astrophysics is too important to be left in the hands of astrophysicists who have gotten their main knowledge from these textbooks. Earthbound and space telescope data must be treated by scientists who are familiar with laboratory and magnetospheric physics and circuit theory, and of course with modern plasma theory.
[Lamenting the traditional neglect of plasma physics]
[Lamenting the traditional neglect of plasma physics]
The faith of scientists in the power and truth of mathematics is so implicit that their work has gradually become less and less observation, and more and more calculation. The promiscuous collection and tabulation of data have given way to a process of assigning possible meanings, merely supposed real entities, to mathematical terms, working out the logical results, and then staging certain crucial experiments to check the hypothesis against the actual empirical results. But the facts which are accepted by virtue of these tests are not actually observed at all. With the advance of mathematical technique in physics, the tangible results of experiment have become less and less spectacular; on the other hand, their significance has grown in inverse proportion. The men in the laboratory have departed so far from the old forms of experimentation—typified by Galileo's weights and Franklin's kite—that they cannot be said to observe the actual objects of their curiosity at all; instead, they are watching index needles, revolving drums, and sensitive plates. No psychology of 'association' of sense-experiences can relate these data to the objects they signify, for in most cases the objects have never been experienced. Observation has become almost entirely indirect; and readings take the place of genuine witness.
The purpose of models is not to fit the data but to sharpen the questions.
The quantum entered physics with a jolt. It didn’t fit anywhere; it made no sense; it contradicted everything we thought we knew about nature. Yet the data seemed to demand it. ... The story of Werner Heisenberg and his science is the story of the desperate failures and ultimate triumphs of the small band of brilliant physicists who—during an incredibly intense period of struggle with the data, the theories, and each other during the 1920s—brought about a revolutionary new understanding of the atomic world known as quantum mechanics.
The teacher can seldom afford to miss the questions: What is the unknown? What are the data? What is the condition? The student should consider the principal parts of the problem attentively, repeatedly, and from from various sides.
Though the world does not change with a change of paradigm, the scientist afterward works in a different world... I am convinced that we must learn to make sense of statements that at least resemble these. What occurs during a scientific revolution is not fully reducible to a re-interpretation of individual and stable data. In the first place, the data are not unequivocally stable.
We are living in an age of awesome agricultural enterprise that needs to be interpreted. We find our simple faith in science dominated by the Religion of PhDeism under the reign of Data; so narrow in people and often so meaningless in context as to be worthless to the scientific farmer.
We are … led to a somewhat vague distinction between what we may call 'hard' data and 'soft' data. This distinction is a matter of degree, and must not be pressed; but if not taken too seriously it may help to make the situation clear. I mean by 'hard' data those which resist the solvent influence of critical reflection, and by 'soft' data those which, under the operation of this process, become to our minds more or less doubtful.
We ought then to consider the present state of the universe as the effect of its previous state and as the cause of that which is to follow. An intelligence that, at a given instant, could comprehend all the forces by which nature is animated and the respective situation of the beings that make it up, if moreover it were vast enough to submit these data to analysis, would encompass in the same formula the movements of the greatest bodies of the universe and those of the lightest atoms. For such an intelligence nothing would be uncertain, and the future, like the past, would be open to its eyes.
[John] Dalton was a man of regular habits. For fifty-seven years he walked out of Manchester every day; he measured the rainfall, the temperature—a singularly monotonous enterprise in this climate. Of all that mass of data, nothing whatever came. But of the one searching, almost childlike question about the weights that enter the construction of these simple molecules—out of that came modern atomic theory. That is the essence of science: ask an impertinent question, and you are on the way to the pertinent answer.