| advertise add site services publishers database health videos | ![]() | about toolbar stats live show health store more stuff JOIN/LOGIN |
Love & Relationships : Relationship Advice : Discovery Health health.discovery.com | Health 24 - Flings and relationships - Long-term relationships health24.com | Hypnosis for Relationships - Hypnotherapy for Relationships richardmackenzie.co.uk |
In statistics, a spurious relationship (or, sometimes, spurious correlation or spurious regression) is a mathematical relationship in which two occurrences have no causal connection, yet it may be inferred that they do, due to a certain third, unseen factor (referred to as a "confounding factor" or "lurking variable"). The spurious relationship gives an impression of a worthy link between two groups that is invalid when objectively examined. The misleading correlation between two variables is produced through the operation of a third causal variable. In other words we find a correlation between A and B. So we have three possible relationships:
The last is a spurious correlation. In a regression model, where A is regressed on B, but C is found to be the true causal factor for B; this is called specification error. It is therefore often said that "Correlation does not imply causation". The true causal chain may be C => A => B or even A => C => B or as illiterated above, C => A and C => B
[edit] General exampleAn example of a spurious relationship can be illuminated examining a city's ice cream sales. These sales are highest when the rate of drownings in city swimming pools is highest. To allege that ice cream sales cause drowning, or vice-versa, would be to imply a spurious relationship between the two. In reality, a heat wave may have caused both. The heat wave is an example of a hidden or unseen variable. Another popular example is a series of Dutch statistics showing a positive correlation between the number of storks nesting in a series of springs and the number of human babies born at that time. Of course there was no causal connection; they were correlated with each other only because they were correlated with the weather nine months before the observations.[1] [edit] ExperimentsThe term is commonly used in statistics and in particular in experimental research techniques. Experimental research attempts to understand and predict causal relationships (X → Y). A non-causal correlation can be spuriously created by an antecedent which causes both (W → X & Y). Intervening variables (X → W → Y), if undetected, may make indirect causation look direct. Because of this, experimentally identified correlations do not represent causal relationships unless spurious relationships can be ruled out. In practice, three conditions must be met in order to conclude that X causes Y, directly or indirectly:
Spurious relationships can often be identified by considering whether any of these three conditions have been violated. The final condition may be relaxed in the case of indirect causation. For example, consider a pistol duel. Two men face off and fire at each other. If one man dies as a result of the other man's shot, we can rightly conclude that the other man caused his death. However, if a doctor saves the wounded man's life (thus violating the third premise), this does not undermine causation, only direct causation. The biological damage (W) sustained from the shot (X) causes death (Y), not the shot itself, allowing medical intervention. [edit] See also
[edit] External links
[edit] Footnotes
[edit] References
|
| ↑ top of page ↑ | about thumbshots |