Misplaced Pages

Negative probability: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editContent deleted Content addedVisualWikitext
Revision as of 15:36, 20 November 2018 editYellowsubmarine321 (talk | contribs)172 editsNo edit summaryTags: Mobile edit Mobile web edit← Previous edit Latest revision as of 11:36, 26 December 2024 edit undo109.53.42.187 (talk) Machine learning and signal processing: Fixed typoTags: Mobile edit Mobile web edit 
(31 intermediate revisions by 23 users not shown)
Line 1: Line 1:
{{Short description|Concept in science}}
The ] of the outcome of an experiment is never negative, although a ] allows a '''negative probability''', or '''quasiprobability''' for some events. These distributions may apply to unobservable events or conditional probabilities. The ] of the outcome of an experiment is never negative, although a ] allows a '''negative probability''', or '''quasiprobability''' for some events. These distributions may apply to unobservable events or conditional probabilities.


==Physics and mathematics== ==Physics and mathematics==
In 1942, ] wrote a paper "The Physical Interpretation of Quantum Mechanics"<ref>{{Cite journal|doi=10.1098/rspa.1942.0023|pages=1–39|jstor=97777|title=Bakerian Lecture. The Physical Interpretation of Quantum Mechanics|journal=Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences|volume=180|issue=980|year=1942|last1=Dirac|first1=P. A. M.|bibcode=1942RSPSA.180....1D}}</ref> where he introduced the concept of ] and negative ]: In 1942, ] wrote a paper "The Physical Interpretation of Quantum Mechanics"<ref>{{Cite journal|doi=10.1098/rspa.1942.0023|pages=1–39|jstor=97777|title=Bakerian Lecture. The Physical Interpretation of Quantum Mechanics|journal=Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences|volume=180|issue=980|year=1942|last1=Dirac|first1=P. A. M.|bibcode=1942RSPSA.180....1D|doi-access=free}}</ref> where he introduced the concept of ] and negative ]:


: "Negative energies and probabilities should not be considered as nonsense. They are well-defined concepts mathematically, like a negative of money." {{blockquote|Negative energies and probabilities should not be considered as nonsense. They are well-defined concepts mathematically, like a negative of money.}}


The idea of negative probabilities later received increased attention in physics and particularly in ]. ] argued<ref>{{cite book |last=Feynman |first=Richard P. |date=1987 |editor1-last=Peat |editor1-first=F. David |editor2-last=Hiley |editor2-first=Basil |title=Quantum Implications: Essays in Honour of David Bohm |publisher=Routledge & Kegan Paul Ltd |pages=235–248 |chapter=Negative Probability |isbn=978-0415069601 |chapter-url=http://cds.cern.ch/record/154856/files/pre-27827.pdf }}</ref> that no one objects to using negative numbers in calculations: although "minus three apples" is not a valid concept in real life, negative money is valid. Similarly he argued how negative probabilities as well as probabilities above ] possibly could be useful in probability ]. The idea of negative probabilities later received increased attention in physics and particularly in ]. ] argued<ref>{{cite book |last=Feynman |first=Richard P. |date=1987 |editor1-last=Peat |editor1-first=F. David |editor2-last=Hiley |editor2-first=Basil |title=Quantum Implications: Essays in Honour of David Bohm |publisher=Routledge & Kegan Paul Ltd |pages=235–248 |chapter=Negative Probability |isbn=978-0415069601 |chapter-url=https://cds.cern.ch/record/154856/files/pre-27827.pdf }}</ref> that no one objects to using negative numbers in calculations: although "minus three apples" is not a valid concept in real life, negative money is valid. Similarly he argued how negative probabilities as well as probabilities above ] possibly could be useful in probability ].


Negative probabilities have later been suggested to solve several problems and ].<ref>{{cite book|last=Khrennikov|first=Andrei Y.|title=Non-Archimedean Analysis: Quantum Paradoxes, Dynamical Systems and Biological Models|url=https://books.google.com/books?id=HdzqCAAAQBAJ|date=March 7, 2013|publisher=Springer Science & Business Media|isbn=978-94-009-1483-4}}</ref> ''Half-coins'' provide simple examples for negative probabilities. These strange coins were introduced in 2005 by ].<ref>{{cite journal |last=Székely |first=G.J. |date=July 2005 |url=http://www.wilmott.com/pdfs/100609_gjs.pdf |title=Half of a Coin: Negative Probabilities |archive-url=https://web.archive.org/web/20131108010759/http://www.wilmott.com/pdfs/100609_gjs.pdf |archive-date=2013-11-08 |journal=Wilmott Magazine |pages=66–68}}</ref> Half-coins have infinitely many sides numbered with 0,1,2,... and the positive even numbers are taken with negative probabilities. Two half-coins make a complete coin in the sense that if we flip two half-coins then the sum of the outcomes is 0 or 1 with probability 1/2 as if we simply flipped a fair coin.
Mark Burgin gives another example:
{{quote|"Let us consider the situation when an attentive person A with the high knowledge of English writes some text T. We may ask what the probability is for the word “texxt” or “wrod” to appear in his text T. Conventional probability theory gives 0 as the answer. However, we all know that there are usually misprints. So, due to such a misprint this word may appear but then it would be corrected. In terms of extended probability, a negative value (say, −0.1) of the probability for the word “texxt” to appear in his text T means that this word may appear due to a misprint but then it’ll be corrected and will not be present in the text T."|Mark Burgin|{{cite arXiv |authorlink= |eprint=1008.1287 |title= Interpretations of Negative Probabilities|class= physics.data-an|year= 2010|version= |last1= Burgin|first1= Mark}} }}


In '']s of nonnegative definite functions''<ref>{{cite journal|pages= 235–239|doi=10.1007/BF01352002|title=Convolution quotients of nonnegative functions|journal=Monatshefte für Mathematik|volume=95|issue=3|year=1983|last1=Ruzsa|first1=Imre Z.|last2=SzéKely|first2=Gábor J.|s2cid=122858460}}</ref> and ''Algebraic Probability Theory'' <ref>{{cite book |last1=Ruzsa |first1=I.Z. |last2=Székely |first2=G.J. |year=1988 |title=Algebraic Probability Theory |publisher=Wiley |location=New York |isbn=0-471-91803-2}}</ref> ] and ] proved that if a ] X has a signed or quasi distribution where some of the probabilities are negative then one can always find two random variables, Y and Z, with ordinary (not signed / not quasi) distributions such that X, Y are independent and X + Y = Z in distribution. Thus X can always be interpreted as the "difference" of two ordinary random variables, Z and Y. If Y is interpreted as a measurement error of X and the observed value is Z then the negative regions of the distribution of X are masked / shielded by the error Y.
Negative probabilities have later been suggested to solve several problems and ].<ref>Khrennikov, A. Y. (1997): ''Non-Archimedean Analysis: Quantum Paradoxes, Dynamical Systems and Biological Models''. Kluwer Academic Publishers. {{ISBN|0-7923-4800-1}}</ref> ''Half-coins'' provide simple examples for negative probabilities. These strange coins were introduced in 2005 by ].<ref>Székely, G.J. (2005) {{webarchive|url=https://web.archive.org/web/20131108010759/http://www.wilmott.com/pdfs/100609_gjs.pdf |date=2013-11-08 }}, Wilmott Magazine July, pp 66–68.</ref> Half-coins have infinitely many sides numbered with 0,1,2,... and the positive even numbers are taken with negative probabilities. Two half-coins make a complete coin in the sense that if we flip two half-coins then the sum of the outcomes is 0 or 1 with probability 1/2 as if we simply flipped a fair coin.


Another example known as the Wigner distribution in ], introduced by ] in 1932 to study quantum corrections, often leads to negative probabilities.<ref>{{Cite journal|pages=749–759|doi=10.1103/PhysRev.40.749|title=On the Quantum Correction for Thermodynamic Equilibrium|journal=Physical Review|volume=40|issue=5|year=1932|last1=Wigner|first1=E.|bibcode=1932PhRv...40..749W|hdl=10338.dmlcz/141466|hdl-access=free}}</ref> For this reason, it has later been better known as the ]. In 1945, ] worked out the mathematical and logical consistency of such negative valuedness.<ref>{{cite journal | last=Bartlett | first= M. S.| year = 1945| title= Negative Probability | journal=Mathematical Proceedings of the Cambridge Philosophical Society | volume = 41| issue= 1| pages= 71–73 | doi = 10.1017/S0305004100022398|bibcode = 1945PCPS...41...71B | s2cid= 12149669}}</ref> The Wigner distribution function is routinely used in ] nowadays, and provides the cornerstone of ]. Its negative features are an asset to the formalism, and often indicate quantum interference. The negative regions of the distribution are shielded from direct observation by the quantum ]: typically, the moments of such a non-positive-semidefinite quasi] are highly constrained, and prevent ''direct measurability'' of the negative regions of the distribution. Nevertheless, these regions contribute negatively and crucially to the ]s of observable quantities computed through such distributions.
In '']s of nonnegative definite functions''<ref>{{cite journal|pages= 235–239|doi=10.1007/BF01352002|title=Convolution quotients of nonnegative functions|journal=Monatshefte für Mathematik|volume=95|issue=3|year=1983|last1=Ruzsa|first1=Imre Z.|last2=SzéKely|first2=Gábor J.}}</ref> and ''Algebraic Probability Theory'' <ref>Ruzsa, I.Z. and Székely, G.J. (1988): ''Algebraic Probability Theory'', Wiley, New York {{ISBN|0-471-91803-2}}</ref> ] and ] proved that if a ] X has a signed or quasi distribution where some of the probabilities are negative then one can always find two random variables, Y and Z, with ordinary (not signed / not quasi) distributions such that X, Y are independent and X + Y = Z in distribution. Thus X can always be interpreted as the "difference" of two ordinary random variables, Z and Y. If Y is interpreted as a measurement error of X and the observed value is Z then the negative regions of the distribution of X are masked / shielded by the error Y.


== Engineering ==
Another example known as the Wigner distribution in ], introduced by ] in 1932 to study quantum corrections, often leads to negative probabilities.<ref>{{Cite journal|pages=749–759|doi=10.1103/PhysRev.40.749|title=On the Quantum Correction for Thermodynamic Equilibrium|journal=Physical Review|volume=40|issue=5|year=1932|last1=Wigner|first1=E.|bibcode=1932PhRv...40..749W}}</ref> For this reason, it has later been better known as the ]. In 1945, ] worked out the mathematical and logical consistency of such negative valuedness.<ref>{{cite journal | last=Bartlett | first= M. S.| year = 1945| title= Negative Probability | journal=Mathematical Proceedings of the Cambridge Philosophical Society | volume = 41| pages= 71–73 | doi = 10.1017/S0305004100022398|bibcode = 1945PCPS...41...71B }}</ref> The Wigner distribution function is routinely used in ] nowadays, and provides the cornerstone of ]. Its negative features are an asset to the formalism, and often indicate quantum interference. The negative regions of the distribution are shielded from direct observation by the quantum ]: typically, the moments of such a non-positive-semidefinite quasiprobability distribution are highly constrained, and prevent ''direct measurability'' of the negative regions of the distribution. But these regions contribute negatively and crucially to the ]s of observable quantities computed through such distributions, nevertheless.


The concept of negative probabilities has also been proposed for reliable facility location models where facilities are subject to negatively correlated disruption risks when facility locations, customer allocation, and backup service plans are determined simultaneously.<ref>{{Cite journal|last1=Snyder|first1=L.V.|last2=Daskin|first2=M.S.|date=2005|title=Reliability Models for Facility Location: The Expected Failure Cost Case|journal=Transportation Science|volume=39|issue=3|pages=400–416|doi=10.1287/trsc.1040.0107|citeseerx=10.1.1.1.7162}}</ref><ref>{{Cite journal|last1=Cui|first1=T.|last2=Ouyang|first2=Y.|last3=Shen|first3=Z-J. M.|date=2010|title=Reliable Facility Location Design Under the Risk of Disruptions|journal=Operations Research|volume=58|issue=4|pages=998–1011|doi=10.1287/opre.1090.0801|citeseerx=10.1.1.367.3741|s2cid=6236098 }}</ref> Li et al.<ref>{{Cite journal|last1=Li|first1=X.|last2=Ouyang|first2=Y.|last3=Peng|first3=F.|date=2013|title=A supporting station model for reliable infrastructure location design under interdependent disruptions|journal=Transportation Research Part E|volume=60|pages=80–93|doi=10.1016/j.tre.2013.06.005}}</ref> proposed a virtual station structure that transforms a facility network with positively correlated disruptions into an equivalent one with added virtual supporting stations, and these virtual stations were subject to independent disruptions. This approach reduces a problem from one with correlated disruptions to one without. Xie et al.<ref name=":0">{{Cite journal|last1=Xie|first1=S.|last2=Li|first2=X.|last3=Ouyang|first3=Y.|year=2015|title=Decomposition of general facility disruption correlations via augmentation of virtual supporting stations|journal=Transportation Research Part B|volume=80|pages=64–81|doi=10.1016/j.trb.2015.06.006}}</ref> later showed how negatively correlated disruptions can also be addressed by the same modeling framework, except that a virtual supporting station now may be disrupted with a “failure propensity” which
=== An example: the double slit experiment ===


{{blockquote|... inherits all mathematical characteristics and properties of a failure probability except that we allow it to be larger than 1... }}
Consider a double slit experiment with photons. The two waves exiting each slit can be written as:


This finding paves ways for using compact mixed-integer mathematical programs to optimally design reliable location of service facilities under site-dependent and positive/negative/mixed facility disruption correlations.<ref name=":1">{{cite journal | last1=Xie | first1=Siyang | last2=An | first2=Kun | last3=Ouyang | first3=Yanfeng | title=Planning facility location under generally correlated facility disruptions: Use of supporting stations and quasi-probabilities | journal=Transportation Research Part B: Methodological | publisher=Elsevier BV | volume=122 | year=2019 | issn=0191-2615 | doi=10.1016/j.trb.2019.02.001 | pages=115–139| doi-access=free }}</ref>
<math>f_1(x) = \sqrt{\frac{dN/dt}{2\pi/d}}\frac{1}{\sqrt{d^2+(x+a/2)^2}}\exp\left,</math>


The proposed “propensity” concept in Xie et al.<ref name=":0" /> turns out to be what Feynman and others referred to as “quasi-probability.” Note that when a quasi-probability is larger than 1, then 1 minus this value gives a negative probability. In the reliable facility location context, the truly physically verifiable observation is the facility disruption states (whose probabilities are ensured to be within the conventional range ), but there is no direct information on the station disruption states or their corresponding probabilities. Hence the disruption "probabilities" of the stations, interpreted as “probabilities of imagined intermediary states,” could exceed unity, and thus are referred to as quasi-probabilities.
and

<math>f_2(x) = \sqrt{\frac{dN/dt}{2\pi/d}}\frac{1}{\sqrt{d^2+(x-a/2)^2}}\exp\left,</math>

where ''d'' is the distance to the detection screen, ''a'' is the separation between the two slits, ''x'' the distance to the center of the screen, ''λ'' the wavelength and ''dN/dt'' is the number of photons emitted per unit time at the source. The amplitude of measuring a photon at distance ''x'' from the center of the screen is the sum of these two amplitudes coming out of each hole, and therefore the probability that a photon is detected at position ''x'' will be given by the square of this sum:

<math>I(x) = \left\vert f_1(x)+f_2(x) \right\vert^2 = \left\vert f_1(x) \right\vert^2 + \left\vert f_2(x) \right\vert^2 + \left</math>,

This should strike you as the well-known probability rule:

<math>\begin{align} P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,\,\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{either}\,\,\mathtt{slit})
= \,&P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,\,\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{1}) \\
& + P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,\,\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{2}) \\
& - P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,\,\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{both}\,\,\mathtt{slits}) \\ \\
=\,&P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,|\,\mathtt{went}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{1})\,P(\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{1}) \\
& + P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,|\,\mathtt{went}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{2})\,P(\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{2}) \\
& - P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,\,\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{both}\,\,\mathtt{slits}) \\ \\
=\,&P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,|\,\mathtt{went}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{1})\,\frac{1}{2} \\
& + P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,|\,\mathtt{went}\,\,\mathtt{through}\,\,\mathtt{slit}\,\,\mathtt{2})\,\frac{1}{2} \\
& - P(\mathtt{photon}\,\,\mathtt{reaches}\,\,\mathtt{x}\,\,\mathtt{going}\,\,\mathtt{through}\,\,\mathtt{both}\,\,\mathtt{slits})
\end{align}</math>

]whatever the last term means. Indeed, if one closes either one of the holes forcing the photon to go through the other slit, the two corresponding intensities are

<math>I_1(x) = \left\vert f_1(x) \right\vert^2 = \frac{1}{2}\frac{dN}{dt}\frac{d/\pi}{d^2+(x+a/2)^2}</math> and <math>I_2(x) = \left\vert f_2(x) \right\vert^2 = \frac{1}{2}\frac{dN}{dt}\frac{d/\pi}{d^2+(x-a/2)^2}</math>.

But now, if one does interpret each of these terms in this way, the joint probability takes negative values roughly every <math>\lambda\frac{d}{a}</math> !<math>\begin{align} I_{12}(x) & = \left \\ & = \frac{1}{2}\frac{dN}{dt}\frac{d/\pi}{\sqrt{d^2+(x-a/2)^2}\sqrt{d^2+(x+a/2)^2}}\sin\left \\ \end{align}</math>

However, these negative probabilities are never observed as one can't isolate the cases in which the photon "goes through both slits", but can hint at the existence of anti-particles.


== Finance == == Finance ==


Negative probabilities have more recently been applied to ]. In quantitative finance most probabilities are not real probabilities but pseudo probabilities, often what is known as ] probabilities.{{clarify|date=March 2015}} These are not real probabilities, but theoretical "probabilities" under a series of assumptions that helps simplify calculations by allowing such pseudo probabilities to be negative in certain cases as first pointed out by Espen Gaarder Haug in 2004.<ref>Haug, E. G. (2004): , Wilmott Magazine, Re-printed in the book (2007); Derivatives Models on Models, John Wiley & Sons, New York</ref> Negative probabilities have more recently been applied to ]. In quantitative finance most probabilities are not real probabilities but pseudo probabilities, often what is known as ] probabilities.<ref name=":2" >{{cite journal | last1=Meissner | first1=Gunter A. | last2=Burgin | first2=Dr. Mark | title=Negative Probabilities in Financial Modeling | journal=SSRN Electronic Journal | publisher=Elsevier BV | year=2011 | issn=1556-5068 | doi=10.2139/ssrn.1773077 | s2cid=197765776 }}</ref> These are not real probabilities, but theoretical "probabilities" under a series of assumptions that help simplify calculations by allowing such pseudo probabilities to be negative in certain cases as first pointed out by Espen Gaarder Haug in 2004.<ref>{{cite journal |last=Haug |first=E. G. |year=2004 |url=http://www.espenhaug.com/NegativeProbabilitiesHaug.pdf |title=Why so Negative to Negative Probabilities? |journal=Wilmott Magazine |pages=34–38}}</ref>


A rigorous mathematical definition of negative probabilities and their properties was recently derived by Mark Burgin and Gunter Meissner (2011). The authors also show how negative probabilities can be applied to financial ].<ref> Wilmott Magazine March 2012</ref> A rigorous mathematical definition of negative probabilities and their properties was recently derived by Mark Burgin and Gunter Meissner (2011). The authors also show how negative probabilities can be applied to financial ].<ref name=":2" />

== Engineering ==


== ] and ]==
The concept of negative probabilities have also been proposed for reliable facility location models where facilities are subject to negatively correlated disruption risks when facility locations, customer allocation, and backup service plans are determined simultaneously.<ref>{{Cite journal|last=Snyder|first=L.V.|last2=Daskin|first2=M.S.|date=2005|title=Reliability Models for Facility Location: The Expected Failure Cost Case|url=http://pubsonline.informs.org/doi/abs/10.1287/trsc.1040.0107|journal=Transportation Science|volume=39|issue=3|pages=400–416|doi=10.1287/trsc.1040.0107}}</ref><ref>{{Cite journal|last=Cui|first=T.|last2=Ouyang|first2=Y.|last3=Shen|first3=Z-J. M.|date=2010|title=Reliable Facility Location Design Under the Risk of Disruptions|url=http://pubsonline.informs.org/doi/abs/10.1287/opre.1090.0801?journalCode=opre|journal=Operations Research|volume=58|issue=4|pages=998–1011|doi=10.1287/opre.1090.0801}}</ref> Li et al.<ref>{{Cite journal|last=Li|first=X.|last2=Ouyang|first2=Y.|last3=Peng|first3=F.|date=2013|title=A supporting station model for reliable infrastructure location design under interdependent disruptions|url=http://www.sciencedirect.com/science/article/pii/S1366554513001221|journal=Transportation Research Part E|volume=60|pages=80–93|doi=10.1016/j.tre.2013.06.005}}</ref> proposed a virtual station structure that transforms a facility network with positively correlated disruptions into an equivalent one with added virtual supporting stations, and these virtual stations were subject to independent disruptions. This approach reduces a problem from one with correlated disruptions to one without. Xie et al.<ref name=":0">{{Cite journal|last=Xie|first=S.|last2=Li|first2=X.|last3=Ouyang|first3=Y.|date=|year=2015|title=Decomposition of general facility disruption correlations via augmentation of virtual supporting stations|url=http://www.sciencedirect.com/science/article/pii/S0191261515001290|journal=Transportation Research Part B|volume=80|pages=64–81|doi=10.1016/j.trb.2015.06.006}}</ref> later showed how negatively correlated disruptions can also be addressed by the same modeling framework, except that a supporting station now may be disrupted with a “failure propensity” which


Some problems in machine learning use ]- or ]-based formulations having edges assigned with weights, most commonly positive. A positive weight from one vertex to another can be interpreted in a ] as a probability of getting from the former vertex to the latter. In a ] that is the probability of each event depending only on the state attained in the previous event.
... inherits all mathematical characteristics and properties of a failure probability except that we allow it to be larger than 1...


Some problems in machine learning, e.g., ], naturally often deal with a ] where the edge weight indicates whether two nodes are similar (correlated with a positive edge weight) or dissimilar (anticorrelated with a negative edge weight). Treating a graph weight as a probability of the two vertices to be related is being replaced here with a correlation that of course can be negative or positive equally legitimately. Positive and negative graph weights are uncontroversial if interpreted as correlations rather than probabilities but raise similar issues, e.g., challenges for normalization in ] and explainability of ] for signed ]; e.g.,<ref>
This finding paves ways for using compact mixed-integer mathematical programs to optimally design reliable location of service facilities under site-dependent and positive/negative/mixed disruption correlations.<ref name=":1">{{Cite journal|last=Xie|first=S.|last2=An|first2=K.|last3=Ouyang|first3=Y.|date=2017|title=Planning of Facility Location under Correlated Facility Disruptions|url=|journal=Under Review|volume=|pages=}}</ref>
{{cite conference
| first1=Andrew |last1=Knyazev
| year = 2018
| title = On spectral partitioning of signed graphs
| conference = Eighth SIAM Workshop on Combinatorial Scientific Computing, CSC 2018, Bergen, Norway, June 6–8
| doi = 10.1137/1.9781611975215.2
| doi-access = free
| arxiv = 1701.01394
}}</ref>


Similarly, in ], the eigenvalues of the ] represent ] and eigenvectors form what is known as a graph ] substituting the ] in the graph-based ]. In applications to imaging, the ] is formulated analogous to the ] operator where a Gaussian smoothed image is interpreted as a single time slice of the solution to the heat equation, that has the original image as its initial conditions. If the graph weight was negative, that would correspond to a negative conductivity in the ], stimulating the heat concentration at the ] connected by the graph edge, rather than the normal heat ]. While negative ] is not-physical, this effect is useful for ], e.g., resulting in sharpening corners of one-dimensional signals, when used in graph-based ].<ref>{{Cite conference | first1 = A. | last1 =Knyazev | title = Edge-enhancing Filters with Negative Weights | conference = IEEE Global Conference on Signal and Information Processing (GlobalSIP), Orlando, FL, 14-16 Dec.2015| year = 2015 | pages = 260–264 | doi = 10.1109/GlobalSIP.2015.7418197| arxiv = 1509.02491 }}</ref>
The proposed “propensity” concept in Xie et al.<ref name=":0" /> turns out to be what Feynman and others referred to as “quasi-probability.” Note that when a quasi-probability is larger than 1, then 1 minus this value gives a negative probability. The truly physically verifiable observation is the facility disruption states, and there is no direct information on the station states or their corresponding probabilities. Hence the failure probability of the stations, interpreted as “probabilities of imagined intermediary states,” could exceed unity.


== See also == == See also ==
Line 80: Line 59:
] ]
] ]
]

Latest revision as of 11:36, 26 December 2024

Concept in science

The probability of the outcome of an experiment is never negative, although a quasiprobability distribution allows a negative probability, or quasiprobability for some events. These distributions may apply to unobservable events or conditional probabilities.

Physics and mathematics

In 1942, Paul Dirac wrote a paper "The Physical Interpretation of Quantum Mechanics" where he introduced the concept of negative energies and negative probabilities:

Negative energies and probabilities should not be considered as nonsense. They are well-defined concepts mathematically, like a negative of money.

The idea of negative probabilities later received increased attention in physics and particularly in quantum mechanics. Richard Feynman argued that no one objects to using negative numbers in calculations: although "minus three apples" is not a valid concept in real life, negative money is valid. Similarly he argued how negative probabilities as well as probabilities above unity possibly could be useful in probability calculations.

Negative probabilities have later been suggested to solve several problems and paradoxes. Half-coins provide simple examples for negative probabilities. These strange coins were introduced in 2005 by Gábor J. Székely. Half-coins have infinitely many sides numbered with 0,1,2,... and the positive even numbers are taken with negative probabilities. Two half-coins make a complete coin in the sense that if we flip two half-coins then the sum of the outcomes is 0 or 1 with probability 1/2 as if we simply flipped a fair coin.

In Convolution quotients of nonnegative definite functions and Algebraic Probability Theory Imre Z. Ruzsa and Gábor J. Székely proved that if a random variable X has a signed or quasi distribution where some of the probabilities are negative then one can always find two random variables, Y and Z, with ordinary (not signed / not quasi) distributions such that X, Y are independent and X + Y = Z in distribution. Thus X can always be interpreted as the "difference" of two ordinary random variables, Z and Y. If Y is interpreted as a measurement error of X and the observed value is Z then the negative regions of the distribution of X are masked / shielded by the error Y.

Another example known as the Wigner distribution in phase space, introduced by Eugene Wigner in 1932 to study quantum corrections, often leads to negative probabilities. For this reason, it has later been better known as the Wigner quasiprobability distribution. In 1945, M. S. Bartlett worked out the mathematical and logical consistency of such negative valuedness. The Wigner distribution function is routinely used in physics nowadays, and provides the cornerstone of phase-space quantization. Its negative features are an asset to the formalism, and often indicate quantum interference. The negative regions of the distribution are shielded from direct observation by the quantum uncertainty principle: typically, the moments of such a non-positive-semidefinite quasiprobability distribution are highly constrained, and prevent direct measurability of the negative regions of the distribution. Nevertheless, these regions contribute negatively and crucially to the expected values of observable quantities computed through such distributions.

Engineering

The concept of negative probabilities has also been proposed for reliable facility location models where facilities are subject to negatively correlated disruption risks when facility locations, customer allocation, and backup service plans are determined simultaneously. Li et al. proposed a virtual station structure that transforms a facility network with positively correlated disruptions into an equivalent one with added virtual supporting stations, and these virtual stations were subject to independent disruptions. This approach reduces a problem from one with correlated disruptions to one without. Xie et al. later showed how negatively correlated disruptions can also be addressed by the same modeling framework, except that a virtual supporting station now may be disrupted with a “failure propensity” which

... inherits all mathematical characteristics and properties of a failure probability except that we allow it to be larger than 1...

This finding paves ways for using compact mixed-integer mathematical programs to optimally design reliable location of service facilities under site-dependent and positive/negative/mixed facility disruption correlations.

The proposed “propensity” concept in Xie et al. turns out to be what Feynman and others referred to as “quasi-probability.” Note that when a quasi-probability is larger than 1, then 1 minus this value gives a negative probability. In the reliable facility location context, the truly physically verifiable observation is the facility disruption states (whose probabilities are ensured to be within the conventional range ), but there is no direct information on the station disruption states or their corresponding probabilities. Hence the disruption "probabilities" of the stations, interpreted as “probabilities of imagined intermediary states,” could exceed unity, and thus are referred to as quasi-probabilities.

Finance

Negative probabilities have more recently been applied to mathematical finance. In quantitative finance most probabilities are not real probabilities but pseudo probabilities, often what is known as risk neutral probabilities. These are not real probabilities, but theoretical "probabilities" under a series of assumptions that help simplify calculations by allowing such pseudo probabilities to be negative in certain cases as first pointed out by Espen Gaarder Haug in 2004.

A rigorous mathematical definition of negative probabilities and their properties was recently derived by Mark Burgin and Gunter Meissner (2011). The authors also show how negative probabilities can be applied to financial option pricing.

Machine learning and signal processing

Some problems in machine learning use graph- or hypergraph-based formulations having edges assigned with weights, most commonly positive. A positive weight from one vertex to another can be interpreted in a random walk as a probability of getting from the former vertex to the latter. In a Markov chain that is the probability of each event depending only on the state attained in the previous event.

Some problems in machine learning, e.g., correlation clustering, naturally often deal with a signed graph where the edge weight indicates whether two nodes are similar (correlated with a positive edge weight) or dissimilar (anticorrelated with a negative edge weight). Treating a graph weight as a probability of the two vertices to be related is being replaced here with a correlation that of course can be negative or positive equally legitimately. Positive and negative graph weights are uncontroversial if interpreted as correlations rather than probabilities but raise similar issues, e.g., challenges for normalization in graph Laplacian and explainability of spectral clustering for signed graph partitioning; e.g.,

Similarly, in spectral graph theory, the eigenvalues of the Laplacian matrix represent frequencies and eigenvectors form what is known as a graph Fourier basis substituting the classical Fourier transform in the graph-based signal processing. In applications to imaging, the graph Laplacian is formulated analogous to the anisotropic diffusion operator where a Gaussian smoothed image is interpreted as a single time slice of the solution to the heat equation, that has the original image as its initial conditions. If the graph weight was negative, that would correspond to a negative conductivity in the heat equation, stimulating the heat concentration at the graph vertices connected by the graph edge, rather than the normal heat dissipation. While negative heat conductivity is not-physical, this effect is useful for edge-enhancing image smoothing, e.g., resulting in sharpening corners of one-dimensional signals, when used in graph-based edge-preserving smoothing.

See also

References

  1. Dirac, P. A. M. (1942). "Bakerian Lecture. The Physical Interpretation of Quantum Mechanics". Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences. 180 (980): 1–39. Bibcode:1942RSPSA.180....1D. doi:10.1098/rspa.1942.0023. JSTOR 97777.
  2. Feynman, Richard P. (1987). "Negative Probability" (PDF). In Peat, F. David; Hiley, Basil (eds.). Quantum Implications: Essays in Honour of David Bohm. Routledge & Kegan Paul Ltd. pp. 235–248. ISBN 978-0415069601.
  3. Khrennikov, Andrei Y. (March 7, 2013). Non-Archimedean Analysis: Quantum Paradoxes, Dynamical Systems and Biological Models. Springer Science & Business Media. ISBN 978-94-009-1483-4.
  4. Székely, G.J. (July 2005). "Half of a Coin: Negative Probabilities" (PDF). Wilmott Magazine: 66–68. Archived from the original (PDF) on 2013-11-08.
  5. Ruzsa, Imre Z.; SzéKely, Gábor J. (1983). "Convolution quotients of nonnegative functions". Monatshefte für Mathematik. 95 (3): 235–239. doi:10.1007/BF01352002. S2CID 122858460.
  6. Ruzsa, I.Z.; Székely, G.J. (1988). Algebraic Probability Theory. New York: Wiley. ISBN 0-471-91803-2.
  7. Wigner, E. (1932). "On the Quantum Correction for Thermodynamic Equilibrium". Physical Review. 40 (5): 749–759. Bibcode:1932PhRv...40..749W. doi:10.1103/PhysRev.40.749. hdl:10338.dmlcz/141466.
  8. Bartlett, M. S. (1945). "Negative Probability". Mathematical Proceedings of the Cambridge Philosophical Society. 41 (1): 71–73. Bibcode:1945PCPS...41...71B. doi:10.1017/S0305004100022398. S2CID 12149669.
  9. Snyder, L.V.; Daskin, M.S. (2005). "Reliability Models for Facility Location: The Expected Failure Cost Case". Transportation Science. 39 (3): 400–416. CiteSeerX 10.1.1.1.7162. doi:10.1287/trsc.1040.0107.
  10. Cui, T.; Ouyang, Y.; Shen, Z-J. M. (2010). "Reliable Facility Location Design Under the Risk of Disruptions". Operations Research. 58 (4): 998–1011. CiteSeerX 10.1.1.367.3741. doi:10.1287/opre.1090.0801. S2CID 6236098.
  11. Li, X.; Ouyang, Y.; Peng, F. (2013). "A supporting station model for reliable infrastructure location design under interdependent disruptions". Transportation Research Part E. 60: 80–93. doi:10.1016/j.tre.2013.06.005.
  12. ^ Xie, S.; Li, X.; Ouyang, Y. (2015). "Decomposition of general facility disruption correlations via augmentation of virtual supporting stations". Transportation Research Part B. 80: 64–81. doi:10.1016/j.trb.2015.06.006.
  13. Xie, Siyang; An, Kun; Ouyang, Yanfeng (2019). "Planning facility location under generally correlated facility disruptions: Use of supporting stations and quasi-probabilities". Transportation Research Part B: Methodological. 122. Elsevier BV: 115–139. doi:10.1016/j.trb.2019.02.001. ISSN 0191-2615.
  14. ^ Meissner, Gunter A.; Burgin, Dr. Mark (2011). "Negative Probabilities in Financial Modeling". SSRN Electronic Journal. Elsevier BV. doi:10.2139/ssrn.1773077. ISSN 1556-5068. S2CID 197765776.
  15. Haug, E. G. (2004). "Why so Negative to Negative Probabilities?" (PDF). Wilmott Magazine: 34–38.
  16. Knyazev, Andrew (2018). On spectral partitioning of signed graphs. Eighth SIAM Workshop on Combinatorial Scientific Computing, CSC 2018, Bergen, Norway, June 6–8. arXiv:1701.01394. doi:10.1137/1.9781611975215.2.
  17. Knyazev, A. (2015). Edge-enhancing Filters with Negative Weights. IEEE Global Conference on Signal and Information Processing (GlobalSIP), Orlando, FL, 14-16 Dec.2015. pp. 260–264. arXiv:1509.02491. doi:10.1109/GlobalSIP.2015.7418197.
Categories:
Negative probability: Difference between revisions Add topic