<?xml version="1.0" encoding="utf-8" standalone="no"?>
<!DOCTYPE article SYSTEM "http://www.nonlin-processes-geophys.net/inc/npg/copernicus.dtd">
<article language="en">
	<journal>
		<journal_title>Nonlinear Processes  in Geophysics</journal_title>
		<journal_url>www.nonlin-processes-geophys.net</journal_url>
		<issn>1023-5809</issn>
		<eissn>1607-7946</eissn>
		<volume_number>16</volume_number>
		<issue_number>1</issue_number>
		<publication_year>2009</publication_year>
	</journal>
	<doi>10.5194/npg-16-57-2009</doi>
	<article_url>http://www.nonlin-processes-geophys.net/16/57/2009/</article_url>
	<abstract_html>http://www.nonlin-processes-geophys.net/16/57/2009/npg-16-57-2009.html</abstract_html>
	<fulltext_pdf>http://www.nonlin-processes-geophys.net/16/57/2009/npg-16-57-2009.pdf</fulltext_pdf>
	<start_page>57</start_page>
	<end_page>64</end_page>
	<publication_date>2009-02-06</publication_date>
	<article_title content_type="html">Information theoretic measures of  dependence,   compactness, and non-gaussianity for multivariate probability distributions</article_title>
	<authors>
		<author numeration="1" affiliations="1">
			<name>A. H. Monahan</name>
			<email>monahana@uvic.ca</email>
		</author>
		<author numeration="2" affiliations="2">
			<name>T. DelSole</name>
		</author>
	</authors>
	<affiliations>
		<affiliation numeration="1" content_type="html">School of Earth and Ocean Sciences, University of   Victoria, Victoria, BC, Canada  and Canadian Institute for   Advanced Research Earth System Evolution Program, Canada</affiliation>
		<affiliation numeration="2" content_type="html">Department of Atmospheric, Oceanic, and Earth Sciences, George Mason University, Fairfax, Virginia, and Center for Ocean-Land-Atmosphere Studies, Calverton, Maryland, USA</affiliation>
	</affiliations>
	<abstract content_type="html">A basic task of exploratory data analysis is the characterisation of
&quot;structure&quot; in multivariate datasets. For bivariate Gaussian distributions,
natural measures of dependence (the predictive relationship between
individual variables) and compactness (the degree of concentration of the
probability density function (pdf) around a low-dimensional axis) are
respectively provided by ordinary least-squares regression and Principal
Component Analysis. This study considers general measures of structure for
non-Gaussian distributions and demonstrates that these can be defined in
terms of the information theoretic &quot;distance&quot; (as measured by relative
entropy) between the given pdf and an appropriate &quot;unstructured&quot; pdf. The
measure of dependence, mutual information, is well-known; it is shown that
this is not a useful measure of compactness because it is not invariant under
an orthogonal rotation of the variables. An appropriate rotationally
invariant compactness measure is defined and shown to reduce to the
equivalent PCA measure for bivariate Gaussian distributions. This compactness
measure is shown to be naturally related to a standard information theoretic
measure of non-Gaussianity. Finally, straightforward geometric
interpretations of each of these measures in terms of &quot;effective volume&quot; of
the pdf are presented.</abstract>
	<references>
		<reference numeration="1" content_type="text"> Cover, T M. and Thomas, J A.: Elements of Information Theory, John Wiley &amp; Sons, Inc., New York, 1991. </reference>
		<reference numeration="2" content_type="text"> DelSole, T.: Predictability and Information Theory. Part I: Measures of predictability, J. Atmos. Sci., 61, 2425–2440, 2004. </reference>
		<reference numeration="3" content_type="text"> DelSole, T. and Tippett, M K.: Predictability: Recent Insights from Information Theory, Rev. Geophys., 45, RG4002, doi:10.1029/2006RG000202, 2007. </reference>
		<reference numeration="4" content_type="text"> Haven, K., Majda, A., and Abramov, R.: Quantifying predictability through information theory: Small sample estimation in a non-Gaussian framework, J. Comp. Phys., 206, 334–362, 2005. </reference>
		<reference numeration="5" content_type="text"> Kleeman, R. and Majda, A.: Predictability in a model of geophysical turbulence, J. Atmos. Sci., 62, 2864–2879, 2005. </reference>
		<reference numeration="6" content_type="text"> Lee, T.-W., Girolami, M., Bell, A., and Sejnowski, T.: A unifying information-theoretic framework for independent component analysis, Comp. Math. App., 39, 1–21, 2000. </reference>
		<reference numeration="7" content_type="text"> Majda, A., Kleeman, R., and Cai, D.: A Mathematical Framework for Quantifying Predictability Through Relative Entropy, Meth. Appl. Anal., 9, 425–444, 2002. </reference>
		<reference numeration="8" content_type="text"> Majda, A J., Abramov, R V., and Grote, M J.: Information Theory and Stochastics for Multiscale Nonlinear Systems, American Mathematical Society, Providence, RI, USA, 2005. </reference>
		<reference numeration="9" content_type="text"> Monahan, A H., Fyfe, J C., and Pandolfo, L.: The vertical structure of wintertime climate regimes of the Northern Hemisphere extratropical atmosphere, J. Climate, 16, 2005–2021, 2003. </reference>
		<reference numeration="10" content_type="text"> Pardo, L.: Statistical Inference Based on Divergence Measures, Chapman and Hall, 2006. </reference>
		<reference numeration="11" content_type="text"> Peña, D. and van~der Linde, A.: Dimensionless measures of variability and dependence for multivariate continuous distributions, Comm. Stat. Theory Meth., 36, 1845–1854 doi:10.1080/03610920601126449, 2007. </reference>
		<reference numeration="12" content_type="text"> Schneidman, E., Still, S., Berry, M J., and Bialek, W.: Network information and connected correlations, Phys. Rev. Lett., 91, 238701-1–238701-4, 2003. </reference>
	</references>
</article>

