﻿{"id":5563,"date":"2018-07-02T05:32:13","date_gmt":"2018-07-02T03:32:13","guid":{"rendered":"http:\/\/www.sigterritoires.fr\/?p=5563"},"modified":"2018-10-17T13:01:41","modified_gmt":"2018-10-17T11:01:41","slug":"exploratory-analysis-of-data-for-geostatistics-the-qq-plot","status":"publish","type":"post","link":"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/","title":{"rendered":"Exploratory analysis of data for geostatistics: the QQ-plot"},"content":{"rendered":"<p>Following the article\u00a0\u00a0\u00a0<a href=\"https:\/\/translate.google.com\/translate?hl=en&amp;prev=_t&amp;sl=fr&amp;tl=en&amp;u=http:\/\/wp.me\/p6XU0A-Y9\">Introduction to exploratory data analysis for geostatistics<\/a>\u00a0\u00a0\u00a0we\u00a0\u00a0\u00a0will discuss each of the available tools to carry out the exploratory analysis of spatial data.\u00a0We have already discussed \u00a0<a href=\"https:\/\/translate.google.com\/translate?hl=en&amp;prev=_t&amp;sl=fr&amp;tl=en&amp;u=http:\/\/www.sigterritoires.fr\/index.php\/analyse-exploratoire-des-donnees-pour-la-geostatistiqueles-histogrammes\">the histograms<\/a>\u00a0, and now we will address the QQ-Plots.<\/p>\n<p>QQ-Plots (or Quantile-Quantile Diagrams) are graphs in which the quantiles of two distributions are plotted against each other.<!--more--><\/p>\n<p><strong>Building a normal QQ-Plot<\/strong><\/p>\n<p>A\u00a0<strong><em>QQ-Normal Plot<\/em><\/strong>\u00a0is\u00a0the diagram that makes it possible to compare the distribution of the data of a batch with the so-called <strong><em>normal<\/em><\/strong>\u00a0<strong><em>\u00a0<\/em><\/strong>\u00a0or\u00a0\u00a0\u00a0<strong><em>Gaussian<\/em><\/strong>\u00a0distribution.\u00a0Here is an example.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5564\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex1-4\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex1.png?fit=756%2C411&amp;ssl=1\" data-orig-size=\"756,411\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex1\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex1.png?fit=640%2C348&amp;ssl=1\" class=\"alignnone size-medium wp-image-5564\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex1-300x163.png?resize=300%2C163\" alt=\"\" width=\"300\" height=\"163\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex1.png?resize=300%2C163&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex1.png?w=756&amp;ssl=1 756w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>How to build it?<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5565\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex2-4\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex2.png?fit=485%2C391&amp;ssl=1\" data-orig-size=\"485,391\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex2\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex2.png?fit=485%2C391&amp;ssl=1\" class=\"alignnone size-medium wp-image-5565\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex2-300x242.png?resize=300%2C242\" alt=\"\" width=\"300\" height=\"242\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex2.png?resize=300%2C242&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex2.png?w=485&amp;ssl=1 485w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p><strong>AT &#8211;<\/strong>\u00a0\u00a0\u00a0The batch of data to be processed is ordered by value, from the smallest to the largest, and then the percentage of lower values \u200b\u200bis calculated for each value.\u00a0We plot the values \u200b\u200bof the batch on the abscissa and the percentages on the ordinates.\u00a0In this example on the ordinates for the value 2 corresponds 21% (0,21) of lower values \u200b\u200bpresent in the batch (and thus 79% of values \u200b\u200bgreater than 2).<\/p>\n<p><strong>B-<\/strong>\u00a0<strong>\u00a0<\/strong>\u00a0The Gaussian function is plotted with the standard deviations on the abscissa and the frequency percentage inferior to this value on the ordinates. For a frequency equal to 21% (0.21), the standard deviation equals -0.85.<\/p>\n<p><strong>C-<\/strong>\u00a0We create the QQ-Plot:<\/p>\n<ul>\n<li>we use the value (DV) for each data,<\/li>\n<li>we look for the percentage of graph A,<\/li>\n<li>with this percentage, we move to\u00a0\u00a0\u00a0graph B and obtain the corresponding standard deviation value (NV),<\/li>\n<li>we draw the point using NV on the abscissa and DV on the ordinates.<\/li>\n<\/ul>\n<p>The right portion of the QQ-Plot indicates the position that should have the points if they matched exactly the normal distribution.<\/p>\n<p><strong>How to build a general QQPlot<\/strong><\/p>\n<p>The\u00a0\u00a0\u00a0<strong><em>QQ-General Plot<\/em><\/strong>\u00a0\u00a0\u00a0is used to evaluate the similarity between the distributions of two sets of data.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5566\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex3-3\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex3.png?fit=758%2C414&amp;ssl=1\" data-orig-size=\"758,414\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex3\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex3.png?fit=640%2C350&amp;ssl=1\" class=\"alignnone size-medium wp-image-5566\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex3-300x164.png?resize=300%2C164\" alt=\"\" width=\"300\" height=\"164\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex3.png?resize=300%2C164&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex3.png?w=758&amp;ssl=1 758w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>Here we have two variables: Depth and Distance<\/p>\n<p>How to build it?<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5567\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex4-4\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex4.png?fit=467%2C384&amp;ssl=1\" data-orig-size=\"467,384\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex4\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex4.png?fit=467%2C384&amp;ssl=1\" class=\"alignnone size-medium wp-image-5567\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex4-300x247.png?resize=300%2C247\" alt=\"\" width=\"300\" height=\"247\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex4.png?resize=300%2C247&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex4.png?w=467&amp;ssl=1 467w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p><strong>A &#8211;<\/strong>\u00a0\u00a0\u00a0As for the normal QQ-Plot, the first\u00a0\u00a0\u00a0batch of data to be processed is ordered by value, from the smallest to the largest, then the percentage of lower values \u200b\u200bis calculated for each value.\u00a0We plot on the ordinates the values \u200b\u200bof the lot and on the abscissa the percentages.\u00a0In this example for the value 2 in our data we have 21% (0,21) of lower values \u200b\u200bpresent in the batch.<\/p>\n<p><strong>B &#8211;<\/strong>\u00a0\u00a0\u00a0The second batch of data is treated in the same way.\u00a0In this example for the value in our data we have 37% (0.37) of lower values \u200b\u200bpresent in the batch.\u00a0You will notice that there is no value in the batch with a frequency of 0.21 as in the first batch of data.<\/p>\n<p><strong>C-<\/strong>\u00a0We create the QQ-Plot:<\/p>\n<ul>\n<li>for each data of lot A we select its value (DV1),<\/li>\n<li>we look for the percentage of graph A,<\/li>\n<li>with this percentage, we move to\u00a0\u00a0\u00a0graph B and obtain the value of the corresponding batch B (DV2), either by selecting it directly (whenever possible), or by interpolating between the two encompassing values \u200b\u200b, as in the example above.<\/li>\n<li>we draw the point using DV2 on the abscissa and DV1 on the ordinate,<\/li>\n<li>for each data of the batch B we select its value (DV2),<\/li>\n<li>we look for the percentage of chart B,<\/li>\n<li>using this percentage, we move to\u00a0\u00a0\u00a0graph A and obtain the value of the corresponding batch A (DV1), either by selecting it directly (whenever possible), or by interpolating between the two values \u200b\u200bwhich encompass it, as in the example above.<\/li>\n<li>we draw the point using DV2 on the abscissa and DV1 on the ordinate.<\/li>\n<\/ul>\n<p>Unlike the normal QQ-Plot, we can not draw theoretical line because we do not know the distribution function of the lots A and B. However, if the two distributions are exactly the same, the points will be aligned on a straight line.\u00a0In the above example (Depth-Distance) this is not the case.<br \/>\nThe following example is a perfect match (since it&rsquo;s the same variable):<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5568\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex5-4\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex5.png?fit=754%2C413&amp;ssl=1\" data-orig-size=\"754,413\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex5\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex5.png?fit=640%2C351&amp;ssl=1\" class=\"alignnone size-medium wp-image-5568\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex5-300x164.png?resize=300%2C164\" alt=\"\" width=\"300\" height=\"164\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex5.png?resize=300%2C164&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex5.png?w=754&amp;ssl=1 754w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p><strong>Interpretation of QQ-Plots<\/strong><\/p>\n<p>We will repeat here what we have, already, said for the histograms:<\/p>\n<p><em>\u00ab\u00a0<\/em>\u00a0<em>\u00a0<\/em>\u00a0<em>Some kriging methods work best if the data is distributed approximately normally (the bell-shaped curve).<\/em><br \/>\n<em>In particular, quantile and probability maps using ordinary, simple and universal kriging assume that the data come from a normal distribution.<\/em><br \/>\n<em>As we saw in the previous article, kriging is also based on the hypothesis of stationarity.<\/em>\u00a0<em>\u00a0<\/em>\u00a0<em>This assumption requires, in part, that all data values <\/em><em>\u200b\u200b<\/em><em>come from distributions that have the same variability. In nature, we often observe that as values <\/em><em>\u200b\u200b<\/em><em>increase, their variability, also, increases.<\/em>\u00a0<em>\u00a0<\/em>\u00a0<em>The<\/em>\u00a0<em>\u00a0<\/em>\u00a0<strong><em>transformations<\/em><\/strong>\u00a0of<em>\u00a0source data can be used to transform your data into a normal \u00a0distribution and satisfy the assumption of equal variability for the whole set.<\/em>\u00a0<em>\u00ab\u00a0<\/em>\u00a0<em>\u00a0<\/em><\/p>\n<p>Therefore, we will look for the same things as with the histograms, but with the QQ-Plot it will be easier.<\/p>\n<p>If we select the variable\u00a0\u00a0\u00a0<strong><em>Depth<\/em><\/strong>, used for the histogram and we draw its normal QQ-Plot we have:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5569\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex6-4\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex6.png?fit=756%2C411&amp;ssl=1\" data-orig-size=\"756,411\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex6\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex6.png?fit=640%2C348&amp;ssl=1\" class=\"alignnone size-medium wp-image-5569\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex6-300x163.png?resize=300%2C163\" alt=\"\" width=\"300\" height=\"163\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex6.png?resize=300%2C163&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex6.png?w=756&amp;ssl=1 756w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>We have three different areas:<\/p>\n<ul>\n<li>A- Points to the left of the theoretical line, very far from this one<\/li>\n<li>B- Points to the right of the theoretical line, and<\/li>\n<li>C- Points to the left, again<\/li>\n<\/ul>\n<p>The general shape is equivalent to an S.<\/p>\n<p>The information that we can draw from the general shape of the points curve is mainly related to the form coefficients: skewness and kurtosis.\u00a0Moreover, we can immediately observe if our data follow a mono or bi-modal curve.<\/p>\n<p><strong>Observation of the spreading<\/strong><\/p>\n<p>First, a few words on spreading (\u00a0<strong><em>skewness<\/em><\/strong>\u00a0).<\/p>\n<p>We have three main types of distribution: normal, moved to the left (towards the small values \u200b\u200bof our data), moved to the right (towards the big values \u200b\u200bof our data).<\/p>\n<p>In order to, quickly, find which the type of our distribution is; look at the corresponding QQ-Plot area in the centre of our distribution (Value 0 of the standard deviation):<\/p>\n<p><strong>UNBIASED DISTRIBUTION (NORMAL):\u00a0<\/strong><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5570\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex7-3\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex7.png?fit=520%2C818&amp;ssl=1\" data-orig-size=\"520,818\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex7\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex7.png?fit=520%2C818&amp;ssl=1\" class=\"alignnone size-medium wp-image-5570\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex7-191x300.png?resize=191%2C300\" alt=\"\" width=\"191\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex7.png?resize=191%2C300&amp;ssl=1 191w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex7.png?w=520&amp;ssl=1 520w\" sizes=\"auto, (max-width: 191px) 100vw, 191px\" \/><\/p>\n<p>The points of the data corresponding to the centre of the distribution are included (or very close) in the theoretical line.<\/p>\n<p><strong>DISTRIBUTION BIASED to the LEFT:\u00a0<\/strong><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5571\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex8\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex8.png?fit=536%2C796&amp;ssl=1\" data-orig-size=\"536,796\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex8\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex8.png?fit=536%2C796&amp;ssl=1\" class=\"alignnone size-medium wp-image-5571\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex8-202x300.png?resize=202%2C300\" alt=\"\" width=\"202\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex8.png?resize=202%2C300&amp;ssl=1 202w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex8.png?w=536&amp;ssl=1 536w\" sizes=\"auto, (max-width: 202px) 100vw, 202px\" \/><\/p>\n<p>The area of \u200b\u200bpoints around 0 standard deviation is substantially below the theoretical line.<\/p>\n<p><strong>\u00a0DISTRIBUTION BIASED TO THE RIGHT:\u00a0<\/strong><\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5572\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex9\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex9.png?fit=510%2C814&amp;ssl=1\" data-orig-size=\"510,814\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex9\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex9.png?fit=510%2C814&amp;ssl=1\" class=\"alignnone size-medium wp-image-5572\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex9-188x300.png?resize=188%2C300\" alt=\"\" width=\"188\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex9.png?resize=188%2C300&amp;ssl=1 188w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex9.png?w=510&amp;ssl=1 510w\" sizes=\"auto, (max-width: 188px) 100vw, 188px\" \/><\/p>\n<p>The area of \u200b\u200bpoints around 0 standard deviation is substantially above the theoretical line.<\/p>\n<p><strong>Observation of the flattening<\/strong><\/p>\n<p>The other possible observation concerns the coefficient of spread (kurtosis).<\/p>\n<p><strong>KURTOSIS LESS THAN 3<\/strong><\/p>\n<p>Distributions with relatively thin edges (called platykurtic) and which have a kurtosis\u00a0value lower than 3, have a general S shape, with the negative part of the standard deviations concave, and the positive part convex:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5573\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex10\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex10.png?fit=840%2C591&amp;ssl=1\" data-orig-size=\"840,591\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex10\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex10.png?fit=640%2C450&amp;ssl=1\" class=\"alignnone size-medium wp-image-5573\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex10-300x211.png?resize=300%2C211\" alt=\"\" width=\"300\" height=\"211\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex10.png?resize=300%2C211&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex10.png?resize=768%2C540&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex10.png?w=840&amp;ssl=1 840w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p><strong>KURTOSIS GREATER THAN 3<\/strong><\/p>\n<p>Distributions with relatively thick edges (called leptokurtic)\u00a0\u00a0\u00a0and which have a kurtosis \u00a0value greater than 3, have a general inverted S shape, with the negative section of the standard deviations convex, and the positive section concave:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5574\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex11\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex11.png?fit=840%2C583&amp;ssl=1\" data-orig-size=\"840,583\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex11\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex11.png?fit=640%2C444&amp;ssl=1\" class=\"alignnone size-medium wp-image-5574\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex11-300x208.png?resize=300%2C208\" alt=\"\" width=\"300\" height=\"208\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex11.png?resize=300%2C208&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex11.png?resize=768%2C533&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex11.png?w=840&amp;ssl=1 840w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p><strong>What to do?<\/strong><\/p>\n<p>Using the normal QQ-plot there are two things we can do: find a transformation that brings our data back to a normal (or near) distribution and identify the data that can be problematic.<\/p>\n<p>If we consider the first diagram of this article, it is easier to find the exponential transformation (Box-Cox) with the QQ-Plot than with the histogram:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5575\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex12\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex12.png?fit=840%2C320&amp;ssl=1\" data-orig-size=\"840,320\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex12\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex12.png?fit=640%2C244&amp;ssl=1\" class=\"alignnone size-medium wp-image-5575\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex12-300x114.png?resize=300%2C114\" alt=\"\" width=\"300\" height=\"114\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex12.png?resize=300%2C114&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex12.png?resize=768%2C293&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex12.png?w=840&amp;ssl=1 840w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>When we modify the transformation parameter, we get a better picture of the adequacy to the theoretical line..<\/p>\n<p>The other interesting aspect of the Geostatistical Analyst is the link between the diagram tools, here the QQ-plot and the display in ArcMap.\u00a0If you use the selection tool on QQ-Plot points you will observe the selected points on the map.<\/p>\n<p>If you select points that deviate from the normal line for large values:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5576\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-analysis-of-data-for-geostatistics-the-qq-plot\/ex13\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex13.png?fit=840%2C557&amp;ssl=1\" data-orig-size=\"840,557\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"ex13\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex13.png?fit=640%2C424&amp;ssl=1\" class=\"alignnone size-medium wp-image-5576\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex13-300x199.png?resize=300%2C199\" alt=\"\" width=\"300\" height=\"199\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex13.png?resize=300%2C199&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex13.png?resize=768%2C509&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/ex13.png?w=840&amp;ssl=1 840w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>We realize that they are all in the periphery of the area considered.\u00a0They can therefore translate an external phenomenon to our zone.\u00a0This will be kept in mind, for example, to test the quality of the final interpolation with or without these points.<\/p>\n<p>If we do the same thing for points deviating in small values: figura 14<\/p>\n<p>We observe that the distribution of these points belong to the phenomenon intrinsic to the area considered, or that, in any case, it will be necessary to take them into account for the modelling of the interpolation function.<\/p>\n<p>In the next article we will see how to detect outliers with Voronoi polygons.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Following the article\u00a0\u00a0\u00a0Introduction to exploratory data analysis for geostatistics\u00a0\u00a0\u00a0we\u00a0\u00a0\u00a0will discuss each of the available tools to carry out the exploratory analysis of spatial data.\u00a0We have already discussed \u00a0the histograms\u00a0, and now we will address the QQ-Plots.&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"give_campaign_id":0,"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"sfsi_plus_gutenberg_text_before_share":"","sfsi_plus_gutenberg_show_text_before_share":"","sfsi_plus_gutenberg_icon_type":"","sfsi_plus_gutenberg_icon_alignemt":"","sfsi_plus_gutenburg_max_per_row":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1260],"tags":[],"class_list":["post-5563","post","type-post","status-publish","format-standard","hentry","category-non-classe-en"],"aioseo_notices":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p6XU0A-1rJ","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/posts\/5563","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/comments?post=5563"}],"version-history":[{"count":0,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/posts\/5563\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/media?parent=5563"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/categories?post=5563"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/tags?post=5563"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}