﻿{"id":5579,"date":"2018-07-03T02:20:59","date_gmt":"2018-07-03T00:20:59","guid":{"rendered":"http:\/\/www.sigterritoires.fr\/?p=5579"},"modified":"2018-10-17T13:01:30","modified_gmt":"2018-10-17T11:01:30","slug":"exploratory-data-analysis-for-geostatistics-voronoi-diagrams","status":"publish","type":"post","link":"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/","title":{"rendered":"Exploratory data analysis for geostatistics: Voronoi diagrams"},"content":{"rendered":"<p>Following the article\u00a0\u00a0\u00a0<a href=\"https:\/\/translate.google.com\/translate?hl=en&amp;prev=_t&amp;sl=fr&amp;tl=en&amp;u=http:\/\/wp.me\/p6XU0A-Y9\">Introduction to exploratory data analysis for geostatistics<\/a>\u00a0\u00a0\u00a0we will discuss all the available tools to carry out the exploratory analysis of spatialized data.\u00a0We have discussed\u00a0\u00a0\u00a0<a href=\"https:\/\/translate.google.com\/translate?hl=en&amp;prev=_t&amp;sl=fr&amp;tl=en&amp;u=http:\/\/www.sigterritoires.fr\/index.php\/analyse-exploratoire-des-donnees-pour-la-geostatistiqueles-histogrammes\">the histograms<\/a>\u00a0,\u00a0\u00a0\u00a0\u00a0<a href=\"https:\/\/translate.google.com\/translate?hl=en&amp;prev=_t&amp;sl=fr&amp;tl=en&amp;u=http:\/\/www.sigterritoires.fr\/index.php\/analyse-exploratoire-des-donnees-pour-la-geostatistiqueles-qq-plot\/\">the QQ-Plots<\/a>\u00a0, and now we will address the Voronoi maps.<\/p>\n<p>We must introduce a notion not yet introduced in the previous articles that concerns the extent or influence of a phenomenon.\u00a0In geostatistics we can consider two types of extension for a phenomenon: GLOBAL or LOCAL extent.<!--more--><\/p>\n<p><strong>Global and local phenomena.<\/strong><\/p>\n<p>We refer to a global phenomenon when we use as reference all the available data.\u00a0We refer to a local phenomenon when we use as reference a sampling point and its neighbouring points.<\/p>\n<p>A simple example is when we refer to outliers.\u00a0If we look for global outliers, we will look for values \u200b\u200bthat are outside the logical range of our data.\u00a0For example, if we have a batch of seawater temperature data, with values \u200b\u200bthat are between 2 \u00b0 C and 16 \u00b0 C, a value of -3 \u00b0 C or a value of 35 \u00b0 C, will appear as\u00a0\u00a0\u00a0<strong><em>global aberrant<\/em><\/strong>\u00a0values.\u00a0Suppose that our measurements are spread over a whole year and that we have in winter the following series of values \u200b\u200b2 \u00b0 C, 2.5 \u00b0 C, 11 \u00b0 C, 2.2 \u00b0 C, 2.5 \u00b0 C.\u00a0The value 11 \u00b0 C is not aberrant from a global point of view, because in the course of the year it can appear quite often.\u00a0But since it is found in the middle of much lower and regular temperatures we can deduce that it is a\u00a0<strong><em>local aberrant<\/em><\/strong>\u00a0value.<\/p>\n<p>The histograms and the QQ-plot that we have already discussed are global tools.\u00a0They allow us to work and understand phenomena that affect all our data.\u00a0With the Voronoi maps we will tackle tools that will allow us to\u00a0\u00a0\u00a0visualize and understand local phenomena, ie they will only concern a part of our data.<\/p>\n<p><strong>The Voronoi maps <\/strong><\/p>\n<p>The Voronoi maps are built from a series of polygons formed around the location of each sampling point.<\/p>\n<p>The Voronoi polygons are created so that each location in a polygon is closer to the sampling point present in that polygon rather than to any other sampling point.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5581\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor1\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor1.png?fit=383%2C379&amp;ssl=1\" data-orig-size=\"383,379\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor1\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor1.png?fit=300%2C297&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor1.png?fit=383%2C379&amp;ssl=1\" class=\"alignnone size-medium wp-image-5581\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor1-300x297.png?resize=300%2C297\" alt=\"\" width=\"300\" height=\"297\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor1.png?resize=300%2C297&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor1.png?resize=150%2C150&amp;ssl=1 150w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor1.png?w=383&amp;ssl=1 383w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>For example, in this figure, the yellow dot is surrounded by a polygon, displayed in red.\u00a0Each location in the red polygon is closer to the yellow sampling point than to any other sampling point (dark blue dots).<\/p>\n<p>After creating the polygons, the\u00a0\u00a0\u00a0<strong><em>neighbours<\/em><\/strong>\u00a0of a sampling point are defined as any other sampling point whose polygon shares a border with the chosen sampling point.\u00a0The blue polygons all share a border with the red polygon, so the sampling points in the blue polygons are\u00a0\u00a0\u00a0<strong><em>neighbours<\/em><\/strong>\u00a0of the clear green sampling point.<\/p>\n<p>Using this neighbourhood definition, an array of local statistics can be calculated.\u00a0For example, a local average will be calculated by using the average of the sampling points in the central polygon and the neighbouring polygons (red and blue polygons).\u00a0This average will then be assigned to the red polygon.\u00a0After repeating this task for all the polygons and their neighbours, a colour scale will show the relative values \u200b\u200bof the local averages, which allows to visualize regions of high and low values.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5582\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor2\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor2.png?fit=561%2C759&amp;ssl=1\" data-orig-size=\"561,759\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor2\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor2.png?fit=222%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor2.png?fit=561%2C759&amp;ssl=1\" class=\"alignnone size-medium wp-image-5582\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor2-222x300.png?resize=222%2C300\" alt=\"\" width=\"222\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor2.png?resize=222%2C300&amp;ssl=1 222w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor2.png?w=561&amp;ssl=1 561w\" sizes=\"auto, (max-width: 222px) 100vw, 222px\" \/><\/p>\n<p>At the top right, the colour scale indicates the values \u200b\u200bof the calculated averages.\u00a0We can see that the top and right corner have the lowest values \u200b\u200bof the set and the bottom and left corner have the highest values.<\/p>\n<p><strong>The different Geostatistical Analyst Voronoi maps <\/strong><\/p>\n<p>The tool Geostatistical Analyst Voronoi map provides a number of methods for assigning or calculating values \u200b\u200bto the polygons.<\/p>\n<p>Let&rsquo;s first look at the list of possibilities and how they are calculated.\u00a0Then we will then discuss their use.<\/p>\n<p><strong>MAPS TYPES <\/strong><\/p>\n<p><strong>Simple<\/strong>\u00a0: The value assigned to each polygon a cell is the value of the sampling point of that polygon.<\/p>\n<p><strong>Average<\/strong>\u00a0: The value assigned to a polygon is the average calculated from this polygon and its neighbours.<\/p>\n<p><strong>Mode<\/strong>\u00a0: All polygons are classified in\u00a0\u00a0\u00a0five class intervals.\u00a0The value assigned to a polygon is the most current value (mode) between the polygon and its neighbours.<\/p>\n<p><strong>Cluster<\/strong>\u00a0: All polygons are classified into five colour class intervals.\u00a0If the class range of the polygon is different from all its neighbours, the cell is grey (to distinguish it from its neighbours).<\/p>\n<p><strong>Entropy<\/strong>\u00a0: All polygons are classified into five classes using the\u00a0\u00a0\u00a0<strong><em>smart quartiles<\/em><\/strong> method, a variation of the quartile method.\u00a0Entropy is calculated using the following formula<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5583\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor3\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor3.png?fit=351%2C61&amp;ssl=1\" data-orig-size=\"351,61\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor3\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor3.png?fit=300%2C52&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor3.png?fit=351%2C61&amp;ssl=1\" class=\"alignnone size-medium wp-image-5583\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor3-300x52.png?resize=300%2C52\" alt=\"\" width=\"300\" height=\"52\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor3.png?resize=300%2C52&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor3.png?w=351&amp;ssl=1 351w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>Where pi is the proportion of polygons, among the central polygon and the neighbouring polygons, for each of the five classes, and Log is the logarithm base 2.<\/p>\n<p>Since this is not simple, let&rsquo;s look at an example.\u00a0We have a polygon with 5 neighbouring \u00a0polygons.\u00a0We apply the smart quartiles method and we obtain 3 class 1 polygons, 1 class 3 polygon and 2 class 5 polygons.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5584\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor4\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor4.png?fit=395%2C159&amp;ssl=1\" data-orig-size=\"395,159\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor4\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor4.png?fit=300%2C121&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor4.png?fit=395%2C159&amp;ssl=1\" class=\"alignnone size-medium wp-image-5584\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor4-300x121.png?resize=300%2C121\" alt=\"\" width=\"300\" height=\"121\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor4.png?resize=300%2C121&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor4.png?w=395&amp;ssl=1 395w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>We will have an entropy<\/p>\n<p>&#8211; [0.6 *\u00a0\u00a0\u00a0-0.736966\u00a0\u00a0\u00a0+ 0.2 * -2.321928\u00a0\u00a0\u00a0+ 0.4 *\u00a0\u00a0\u00a0-1.321928] = 1.4353<\/p>\n<p>In all cases we will have values \u200b\u200branging from 0 to 2.322.<\/p>\n<p>If all polygons (central polygon and neighbours) have the same class, the entropy is zero (1 * log2 (1)).<\/p>\n<p>If we find the five classes, each one will have a proportion 0.2 and the resulting entropy will be 2.322.<\/p>\n<p><strong>Median<\/strong>: The value assigned to a cell is the median value calculated from the frequency distribution of the cell and its neighbours.<\/p>\n<p><strong>Standard Deviation<\/strong>: The value assigned to a cell is the standard deviation calculated from the cell and its neighbours.<\/p>\n<p><strong>Interquartile Deviation<\/strong>: The first and third quartiles are calculated using the frequency distribution of a polygon\u00a0and its neighbours.<br \/>\nThe value assigned to the cell is calculated by subtracting the first quartile value from the third quartile value:<\/p>\n<ul>\n<li>the 1st quartile is the data in the series that separates the bottom 25% of the data;<\/li>\n<li>the 2nd quartile is the data of the series that separates the series into two parts (50%) of the series;<\/li>\n<li>the 3rd quartile is the data in the series that separates the top 25% of the data.<\/li>\n<\/ul>\n<p>The difference between the third quartile and the first quartile is called the interquartile deviation;\u00a0it is a dispersion criterion of the series.\u00a0Dispersion represents the variability or range of different values \u200b\u200ba variable can take.\u00a0The interquartile deviation is the range of the statistical series after elimination of 25% of the lowest values \u200b\u200band 25% of the highest values.\u00a0This measure is more robust than the extent (range =\u00a0\u00a0\u00a0), which is sensitive to extreme values.<\/p>\n<p><strong>USAGE OF DIFFERENT MAP TYPES<\/strong><\/p>\n<p>The different Voronoi statistics are used for different purposes.<br \/>\nStatistics can be grouped into the following general functional categories:<\/p>\n<p><strong>Local smoothing tools:<\/strong><\/p>\n<ul>\n<li>average map<\/li>\n<li>mode map<\/li>\n<li>median map<\/li>\n<\/ul>\n<p>By calculating one of the three statistical variables for each point and its neighbours, the variation between each polygon and its neighbours becomes less abrupt.\u00a0We get a smoother map of our data.\u00a0This is useful when there is too much variation between neighbouring \u00a0points which results in the global map masking or making it more difficult to observe the global phenomena.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5585\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor5\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor5.png?fit=840%2C427&amp;ssl=1\" data-orig-size=\"840,427\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor5\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor5.png?fit=300%2C153&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor5.png?fit=640%2C325&amp;ssl=1\" class=\"alignnone size-medium wp-image-5585\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor5-300x153.png?resize=300%2C153\" alt=\"\" width=\"300\" height=\"153\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor5.png?resize=300%2C153&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor5.png?resize=768%2C390&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor5.png?w=840&amp;ssl=1 840w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>The map on the left shows the simple Voronoi polygons, ie each point is represented by its real value. The map on the right shows the average Voronoi polygons.\u00a0Each polygon has as value equal to the average of its value and its neighbours.\u00a0The background noise from the centre of the map is \u00ab\u00a0smoothed\u201d\u00a0\u00a0thanks to the use of the average.<\/p>\n<p><strong>Tools for visualizing local variability<\/strong><\/p>\n<ul>\n<li>standard deviations map<\/li>\n<li>interquartile deviation map<\/li>\n<li>entropy map<\/li>\n<\/ul>\n<p>If the smoothing tools (average, mode, median) are tools that refer to what we can call the\u00a0<strong>central tendency<\/strong>\u00a0of a distribution, these three tools refer to the <strong>dispersion<\/strong>\u00a0of the\u00a0distributions.<\/p>\n<p>If we observe a lot of difference between the neighbouring values, we will state that the values \u200b\u200bhave a strong dispersion and variability.<\/p>\n<p>On the other hand, the notion of\u00a0\u00a0\u00a0\u201cvery much\u201d\u00a0\u00a0\u00a0is a relative notion.\u00a0Let\u2019s look the standard deviations map:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5586\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor6\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor6.png?fit=551%2C693&amp;ssl=1\" data-orig-size=\"551,693\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor6\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor6.png?fit=239%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor6.png?fit=551%2C693&amp;ssl=1\" class=\"alignnone size-medium wp-image-5586\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor6-239x300.png?resize=239%2C300\" alt=\"\" width=\"239\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor6.png?resize=239%2C300&amp;ssl=1 239w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor6.png?w=551&amp;ssl=1 551w\" sizes=\"auto, (max-width: 239px) 100vw, 239px\" \/><\/p>\n<p>The colour scale is always the same, whatever the dispersion of the\u00a0values.\u00a0To fully interpret this image, you must know which data is involved and whether a maximum variability\u00a0\u00a0\u00a0from 4 to 13 is logical or not.<\/p>\n<p>On the other hand, what we can immediately understand is that the light areas are the areas where the relative variability is low and the darker areas those where the variability is very strong.<\/p>\n<p>Take the interquartile deviation map:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5587\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor7\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor7.png?fit=557%2C705&amp;ssl=1\" data-orig-size=\"557,705\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor7\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor7.png?fit=237%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor7.png?fit=557%2C705&amp;ssl=1\" class=\"alignnone size-medium wp-image-5587\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor7-237x300.png?resize=237%2C300\" alt=\"\" width=\"237\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor7.png?resize=237%2C300&amp;ssl=1 237w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor7.png?w=557&amp;ssl=1 557w\" sizes=\"auto, (max-width: 237px) 100vw, 237px\" \/><\/p>\n<p>This map is another measure of dispersion.\u00a0We use the values \u200b\u200bof the point and its\u00a0neighbours, eliminate the 25% lower and higher, and display the min and max values \u200b\u200bof the remaining points.<\/p>\n<p>In short, we eliminate the extreme values \u200b\u200band display a range of variation.\u00a0Dark areas vary between 4 and 10.<\/p>\n<p>Since we, now, know that these data correspond to depths, we can deduce that the dark areas of the two previous maps correspond to the zones of steeper slope, ie where the values \u200b\u200bchange faster. The light areas correspond to rather flat areas.<\/p>\n<p>The interpretation of these two types of maps depends on the knowledge of the data, because the variability will always be expressed in five classes, with different boundaries depending on the data.\u00a0On the other hand, the entropy map does not look alike.\u00a0It always has 5 classes but class boundaries do not depend on the processed data.\u00a0They are fixed.<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5588\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor8\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor8.png?fit=556%2C674&amp;ssl=1\" data-orig-size=\"556,674\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor8\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor8.png?fit=247%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor8.png?fit=556%2C674&amp;ssl=1\" class=\"alignnone size-medium wp-image-5588\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor8-247x300.png?resize=247%2C300\" alt=\"\" width=\"247\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor8.png?resize=247%2C300&amp;ssl=1 247w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor8.png?w=556&amp;ssl=1 556w\" sizes=\"auto, (max-width: 247px) 100vw, 247px\" \/><\/p>\n<p>If all the polygons (nearest neighbours) look alike, the value of the entropy will be 0. If all the polygons are different, the value is 2.32.<\/p>\n<p>As its name suggests, the entropy map is a measure of\u00a0\u00a0\u00a0\u201cdisorder\u201d.\u00a0If we find areas with high entropy (this is not the case in our example map) a detour to try to understand the reasons is required.<\/p>\n<p><strong>Search for outliers<\/strong><\/p>\n<p>It is important to identify outliers for two reasons: they may be actual anomalies of the phenomenon, or the value may have been wrongly measured or recorded.<br \/>\nIf an aberration is a real anomaly in the phenomenon, it is perhaps the most important point to study and to understand the phenomenon.\u00a0For example, a sample on the vein of an ore could appear as an outlier, and it is precisely this location that is the most important goal for a mining company.<br \/>\nIf outliers are caused by errors in data entry or by any other clearly incorrect reason, they must be corrected or deleted before creating a surface.\u00a0Outliers can have several detrimental effects on your interpolated surface, with effects on semi-variogram modelling and its influence on neighbouring values.<\/p>\n<p>Voronoi maps created using cluster and entropy methods can be used to help identify possible outliers.<br \/>\nEntropy values \u200b\u200bprovide a measure of dissimilarity between neighbouring polygons.\u00a0In nature, you expect that things close together are more\u00a0\u00a0\u00a0similar than things further away.\u00a0Therefore, local outliers can be identified by high entropy areas.<\/p>\n<p>The cluster method identifies polygons that are dissimilar to their surrounding neighbours.\u00a0You expect the value stored in a particular polygon to be similar to, at least, one of its neighbours. Therefore, this tool can be used to identify possible local outliers:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5589\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor9\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor9.png?fit=558%2C700&amp;ssl=1\" data-orig-size=\"558,700\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor9\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor9.png?fit=239%2C300&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor9.png?fit=558%2C700&amp;ssl=1\" class=\"alignnone size-medium wp-image-5589\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor9-239x300.png?resize=239%2C300\" alt=\"\" width=\"239\" height=\"300\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor9.png?resize=239%2C300&amp;ssl=1 239w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor9.png?w=558&amp;ssl=1 558w\" sizes=\"auto, (max-width: 239px) 100vw, 239px\" \/><\/p>\n<p>The cluster includes all the points and classifies these values \u200b\u200binto five classes.\u00a0For each polygon we display its class, if and only if, at least one neighbouring polygon belongs to the same class.\u00a0If all neighbouring polygons belong to different classes, the polygon is displayed in grey.<\/p>\n<p>On the previous image, it will be useful to click on each grey polygon and observe its value on the map of points:<\/p>\n<p><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" data-attachment-id=\"5590\" data-permalink=\"https:\/\/www.sigterritoires.fr\/index.php\/en\/exploratory-data-analysis-for-geostatistics-voronoi-diagrams\/vor10\/\" data-orig-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor10.png?fit=840%2C503&amp;ssl=1\" data-orig-size=\"840,503\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"vor10\" data-image-description=\"\" data-image-caption=\"\" data-medium-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor10.png?fit=300%2C180&amp;ssl=1\" data-large-file=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor10.png?fit=640%2C383&amp;ssl=1\" class=\"alignnone size-medium wp-image-5590\" src=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor10-300x180.png?resize=300%2C180\" alt=\"\" width=\"300\" height=\"180\" srcset=\"https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor10.png?resize=300%2C180&amp;ssl=1 300w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor10.png?resize=768%2C460&amp;ssl=1 768w, https:\/\/i0.wp.com\/www.sigterritoires.fr\/wp-content\/uploads\/2018\/07\/vor10.png?w=840&amp;ssl=1 840w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>In this example, a value of 3.4 is observed among values \u200b\u200bof the order of 30. It may be a decimal entry error.<\/p>\n<p>In the next article we will see how to analyse the data trends.<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Following the article\u00a0\u00a0\u00a0Introduction to exploratory data analysis for geostatistics\u00a0\u00a0\u00a0we will discuss all the available tools to carry out the exploratory analysis of spatialized data.\u00a0We have discussed\u00a0\u00a0\u00a0the histograms\u00a0,\u00a0\u00a0\u00a0\u00a0the QQ-Plots\u00a0, and now we will address the Voronoi maps.&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"give_campaign_id":0,"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"sfsi_plus_gutenberg_text_before_share":"","sfsi_plus_gutenberg_show_text_before_share":"","sfsi_plus_gutenberg_icon_type":"","sfsi_plus_gutenberg_icon_alignemt":"","sfsi_plus_gutenburg_max_per_row":"","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1260],"tags":[],"class_list":["post-5579","post","type-post","status-publish","format-standard","hentry","category-non-classe-en"],"aioseo_notices":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p6XU0A-1rZ","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/posts\/5579","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/comments?post=5579"}],"version-history":[{"count":0,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/posts\/5579\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/media?parent=5579"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/categories?post=5579"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.sigterritoires.fr\/index.php\/wp-json\/wp\/v2\/tags?post=5579"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}