![Hands-On Mathematics for Deep Learning](https://wfqqreader-1252317822.image.myqcloud.com/cover/81/36698081/b_36698081.jpg)
Variance
We define the variance of X as follows:
![](https://epubservercos.yuewen.com/FF11E0/19470372701459106/epubprivate/OEBPS/Images/Chapter_542.jpg?sign=1738886789-N3Aqx9ONk1dgVecCutysAALRbPLa7FFn-0-420dc45300c373bea1cdb5028c562630)
The standard deviation of X is the square root of the variance:
![](https://epubservercos.yuewen.com/FF11E0/19470372701459106/epubprivate/OEBPS/Images/Chapter_356.jpg?sign=1738886789-9dFNLZTTivLrFzfvaFXirbMCW79irJQO-0-0c8c9100e08fb2bfe2c6acff50d03fc0)
We can think of this as how spread out or close values are from the expected (mean) value. If they are highly dispersed, then they have a high variance, but if they are grouped together, then they have a low variance.
Here are some properties for variance that are important to remember:
.
- If
, then
.
.
.
, given that all the Xi values are independent.
Let's suppose that we now have discrete random variables. Then, they are independent if we take the following:
![](https://epubservercos.yuewen.com/FF11E0/19470372701459106/epubprivate/OEBPS/Images/Chapter_80.jpg?sign=1738886789-uP1Zk4xmS9WQRzMyWG39cABJmEBzIpaG-0-ed8a12e6bb86cc97adce9a3e91efef23)
Now, let our n random variables be independent and identically distributed (iid). We now have the following:
![](https://epubservercos.yuewen.com/FF11E0/19470372701459106/epubprivate/OEBPS/Images/Chapter_622.jpg?sign=1738886789-XHf2oB8Bu8PQuovaw41jS6IJSIEZoYjz-0-9391bd9850c4ed9fc2f2b2ade208d66b)
This concept is very important, especially in statistics. It implies that if we want to reduce the variance in the results of our experiment, then we can repeat the experiment a number of times and the sample average will have a small variance.
For example, let's imagine two pieces of rope that have unknown lengths—a and b, respectively. Because the objects are ropes—and, therefore, are non-rigid—we can measure the lengths of the ropes, but our measurements may not be accurate. Let A be the measured value of rope a and B be the measured value of rope b so that we have the following:
![](https://epubservercos.yuewen.com/FF11E0/19470372701459106/epubprivate/OEBPS/Images/Chapter_559.jpg?sign=1738886789-YpWQk2gaOunx5qm6C1e7Ht5GXF9i4xIE-0-7f3307134faba95e044b6e127db1b2b8)
We can increase the accuracy of our measurements by measuring X = A + B and Y = A – B. Now, we can estimate a and b using the following:
![](https://epubservercos.yuewen.com/FF11E0/19470372701459106/epubprivate/OEBPS/Images/Chapter_257.jpg?sign=1738886789-ZsNP1bQ7Q6QpgVYdk0QVkiWVAgSvlBhq-0-e7b8f8119117b3bc2c507df426fc771d)
Now, and
, which are both unbiased. Additionally, we can see that the variance has decreased in our measurement using the following:
![](https://epubservercos.yuewen.com/FF11E0/19470372701459106/epubprivate/OEBPS/Images/Chapter_777.jpg?sign=1738886789-y1GkTIw7XQiNLFQ48313d1pGthYSNIyT-0-520878c3b8e7edb7dbc4e68efc9e1384)
From this, we can clearly see that measuring the ropes together instead of separately has improved our accuracy significantly.