Counts the number of the specified files which fall within each of a sequence of discrete size-intervals.
Files=86, mean=394459250.942, standard-deviation=304129537.729 Bin-size Frequency ========== ========= 0 44 304129538 29 608259076 6 912388614 4 1216518152 2 1520647690 1
The left-hand column defines an arithmetic sequence of bins, whilst the
right-hand column defines the number of files from those specified which
fall into each. The choice of the increment between each bin has defaulted
to one standard-deviation.
From this data one can conclude that there are 44 files whose size lies
in the semi-closed interval [0, 1) standard-deviations, decaying monotonically
to only one file whose size lies in the semi-closed interval [5, 6) standard-deviations.
Files=86, mean=394459250.942, standard-deviation=304129537.729 Bin-size Probability ========== =========== 100000000 0.209 200000000 0.302 300000000 0.233 400000000 0.035 500000000 0.070 600000000 0.023 800000000 0.035 900000000 0.035 1000000000 0.012 1100000000 0.012 1200000000 0.012 1400000000 0.012 1600000000 0.012
CAVEAT: the total probability may differ from "1", due to round-errors; see "nDecimalDigits".
Files=1735, mean=13846.622, standard-deviation=74846.621 Bin-size Frequency ========== ========= 0 4 #Though "0" isn’t a member of the requested geometric sequence, it’s the integral value beneath all fractional values which are. 1 2 10 100 100 563 1000 794 10000 188 100000 83 1000000 1
From this data one can conclude that there are 4 files whose size is zero, 2 files in the semi-closed interval [1, 10), 100 files in [10, 100), ...
Files=878, mean=78365.943, standard-deviation=297831.014 Bin-size Probability ========== =========== 32 0.023 64 0.017 128 0.008 256 0.015 512 0.034 1024 0.046 2048 0.047 4096 0.096 8192 0.155 16384 0.179 32768 0.155 65536 0.157 131072 0.032 262144 0.017 524288 0.003 1048576 0.010 2097152 0.007
When specifying an arithmetic sequence of bin-sizes, the lack of resolution amongst smaller files makes the distribution appear like the decaying exponential of a geometric distribution, but by using a geometric sequence of bin-sizes, it can be seen more clearly to be a log-normal distribution; see "A Large-Scale Study of File-System Contents" by John R. Douceur and William J. Bolosky.
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see <https://www.gnu.org/licenses/ >.