Counts the number of the specified files which fall within each of a sequence of discrete size-intervals.
Files=86, mean=394459250.942, standard-deviation=304129537.729
Bin-size Frequency
========== =========
0 44
304129538 29
608259076 6
912388614 4
1216518152 2
1520647690 1
The left-hand column defines an arithmetic sequence of bins, whilst the
right-hand column defines the number of files from those specified which
fall into each. The choice of the increment between each bin has defaulted
to one standard-deviation.
From this data one can conclude that there are 44 files whose size lies
in the semi-closed interval [0, 1) standard-deviations, decaying monotonically
to only one file whose size lies in the semi-closed interval [5, 6) standard-deviations.
Files=86, mean=394459250.942, standard-deviation=304129537.729 Bin-size Probability ========== =========== 100000000 0.209 200000000 0.302 300000000 0.233 400000000 0.035 500000000 0.070 600000000 0.023 800000000 0.035 900000000 0.035 1000000000 0.012 1100000000 0.012 1200000000 0.012 1400000000 0.012 1600000000 0.012
CAVEAT: the total probability may differ from "1", due to round-errors; see "nDecimalDigits".
Files=1735, mean=13846.622, standard-deviation=74846.621
Bin-size Frequency
========== =========
0 4 #Though "0" isn’t a member of the requested geometric
sequence, it’s the integral value beneath all fractional values which are.
1 2
10 100
100 563
1000 794
10000 188
100000 83
1000000 1
From this data one can conclude that there are 4 files whose size is zero, 2 files in the semi-closed interval [1, 10), 100 files in [10, 100), ...
Files=878, mean=78365.943, standard-deviation=297831.014
Bin-size Probability
========== ===========
32 0.023
64 0.017
128 0.008
256 0.015
512 0.034
1024 0.046
2048 0.047
4096 0.096
8192 0.155
16384 0.179
32768 0.155
65536 0.157
131072 0.032
262144 0.017
524288 0.003
1048576 0.010
2097152 0.007
When specifying an arithmetic sequence of bin-sizes, the lack of resolution amongst smaller files makes the distribution appear like the decaying exponential of a geometric distribution, but by using a geometric sequence of bin-sizes, it can be seen more clearly to be a log-normal distribution; see "A Large-Scale Study of File-System Contents" by John R. Douceur and William J. Bolosky.
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see <https://www.gnu.org/licenses/ >.