Dataset statistics
| Number of variables | 5 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 39.2 KiB |
| Average record size in memory | 40.1 B |
Variable types
| NUM | 5 |
|---|
Reproduction
| Analysis started | 2020-08-25 13:49:51.681350 |
|---|---|
| Analysis finished | 2020-08-25 13:49:56.523497 |
| Duration | 4.84 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
| Distinct count | 1000 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.49603561662410556 |
|---|---|
| Minimum | 0.00011036852237689132 |
| Maximum | 0.9983580831890366 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.8 KiB |
Quantile statistics
| Minimum | 0.0001103685224 |
|---|---|
| 5-th percentile | 0.03892341171 |
| Q1 | 0.2410683702 |
| median | 0.5024830366 |
| Q3 | 0.7375589717 |
| 95-th percentile | 0.9471630622 |
| Maximum | 0.9983580832 |
| Range | 0.9982477147 |
| Interquartile range (IQR) | 0.4964906016 |
Descriptive statistics
| Standard deviation | 0.2898457978 |
|---|---|
| Coefficient of variation (CV) | 0.5843245688 |
| Kurtosis | -1.202722056 |
| Mean | 0.4960356166 |
| Median Absolute Deviation (MAD) | 0.2492625755 |
| Skewness | -0.008708593557 |
| Sum | 496.0356166 |
| Variance | 0.0840105865 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.9385864061 | 1 | 0.1% | |
| 0.114433137 | 1 | 0.1% | |
| 0.4752931621 | 1 | 0.1% | |
| 0.2102463925 | 1 | 0.1% | |
| 0.4603792023 | 1 | 0.1% | |
| 0.2113915275 | 1 | 0.1% | |
| 0.5986569384 | 1 | 0.1% | |
| 0.7086391378 | 1 | 0.1% | |
| 0.4138992015 | 1 | 0.1% | |
| 0.6962643163 | 1 | 0.1% | |
| Other values (990) | 990 | 99.0% |
| Value | Count | Frequency (%) | |
| 0.0001103685224 | 1 | 0.1% | |
| 0.000167063331 | 1 | 0.1% | |
| 0.001534646043 | 1 | 0.1% | |
| 0.001744181929 | 1 | 0.1% | |
| 0.002829276984 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0.9983580832 | 1 | 0.1% | |
| 0.9969940265 | 1 | 0.1% | |
| 0.9962974759 | 1 | 0.1% | |
| 0.9956938526 | 1 | 0.1% | |
| 0.994672249 | 1 | 0.1% |
| Distinct count | 1000 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4976896441671239 |
|---|---|
| Minimum | 3.255334367957552e-05 |
| Maximum | 0.9998802914813884 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.8 KiB |
Quantile statistics
| Minimum | 3.255334368e-05 |
|---|---|
| 5-th percentile | 0.05840948171 |
| Q1 | 0.2440723592 |
| median | 0.48119514 |
| Q3 | 0.7571758405 |
| 95-th percentile | 0.9540105859 |
| Maximum | 0.9998802915 |
| Range | 0.9998477381 |
| Interquartile range (IQR) | 0.5131034814 |
Descriptive statistics
| Standard deviation | 0.2934853901 |
|---|---|
| Coefficient of variation (CV) | 0.5896955936 |
| Kurtosis | -1.247295364 |
| Mean | 0.4976896442 |
| Median Absolute Deviation (MAD) | 0.2556933251 |
| Skewness | 0.04871458042 |
| Sum | 497.6896442 |
| Variance | 0.08613367422 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.104371909 | 1 | 0.1% | |
| 0.805434504 | 1 | 0.1% | |
| 0.2812391539 | 1 | 0.1% | |
| 0.6613548265 | 1 | 0.1% | |
| 0.2604039842 | 1 | 0.1% | |
| 0.962532327 | 1 | 0.1% | |
| 0.46133387 | 1 | 0.1% | |
| 0.8242477535 | 1 | 0.1% | |
| 0.2014581129 | 1 | 0.1% | |
| 0.337153214 | 1 | 0.1% | |
| Other values (990) | 990 | 99.0% |
| Value | Count | Frequency (%) | |
| 3.255334368e-05 | 1 | 0.1% | |
| 0.0009543377432 | 1 | 0.1% | |
| 0.001843832395 | 1 | 0.1% | |
| 0.002320559076 | 1 | 0.1% | |
| 0.002669353878 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0.9998802915 | 1 | 0.1% | |
| 0.9986568675 | 1 | 0.1% | |
| 0.9984117679 | 1 | 0.1% | |
| 0.9971561539 | 1 | 0.1% | |
| 0.9963002548 | 1 | 0.1% |
| Distinct count | 1000 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5002124154518173 |
|---|---|
| Minimum | 0.00016299075369019533 |
| Maximum | 0.9990704237749324 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.8 KiB |
Quantile statistics
| Minimum | 0.0001629907537 |
|---|---|
| 5-th percentile | 0.04327553595 |
| Q1 | 0.2474555422 |
| median | 0.5167601724 |
| Q3 | 0.7461510712 |
| 95-th percentile | 0.956067994 |
| Maximum | 0.9990704238 |
| Range | 0.998907433 |
| Interquartile range (IQR) | 0.498695529 |
Descriptive statistics
| Standard deviation | 0.2900708012 |
|---|---|
| Coefficient of variation (CV) | 0.579895245 |
| Kurtosis | -1.172289288 |
| Mean | 0.5002124155 |
| Median Absolute Deviation (MAD) | 0.2489490884 |
| Skewness | -0.02498797573 |
| Sum | 500.2124155 |
| Variance | 0.08414106973 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.7673086287 | 1 | 0.1% | |
| 0.2235953947 | 1 | 0.1% | |
| 0.9810037169 | 1 | 0.1% | |
| 0.5828179623 | 1 | 0.1% | |
| 0.2463940261 | 1 | 0.1% | |
| 0.5164020475 | 1 | 0.1% | |
| 0.4128799841 | 1 | 0.1% | |
| 0.6223114139 | 1 | 0.1% | |
| 0.5204983621 | 1 | 0.1% | |
| 0.772549966 | 1 | 0.1% | |
| Other values (990) | 990 | 99.0% |
| Value | Count | Frequency (%) | |
| 0.0001629907537 | 1 | 0.1% | |
| 0.001207473876 | 1 | 0.1% | |
| 0.001583146709 | 1 | 0.1% | |
| 0.001681737516 | 1 | 0.1% | |
| 0.001760754771 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0.9990704238 | 1 | 0.1% | |
| 0.9979050199 | 1 | 0.1% | |
| 0.99683349 | 1 | 0.1% | |
| 0.9963737955 | 1 | 0.1% | |
| 0.9954980932 | 1 | 0.1% |
| Distinct count | 1000 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5108146611339072 |
|---|---|
| Minimum | 0.00037029279084022093 |
| Maximum | 0.9995547781577581 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.8 KiB |
Quantile statistics
| Minimum | 0.0003702927908 |
|---|---|
| 5-th percentile | 0.05405086738 |
| Q1 | 0.2512262066 |
| median | 0.5231901836 |
| Q3 | 0.7652953195 |
| 95-th percentile | 0.9562950624 |
| Maximum | 0.9995547782 |
| Range | 0.9991844854 |
| Interquartile range (IQR) | 0.5140691129 |
Descriptive statistics
| Standard deviation | 0.2896394878 |
|---|---|
| Coefficient of variation (CV) | 0.567014829 |
| Kurtosis | -1.188396004 |
| Mean | 0.5108146611 |
| Median Absolute Deviation (MAD) | 0.254591609 |
| Skewness | -0.06505746225 |
| Sum | 510.8146611 |
| Variance | 0.08389103287 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.4462812582 | 1 | 0.1% | |
| 0.9626243634 | 1 | 0.1% | |
| 0.4035546722 | 1 | 0.1% | |
| 0.07846311922 | 1 | 0.1% | |
| 0.7741706145 | 1 | 0.1% | |
| 0.4028344023 | 1 | 0.1% | |
| 0.6812746488 | 1 | 0.1% | |
| 0.5275647013 | 1 | 0.1% | |
| 0.7307168041 | 1 | 0.1% | |
| 0.6819473024 | 1 | 0.1% | |
| Other values (990) | 990 | 99.0% |
| Value | Count | Frequency (%) | |
| 0.0003702927908 | 1 | 0.1% | |
| 0.001359281988 | 1 | 0.1% | |
| 0.00186156393 | 1 | 0.1% | |
| 0.003046201856 | 1 | 0.1% | |
| 0.003101275089 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0.9995547782 | 1 | 0.1% | |
| 0.9965432486 | 1 | 0.1% | |
| 0.9960923487 | 1 | 0.1% | |
| 0.9953510554 | 1 | 0.1% | |
| 0.9925925808 | 1 | 0.1% |
| Distinct count | 1000 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5010268385918255 |
|---|---|
| Minimum | 0.00037950967686839476 |
| Maximum | 0.9997855084461895 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 7.8 KiB |
Quantile statistics
| Minimum | 0.0003795096769 |
|---|---|
| 5-th percentile | 0.04960697893 |
| Q1 | 0.2526117816 |
| median | 0.4910308642 |
| Q3 | 0.7677758696 |
| 95-th percentile | 0.9342226726 |
| Maximum | 0.9997855084 |
| Range | 0.9994059988 |
| Interquartile range (IQR) | 0.5151640881 |
Descriptive statistics
| Standard deviation | 0.289226879 |
|---|---|
| Coefficient of variation (CV) | 0.5772682353 |
| Kurtosis | -1.251931332 |
| Mean | 0.5010268386 |
| Median Absolute Deviation (MAD) | 0.259204655 |
| Skewness | 0.01192021794 |
| Sum | 501.0268386 |
| Variance | 0.08365218752 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.2301025349 | 1 | 0.1% | |
| 0.06605608778 | 1 | 0.1% | |
| 0.193505822 | 1 | 0.1% | |
| 0.5597152983 | 1 | 0.1% | |
| 0.870400768 | 1 | 0.1% | |
| 0.1509152324 | 1 | 0.1% | |
| 0.7581694317 | 1 | 0.1% | |
| 0.882948668 | 1 | 0.1% | |
| 0.7432413958 | 1 | 0.1% | |
| 0.2032994164 | 1 | 0.1% | |
| Other values (990) | 990 | 99.0% |
| Value | Count | Frequency (%) | |
| 0.0003795096769 | 1 | 0.1% | |
| 0.000859878597 | 1 | 0.1% | |
| 0.001314535632 | 1 | 0.1% | |
| 0.004629828063 | 1 | 0.1% | |
| 0.004929238282 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 0.9997855084 | 1 | 0.1% | |
| 0.9969876266 | 1 | 0.1% | |
| 0.996225163 | 1 | 0.1% | |
| 0.9945595067 | 1 | 0.1% | |
| 0.9935423968 | 1 | 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| a | b | c | d | e | |
|---|---|---|---|---|---|
| 0 | 0.168605 | 0.088419 | 0.631029 | 0.533096 | 0.917958 |
| 1 | 0.315895 | 0.073848 | 0.829413 | 0.143737 | 0.280338 |
| 2 | 0.137830 | 0.869474 | 0.501873 | 0.808615 | 0.149282 |
| 3 | 0.493717 | 0.034421 | 0.528937 | 0.600067 | 0.932171 |
| 4 | 0.361675 | 0.205724 | 0.742665 | 0.129008 | 0.637564 |
| 5 | 0.769593 | 0.772678 | 0.245068 | 0.148680 | 0.726782 |
| 6 | 0.473384 | 0.456181 | 0.665889 | 0.177821 | 0.019648 |
| 7 | 0.183022 | 0.476833 | 0.001207 | 0.688401 | 0.688314 |
| 8 | 0.532989 | 0.141323 | 0.514437 | 0.855395 | 0.127710 |
| 9 | 0.196139 | 0.836204 | 0.568191 | 0.389468 | 0.274022 |
Last rows
| a | b | c | d | e | |
|---|---|---|---|---|---|
| 990 | 0.732259 | 0.562789 | 0.447696 | 0.804464 | 0.634221 |
| 991 | 0.739007 | 0.753929 | 0.165127 | 0.844081 | 0.801953 |
| 992 | 0.447996 | 0.585246 | 0.954670 | 0.059409 | 0.783304 |
| 993 | 0.295182 | 0.211035 | 0.040836 | 0.169337 | 0.150936 |
| 994 | 0.335973 | 0.252420 | 0.566453 | 0.024700 | 0.012688 |
| 995 | 0.265495 | 0.548424 | 0.958761 | 0.291487 | 0.488196 |
| 996 | 0.917419 | 0.897077 | 0.582288 | 0.037852 | 0.588344 |
| 997 | 0.507682 | 0.816218 | 0.843139 | 0.769346 | 0.287649 |
| 998 | 0.394697 | 0.463418 | 0.188358 | 0.829562 | 0.928909 |
| 999 | 0.909544 | 0.696754 | 0.168389 | 0.948427 | 0.036411 |