Overview
Brought to you by YData
Dataset statistics
| Number of variables | 7 |
|---|---|
| Number of observations | 48,489 |
| Missing cells | 9,234 |
| Missing cells (%) | 2.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.6 MiB |
| Average record size in memory | 56.0 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 2 |
| Numeric | 3 |
type_of_the_bulb is highly imbalanced (67.1%) | Imbalance |
wattage_of_the_bulb has 9234 (19.0%) missing values | Missing |
wattage_of_the_bulb has 8576 (17.7%) zeros | Zeros |
no_of_hours_bulb_was_on_during_daytime_last_week has 42744 (88.2%) zeros | Zeros |
no_of_hours_bulb_was_on_during_night_last_week has 13926 (28.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-06 05:54:46.469423 |
|---|---|
| Analysis finished | 2024-12-06 05:54:48.195246 |
| Duration | 1.73 second |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
household_ID
Text
| Distinct | 4054 |
|---|---|
| Distinct (%) | 8.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 378.9 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | ID0001 |
|---|---|
| 2nd row | ID0001 |
| 3rd row | ID0001 |
| 4th row | ID0001 |
| 5th row | ID0001 |
| Value | Count | Frequency (%) |
| id0469 | 228 | 0.5% |
| id2033 | 225 | 0.5% |
| id0278 | 209 | 0.4% |
| id0282 | 182 | 0.4% |
| id1589 | 171 | 0.4% |
| id1841 | 165 | 0.3% |
| id0399 | 162 | 0.3% |
| id0069 | 152 | 0.3% |
| id0901 | 150 | 0.3% |
| id2072 | 144 | 0.3% |
| Other values (4044) | 46701 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 48489 | |
| D | 48489 | |
| 0 | 30565 | |
| 1 | 27712 | |
| 2 | 25976 | |
| 3 | 22658 | |
| 4 | 15122 | 5.2% |
| 6 | 14938 | 5.1% |
| 7 | 14483 | 5.0% |
| 8 | 14336 | 4.9% |
| Other values (2) | 28166 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 193956 | |
| Uppercase Letter | 96978 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 30565 | |
| 1 | 27712 | |
| 2 | 25976 | |
| 3 | 22658 | |
| 4 | 15122 | |
| 6 | 14938 | |
| 7 | 14483 | |
| 8 | 14336 | |
| 9 | 14213 | |
| 5 | 13953 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 48489 | |
| D | 48489 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 193956 | |
| Latin | 96978 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 30565 | |
| 1 | 27712 | |
| 2 | 25976 | |
| 3 | 22658 | |
| 4 | 15122 | |
| 6 | 14938 | |
| 7 | 14483 | |
| 8 | 14336 | |
| 9 | 14213 | |
| 5 | 13953 |
Latin
| Value | Count | Frequency (%) |
| I | 48489 | |
| D | 48489 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 290934 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 48489 | |
| D | 48489 | |
| 0 | 30565 | |
| 1 | 27712 | |
| 2 | 25976 | |
| 3 | 22658 | |
| 4 | 15122 | 5.2% |
| 6 | 14938 | 5.1% |
| 7 | 14483 | 5.0% |
| 8 | 14336 | 4.9% |
| Other values (2) | 28166 |
room_ID
Categorical
| Distinct | 32 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 378.9 KiB |
| I1 | |
|---|---|
| I2 | |
| I3 | |
| I4 | |
| I5 | |
| Other values (27) |
Length
| Max length | 6 |
|---|---|
| Median length | 2 |
| Mean length | 2.1524263 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | I1 |
|---|---|
| 2nd row | I1 |
| 3rd row | I1 |
| 4th row | I1 |
| 5th row | I2 |
Common Values
| Value | Count | Frequency (%) |
| I1 | 10095 | |
| I2 | 5696 | |
| I3 | 5097 | |
| I4 | 4918 | |
| I5 | 4649 | |
| I6 | 4066 | |
| I7 | 3443 | 7.1% |
| I8 | 2533 | 5.2% |
| I9 | 2033 | 4.2% |
| I10 | 1347 | 2.8% |
| Other values (22) | 4612 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| i1 | 10095 | |
| i2 | 5696 | |
| i3 | 5097 | |
| i4 | 4918 | |
| i5 | 4649 | |
| i6 | 4066 | |
| i7 | 3443 | 7.1% |
| i8 | 2533 | 5.2% |
| i9 | 2033 | 4.2% |
| i10 | 1347 | 2.8% |
| Other values (22) | 4612 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 48489 | |
| 1 | 16872 | 16.2% |
| 2 | 6582 | 6.3% |
| 3 | 5767 | 5.5% |
| 4 | 5374 | 5.1% |
| 5 | 5009 | 4.8% |
| 6 | 4310 | 4.1% |
| 7 | 3627 | 3.5% |
| 8 | 2666 | 2.6% |
| 9 | 2142 | 2.1% |
| Other values (4) | 3531 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 53750 | |
| Uppercase Letter | 49199 | |
| Lowercase Letter | 1420 | 1.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 16872 | |
| 2 | 6582 | 12.2% |
| 3 | 5767 | 10.7% |
| 4 | 5374 | 10.0% |
| 5 | 5009 | 9.3% |
| 6 | 4310 | 8.0% |
| 7 | 3627 | 6.7% |
| 8 | 2666 | 5.0% |
| 9 | 2142 | 4.0% |
| 0 | 1401 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 48489 | |
| O | 710 | 1.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 710 | |
| h | 710 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 53750 | |
| Latin | 50619 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 16872 | |
| 2 | 6582 | 12.2% |
| 3 | 5767 | 10.7% |
| 4 | 5374 | 10.0% |
| 5 | 5009 | 9.3% |
| 6 | 4310 | 8.0% |
| 7 | 3627 | 6.7% |
| 8 | 2666 | 5.0% |
| 9 | 2142 | 4.0% |
| 0 | 1401 | 2.6% |
Latin
| Value | Count | Frequency (%) |
| I | 48489 | |
| O | 710 | 1.4% |
| t | 710 | 1.4% |
| h | 710 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104369 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 48489 | |
| 1 | 16872 | 16.2% |
| 2 | 6582 | 6.3% |
| 3 | 5767 | 5.5% |
| 4 | 5374 | 5.1% |
| 5 | 5009 | 4.8% |
| 6 | 4310 | 4.1% |
| 7 | 3627 | 3.5% |
| 8 | 2666 | 2.6% |
| 9 | 2142 | 2.1% |
| Other values (4) | 3531 | 3.4% |
light_ID
Text
| Distinct | 412 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 378.9 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.1832581 |
| Min length | 5 |
Unique
| Unique | 82 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | I1_L1 |
|---|---|
| 2nd row | I1_L2 |
| 3rd row | I1_L3 |
| 4th row | I1_L4 |
| 5th row | I2_L1 |
| Value | Count | Frequency (%) |
| i1_l1 | 4017 | 8.3% |
| i2_l1 | 3928 | 8.1% |
| i3_l1 | 3797 | 7.8% |
| i4_l1 | 3683 | 7.6% |
| i5_l1 | 3395 | 7.0% |
| i6_l1 | 2877 | 5.9% |
| i7_l1 | 2200 | 4.5% |
| i1_l2 | 2160 | 4.5% |
| i8_l1 | 1567 | 3.2% |
| i1_l3 | 1237 | 2.6% |
| Other values (402) | 19628 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 48489 | |
| _ | 48489 | |
| L | 48489 | |
| 1 | 48408 | |
| 2 | 14577 | 5.8% |
| 3 | 9281 | 3.7% |
| 4 | 7662 | 3.0% |
| 5 | 6479 | 2.6% |
| 6 | 5337 | 2.1% |
| 7 | 4379 | 1.7% |
| Other values (6) | 9741 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 103734 | |
| Uppercase Letter | 97688 | |
| Connector Punctuation | 48489 | |
| Lowercase Letter | 1420 | 0.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 48408 | |
| 2 | 14577 | 14.1% |
| 3 | 9281 | 8.9% |
| 4 | 7662 | 7.4% |
| 5 | 6479 | 6.2% |
| 6 | 5337 | 5.1% |
| 7 | 4379 | 4.2% |
| 8 | 3272 | 3.2% |
| 9 | 2602 | 2.5% |
| 0 | 1737 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 48489 | |
| L | 48489 | |
| O | 710 | 0.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 710 | |
| h | 710 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 48489 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 152223 | |
| Latin | 99108 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 48489 | |
| 1 | 48408 | |
| 2 | 14577 | 9.6% |
| 3 | 9281 | 6.1% |
| 4 | 7662 | 5.0% |
| 5 | 6479 | 4.3% |
| 6 | 5337 | 3.5% |
| 7 | 4379 | 2.9% |
| 8 | 3272 | 2.1% |
| 9 | 2602 | 1.7% |
Latin
| Value | Count | Frequency (%) |
| I | 48489 | |
| L | 48489 | |
| O | 710 | 0.7% |
| t | 710 | 0.7% |
| h | 710 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 251331 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 48489 | |
| _ | 48489 | |
| L | 48489 | |
| 1 | 48408 | |
| 2 | 14577 | 5.8% |
| 3 | 9281 | 3.7% |
| 4 | 7662 | 3.0% |
| 5 | 6479 | 2.6% |
| 6 | 5337 | 2.1% |
| 7 | 4379 | 1.7% |
| Other values (6) | 9741 | 3.9% |
type_of_the_bulb
Categorical
Imbalance 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 378.9 KiB |
| LED | |
|---|---|
| CFL | |
| Incandescent | 1193 |
| Tube Light (conventional) | 398 |
| Halogen | 296 |
| Other values (4) | 601 |
Length
| Max length | 25 |
|---|---|
| Median length | 3 |
| Mean length | 3.5187775 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LED |
|---|---|
| 2nd row | LED |
| 3rd row | LED |
| 4th row | LED |
| 5th row | CFL |
Common Values
| Value | Count | Frequency (%) |
| LED | 37862 | |
| CFL | 8139 | 16.8% |
| Incandescent | 1193 | 2.5% |
| Tube Light (conventional) | 398 | 0.8% |
| Halogen | 296 | 0.6% |
| Other | 228 | 0.5% |
| Tube Light (LED) | 212 | 0.4% |
| Flood light | 139 | 0.3% |
| Flashlight | 22 | < 0.1% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| led | 38074 | |
| cfl | 8139 | 16.3% |
| incandescent | 1193 | 2.4% |
| light | 749 | 1.5% |
| tube | 610 | 1.2% |
| conventional | 398 | 0.8% |
| halogen | 296 | 0.6% |
| other | 228 | 0.5% |
| flood | 139 | 0.3% |
| flashlight | 22 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 46823 | |
| E | 38074 | |
| D | 38074 | |
| F | 8300 | 4.9% |
| C | 8139 | 4.8% |
| n | 5069 | 3.0% |
| e | 3918 | 2.3% |
| c | 2784 | 1.6% |
| t | 2590 | 1.5% |
| a | 1909 | 1.1% |
| Other values (18) | 14942 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 141737 | |
| Lowercase Letter | 26306 | 15.4% |
| Space Separator | 1359 | 0.8% |
| Open Punctuation | 610 | 0.4% |
| Close Punctuation | 610 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 5069 | |
| e | 3918 | |
| c | 2784 | |
| t | 2590 | |
| a | 1909 | 7.3% |
| o | 1370 | 5.2% |
| d | 1332 | 5.1% |
| s | 1215 | 4.6% |
| i | 1169 | 4.4% |
| g | 1067 | 4.1% |
| Other values (6) | 3883 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 46823 | |
| E | 38074 | |
| D | 38074 | |
| F | 8300 | 5.9% |
| C | 8139 | 5.7% |
| I | 1193 | 0.8% |
| T | 610 | 0.4% |
| H | 296 | 0.2% |
| O | 228 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1359 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 610 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 610 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 168043 | |
| Common | 2579 | 1.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 46823 | |
| E | 38074 | |
| D | 38074 | |
| F | 8300 | 4.9% |
| C | 8139 | 4.8% |
| n | 5069 | 3.0% |
| e | 3918 | 2.3% |
| c | 2784 | 1.7% |
| t | 2590 | 1.5% |
| a | 1909 | 1.1% |
| Other values (15) | 12363 | 7.4% |
Common
| Value | Count | Frequency (%) |
| 1359 | ||
| ( | 610 | |
| ) | 610 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 170622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 46823 | |
| E | 38074 | |
| D | 38074 | |
| F | 8300 | 4.9% |
| C | 8139 | 4.8% |
| n | 5069 | 3.0% |
| e | 3918 | 2.3% |
| c | 2784 | 1.6% |
| t | 2590 | 1.5% |
| a | 1909 | 1.1% |
| Other values (18) | 14942 | 8.8% |
wattage_of_the_bulb
Real number (ℝ)
Missing  Zeros 
| Distinct | 70 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 9234 |
| Missing (%) | 19.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.386983 |
| Minimum | 0 |
|---|---|
| Maximum | 908 |
| Zeros | 8576 |
| Zeros (%) | 17.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 378.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3.75 |
| median | 7 |
| Q3 | 12 |
| 95-th percentile | 60 |
| Maximum | 908 |
| Range | 908 |
| Interquartile range (IQR) | 8.25 |
Descriptive statistics
| Standard deviation | 68.652667 |
|---|---|
| Coefficient of variation (CV) | 3.9485096 |
| Kurtosis | 136.2298 |
| Mean | 17.386983 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 11.276204 |
| Sum | 682526 |
| Variance | 4713.1887 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 8576 | |
| 5 | 5248 | |
| 12 | 5205 | |
| 7 | 3623 | 7.5% |
| 9 | 2483 | 5.1% |
| 15 | 2136 | 4.4% |
| 10 | 1536 | 3.2% |
| 8 | 1193 | 2.5% |
| 6 | 1059 | 2.2% |
| 18 | 935 | 1.9% |
| Other values (60) | 7261 | |
| (Missing) | 9234 |
| Value | Count | Frequency (%) |
| 0 | 8576 | |
| 1 | 80 | 0.2% |
| 2 | 378 | 0.8% |
| 3 | 479 | 1.0% |
| 3.5 | 301 | 0.6% |
| 4 | 86 | 0.2% |
| 5 | 5248 | |
| 5.5 | 237 | 0.5% |
| 6 | 1059 | 2.2% |
| 7 | 3623 |
| Value | Count | Frequency (%) |
| 908 | 1 | < 0.1% |
| 900 | 179 | |
| 675 | 64 | 0.1% |
| 250 | 1 | < 0.1% |
| 168 | 1 | < 0.1% |
| 165 | 255 | |
| 150 | 4 | < 0.1% |
| 125 | 1 | < 0.1% |
| 123 | 144 | |
| 120 | 57 | 0.1% |
no_of_hours_bulb_was_on_during_daytime_last_week
Real number (ℝ)
Zeros 
| Distinct | 123 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6227784 |
| Minimum | 0 |
|---|---|
| Maximum | 70 |
| Zeros | 42744 |
| Zeros (%) | 88.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 378.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 10 |
| Maximum | 70 |
| Range | 70 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.0569084 |
|---|---|
| Coefficient of variation (CV) | 4.3486581 |
| Kurtosis | 51.180155 |
| Mean | 1.6227784 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.5893157 |
| Sum | 78686.902 |
| Variance | 49.799956 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 42744 | |
| 7 | 1087 | 2.2% |
| 14 | 701 | 1.4% |
| 21 | 539 | 1.1% |
| 1 | 492 | 1.0% |
| 2 | 321 | 0.7% |
| 10 | 264 | 0.5% |
| 70 | 218 | 0.4% |
| 35 | 196 | 0.4% |
| 3 | 173 | 0.4% |
| Other values (113) | 1754 | 3.6% |
| Value | Count | Frequency (%) |
| 0 | 42744 | |
| 0.033 | 1 | < 0.1% |
| 0.05 | 3 | < 0.1% |
| 0.1 | 5 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| 0.125 | 1 | < 0.1% |
| 0.175 | 1 | < 0.1% |
| 0.2 | 2 | < 0.1% |
| 0.21 | 2 | < 0.1% |
| 0.25 | 112 | 0.2% |
| Value | Count | Frequency (%) |
| 70 | 218 | |
| 69 | 1 | < 0.1% |
| 66.5 | 4 | < 0.1% |
| 66 | 1 | < 0.1% |
| 65 | 1 | < 0.1% |
| 63 | 8 | < 0.1% |
| 60 | 18 | < 0.1% |
| 57 | 1 | < 0.1% |
| 56 | 19 | < 0.1% |
| 54 | 2 | < 0.1% |
no_of_hours_bulb_was_on_during_night_last_week
Real number (ℝ)
Zeros 
| Distinct | 294 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.515142 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 13926 |
| Zeros (%) | 28.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 378.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 7 |
| Q3 | 28 |
| 95-th percentile | 59.5 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 20.584469 |
|---|---|
| Coefficient of variation (CV) | 1.2463997 |
| Kurtosis | 2.9408673 |
| Mean | 16.515142 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 1.678896 |
| Sum | 800802.71 |
| Variance | 423.72035 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 13926 | |
| 28 | 6070 | |
| 7 | 3642 | 7.5% |
| 14 | 3567 | 7.4% |
| 21 | 3407 | 7.0% |
| 35 | 2355 | 4.9% |
| 1 | 1531 | 3.2% |
| 2 | 1006 | 2.1% |
| 3.5 | 923 | 1.9% |
| 42 | 911 | 1.9% |
| Other values (284) | 11151 |
| Value | Count | Frequency (%) |
| 0 | 13926 | |
| 0.00083 | 2 | < 0.1% |
| 0.0023 | 1 | < 0.1% |
| 0.025 | 2 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.033 | 1 | < 0.1% |
| 0.05 | 8 | < 0.1% |
| 0.066 | 1 | < 0.1% |
| 0.075 | 1 | < 0.1% |
| 0.083 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 98 | 396 | |
| 97 | 1 | < 0.1% |
| 96 | 6 | < 0.1% |
| 95 | 7 | < 0.1% |
| 94.5 | 1 | < 0.1% |
| 92 | 1 | < 0.1% |
| 91 | 143 | 0.3% |
| 90 | 31 | 0.1% |
| 88 | 1 | < 0.1% |
| 87.5 | 2 | < 0.1% |
Interactions
Correlations
| no_of_hours_bulb_was_on_during_daytime_last_week | no_of_hours_bulb_was_on_during_night_last_week | room_ID | type_of_the_bulb | wattage_of_the_bulb | |
|---|---|---|---|---|---|
| no_of_hours_bulb_was_on_during_daytime_last_week | 1.000 | 0.149 | 0.028 | 0.063 | 0.067 |
| no_of_hours_bulb_was_on_during_night_last_week | 0.149 | 1.000 | 0.076 | 0.141 | 0.113 |
| room_ID | 0.028 | 0.076 | 1.000 | 0.058 | 0.045 |
| type_of_the_bulb | 0.063 | 0.141 | 0.058 | 1.000 | 0.087 |
| wattage_of_the_bulb | 0.067 | 0.113 | 0.045 | 0.087 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| household_ID | room_ID | light_ID | type_of_the_bulb | wattage_of_the_bulb | no_of_hours_bulb_was_on_during_daytime_last_week | no_of_hours_bulb_was_on_during_night_last_week | |
|---|---|---|---|---|---|---|---|
| 0 | ID0001 | I1 | I1_L1 | LED | 5.0 | 0.00 | 2.00 |
| 1 | ID0001 | I1 | I1_L2 | LED | 5.0 | 0.00 | 2.00 |
| 2 | ID0001 | I1 | I1_L3 | LED | 5.0 | 0.00 | 2.00 |
| 3 | ID0001 | I1 | I1_L4 | LED | 5.0 | 0.00 | 2.00 |
| 4 | ID0001 | I2 | I2_L1 | CFL | 5.0 | 0.00 | 0.25 |
| 5 | ID0001 | I3 | I3_L1 | CFL | 5.0 | 0.00 | 0.25 |
| 6 | ID0001 | I4 | I4_L1 | CFL | 5.0 | 0.00 | 0.25 |
| 7 | ID0001 | I5 | I5_L1 | CFL | 5.0 | 0.25 | 0.00 |
| 8 | ID0001 | I6 | I6_L1 | CFL | 5.0 | 2.00 | 1.00 |
| 9 | ID0001 | I7 | I7_L1 | CFL | 5.0 | 0.00 | 2.00 |
| household_ID | room_ID | light_ID | type_of_the_bulb | wattage_of_the_bulb | no_of_hours_bulb_was_on_during_daytime_last_week | no_of_hours_bulb_was_on_during_night_last_week | |
|---|---|---|---|---|---|---|---|
| 48479 | ID4062 | I3 | I3_L1 | LED | 7.0 | 0.0 | 0.0 |
| 48480 | ID4062 | I4 | I4_L1 | LED | 7.0 | 28.0 | 28.0 |
| 48481 | ID4062 | I6 | I6_L1 | LED | 7.0 | 0.0 | 7.0 |
| 48482 | ID4062 | I7 | I7_L1 | LED | 7.0 | 0.0 | 0.0 |
| 48483 | ID4062 | I7 | I7_L2 | LED | 7.0 | 0.0 | 0.0 |
| 48484 | ID4063 | I1 | I1_L1 | LED | 7.0 | 0.0 | 35.0 |
| 48485 | ID4063 | I2 | I2_L1 | LED | 7.0 | 0.0 | 2.0 |
| 48486 | ID4063 | I3 | I3_L1 | LED | 7.0 | 0.0 | 4.0 |
| 48487 | ID4063 | I7 | I7_L1 | LED | 7.0 | 0.0 | 2.0 |
| 48488 | ID4063 | I8 | I8_L1 | LED | 7.0 | 4.0 | 20.0 |