Overview
Brought to you by YData
Dataset statistics
Number of variables | 6 |
---|---|
Number of observations | 48489 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 5119 |
Duplicate rows (%) | 10.6% |
Total size in memory | 3.6 MiB |
Average record size in memory | 77.8 B |
Variable types
Categorical | 1 |
---|---|
Text | 1 |
Numeric | 4 |
Dataset has 5119 (10.6%) duplicate rows | Duplicates |
wattage_of_the_bulb has 8576 (17.7%) zeros | Zeros |
no_of_hours_bulbs_was_on_during_daytime_last_week has 42744 (88.2%) zeros | Zeros |
no_of_hours_bulbs_was_on_during_night_last_week has 13926 (28.7%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-18 08:39:21.367361 |
---|---|
Analysis finished | 2024-11-18 08:39:23.814095 |
Duration | 2.45 seconds |
Software version | ydata-profiling vv4.11.0 |
Download configuration | config.json |
Variables
room_ID
Categorical
Distinct | 32 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 MiB |
I_1 | |
---|---|
I_2 | |
I_3 | |
I_4 | |
I_5 | |
Other values (27) |
Common Values
Value | Count | Frequency (%) |
I_1 | 10095 | |
I_2 | 5696 | |
I_3 | 5097 | |
I_4 | 4918 | |
I_5 | 4649 | |
I_6 | 4066 | |
I_7 | 3443 | 7.1% |
I_8 | 2533 | 5.2% |
I_9 | 2033 | 4.2% |
I_10 | 1347 | 2.8% |
Other values (22) | 4612 |
Length
Value | Count | Frequency (%) |
i_1 | 10095 | |
i_2 | 5696 | |
i_3 | 5097 | |
i_4 | 4918 | |
i_5 | 4649 | |
i_6 | 4066 | |
i_7 | 3443 | 7.1% |
i_8 | 2533 | 5.2% |
i_9 | 2033 | 4.2% |
i_10 | 1347 | 2.8% |
Other values (22) | 4612 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 49199 | |
I | 48489 | |
1 | 16872 | 11.0% |
2 | 6582 | 4.3% |
3 | 5767 | 3.8% |
4 | 5374 | 3.5% |
5 | 5009 | 3.3% |
6 | 4310 | 2.8% |
7 | 3627 | 2.4% |
8 | 2666 | 1.7% |
Other values (5) | 5673 | 3.7% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 153568 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
_ | 49199 | |
I | 48489 | |
1 | 16872 | 11.0% |
2 | 6582 | 4.3% |
3 | 5767 | 3.8% |
4 | 5374 | 3.5% |
5 | 5009 | 3.3% |
6 | 4310 | 2.8% |
7 | 3627 | 2.4% |
8 | 2666 | 1.7% |
Other values (5) | 5673 | 3.7% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 153568 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
_ | 49199 | |
I | 48489 | |
1 | 16872 | 11.0% |
2 | 6582 | 4.3% |
3 | 5767 | 3.8% |
4 | 5374 | 3.5% |
5 | 5009 | 3.3% |
6 | 4310 | 2.8% |
7 | 3627 | 2.4% |
8 | 2666 | 1.7% |
Other values (5) | 5673 | 3.7% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 153568 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
_ | 49199 | |
I | 48489 | |
1 | 16872 | 11.0% |
2 | 6582 | 4.3% |
3 | 5767 | 3.8% |
4 | 5374 | 3.5% |
5 | 5009 | 3.3% |
6 | 4310 | 2.8% |
7 | 3627 | 2.4% |
8 | 2666 | 1.7% |
Other values (5) | 5673 | 3.7% |
light_ID
Text
Distinct | 412 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 1.7 MiB |
Value | Count | Frequency (%) |
i_1_l_1 | 4017 | 8.3% |
i_2_l_1 | 3928 | 8.1% |
i_3_l_1 | 3797 | 7.8% |
i_4_l_1 | 3683 | 7.6% |
i_5_l_1 | 3395 | 7.0% |
i_6_l_1 | 2877 | 5.9% |
i_7_l_1 | 2200 | 4.5% |
i_1_l_2 | 2160 | 4.5% |
i_8_l_1 | 1567 | 3.2% |
i_1_l_3 | 1237 | 2.6% |
Other values (402) | 19628 |
Most occurring characters
Value | Count | Frequency (%) |
_ | 146177 | |
I | 48489 | 13.9% |
L | 48489 | 13.9% |
1 | 48408 | 13.9% |
2 | 14577 | 4.2% |
3 | 9281 | 2.7% |
4 | 7662 | 2.2% |
5 | 6479 | 1.9% |
6 | 5337 | 1.5% |
7 | 4379 | 1.3% |
Other values (6) | 9741 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 349019 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
_ | 146177 | |
I | 48489 | 13.9% |
L | 48489 | 13.9% |
1 | 48408 | 13.9% |
2 | 14577 | 4.2% |
3 | 9281 | 2.7% |
4 | 7662 | 2.2% |
5 | 6479 | 1.9% |
6 | 5337 | 1.5% |
7 | 4379 | 1.3% |
Other values (6) | 9741 | 2.8% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 349019 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
_ | 146177 | |
I | 48489 | 13.9% |
L | 48489 | 13.9% |
1 | 48408 | 13.9% |
2 | 14577 | 4.2% |
3 | 9281 | 2.7% |
4 | 7662 | 2.2% |
5 | 6479 | 1.9% |
6 | 5337 | 1.5% |
7 | 4379 | 1.3% |
Other values (6) | 9741 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 349019 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
_ | 146177 | |
I | 48489 | 13.9% |
L | 48489 | 13.9% |
1 | 48408 | 13.9% |
2 | 14577 | 4.2% |
3 | 9281 | 2.7% |
4 | 7662 | 2.2% |
5 | 6479 | 1.9% |
6 | 5337 | 1.5% |
7 | 4379 | 1.3% |
Other values (6) | 9741 | 2.8% |
type_of_the_bulb
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.862938 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 MiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 3 |
Q3 | 3 |
95-th percentile | 3 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.75761391 |
---|---|
Coefficient of variation (CV) | 0.26462812 |
Kurtosis | 26.731934 |
Mean | 2.862938 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.373147 |
Sum | 138821 |
Variance | 0.57397884 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 37862 | |
2 | 8139 | 16.8% |
1 | 1193 | 2.5% |
5 | 398 | 0.8% |
4 | 296 | 0.6% |
9 | 228 | 0.5% |
6 | 212 | 0.4% |
8 | 139 | 0.3% |
7 | 22 | < 0.1% |
Value | Count | Frequency (%) |
1 | 1193 | 2.5% |
2 | 8139 | 16.8% |
3 | 37862 | |
4 | 296 | 0.6% |
5 | 398 | 0.8% |
6 | 212 | 0.4% |
7 | 22 | < 0.1% |
8 | 139 | 0.3% |
9 | 228 | 0.5% |
Value | Count | Frequency (%) |
9 | 228 | 0.5% |
8 | 139 | 0.3% |
7 | 22 | < 0.1% |
6 | 212 | 0.4% |
5 | 398 | 0.8% |
4 | 296 | 0.6% |
3 | 37862 | |
2 | 8139 | 16.8% |
1 | 1193 | 2.5% |
wattage_of_the_bulb
Real number (ℝ)
Zeros 
Distinct | 71 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 204.3204 |
Minimum | 0 |
---|---|
Maximum | 999 |
Zeros | 8576 |
Zeros (%) | 17.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 5 |
median | 10 |
Q3 | 40 |
95-th percentile | 999 |
Maximum | 999 |
Range | 999 |
Interquartile range (IQR) | 35 |
Descriptive statistics
Standard deviation | 390.34709 |
---|---|
Coefficient of variation (CV) | 1.9104656 |
Kurtosis | 0.36126863 |
Mean | 204.3204 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 1.5316966 |
Sum | 9907292 |
Variance | 152370.85 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
999 | 9234 | |
0 | 8576 | |
5 | 5248 | |
12 | 5205 | |
7 | 3623 | 7.5% |
9 | 2483 | 5.1% |
15 | 2136 | 4.4% |
10 | 1536 | 3.2% |
8 | 1193 | 2.5% |
6 | 1059 | 2.2% |
Other values (61) | 8196 |
Value | Count | Frequency (%) |
0 | 8576 | |
1 | 80 | 0.2% |
2 | 378 | 0.8% |
3 | 479 | 1.0% |
3.5 | 301 | 0.6% |
4 | 86 | 0.2% |
5 | 5248 | |
5.5 | 237 | 0.5% |
6 | 1059 | 2.2% |
7 | 3623 |
Value | Count | Frequency (%) |
999 | 9234 | |
908 | 1 | < 0.1% |
900 | 179 | 0.4% |
675 | 64 | 0.1% |
250 | 1 | < 0.1% |
168 | 1 | < 0.1% |
165 | 255 | 0.5% |
150 | 4 | < 0.1% |
125 | 1 | < 0.1% |
123 | 144 | 0.3% |
no_of_hours_bulbs_was_on_during_daytime_last_week
Real number (ℝ)
Zeros 
Distinct | 123 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.6227784 |
Minimum | 0 |
---|---|
Maximum | 70 |
Zeros | 42744 |
Zeros (%) | 88.2% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 0 |
95-th percentile | 10 |
Maximum | 70 |
Range | 70 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 7.0569084 |
---|---|
Coefficient of variation (CV) | 4.3486581 |
Kurtosis | 51.180155 |
Mean | 1.6227784 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 6.5893157 |
Sum | 78686.902 |
Variance | 49.799956 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 42744 | |
7 | 1087 | 2.2% |
14 | 701 | 1.4% |
21 | 539 | 1.1% |
1 | 492 | 1.0% |
2 | 321 | 0.7% |
10 | 264 | 0.5% |
70 | 218 | 0.4% |
35 | 196 | 0.4% |
3 | 173 | 0.4% |
Other values (113) | 1754 | 3.6% |
Value | Count | Frequency (%) |
0 | 42744 | |
0.033 | 1 | < 0.1% |
0.05 | 3 | < 0.1% |
0.1 | 5 | < 0.1% |
0.12 | 1 | < 0.1% |
0.125 | 1 | < 0.1% |
0.175 | 1 | < 0.1% |
0.2 | 2 | < 0.1% |
0.21 | 2 | < 0.1% |
0.25 | 112 | 0.2% |
Value | Count | Frequency (%) |
70 | 218 | |
69 | 1 | < 0.1% |
66.5 | 4 | < 0.1% |
66 | 1 | < 0.1% |
65 | 1 | < 0.1% |
63 | 8 | < 0.1% |
60 | 18 | < 0.1% |
57 | 1 | < 0.1% |
56 | 19 | < 0.1% |
54 | 2 | < 0.1% |
no_of_hours_bulbs_was_on_during_night_last_week
Real number (ℝ)
Zeros 
Distinct | 294 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 16.515142 |
Minimum | 0 |
---|---|
Maximum | 98 |
Zeros | 13926 |
Zeros (%) | 28.7% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.7 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 7 |
Q3 | 28 |
95-th percentile | 59.5 |
Maximum | 98 |
Range | 98 |
Interquartile range (IQR) | 28 |
Descriptive statistics
Standard deviation | 20.584469 |
---|---|
Coefficient of variation (CV) | 1.2463997 |
Kurtosis | 2.9408673 |
Mean | 16.515142 |
Median Absolute Deviation (MAD) | 7 |
Skewness | 1.678896 |
Sum | 800802.71 |
Variance | 423.72035 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 13926 | |
28 | 6070 | |
7 | 3642 | 7.5% |
14 | 3567 | 7.4% |
21 | 3407 | 7.0% |
35 | 2355 | 4.9% |
1 | 1531 | 3.2% |
2 | 1006 | 2.1% |
3.5 | 923 | 1.9% |
42 | 911 | 1.9% |
Other values (284) | 11151 |
Value | Count | Frequency (%) |
0 | 13926 | |
0.00083 | 2 | < 0.1% |
0.0023 | 1 | < 0.1% |
0.025 | 2 | < 0.1% |
0.03 | 2 | < 0.1% |
0.033 | 1 | < 0.1% |
0.05 | 8 | < 0.1% |
0.066 | 1 | < 0.1% |
0.075 | 1 | < 0.1% |
0.083 | 1 | < 0.1% |
Value | Count | Frequency (%) |
98 | 396 | |
97 | 1 | < 0.1% |
96 | 6 | < 0.1% |
95 | 7 | < 0.1% |
94.5 | 1 | < 0.1% |
92 | 1 | < 0.1% |
91 | 143 | 0.3% |
90 | 31 | 0.1% |
88 | 1 | < 0.1% |
87.5 | 2 | < 0.1% |
Interactions
Missing values
Sample
room_ID | light_ID | type_of_the_bulb | wattage_of_the_bulb | no_of_hours_bulbs_was_on_during_daytime_last_week | no_of_hours_bulbs_was_on_during_night_last_week | |
---|---|---|---|---|---|---|
household_ID | ||||||
ID0001 | I_1 | I_1_L_1 | 3 | 5.0 | 0.00 | 2.00 |
ID0001 | I_1 | I_1_L_2 | 3 | 5.0 | 0.00 | 2.00 |
ID0001 | I_1 | I_1_L_3 | 3 | 5.0 | 0.00 | 2.00 |
ID0001 | I_1 | I_1_L_4 | 3 | 5.0 | 0.00 | 2.00 |
ID0001 | I_2 | I_2_L_1 | 2 | 5.0 | 0.00 | 0.25 |
ID0001 | I_3 | I_3_L_1 | 2 | 5.0 | 0.00 | 0.25 |
ID0001 | I_4 | I_4_L_1 | 2 | 5.0 | 0.00 | 0.25 |
ID0001 | I_5 | I_5_L_1 | 2 | 5.0 | 0.25 | 0.00 |
ID0001 | I_6 | I_6_L_1 | 2 | 5.0 | 2.00 | 1.00 |
ID0001 | I_7 | I_7_L_1 | 2 | 5.0 | 0.00 | 2.00 |
room_ID | light_ID | type_of_the_bulb | wattage_of_the_bulb | no_of_hours_bulbs_was_on_during_daytime_last_week | no_of_hours_bulbs_was_on_during_night_last_week | |
---|---|---|---|---|---|---|
household_ID | ||||||
ID4062 | I_3 | I_3_L_1 | 3 | 7.0 | 0.0 | 0.0 |
ID4062 | I_4 | I_4_L_1 | 3 | 7.0 | 28.0 | 28.0 |
ID4062 | I_6 | I_6_L_1 | 3 | 7.0 | 0.0 | 7.0 |
ID4062 | I_7 | I_7_L_1 | 3 | 7.0 | 0.0 | 0.0 |
ID4062 | I_7 | I_7_L_2 | 3 | 7.0 | 0.0 | 0.0 |
ID4063 | I_1 | I_1_L_1 | 3 | 7.0 | 0.0 | 35.0 |
ID4063 | I_2 | I_2_L_1 | 3 | 7.0 | 0.0 | 2.0 |
ID4063 | I_3 | I_3_L_1 | 3 | 7.0 | 0.0 | 4.0 |
ID4063 | I_7 | I_7_L_1 | 3 | 7.0 | 0.0 | 2.0 |
ID4063 | I_8 | I_8_L_1 | 3 | 7.0 | 4.0 | 20.0 |
Duplicate rows
Most frequently occurring
room_ID | light_ID | type_of_the_bulb | wattage_of_the_bulb | no_of_hours_bulbs_was_on_during_daytime_last_week | no_of_hours_bulbs_was_on_during_night_last_week | # duplicates | |
---|---|---|---|---|---|---|---|
434 | I_1 | I_1_L_2 | 3 | 0.0 | 0.0 | 0.0 | 287 |
600 | I_1 | I_1_L_3 | 3 | 0.0 | 0.0 | 0.0 | 230 |
305 | I_1 | I_1_L_1 | 3 | 999.0 | 0.0 | 28.0 | 210 |
694 | I_1 | I_1_L_4 | 3 | 0.0 | 0.0 | 0.0 | 193 |
218 | I_1 | I_1_L_1 | 3 | 12.0 | 0.0 | 28.0 | 142 |
750 | I_1 | I_1_L_5 | 3 | 0.0 | 0.0 | 0.0 | 130 |
1955 | I_2 | I_2_L_2 | 3 | 0.0 | 0.0 | 0.0 | 124 |
528 | I_1 | I_1_L_2 | 3 | 12.0 | 0.0 | 0.0 | 115 |
3066 | I_4 | I_4_L_2 | 3 | 0.0 | 0.0 | 0.0 | 110 |
4409 | I_7 | I_7_L_2 | 3 | 0.0 | 0.0 | 0.0 | 109 |