Overview
Brought to you by YData
Dataset statistics
Number of variables | 7 |
---|---|
Number of observations | 48,489 |
Missing cells | 9,234 |
Missing cells (%) | 2.7% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 2.6 MiB |
Average record size in memory | 56.0 B |
Variable types
Text | 2 |
---|---|
Categorical | 2 |
Numeric | 3 |
type_of_the_bulb is highly imbalanced (67.1%) | Imbalance |
wattage_of_the_bulb has 9234 (19.0%) missing values | Missing |
wattage_of_the_bulb has 8576 (17.7%) zeros | Zeros |
no_of_hours_bulb_was_on_during_daytime_last_week has 42744 (88.2%) zeros | Zeros |
no_of_hours_bulb_was_on_during_night_last_week has 13926 (28.7%) zeros | Zeros |
Reproduction
Analysis started | 2024-12-06 05:54:46.469423 |
---|---|
Analysis finished | 2024-12-06 05:54:48.195246 |
Duration | 1.73 second |
Software version | ydata-profiling vv4.11.0 |
Download configuration | config.json |
Variables
household_ID
Text
Distinct | 4054 |
---|---|
Distinct (%) | 8.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 378.9 KiB |
Value | Count | Frequency (%) |
id0469 | 228 | 0.5% |
id2033 | 225 | 0.5% |
id0278 | 209 | 0.4% |
id0282 | 182 | 0.4% |
id1589 | 171 | 0.4% |
id1841 | 165 | 0.3% |
id0399 | 162 | 0.3% |
id0069 | 152 | 0.3% |
id0901 | 150 | 0.3% |
id2072 | 144 | 0.3% |
Other values (4044) | 46701 |
Most occurring characters
Value | Count | Frequency (%) |
I | 48489 | |
D | 48489 | |
0 | 30565 | |
1 | 27712 | |
2 | 25976 | |
3 | 22658 | |
4 | 15122 | 5.2% |
6 | 14938 | 5.1% |
7 | 14483 | 5.0% |
8 | 14336 | 4.9% |
Other values (2) | 28166 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 193956 | |
Uppercase Letter | 96978 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 30565 | |
1 | 27712 | |
2 | 25976 | |
3 | 22658 | |
4 | 15122 | |
6 | 14938 | |
7 | 14483 | |
8 | 14336 | |
9 | 14213 | |
5 | 13953 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 48489 | |
D | 48489 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 193956 | |
Latin | 96978 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 30565 | |
1 | 27712 | |
2 | 25976 | |
3 | 22658 | |
4 | 15122 | |
6 | 14938 | |
7 | 14483 | |
8 | 14336 | |
9 | 14213 | |
5 | 13953 |
Latin
Value | Count | Frequency (%) |
I | 48489 | |
D | 48489 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 290934 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
I | 48489 | |
D | 48489 | |
0 | 30565 | |
1 | 27712 | |
2 | 25976 | |
3 | 22658 | |
4 | 15122 | 5.2% |
6 | 14938 | 5.1% |
7 | 14483 | 5.0% |
8 | 14336 | 4.9% |
Other values (2) | 28166 |
room_ID
Categorical
Distinct | 32 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 378.9 KiB |
I1 | |
---|---|
I2 | |
I3 | |
I4 | |
I5 | |
Other values (27) |
Common Values
Value | Count | Frequency (%) |
I1 | 10095 | |
I2 | 5696 | |
I3 | 5097 | |
I4 | 4918 | |
I5 | 4649 | |
I6 | 4066 | |
I7 | 3443 | 7.1% |
I8 | 2533 | 5.2% |
I9 | 2033 | 4.2% |
I10 | 1347 | 2.8% |
Other values (22) | 4612 |
Length
Value | Count | Frequency (%) |
i1 | 10095 | |
i2 | 5696 | |
i3 | 5097 | |
i4 | 4918 | |
i5 | 4649 | |
i6 | 4066 | |
i7 | 3443 | 7.1% |
i8 | 2533 | 5.2% |
i9 | 2033 | 4.2% |
i10 | 1347 | 2.8% |
Other values (22) | 4612 |
Most occurring characters
Value | Count | Frequency (%) |
I | 48489 | |
1 | 16872 | 16.2% |
2 | 6582 | 6.3% |
3 | 5767 | 5.5% |
4 | 5374 | 5.1% |
5 | 5009 | 4.8% |
6 | 4310 | 4.1% |
7 | 3627 | 3.5% |
8 | 2666 | 2.6% |
9 | 2142 | 2.1% |
Other values (4) | 3531 | 3.4% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 53750 | |
Uppercase Letter | 49199 | |
Lowercase Letter | 1420 | 1.4% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 16872 | |
2 | 6582 | 12.2% |
3 | 5767 | 10.7% |
4 | 5374 | 10.0% |
5 | 5009 | 9.3% |
6 | 4310 | 8.0% |
7 | 3627 | 6.7% |
8 | 2666 | 5.0% |
9 | 2142 | 4.0% |
0 | 1401 | 2.6% |
Uppercase Letter
Value | Count | Frequency (%) |
I | 48489 | |
O | 710 | 1.4% |
Lowercase Letter
Value | Count | Frequency (%) |
t | 710 | |
h | 710 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 53750 | |
Latin | 50619 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 16872 | |
2 | 6582 | 12.2% |
3 | 5767 | 10.7% |
4 | 5374 | 10.0% |
5 | 5009 | 9.3% |
6 | 4310 | 8.0% |
7 | 3627 | 6.7% |
8 | 2666 | 5.0% |
9 | 2142 | 4.0% |
0 | 1401 | 2.6% |
Latin
Value | Count | Frequency (%) |
I | 48489 | |
O | 710 | 1.4% |
t | 710 | 1.4% |
h | 710 | 1.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 104369 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
I | 48489 | |
1 | 16872 | 16.2% |
2 | 6582 | 6.3% |
3 | 5767 | 5.5% |
4 | 5374 | 5.1% |
5 | 5009 | 4.8% |
6 | 4310 | 4.1% |
7 | 3627 | 3.5% |
8 | 2666 | 2.6% |
9 | 2142 | 2.1% |
Other values (4) | 3531 | 3.4% |
light_ID
Text
Distinct | 412 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 378.9 KiB |
Value | Count | Frequency (%) |
i1_l1 | 4017 | 8.3% |
i2_l1 | 3928 | 8.1% |
i3_l1 | 3797 | 7.8% |
i4_l1 | 3683 | 7.6% |
i5_l1 | 3395 | 7.0% |
i6_l1 | 2877 | 5.9% |
i7_l1 | 2200 | 4.5% |
i1_l2 | 2160 | 4.5% |
i8_l1 | 1567 | 3.2% |
i1_l3 | 1237 | 2.6% |
Other values (402) | 19628 |