Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 1,171 |
| Missing cells | 909 |
| Missing cells (%) | 7.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 91.6 KiB |
| Average record size in memory | 80.1 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 3 |
| Boolean | 1 |
| Numeric | 4 |
is_room_fully_sealed is highly imbalanced (64.1%) | Imbalance |
wattage_of_the_ac has 356 (30.4%) missing values | Missing |
btu_of_the_ac has 553 (47.2%) missing values | Missing |
wattage_of_the_ac has 610 (52.1%) zeros | Zeros |
btu_of_the_ac has 67 (5.7%) zeros | Zeros |
no_of_hours_ac_was_on_during_daytime_last_week has 993 (84.8%) zeros | Zeros |
no_of_hours_ac_was_on_during_night_last_week has 499 (42.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-06 05:54:54.739615 |
|---|---|
| Analysis finished | 2024-12-06 05:54:56.671088 |
| Duration | 1.93 second |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
household_ID
Text
| Distinct | 562 |
|---|---|
| Distinct (%) | 48.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.3 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 312 ? |
|---|---|
| Unique (%) | 26.6% |
Sample
| 1st row | ID0012 |
|---|---|
| 2nd row | ID0014 |
| 3rd row | ID0018 |
| 4th row | ID0025 |
| 5th row | ID0039 |
| Value | Count | Frequency (%) |
| id1214 | 28 | 2.4% |
| id2985 | 26 | 2.2% |
| id0282 | 26 | 2.2% |
| id1816 | 24 | 2.0% |
| id2072 | 24 | 2.0% |
| id0278 | 11 | 0.9% |
| id1227 | 9 | 0.8% |
| id0083 | 9 | 0.8% |
| id2660 | 8 | 0.7% |
| id0348 | 7 | 0.6% |
| Other values (552) | 999 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1171 | |
| D | 1171 | |
| 2 | 758 | |
| 0 | 662 | |
| 1 | 634 | |
| 3 | 590 | |
| 7 | 407 | 5.8% |
| 8 | 371 | 5.3% |
| 4 | 365 | 5.2% |
| 6 | 342 | 4.9% |
| Other values (2) | 555 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4684 | |
| Uppercase Letter | 2342 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 758 | |
| 0 | 662 | |
| 1 | 634 | |
| 3 | 590 | |
| 7 | 407 | |
| 8 | 371 | |
| 4 | 365 | |
| 6 | 342 | |
| 5 | 281 | 6.0% |
| 9 | 274 | 5.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1171 | |
| D | 1171 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4684 | |
| Latin | 2342 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 758 | |
| 0 | 662 | |
| 1 | 634 | |
| 3 | 590 | |
| 7 | 407 | |
| 8 | 371 | |
| 4 | 365 | |
| 6 | 342 | |
| 5 | 281 | 6.0% |
| 9 | 274 | 5.8% |
Latin
| Value | Count | Frequency (%) |
| I | 1171 | |
| D | 1171 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7026 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1171 | |
| D | 1171 | |
| 2 | 758 | |
| 0 | 662 | |
| 1 | 634 | |
| 3 | 590 | |
| 7 | 407 | 5.8% |
| 8 | 371 | 5.3% |
| 4 | 365 | 5.2% |
| 6 | 342 | 4.9% |
| Other values (2) | 555 |
room_ID
Categorical
| Distinct | 23 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.3 KiB |
| I2 | |
|---|---|
| I3 | |
| I4 | |
| I5 | |
| I6 | |
| Other values (18) |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.1229718 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | I3 |
|---|---|
| 2nd row | I3 |
| 3rd row | I5 |
| 4th row | I2 |
| 5th row | I8 |
Common Values
| Value | Count | Frequency (%) |
| I2 | 282 | |
| I3 | 255 | |
| I4 | 176 | |
| I5 | 115 | |
| I6 | 76 | 6.5% |
| I1 | 58 | 5.0% |
| I7 | 38 | 3.2% |
| I8 | 30 | 2.6% |
| I11 | 25 | 2.1% |
| I10 | 22 | 1.9% |
| Other values (13) | 94 | 8.0% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| i2 | 282 | |
| i3 | 255 | |
| i4 | 176 | |
| i5 | 115 | |
| i6 | 76 | 6.5% |
| i1 | 58 | 5.0% |
| i7 | 38 | 3.2% |
| i8 | 30 | 2.6% |
| i11 | 25 | 2.1% |
| i10 | 22 | 1.9% |
| Other values (13) | 94 | 8.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 1171 | |
| 2 | 299 | 12.0% |
| 3 | 277 | 11.1% |
| 1 | 198 | 8.0% |
| 4 | 186 | 7.5% |
| 5 | 124 | 5.0% |
| 6 | 80 | 3.2% |
| 7 | 40 | 1.6% |
| 8 | 34 | 1.4% |
| 0 | 23 | 0.9% |
| Other values (4) | 54 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1282 | |
| Uppercase Letter | 1182 | |
| Lowercase Letter | 22 | 0.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 299 | |
| 3 | 277 | |
| 1 | 198 | |
| 4 | 186 | |
| 5 | 124 | |
| 6 | 80 | 6.2% |
| 7 | 40 | 3.1% |
| 8 | 34 | 2.7% |
| 0 | 23 | 1.8% |
| 9 | 21 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1171 | |
| O | 11 | 0.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 11 | |
| h | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1282 | |
| Latin | 1204 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 299 | |
| 3 | 277 | |
| 1 | 198 | |
| 4 | 186 | |
| 5 | 124 | |
| 6 | 80 | 6.2% |
| 7 | 40 | 3.1% |
| 8 | 34 | 2.7% |
| 0 | 23 | 1.8% |
| 9 | 21 | 1.6% |
Latin
| Value | Count | Frequency (%) |
| I | 1171 | |
| O | 11 | 0.9% |
| t | 11 | 0.9% |
| h | 11 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2486 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 1171 | |
| 2 | 299 | 12.0% |
| 3 | 277 | 11.1% |
| 1 | 198 | 8.0% |
| 4 | 186 | 7.5% |
| 5 | 124 | 5.0% |
| 6 | 80 | 3.2% |
| 7 | 40 | 1.6% |
| 8 | 34 | 1.4% |
| 0 | 23 | 0.9% |
| Other values (4) | 54 | 2.2% |
ac_ID
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.3 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.1229718 |
| Min length | 6 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | I3_AC1 |
|---|---|
| 2nd row | I3_AC1 |
| 3rd row | I5_AC1 |
| 4th row | I2_AC1 |
| 5th row | I8_AC1 |
| Value | Count | Frequency (%) |
| i2_ac1 | 273 | |
| i3_ac1 | 245 | |
| i4_ac1 | 168 | |
| i5_ac1 | 107 | 9.1% |
| i6_ac1 | 68 | 5.8% |
| i1_ac1 | 47 | 4.0% |
| i7_ac1 | 32 | 2.7% |
| i11_ac1 | 22 | 1.9% |
| i8_ac1 | 22 | 1.9% |
| i10_ac1 | 18 | 1.5% |
| Other values (41) | 169 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1284 | |
| I | 1171 | |
| _ | 1171 | |
| C | 1171 | |
| A | 1171 | |
| 2 | 356 | 5.0% |
| 3 | 304 | 4.2% |
| 4 | 187 | 2.6% |
| 5 | 124 | 1.7% |
| 6 | 80 | 1.1% |
| Other values (7) | 151 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3524 | |
| Decimal Number | 2453 | |
| Connector Punctuation | 1171 | 16.3% |
| Lowercase Letter | 22 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1284 | |
| 2 | 356 | 14.5% |
| 3 | 304 | 12.4% |
| 4 | 187 | 7.6% |
| 5 | 124 | 5.1% |
| 6 | 80 | 3.3% |
| 7 | 40 | 1.6% |
| 8 | 34 | 1.4% |
| 0 | 23 | 0.9% |
| 9 | 21 | 0.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1171 | |
| C | 1171 | |
| A | 1171 | |
| O | 11 | 0.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 11 | |
| h | 11 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1171 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3624 | |
| Latin | 3546 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1284 | |
| _ | 1171 | |
| 2 | 356 | 9.8% |
| 3 | 304 | 8.4% |
| 4 | 187 | 5.2% |
| 5 | 124 | 3.4% |
| 6 | 80 | 2.2% |
| 7 | 40 | 1.1% |
| 8 | 34 | 0.9% |
| 0 | 23 | 0.6% |
Latin
| Value | Count | Frequency (%) |
| I | 1171 | |
| C | 1171 | |
| A | 1171 | |
| O | 11 | 0.3% |
| t | 11 | 0.3% |
| h | 11 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7170 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1284 | |
| I | 1171 | |
| _ | 1171 | |
| C | 1171 | |
| A | 1171 | |
| 2 | 356 | 5.0% |
| 3 | 304 | 4.2% |
| 4 | 187 | 2.6% |
| 5 | 124 | 1.7% |
| 6 | 80 | 1.1% |
| Other values (7) | 151 | 2.1% |
type_of_the_ac
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.3 KiB |
| Individual AC with two components | |
|---|---|
| Central AC (Only to your Household) | |
| Individual AC with one component | |
| Central AC (For the whole building) | 14 |
| Other | 12 |
Length
| Max length | 35 |
|---|---|
| Median length | 33 |
| Mean length | 32.930828 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Individual AC with one component |
|---|---|
| 2nd row | Other |
| 3rd row | Individual AC with two components |
| 4th row | Central AC (Only to your Household) |
| 5th row | Individual AC with two components |
Common Values
| Value | Count | Frequency (%) |
| Individual AC with two components | 596 | |
| Central AC (Only to your Household) | 332 | |
| Individual AC with one component | 207 | 17.7% |
| Central AC (For the whole building) | 14 | 1.2% |
| Other | 12 | 1.0% |
| Air Cooler | 10 | 0.9% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| ac | 1149 | |
| individual | 803 | |
| with | 803 | |
| two | 596 | |
| components | 596 | |
| central | 346 | 5.7% |
| only | 332 | 5.4% |
| to | 332 | 5.4% |
| your | 332 | 5.4% |
| household | 332 | 5.4% |
| Other values (9) | 502 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4952 | 12.8% | |
| o | 3785 | 9.8% |
| n | 3308 | 8.6% |
| t | 2906 | 7.5% |
| i | 2447 | 6.3% |
| d | 1952 | 5.1% |
| l | 1851 | 4.8% |
| e | 1738 | 4.5% |
| C | 1505 | 3.9% |
| u | 1481 | 3.8% |
| Other values (19) | 12637 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28761 | |
| Space Separator | 4952 | 12.8% |
| Uppercase Letter | 4157 | 10.8% |
| Open Punctuation | 346 | 0.9% |
| Close Punctuation | 346 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3785 | |
| n | 3308 | |
| t | 2906 | |
| i | 2447 | 8.5% |
| d | 1952 | 6.8% |
| l | 1851 | 6.4% |
| e | 1738 | 6.0% |
| u | 1481 | 5.1% |
| w | 1413 | 4.9% |
| h | 1175 | 4.1% |
| Other values (10) | 6705 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1505 | |
| A | 1159 | |
| I | 803 | |
| O | 344 | 8.3% |
| H | 332 | 8.0% |
| F | 14 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 4952 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 346 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 346 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32918 | |
| Common | 5644 | 14.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3785 | 11.5% |
| n | 3308 | 10.0% |
| t | 2906 | 8.8% |
| i | 2447 | 7.4% |
| d | 1952 | 5.9% |
| l | 1851 | 5.6% |
| e | 1738 | 5.3% |
| C | 1505 | 4.6% |
| u | 1481 | 4.5% |
| w | 1413 | 4.3% |
| Other values (16) | 10532 |
Common
| Value | Count | Frequency (%) |
| 4952 | ||
| ( | 346 | 6.1% |
| ) | 346 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38562 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4952 | 12.8% | |
| o | 3785 | 9.8% |
| n | 3308 | 8.6% |
| t | 2906 | 7.5% |
| i | 2447 | 6.3% |
| d | 1952 | 5.1% |
| l | 1851 | 4.8% |
| e | 1738 | 4.5% |
| C | 1505 | 3.9% |
| u | 1481 | 3.8% |
| Other values (19) | 12637 |
is_the_ac_inverter_or_not
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.3 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 748 | |
| False | 423 |
is_room_fully_sealed
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.3 KiB |
| The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | |
|---|---|
| The room is fully closed. However, it is not fully sealed. Therefore, when the AC is on, the cool air may leak through the spaces that are not sealed such as the space in-between the door and the door frame. | 96 |
| The room is not fully closed. There are spaces where the cool air can leak out. | 28 |
Length
| Max length | 207 |
|---|---|
| Median length | 142 |
| Mean length | 145.82237 |
| Min length | 79 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. |
|---|---|
| 2nd row | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. |
| 3rd row | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. |
| 4th row | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. |
| 5th row | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. |
Common Values
| Value | Count | Frequency (%) |
| The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 1047 | |
| The room is fully closed. However, it is not fully sealed. Therefore, when the AC is on, the cool air may leak through the spaces that are not sealed such as the space in-between the door and the door frame. | 96 | 8.2% |
| The room is not fully closed. There are spaces where the cool air can leak out. | 28 | 2.4% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| the | 4916 | 13.8% |
| room | 2218 | 6.2% |
| and | 2190 | 6.1% |
| is | 1363 | 3.8% |
| not | 1267 | 3.5% |
| fully | 1267 | 3.5% |
| sealed | 1239 | 3.5% |
| closed | 1171 | 3.3% |
| cool | 1171 | 3.3% |
| are | 1171 | 3.3% |
| Other values (30) | 17725 |
Most occurring characters
| Value | Count | Frequency (%) |
| 34527 | ||
| e | 19528 | |
| o | 18388 | |
| n | 11198 | 6.6% |
| t | 9708 | 5.7% |
| d | 7933 | 4.6% |
| h | 7642 | 4.5% |
| a | 7574 | 4.4% |
| s | 7450 | 4.4% |
| r | 7382 | 4.3% |
| Other values (21) | 39428 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 127638 | |
| Space Separator | 34527 | 20.2% |
| Uppercase Letter | 4724 | 2.8% |
| Other Punctuation | 3773 | 2.2% |
| Dash Punctuation | 96 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 19528 | |
| o | 18388 | |
| n | 11198 | |
| t | 9708 | 7.6% |
| d | 7933 | 6.2% |
| h | 7642 | 6.0% |
| a | 7574 | 5.9% |
| s | 7450 | 5.8% |
| r | 7382 | 5.8% |
| l | 6239 | 4.9% |
| Other values (12) | 24596 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1295 | |
| C | 1143 | |
| A | 1143 | |
| W | 1047 | |
| H | 96 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2438 | |
| , | 1335 |
Space Separator
| Value | Count | Frequency (%) |
| 34527 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 96 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 132362 | |
| Common | 38396 | 22.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 19528 | |
| o | 18388 | |
| n | 11198 | 8.5% |
| t | 9708 | 7.3% |
| d | 7933 | 6.0% |
| h | 7642 | 5.8% |
| a | 7574 | 5.7% |
| s | 7450 | 5.6% |
| r | 7382 | 5.6% |
| l | 6239 | 4.7% |
| Other values (17) | 29320 |
Common
| Value | Count | Frequency (%) |
| 34527 | ||
| . | 2438 | 6.3% |
| , | 1335 | 3.5% |
| - | 96 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 170758 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 34527 | ||
| e | 19528 | |
| o | 18388 | |
| n | 11198 | 6.6% |
| t | 9708 | 5.7% |
| d | 7933 | 4.6% |
| h | 7642 | 4.5% |
| a | 7574 | 4.4% |
| s | 7450 | 4.4% |
| r | 7382 | 4.3% |
| Other values (21) | 39428 |
wattage_of_the_ac
Real number (ℝ)
Missing  Zeros 
| Distinct | 45 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 356 |
| Missing (%) | 30.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 571.87117 |
| Minimum | 0 |
|---|---|
| Maximum | 18000 |
| Zeros | 610 |
| Zeros (%) | 52.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2000 |
| Maximum | 18000 |
| Range | 18000 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2337.6704 |
|---|---|
| Coefficient of variation (CV) | 4.0877571 |
| Kurtosis | 23.630431 |
| Mean | 571.87117 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.8141448 |
| Sum | 466075 |
| Variance | 5464703 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=45)
| Value | Count | Frequency (%) |
| 0 | 610 | |
| 12 | 31 | 2.6% |
| 55 | 23 | 2.0% |
| 12000 | 18 | 1.5% |
| 18 | 16 | 1.4% |
| 9000 | 10 | 0.9% |
| 1000 | 8 | 0.7% |
| 25 | 7 | 0.6% |
| 24 | 7 | 0.6% |
| 1100 | 7 | 0.6% |
| Other values (35) | 78 | 6.7% |
| (Missing) | 356 |
| Value | Count | Frequency (%) |
| 0 | 610 | |
| 1 | 2 | 0.2% |
| 2 | 1 | 0.1% |
| 5 | 2 | 0.2% |
| 9 | 5 | 0.4% |
| 10 | 2 | 0.2% |
| 12 | 31 | 2.6% |
| 13 | 3 | 0.3% |
| 15 | 5 | 0.4% |
| 18 | 16 | 1.4% |
| Value | Count | Frequency (%) |
| 18000 | 3 | 0.3% |
| 12000 | 18 | |
| 9000 | 10 | |
| 8000 | 2 | 0.2% |
| 6000 | 1 | 0.1% |
| 5800 | 1 | 0.1% |
| 5000 | 2 | 0.2% |
| 2500 | 1 | 0.1% |
| 2000 | 4 | 0.3% |
| 1890 | 2 | 0.2% |
btu_of_the_ac
Real number (ℝ)
Missing  Zeros 
| Distinct | 29 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 553 |
| Missing (%) | 47.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9564.0761 |
| Minimum | 0 |
|---|---|
| Maximum | 100000 |
| Zeros | 67 |
| Zeros (%) | 5.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1000 |
| median | 11000 |
| Q3 | 12000 |
| 95-th percentile | 18000 |
| Maximum | 100000 |
| Range | 100000 |
| Interquartile range (IQR) | 11000 |
Descriptive statistics
| Standard deviation | 10532.706 |
|---|---|
| Coefficient of variation (CV) | 1.1012779 |
| Kurtosis | 32.504953 |
| Mean | 9564.0761 |
| Median Absolute Deviation (MAD) | 2000 |
| Skewness | 4.6239866 |
| Sum | 5910599 |
| Variance | 1.109379 × 108 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=29)
| Value | Count | Frequency (%) |
| 12000 | 233 | |
| 1000 | 109 | 9.3% |
| 9000 | 71 | 6.1% |
| 0 | 67 | 5.7% |
| 18000 | 39 | 3.3% |
| 8000 | 18 | 1.5% |
| 10000 | 16 | 1.4% |
| 24000 | 12 | 1.0% |
| 2000 | 9 | 0.8% |
| 5000 | 5 | 0.4% |
| Other values (19) | 39 | 3.3% |
| (Missing) | 553 |
| Value | Count | Frequency (%) |
| 0 | 67 | |
| 1000 | 109 | |
| 1200 | 3 | 0.3% |
| 1600 | 2 | 0.2% |
| 1800 | 2 | 0.2% |
| 2000 | 9 | 0.8% |
| 2200 | 2 | 0.2% |
| 2400 | 2 | 0.2% |
| 3000 | 1 | 0.1% |
| 5000 | 5 | 0.4% |
| Value | Count | Frequency (%) |
| 100000 | 2 | 0.2% |
| 90000 | 1 | 0.1% |
| 80000 | 3 | 0.3% |
| 60000 | 2 | 0.2% |
| 41000 | 1 | 0.1% |
| 40000 | 2 | 0.2% |
| 24000 | 12 | 1.0% |
| 20000 | 3 | 0.3% |
| 18000 | 39 | |
| 16000 | 3 | 0.3% |
no_of_hours_ac_was_on_during_daytime_last_week
Real number (ℝ)
Zeros 
| Distinct | 27 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.2492143 |
| Minimum | 0 |
|---|---|
| Maximum | 70 |
| Zeros | 993 |
| Zeros (%) | 84.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 14 |
| Maximum | 70 |
| Range | 70 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.63311 |
|---|---|
| Coefficient of variation (CV) | 3.3936783 |
| Kurtosis | 29.528784 |
| Mean | 2.2492143 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.9085963 |
| Sum | 2633.83 |
| Variance | 58.264368 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=27)
| Value | Count | Frequency (%) |
| 0 | 993 | |
| 7 | 37 | 3.2% |
| 14 | 31 | 2.6% |
| 28 | 16 | 1.4% |
| 21 | 12 | 1.0% |
| 2 | 10 | 0.9% |
| 1 | 8 | 0.7% |
| 4 | 7 | 0.6% |
| 35 | 7 | 0.6% |
| 42 | 7 | 0.6% |
| Other values (17) | 43 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 993 | |
| 0.33 | 1 | 0.1% |
| 0.5 | 2 | 0.2% |
| 1 | 8 | 0.7% |
| 2 | 10 | 0.9% |
| 3 | 5 | 0.4% |
| 3.5 | 6 | 0.5% |
| 4 | 7 | 0.6% |
| 5 | 2 | 0.2% |
| 6 | 5 | 0.4% |
| Value | Count | Frequency (%) |
| 70 | 4 | 0.3% |
| 49 | 1 | 0.1% |
| 48 | 1 | 0.1% |
| 42 | 7 | |
| 35 | 7 | |
| 28 | 16 | |
| 24 | 1 | 0.1% |
| 21 | 12 | |
| 20 | 1 | 0.1% |
| 18 | 1 | 0.1% |
no_of_hours_ac_was_on_during_night_last_week
Real number (ℝ)
Zeros 
| Distinct | 65 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.361085 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 499 |
| Zeros (%) | 42.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 9.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 4 |
| Q3 | 30 |
| 95-th percentile | 70 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 23.892416 |
|---|---|
| Coefficient of variation (CV) | 1.3762053 |
| Kurtosis | 1.0044618 |
| Mean | 17.361085 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.3788235 |
| Sum | 20329.83 |
| Variance | 570.84753 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 499 | |
| 14 | 87 | 7.4% |
| 42 | 63 | 5.4% |
| 7 | 61 | 5.2% |
| 56 | 60 | 5.1% |
| 21 | 40 | 3.4% |
| 28 | 39 | 3.3% |
| 35 | 27 | 2.3% |
| 70 | 24 | 2.0% |
| 30 | 23 | 2.0% |
| Other values (55) | 248 |
| Value | Count | Frequency (%) |
| 0 | 499 | |
| 0.25 | 3 | 0.3% |
| 0.5 | 4 | 0.3% |
| 0.75 | 1 | 0.1% |
| 0.99 | 1 | 0.1% |
| 1 | 19 | 1.6% |
| 1.5 | 3 | 0.3% |
| 1.66 | 1 | 0.1% |
| 1.75 | 1 | 0.1% |
| 2 | 18 | 1.5% |
| Value | Count | Frequency (%) |
| 98 | 14 | |
| 90 | 1 | 0.1% |
| 86 | 1 | 0.1% |
| 85 | 1 | 0.1% |
| 84 | 10 | |
| 81 | 1 | 0.1% |
| 80 | 3 | 0.3% |
| 77 | 2 | 0.2% |
| 72 | 2 | 0.2% |
| 70.5 | 1 | 0.1% |
Interactions
Correlations
| btu_of_the_ac | is_room_fully_sealed | is_the_ac_inverter_or_not | no_of_hours_ac_was_on_during_daytime_last_week | no_of_hours_ac_was_on_during_night_last_week | room_ID | type_of_the_ac | wattage_of_the_ac | |
|---|---|---|---|---|---|---|---|---|
| btu_of_the_ac | 1.000 | 0.090 | 0.042 | 0.056 | 0.189 | 0.152 | 0.230 | -0.171 |
| is_room_fully_sealed | 0.090 | 1.000 | 0.000 | 0.000 | 0.085 | 0.085 | 0.221 | 0.169 |
| is_the_ac_inverter_or_not | 0.042 | 0.000 | 1.000 | 0.000 | 0.101 | 0.122 | 0.259 | 0.000 |
| no_of_hours_ac_was_on_during_daytime_last_week | 0.056 | 0.000 | 0.000 | 1.000 | 0.233 | 0.084 | 0.040 | 0.039 |
| no_of_hours_ac_was_on_during_night_last_week | 0.189 | 0.085 | 0.101 | 0.233 | 1.000 | 0.000 | 0.071 | 0.200 |
| room_ID | 0.152 | 0.085 | 0.122 | 0.084 | 0.000 | 1.000 | 0.101 | 0.059 |
| type_of_the_ac | 0.230 | 0.221 | 0.259 | 0.040 | 0.071 | 0.101 | 1.000 | 0.114 |
| wattage_of_the_ac | -0.171 | 0.169 | 0.000 | 0.039 | 0.200 | 0.059 | 0.114 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| household_ID | room_ID | ac_ID | type_of_the_ac | is_the_ac_inverter_or_not | is_room_fully_sealed | wattage_of_the_ac | btu_of_the_ac | no_of_hours_ac_was_on_during_daytime_last_week | no_of_hours_ac_was_on_during_night_last_week | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | ID0012 | I3 | I3_AC1 | Individual AC with one component | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 1500.0 | NaN | 0.0 | 45.0 |
| 1 | ID0014 | I3 | I3_AC1 | Other | No | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | NaN | 0.0 | 0.0 |
| 2 | ID0018 | I5 | I5_AC1 | Individual AC with two components | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 2000.0 | NaN | 0.0 | 56.0 |
| 3 | ID0025 | I2 | I2_AC1 | Central AC (Only to your Household) | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | NaN | 0.0 | 0.0 |
| 4 | ID0039 | I8 | I8_AC1 | Individual AC with two components | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | NaN | 12.0 | 12.0 |
| 5 | ID0039 | I9 | I9_AC1 | Individual AC with two components | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | 9000.0 | 12.0 | 12.0 |
| 6 | ID0041 | I8 | I8_AC1 | Air Cooler | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | NaN | 0.0 | 21.0 |
| 7 | ID0043 | I3 | I3_AC1 | Individual AC with two components | No | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | NaN | 0.0 | 0.0 |
| 8 | ID0043 | I18 | I18_AC1 | Individual AC with two components | No | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | NaN | 42.0 | 21.0 |
| 9 | ID0043 | I19 | I19_AC1 | Individual AC with two components | No | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | 0.0 | NaN | 42.0 | 42.0 |
| household_ID | room_ID | ac_ID | type_of_the_ac | is_the_ac_inverter_or_not | is_room_fully_sealed | wattage_of_the_ac | btu_of_the_ac | no_of_hours_ac_was_on_during_daytime_last_week | no_of_hours_ac_was_on_during_night_last_week | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1161 | ID3910 | I3 | I3_AC1 | Individual AC with two components | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | 12000.0 | 0.0 | 56.0 |
| 1162 | ID3910 | I13 | I13_AC1 | Individual AC with two components | No | The room is not fully closed. There are spaces where the cool air can leak out. | NaN | 12000.0 | 0.0 | 4.0 |
| 1163 | ID3910 | I14 | I14_AC1 | Individual AC with two components | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | 12000.0 | 0.0 | 21.0 |
| 1164 | ID3910 | I15 | I15_AC1 | Individual AC with two components | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | 12000.0 | 0.0 | 0.0 |
| 1165 | ID3943 | I2 | I2_AC1 | Central AC (Only to your Household) | No | The room is fully closed. However, it is not fully sealed. Therefore, when the AC is on, the cool air may leak through the spaces that are not sealed such as the space in-between the door and the door frame. | NaN | NaN | 0.0 | 0.0 |
| 1166 | ID3950 | I1 | I1_AC1 | Individual AC with one component | No | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | 1000.0 | 0.0 | 2.0 |
| 1167 | ID4015 | I2 | I2_AC1 | Individual AC with one component | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | NaN | 0.0 | 0.0 |
| 1168 | ID4026 | I2 | I2_AC1 | Individual AC with one component | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | NaN | 0.0 | 2.0 |
| 1169 | ID4053 | I2 | I2_AC1 | Individual AC with one component | No | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | 1000.0 | 7.0 | 7.0 |
| 1170 | ID4055 | I4 | I4_AC1 | Individual AC with one component | Yes | The room can be fully closed and sealed and there are no outside openings. When the AC is turned on, the cool air does not go out of the room. | NaN | 1000.0 | 0.0 | 0.0 |