Overview
Brought to you by YData
Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 4,063 |
| Missing cells | 27,643 |
| Missing cells (%) | 26.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 825.4 KiB |
| Average record size in memory | 208.0 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 6 |
| Categorical | 17 |
| Boolean | 2 |
awareness_of_electricity_consumption_of_renters has constant value "I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc." | Constant |
charging_method_of_renters_for_electricity is highly overall correlated with no_of_storeys and 1 other fields | High correlation |
electricity_provider_csc_area is highly overall correlated with type_of_electricity_meter | High correlation |
floor_which_house_located is highly overall correlated with occupy_renters_boarders | High correlation |
highest_level_of_education_of_the_chief_wage_earner is highly overall correlated with socio_economic_class | High correlation |
is_there_business_carried_out_in_the_household is highly overall correlated with type_of_business | High correlation |
no_of_storeys is highly overall correlated with charging_method_of_renters_for_electricity and 1 other fields | High correlation |
occupation_of_the_chief_wage_earner is highly overall correlated with socio_economic_class | High correlation |
occupy_renters_boarders is highly overall correlated with floor_which_house_located and 1 other fields | High correlation |
own_the_house_or_living_on_rent is highly overall correlated with charging_method_of_renters_for_electricity and 1 other fields | High correlation |
socio_economic_class is highly overall correlated with highest_level_of_education_of_the_chief_wage_earner and 1 other fields | High correlation |
type_of_business is highly overall correlated with is_there_business_carried_out_in_the_household and 1 other fields | High correlation |
type_of_electricity_meter is highly overall correlated with electricity_provider_csc_area | High correlation |
own_the_house_or_living_on_rent is highly imbalanced (68.5%) | Imbalance |
occupy_renters_boarders is highly imbalanced (86.2%) | Imbalance |
type_of_house is highly imbalanced (59.8%) | Imbalance |
charged_method_for_rent_for_electricity is highly imbalanced (79.8%) | Imbalance |
is_there_business_carried_out_in_the_household is highly imbalanced (73.2%) | Imbalance |
main_material_used_for_roof_of_the_house is highly imbalanced (50.4%) | Imbalance |
any_constructions_or_renovations_in_the_household is highly imbalanced (71.3%) | Imbalance |
occupy_renters_boarders has 536 (13.2%) missing values | Missing |
awareness_of_electricity_consumption_of_renters has 3959 (97.4%) missing values | Missing |
floor_which_house_located has 3970 (97.7%) missing values | Missing |
no_of_storeys has 3814 (93.9%) missing values | Missing |
charging_method_of_renters_for_electricity has 3959 (97.4%) missing values | Missing |
charged_method_for_rent_for_electricity has 3527 (86.8%) missing values | Missing |
type_of_business has 3877 (95.4%) missing values | Missing |
whom_or_how_the_house_was_designed has 1280 (31.5%) missing values | Missing |
availability_of_certificate_of_compliance has 1280 (31.5%) missing values | Missing |
main_material_used_for_roof_of_the_house has 1280 (31.5%) missing values | Missing |
total_monthly_expenditure_of_last_month has 135 (3.3%) missing values | Missing |
household_ID has unique values | Unique |
Reproduction
| Analysis started | 2024-12-06 05:54:12.129767 |
|---|---|
| Analysis finished | 2024-12-06 05:54:18.907451 |
| Duration | 6.78 seconds |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
household_ID
Text
Unique 
| Distinct | 4063 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 4,063 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | ID0001 |
|---|---|
| 2nd row | ID0002 |
| 3rd row | ID0003 |
| 4th row | ID0004 |
| 5th row | ID0005 |
| Value | Count | Frequency (%) |
| id0039 | 1 | < 0.1% |
| id4063 | 1 | < 0.1% |
| id0001 | 1 | < 0.1% |
| id0002 | 1 | < 0.1% |
| id0003 | 1 | < 0.1% |
| id0004 | 1 | < 0.1% |
| id0005 | 1 | < 0.1% |
| id0006 | 1 | < 0.1% |
| id0007 | 1 | < 0.1% |
| id0008 | 1 | < 0.1% |
| Other values (4053) | 4053 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 4063 | |
| D | 4063 | |
| 0 | 2277 | |
| 3 | 2217 | |
| 2 | 2217 | |
| 1 | 2217 | |
| 4 | 1280 | 5.3% |
| 5 | 1216 | 5.0% |
| 6 | 1210 | 5.0% |
| 7 | 1206 | 4.9% |
| Other values (2) | 2412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16252 | |
| Uppercase Letter | 8126 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2277 | |
| 3 | 2217 | |
| 2 | 2217 | |
| 1 | 2217 | |
| 4 | 1280 | |
| 5 | 1216 | |
| 6 | 1210 | |
| 7 | 1206 | |
| 8 | 1206 | |
| 9 | 1206 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 4063 | |
| D | 4063 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16252 | |
| Latin | 8126 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2277 | |
| 3 | 2217 | |
| 2 | 2217 | |
| 1 | 2217 | |
| 4 | 1280 | |
| 5 | 1216 | |
| 6 | 1210 | |
| 7 | 1206 | |
| 8 | 1206 | |
| 9 | 1206 |
Latin
| Value | Count | Frequency (%) |
| I | 4063 | |
| D | 4063 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24378 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 4063 | |
| D | 4063 | |
| 0 | 2277 | |
| 3 | 2217 | |
| 2 | 2217 | |
| 1 | 2217 | |
| 4 | 1280 | 5.3% |
| 5 | 1216 | 5.0% |
| 6 | 1210 | 5.0% |
| 7 | 1206 | 4.9% |
| Other values (2) | 2412 |
no_of_electricity_meters
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0762983 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.31628519 |
|---|---|
| Coefficient of variation (CV) | 0.29386387 |
| Kurtosis | 51.158272 |
| Mean | 1.0762983 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.6452849 |
| Sum | 4373 |
| Variance | 0.10003632 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 3799 | |
| 2 | 225 | 5.5% |
| 3 | 36 | 0.9% |
| 5 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 3799 | |
| 2 | 225 | 5.5% |
| 3 | 36 | 0.9% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 36 | 0.9% |
| 2 | 225 | 5.5% |
| 1 | 3799 |
electricity_provider_csc_area
Categorical
High correlation 
| Distinct | 23 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| MORATUWA NORTH | |
|---|---|
| MORATUWA SOUTH | |
| PANADURA | |
| GALLE | 216 |
| KESELWATTA | 206 |
| Other values (18) |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 9.6475511 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | GALLE |
|---|---|
| 2nd row | GALLE |
| 3rd row | GALLE |
| 4th row | BORALASGAMUWA |
| 5th row | KOLONNAWA |
Common Values
| Value | Count | Frequency (%) |
| MORATUWA NORTH | 533 | 13.1% |
| MORATUWA SOUTH | 370 | 9.1% |
| PANADURA | 357 | 8.8% |
| GALLE | 216 | 5.3% |
| KESELWATTA | 206 | 5.1% |
| MAHARAGAMA | 202 | 5.0% |
| PAYAGALA | 196 | 4.8% |
| KALUTARA | 189 | 4.7% |
| HIKKADUWA | 163 | 4.0% |
| ALUTHGAMA | 158 | 3.9% |
| Other values (13) | 1473 |
Length
| Value | Count | Frequency (%) |
| moratuwa | 903 | |
| north | 533 | 10.7% |
| south | 370 | 7.5% |
| panadura | 357 | 7.2% |
| galle | 216 | 4.3% |
| keselwatta | 206 | 4.1% |
| maharagama | 202 | 4.1% |
| payagala | 196 | 3.9% |
| kalutara | 189 | 3.8% |
| hikkaduwa | 163 | 3.3% |
| Other values (14) | 1631 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10055 | |
| T | 3417 | 8.7% |
| O | 2914 | 7.4% |
| U | 2568 | 6.6% |
| R | 2454 | 6.3% |
| M | 2125 | 5.4% |
| L | 1849 | 4.7% |
| W | 1788 | 4.6% |
| N | 1693 | 4.3% |
| H | 1565 | 4.0% |
| Other values (12) | 8770 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 38065 | |
| Space Separator | 903 | 2.3% |
| Dash Punctuation | 230 | 0.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10055 | |
| T | 3417 | 9.0% |
| O | 2914 | 7.7% |
| U | 2568 | 6.7% |
| R | 2454 | 6.4% |
| M | 2125 | 5.6% |
| L | 1849 | 4.9% |
| W | 1788 | 4.7% |
| N | 1693 | 4.4% |
| H | 1565 | 4.1% |
| Other values (10) | 7637 |
Space Separator
| Value | Count | Frequency (%) |
| 903 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 230 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38065 | |
| Common | 1133 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10055 | |
| T | 3417 | 9.0% |
| O | 2914 | 7.7% |
| U | 2568 | 6.7% |
| R | 2454 | 6.4% |
| M | 2125 | 5.6% |
| L | 1849 | 4.9% |
| W | 1788 | 4.7% |
| N | 1693 | 4.4% |
| H | 1565 | 4.1% |
| Other values (10) | 7637 |
Common
| Value | Count | Frequency (%) |
| 903 | ||
| - | 230 | 20.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39198 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 10055 | |
| T | 3417 | 8.7% |
| O | 2914 | 7.4% |
| U | 2568 | 6.6% |
| R | 2454 | 6.3% |
| M | 2125 | 5.4% |
| L | 1849 | 4.7% |
| W | 1788 | 4.6% |
| N | 1693 | 4.3% |
| H | 1565 | 4.0% |
| Other values (12) | 8770 |
own_the_house_or_living_on_rent
Categorical
High correlation  Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| Yes, I or a household member owns it. | |
|---|---|
| No, I am living on rent and the rent is paid by me or a household member. | |
| No, I or any household member does not own or rent this household. We occupy this household without any payment of rent. | 50 |
| No, I am living on rent and the rent is paid by the employer. | 4 |
Length
| Max length | 120 |
|---|---|
| Median length | 37 |
| Mean length | 42.315777 |
| Min length | 37 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yes, I or a household member owns it. |
|---|---|
| 2nd row | Yes, I or a household member owns it. |
| 3rd row | Yes, I or a household member owns it. |
| 4th row | Yes, I or a household member owns it. |
| 5th row | Yes, I or a household member owns it. |
Common Values
| Value | Count | Frequency (%) |
| Yes, I or a household member owns it. | 3527 | |
| No, I am living on rent and the rent is paid by me or a household member. | 482 | 11.9% |
| No, I or any household member does not own or rent this household. We occupy this household without any payment of rent. | 50 | 1.2% |
| No, I am living on rent and the rent is paid by the employer. | 4 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| household | 4159 | |
| or | 4109 | |
| i | 4063 | |
| member | 4059 | |
| a | 4009 | |
| yes | 3527 | |
| owns | 3527 | |
| it | 3527 | |
| rent | 1072 | 2.9% |
| no | 536 | 1.4% |
| Other values (20) | 4978 |
Most occurring characters
| Value | Count | Frequency (%) |
| 33503 | ||
| e | 18006 | 10.5% |
| o | 17280 | 10.1% |
| s | 11849 | 6.9% |
| r | 9244 | 5.4% |
| m | 9140 | 5.3% |
| h | 8958 | 5.2% |
| n | 6307 | 3.7% |
| i | 5621 | 3.3% |
| a | 5617 | 3.3% |
| Other values (18) | 46404 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 122074 | |
| Space Separator | 33503 | 19.5% |
| Other Punctuation | 8176 | 4.8% |
| Uppercase Letter | 8176 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 18006 | |
| o | 17280 | |
| s | 11849 | |
| r | 9244 | 7.6% |
| m | 9140 | 7.5% |
| h | 8958 | 7.3% |
| n | 6307 | 5.2% |
| i | 5621 | 4.6% |
| a | 5617 | 4.6% |
| t | 5389 | 4.4% |
| Other values (11) | 24663 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 4063 | |
| Y | 3527 | |
| N | 536 | 6.6% |
| W | 50 | 0.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4113 | |
| , | 4063 |
Space Separator
| Value | Count | Frequency (%) |
| 33503 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 130250 | |
| Common | 41679 | 24.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 18006 | |
| o | 17280 | |
| s | 11849 | 9.1% |
| r | 9244 | 7.1% |
| m | 9140 | 7.0% |
| h | 8958 | 6.9% |
| n | 6307 | 4.8% |
| i | 5621 | 4.3% |
| a | 5617 | 4.3% |
| t | 5389 | 4.1% |
| Other values (15) | 32839 |
Common
| Value | Count | Frequency (%) |
| 33503 | ||
| . | 4113 | 9.9% |
| , | 4063 | 9.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 171929 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 33503 | ||
| e | 18006 | 10.5% |
| o | 17280 | 10.1% |
| s | 11849 | 6.9% |
| r | 9244 | 5.4% |
| m | 9140 | 5.3% |
| h | 8958 | 5.2% |
| n | 6307 | 3.7% |
| i | 5621 | 3.3% |
| a | 5617 | 3.3% |
| Other values (18) | 46404 |
occupy_renters_boarders
Categorical
High correlation  Imbalance  Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 536 |
| Missing (%) | 13.2% |
| Memory size | 31.9 KiB |
| I don't occupy any of the above. | |
|---|---|
| Renters / boarders who are living in your annexe or any other attached place, maintaining separate living conditions but share the same electricity meter. | 72 |
| Boarders who live in your house using a room/s that is attached to your living conditions. | 32 |
Length
| Max length | 154 |
|---|---|
| Median length | 32 |
| Mean length | 35.016728 |
| Min length | 32 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | I don't occupy any of the above. |
|---|---|
| 2nd row | I don't occupy any of the above. |
| 3rd row | I don't occupy any of the above. |
| 4th row | I don't occupy any of the above. |
| 5th row | I don't occupy any of the above. |
Common Values
| Value | Count | Frequency (%) |
| I don't occupy any of the above. | 3423 | |
| Renters / boarders who are living in your annexe or any other attached place, maintaining separate living conditions but share the same electricity meter. | 72 | 1.8% |
| Boarders who live in your house using a room/s that is attached to your living conditions. | 32 | 0.8% |
| (Missing) | 536 | 13.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| any | 3495 | |
| the | 3495 | |
| don't | 3423 | |
| i | 3423 | |
| occupy | 3423 | |
| of | 3423 | |
| above | 3423 | |
| living | 176 | 0.7% |
| your | 136 | 0.5% |
| in | 104 | 0.4% |
| Other values (26) | 1680 |
Most occurring characters
| Value | Count | Frequency (%) |
| 22674 | ||
| o | 14516 | |
| e | 8270 | 6.7% |
| a | 7942 | 6.4% |
| t | 7902 | 6.4% |
| n | 7870 | 6.4% |
| c | 7270 | 5.9% |
| y | 7126 | 5.8% |
| h | 3911 | 3.2% |
| d | 3735 | 3.0% |
| Other values (20) | 32288 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 90177 | |
| Space Separator | 22674 | 18.4% |
| Other Punctuation | 7126 | 5.8% |
| Uppercase Letter | 3527 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 14516 | |
| e | 8270 | |
| a | 7942 | |
| t | 7902 | |
| n | 7870 | |
| c | 7270 | |
| y | 7126 | 7.9% |
| h | 3911 | 4.3% |
| d | 3735 | 4.1% |
| u | 3695 | 4.1% |
| Other values (12) | 17940 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3527 | |
| ' | 3423 | |
| / | 104 | 1.5% |
| , | 72 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 3423 | |
| R | 72 | 2.0% |
| B | 32 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 22674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 93704 | |
| Common | 29800 | 24.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 14516 | |
| e | 8270 | 8.8% |
| a | 7942 | 8.5% |
| t | 7902 | 8.4% |
| n | 7870 | 8.4% |
| c | 7270 | 7.8% |
| y | 7126 | 7.6% |
| h | 3911 | 4.2% |
| d | 3735 | 4.0% |
| u | 3695 | 3.9% |
| Other values (15) | 21467 |
Common
| Value | Count | Frequency (%) |
| 22674 | ||
| . | 3527 | 11.8% |
| ' | 3423 | 11.5% |
| / | 104 | 0.3% |
| , | 72 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 123504 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 22674 | ||
| o | 14516 | |
| e | 8270 | 6.7% |
| a | 7942 | 6.4% |
| t | 7902 | 6.4% |
| n | 7870 | 6.4% |
| c | 7270 | 5.9% |
| y | 7126 | 5.8% |
| h | 3911 | 3.2% |
| d | 3735 | 3.0% |
| Other values (20) | 32288 |
awareness_of_electricity_consumption_of_renters
Categorical
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 3959 |
| Missing (%) | 97.4% |
| Memory size | 31.9 KiB |
| I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
|---|
Length
| Max length | 218 |
|---|---|
| Median length | 218 |
| Mean length | 218 |
| Min length | 218 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
|---|---|
| 2nd row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
| 3rd row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
| 4th row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
| 5th row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
Common Values
| Value | Count | Frequency (%) |
| I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. | 104 | 2.6% |
| (Missing) | 3959 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| the | 728 | |
| they | 312 | 7.9% |
| of | 208 | 5.3% |
| use | 208 | 5.3% |
| and | 208 | 5.3% |
| all | 104 | 2.6% |
| details | 104 | 2.6% |
| electricity | 104 | 2.6% |
| know | 104 | 2.6% |
| about | 104 | 2.6% |
| Other values (17) | 1768 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3848 | ||
| e | 2912 | |
| t | 2080 | 9.2% |
| h | 1456 | 6.4% |
| a | 1248 | 5.5% |
| s | 1248 | 5.5% |
| n | 1144 | 5.0% |
| i | 1040 | 4.6% |
| o | 936 | 4.1% |
| c | 832 | 3.7% |
| Other values (17) | 5928 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17992 | |
| Space Separator | 3848 | 17.0% |
| Other Punctuation | 728 | 3.2% |
| Uppercase Letter | 104 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2912 | |
| t | 2080 | |
| h | 1456 | 8.1% |
| a | 1248 | 6.9% |
| s | 1248 | 6.9% |
| n | 1144 | 6.4% |
| i | 1040 | 5.8% |
| o | 936 | 5.2% |
| c | 832 | 4.6% |
| l | 728 | 4.0% |
| Other values (11) | 4368 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 312 | |
| ; | 208 | |
| / | 104 | 14.3% |
| , | 104 | 14.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3848 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 104 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18096 | |
| Common | 4576 | 20.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2912 | |
| t | 2080 | |
| h | 1456 | 8.0% |
| a | 1248 | 6.9% |
| s | 1248 | 6.9% |
| n | 1144 | 6.3% |
| i | 1040 | 5.7% |
| o | 936 | 5.2% |
| c | 832 | 4.6% |
| l | 728 | 4.0% |
| Other values (12) | 4472 |
Common
| Value | Count | Frequency (%) |
| 3848 | ||
| . | 312 | 6.8% |
| ; | 208 | 4.5% |
| / | 104 | 2.3% |
| , | 104 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22672 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3848 | ||
| e | 2912 | |
| t | 2080 | 9.2% |
| h | 1456 | 6.4% |
| a | 1248 | 5.5% |
| s | 1248 | 5.5% |
| n | 1144 | 5.0% |
| i | 1040 | 4.6% |
| o | 936 | 4.1% |
| c | 832 | 3.7% |
| Other values (17) | 5928 |
built_year_of_the_house
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| 2000-2009 | |
|---|---|
| 2010-2019 | |
| Before 1980 | |
| 1990-1999 | |
| 1980-1989 | |
| Other values (2) |
Length
| Max length | 21 |
|---|---|
| Median length | 9 |
| Mean length | 10.108787 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2000-2009 |
|---|---|
| 2nd row | Before 1980 |
| 3rd row | 1980-1989 |
| 4th row | 2010-2019 |
| 5th row | 2010-2019 |
Common Values
| Value | Count | Frequency (%) |
| 2000-2009 | 918 | |
| 2010-2019 | 758 | |
| Before 1980 | 740 | |
| 1990-1999 | 615 | |
| 1980-1989 | 482 | |
| Don't know | 325 | 8.0% |
| In 2020 or After 2020 | 225 | 5.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2000-2009 | 918 | |
| 2010-2019 | 758 | |
| before | 740 | |
| 1980 | 740 | |
| 1990-1999 | 615 | |
| 1980-1989 | 482 | |
| 2020 | 450 | |
| don't | 325 | 5.4% |
| know | 325 | 5.4% |
| in | 225 | 3.7% |
| Other values (2) | 450 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9601 | |
| 9 | 6937 | |
| 1 | 4450 | |
| 2 | 4252 | |
| - | 2773 | 6.8% |
| 1965 | 4.8% | |
| e | 1705 | 4.2% |
| 8 | 1704 | 4.1% |
| o | 1615 | 3.9% |
| r | 1190 | 2.9% |
| Other values (10) | 4880 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26944 | |
| Lowercase Letter | 7550 | 18.4% |
| Dash Punctuation | 2773 | 6.8% |
| Space Separator | 1965 | 4.8% |
| Uppercase Letter | 1515 | 3.7% |
| Other Punctuation | 325 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1705 | |
| o | 1615 | |
| r | 1190 | |
| f | 965 | |
| n | 875 | |
| t | 550 | 7.3% |
| k | 325 | 4.3% |
| w | 325 | 4.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9601 | |
| 9 | 6937 | |
| 1 | 4450 | |
| 2 | 4252 | |
| 8 | 1704 | 6.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 740 | |
| D | 325 | |
| I | 225 | 14.9% |
| A | 225 | 14.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2773 |
Space Separator
| Value | Count | Frequency (%) |
| 1965 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 325 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 32007 | |
| Latin | 9065 | 22.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1705 | |
| o | 1615 | |
| r | 1190 | |
| f | 965 | |
| n | 875 | |
| B | 740 | |
| t | 550 | 6.1% |
| D | 325 | 3.6% |
| k | 325 | 3.6% |
| w | 325 | 3.6% |
| Other values (2) | 450 | 5.0% |
Common
| Value | Count | Frequency (%) |
| 0 | 9601 | |
| 9 | 6937 | |
| 1 | 4450 | |
| 2 | 4252 | |
| - | 2773 | 8.7% |
| 1965 | 6.1% | |
| 8 | 1704 | 5.3% |
| ' | 325 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41072 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9601 | |
| 9 | 6937 | |
| 1 | 4450 | |
| 2 | 4252 | |
| - | 2773 | 6.8% |
| 1965 | 4.8% | |
| e | 1705 | 4.2% |
| 8 | 1704 | 4.1% |
| o | 1615 | 3.9% |
| r | 1190 | 2.9% |
| Other values (10) | 4880 |
type_of_house
Categorical
Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| Single House - Single Floor | |
|---|---|
| Single House - Double Floor | |
| Single House - More than 2 floors | 113 |
| Flat | 80 |
| Condominium/ Luxury apartments | 13 |
| Other values (5) | 43 |
Length
| Max length | 33 |
|---|---|
| Median length | 27 |
| Mean length | 26.605464 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Single House - Double Floor |
|---|---|
| 2nd row | Single House - Single Floor |
| 3rd row | Single House - Single Floor |
| 4th row | Single House - Double Floor |
| 5th row | Flat |
Common Values
| Value | Count | Frequency (%) |
| Single House - Single Floor | 2482 | |
| Single House - Double Floor | 1332 | |
| Single House - More than 2 floors | 113 | 2.8% |
| Flat | 80 | 2.0% |
| Condominium/ Luxury apartments | 13 | 0.3% |
| Slum / Shanty | 11 | 0.3% |
| Line room/row house | 11 | 0.3% |
| Attached house / Annex | 10 | 0.2% |
| Twin houses | 9 | 0.2% |
| Other | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| single | 6409 | |
| house | 3948 | |
| 3948 | ||
| floor | 3814 | |
| double | 1332 | 6.6% |
| more | 113 | 0.6% |
| than | 113 | 0.6% |
| 2 | 113 | 0.6% |
| floors | 113 | 0.6% |
| flat | 80 | 0.4% |
| Other values (12) | 123 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 16043 | ||
| o | 13315 | |
| e | 11857 | |
| l | 11759 | |
| n | 6612 | 6.1% |
| i | 6455 | 6.0% |
| S | 6431 | 5.9% |
| g | 6409 | 5.9% |
| u | 5339 | 4.9% |
| s | 4092 | 3.8% |
| Other values (25) | 19786 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 72205 | |
| Space Separator | 16043 | 14.8% |
| Uppercase Letter | 15765 | 14.6% |
| Dash Punctuation | 3927 | 3.6% |
| Decimal Number | 113 | 0.1% |
| Other Punctuation | 45 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 13315 | |
| e | 11857 | |
| l | 11759 | |
| n | 6612 | |
| i | 6455 | |
| g | 6409 | |
| u | 5339 | |
| s | 4092 | 5.7% |
| r | 4090 | 5.7% |
| b | 1332 | 1.8% |
| Other values (11) | 945 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 6431 | |
| H | 3927 | |
| F | 3894 | |
| D | 1332 | 8.4% |
| M | 113 | 0.7% |
| L | 24 | 0.2% |
| A | 20 | 0.1% |
| C | 13 | 0.1% |
| T | 9 | 0.1% |
| O | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 16043 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3927 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 113 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 45 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 87970 | |
| Common | 20128 | 18.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 13315 | |
| e | 11857 | |
| l | 11759 | |
| n | 6612 | |
| i | 6455 | |
| S | 6431 | |
| g | 6409 | |
| u | 5339 | |
| s | 4092 | 4.7% |
| r | 4090 | 4.6% |
| Other values (21) | 11611 |
Common
| Value | Count | Frequency (%) |
| 16043 | ||
| - | 3927 | 19.5% |
| 2 | 113 | 0.6% |
| / | 45 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 108098 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 16043 | ||
| o | 13315 | |
| e | 11857 | |
| l | 11759 | |
| n | 6612 | 6.1% |
| i | 6455 | 6.0% |
| S | 6431 | 5.9% |
| g | 6409 | 5.9% |
| u | 5339 | 4.9% |
| s | 4092 | 3.8% |
| Other values (25) | 19786 |
floor_which_house_located
Real number (ℝ)
High correlation  Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | 12.9% |
| Missing | 3970 |
| Missing (%) | 97.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7741935 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 17 |
| Zeros (%) | 0.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 9 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.8173725 |
|---|---|
| Coefficient of variation (CV) | 1.0155645 |
| Kurtosis | 0.61342339 |
| Mean | 2.7741935 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.2342572 |
| Sum | 258 |
| Variance | 7.9375877 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 24 | 0.6% |
| 2 | 18 | 0.4% |
| 0 | 17 | 0.4% |
| 3 | 7 | 0.2% |
| 4 | 7 | 0.2% |
| 5 | 4 | 0.1% |
| 8 | 4 | 0.1% |
| 6 | 3 | 0.1% |
| 7 | 3 | 0.1% |
| 9 | 3 | 0.1% |
| Other values (2) | 3 | 0.1% |
| (Missing) | 3970 |
| Value | Count | Frequency (%) |
| 0 | 17 | |
| 1 | 24 | |
| 2 | 18 | |
| 3 | 7 | 0.2% |
| 4 | 7 | 0.2% |
| 5 | 4 | 0.1% |
| 6 | 3 | 0.1% |
| 7 | 3 | 0.1% |
| 8 | 4 | 0.1% |
| 9 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 11 | 1 | < 0.1% |
| 10 | 2 | < 0.1% |
| 9 | 3 | 0.1% |
| 8 | 4 | 0.1% |
| 7 | 3 | 0.1% |
| 6 | 3 | 0.1% |
| 5 | 4 | 0.1% |
| 4 | 7 | 0.2% |
| 3 | 7 | 0.2% |
| 2 | 18 |
no_of_storeys
Real number (ℝ)
High correlation  Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 3814 |
| Missing (%) | 93.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.746988 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 35 |
| Zeros (%) | 0.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.1378039 |
|---|---|
| Coefficient of variation (CV) | 0.65129465 |
| Kurtosis | -1.2583883 |
| Mean | 1.746988 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.013050986 |
| Sum | 435 |
| Variance | 1.2945977 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 93 | 2.3% |
| 1 | 91 | 2.2% |
| 0 | 35 | 0.9% |
| 2 | 28 | 0.7% |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| (Missing) | 3814 |
| Value | Count | Frequency (%) |
| 0 | 35 | 0.9% |
| 1 | 91 | |
| 2 | 28 | 0.7% |
| 3 | 93 | |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 93 | |
| 2 | 28 | 0.7% |
| 1 | 91 | |
| 0 | 35 | 0.9% |
floor_area
Real number (ℝ)
| Distinct | 386 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 26 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1356.1812 |
| Minimum | 100 |
|---|---|
| Maximum | 9000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 KiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 300 |
| Q1 | 600 |
| median | 1000 |
| Q3 | 2000 |
| 95-th percentile | 3000 |
| Maximum | 9000 |
| Range | 8900 |
| Interquartile range (IQR) | 1400 |
Descriptive statistics
| Standard deviation | 950.2113 |
|---|---|
| Coefficient of variation (CV) | 0.70065219 |
| Kurtosis | 2.1828572 |
| Mean | 1356.1812 |
| Median Absolute Deviation (MAD) | 500 |
| Skewness | 1.2111539 |
| Sum | 5474903.3 |
| Variance | 902901.51 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3000 | 277 | 6.8% |
| 1000 | 264 | 6.5% |
| 1200 | 256 | 6.3% |
| 800 | 203 | 5.0% |
| 600 | 202 | 5.0% |
| 2400 | 180 | 4.4% |
| 1500 | 175 | 4.3% |
| 2000 | 171 | 4.2% |
| 500 | 140 | 3.4% |
| 400 | 111 | 2.7% |
| Other values (376) | 2058 |
| Value | Count | Frequency (%) |
| 100 | 12 | |
| 108 | 1 | < 0.1% |
| 120 | 2 | < 0.1% |
| 125 | 1 | < 0.1% |
| 136.5 | 1 | < 0.1% |
| 140 | 2 | < 0.1% |
| 143 | 1 | < 0.1% |
| 144 | 1 | < 0.1% |
| 150 | 22 | |
| 160 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9000 | 2 | < 0.1% |
| 6000 | 1 | < 0.1% |
| 5000 | 1 | < 0.1% |
| 4700 | 1 | < 0.1% |
| 4600 | 7 | 0.2% |
| 4400 | 8 | 0.2% |
| 4200 | 15 | |
| 4000 | 23 | |
| 3960 | 2 | < 0.1% |
| 3900 | 1 | < 0.1% |
no_of_household_members
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.0044302 |
| Minimum | 1 |
|---|---|
| Maximum | 13 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 13 |
| Range | 12 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.6872622 |
|---|---|
| Coefficient of variation (CV) | 0.42134889 |
| Kurtosis | 1.2549374 |
| Mean | 4.0044302 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.68467186 |
| Sum | 16270 |
| Variance | 2.8468538 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 1014 | |
| 3 | 818 | |
| 5 | 772 | |
| 2 | 602 | |
| 6 | 407 | |
| 1 | 186 | 4.6% |
| 7 | 144 | 3.5% |
| 8 | 65 | 1.6% |
| 9 | 29 | 0.7% |
| 10 | 15 | 0.4% |
| Other values (3) | 11 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 186 | 4.6% |
| 2 | 602 | |
| 3 | 818 | |
| 4 | 1014 | |
| 5 | 772 | |
| 6 | 407 | |
| 7 | 144 | 3.5% |
| 8 | 65 | 1.6% |
| 9 | 29 | 0.7% |
| 10 | 15 | 0.4% |
| Value | Count | Frequency (%) |
| 13 | 2 | < 0.1% |
| 12 | 4 | 0.1% |
| 11 | 5 | 0.1% |
| 10 | 15 | 0.4% |
| 9 | 29 | 0.7% |
| 8 | 65 | 1.6% |
| 7 | 144 | 3.5% |
| 6 | 407 | |
| 5 | 772 | |
| 4 | 1014 |
charging_method_of_renters_for_electricity
Categorical
High correlation  Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 4.8% |
| Missing | 3959 |
| Missing (%) | 97.4% |
| Memory size | 31.9 KiB |
| You charge a fixed amount every month for electricity. | |
|---|---|
| You don't charge them for electricity consumption. | |
| You charge an amount for electricity depending on the variance of the bill. | |
| You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. | |
| You don't charge a specific amount for electricity but charge a varied amount for all the utilities such as electricity, water etc. The amount charged varied based on the utility bills. |
Length
| Max length | 185 |
|---|---|
| Median length | 130 |
| Mean length | 78.759615 |
| Min length | 50 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | You don't charge them for electricity consumption. |
|---|---|
| 2nd row | You don't charge them for electricity consumption. |
| 3rd row | You don't charge them for electricity consumption. |
| 4th row | You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. |
| 5th row | You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. |
Common Values
| Value | Count | Frequency (%) |
| You charge a fixed amount every month for electricity. | 34 | 0.8% |
| You don't charge them for electricity consumption. | 24 | 0.6% |
| You charge an amount for electricity depending on the variance of the bill. | 21 | 0.5% |
| You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. | 19 | 0.5% |
| You don't charge a specific amount for electricity but charge a varied amount for all the utilities such as electricity, water etc. The amount charged varied based on the utility bills. | 6 | 0.1% |
| (Missing) | 3959 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| charge | 129 | 9.5% |
| for | 129 | 9.5% |
| electricity | 129 | 9.5% |
| amount | 111 | 8.2% |
| you | 104 | 7.7% |
| a | 84 | 6.2% |
| the | 79 | 5.8% |
| fixed | 53 | 3.9% |
| don't | 49 | 3.6% |
| every | 34 | 2.5% |
| Other values (22) | 450 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1247 | ||
| e | 798 | 9.7% |
| t | 710 | 8.7% |
| i | 553 | 6.8% |
| c | 538 | 6.6% |
| o | 523 | 6.4% |
| a | 486 | 5.9% |
| r | 485 | 5.9% |
| n | 353 | 4.3% |
| u | 320 | 3.9% |
| Other values (18) | 2178 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6650 | |
| Space Separator | 1247 | 15.2% |
| Other Punctuation | 184 | 2.2% |
| Uppercase Letter | 110 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 798 | |
| t | 710 | |
| i | 553 | 8.3% |
| c | 538 | 8.1% |
| o | 523 | 7.9% |
| a | 486 | 7.3% |
| r | 485 | 7.3% |
| n | 353 | 5.3% |
| u | 320 | 4.8% |
| h | 297 | 4.5% |
| Other values (12) | 1587 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 110 | |
| ' | 49 | |
| , | 25 | 13.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 104 | |
| T | 6 | 5.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1247 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6760 | |
| Common | 1431 | 17.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 798 | |
| t | 710 | |
| i | 553 | 8.2% |
| c | 538 | 8.0% |
| o | 523 | 7.7% |
| a | 486 | 7.2% |
| r | 485 | 7.2% |
| n | 353 | 5.2% |
| u | 320 | 4.7% |
| h | 297 | 4.4% |
| Other values (14) | 1697 |
Common
| Value | Count | Frequency (%) |
| 1247 | ||
| . | 110 | 7.7% |
| ' | 49 | 3.4% |
| , | 25 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8191 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1247 | ||
| e | 798 | 9.7% |
| t | 710 | 8.7% |
| i | 553 | 6.8% |
| c | 538 | 6.6% |
| o | 523 | 6.4% |
| a | 486 | 5.9% |
| r | 485 | 5.9% |
| n | 353 | 4.3% |
| u | 320 | 3.9% |
| Other values (18) | 2178 |
charged_method_for_rent_for_electricity
Categorical
Imbalance  Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3527 |
| Missing (%) | 86.8% |
| Memory size | 31.9 KiB |
| You pay the full amount of the electricity bill. | |
|---|---|
| You don't pay the owner for electricity consumption. | 17 |
| You pay a fixed amount to the owner every month for electricity. | 13 |
| You pay a varied amount to the owner every month for electricity. The amount paid varies depending on the variance of the bill. | 6 |
| You don't pay a specific amount for electricity, but pay a fixed amount for all the utilities such as electricity, water etc. | 3 |
Length
| Max length | 197 |
|---|---|
| Median length | 48 |
| Mean length | 50.108209 |
| Min length | 48 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | You pay the full amount of the electricity bill. |
|---|---|
| 2nd row | You pay the full amount of the electricity bill. |
| 3rd row | You don't pay the owner for electricity consumption. |
| 4th row | You pay the full amount of the electricity bill. |
| 5th row | You pay the full amount of the electricity bill. |
Common Values
| Value | Count | Frequency (%) |
| You pay the full amount of the electricity bill. | 496 | 12.2% |
| You don't pay the owner for electricity consumption. | 17 | 0.4% |
| You pay a fixed amount to the owner every month for electricity. | 13 | 0.3% |
| You pay a varied amount to the owner every month for electricity. The amount paid varies depending on the variance of the bill. | 6 | 0.1% |
| You don't pay a specific amount for electricity, but pay a fixed amount for all the utilities such as electricity, water etc. | 3 | 0.1% |
| You don't pay a specific amount for electricity, but pay a varied amount for all the utilities such as electricity, water etc. The amount paid varies depending on the variance of the utility bills. | 1 | < 0.1% |
| (Missing) | 3527 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| the | 1053 | |
| electricity | 540 | |
| pay | 540 | |
| you | 536 | |
| amount | 530 | |
| of | 503 | |
| bill | 502 | |
| full | 496 | |
| for | 44 | 0.9% |
| owner | 36 | 0.7% |
| Other values (23) | 214 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4458 | ||
| t | 2754 | |
| l | 2551 | 9.5% |
| e | 2274 | 8.5% |
| o | 1749 | 6.5% |
| i | 1673 | 6.2% |
| u | 1592 | 5.9% |
| a | 1144 | 4.3% |
| c | 1120 | 4.2% |
| y | 1100 | 4.1% |
| Other values (18) | 6443 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 21285 | |
| Space Separator | 4458 | 16.6% |
| Other Punctuation | 572 | 2.1% |
| Uppercase Letter | 543 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2754 | |
| l | 2551 | |
| e | 2274 | |
| o | 1749 | |
| i | 1673 | 7.9% |
| u | 1592 | 7.5% |
| a | 1144 | 5.4% |
| c | 1120 | 5.3% |
| y | 1100 | 5.2% |
| h | 1076 | 5.1% |
| Other values (12) | 4252 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 543 | |
| ' | 21 | 3.7% |
| , | 8 | 1.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 536 | |
| T | 7 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 4458 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21828 | |
| Common | 5030 | 18.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2754 | |
| l | 2551 | |
| e | 2274 | |
| o | 1749 | 8.0% |
| i | 1673 | 7.7% |
| u | 1592 | 7.3% |
| a | 1144 | 5.2% |
| c | 1120 | 5.1% |
| y | 1100 | 5.0% |
| h | 1076 | 4.9% |
| Other values (14) | 4795 |
Common
| Value | Count | Frequency (%) |
| 4458 | ||
| . | 543 | 10.8% |
| ' | 21 | 0.4% |
| , | 8 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26858 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4458 | ||
| t | 2754 | |
| l | 2551 | 9.5% |
| e | 2274 | 8.5% |
| o | 1749 | 6.5% |
| i | 1673 | 6.2% |
| u | 1592 | 5.9% |
| a | 1144 | 4.3% |
| c | 1120 | 4.2% |
| y | 1100 | 4.1% |
| Other values (18) | 6443 |
is_there_business_carried_out_in_the_household
Boolean
High correlation  Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| False | |
|---|---|
| True | 186 |
| Value | Count | Frequency (%) |
| False | 3877 | |
| True | 186 | 4.6% |
type_of_business
Categorical
High correlation  Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 3877 |
| Missing (%) | 95.4% |
| Memory size | 31.9 KiB |
| Other | |
|---|---|
| A shop | |
| A communication | 2 |
Length
| Max length | 15 |
|---|---|
| Median length | 5 |
| Mean length | 5.4462366 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other |
|---|---|
| 2nd row | Other |
| 3rd row | Other |
| 4th row | Other |
| 5th row | Other |
Common Values
| Value | Count | Frequency (%) |
| Other | 121 | 3.0% |
| A shop | 63 | 1.6% |
| A communication | 2 | < 0.1% |
| (Missing) | 3877 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| other | 121 | |
| a | 65 | |
| shop | 63 | |
| communication | 2 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 184 | |
| t | 123 | |
| O | 121 | |
| e | 121 | |
| r | 121 | |
| o | 67 | 6.6% |
| 65 | 6.4% | |
| A | 65 | 6.4% |
| s | 63 | 6.2% |
| p | 63 | 6.2% |
| Other values (6) | 20 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 762 | |
| Uppercase Letter | 186 | 18.4% |
| Space Separator | 65 | 6.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| h | 184 | |
| t | 123 | |
| e | 121 | |
| r | 121 | |
| o | 67 | 8.8% |
| s | 63 | 8.3% |
| p | 63 | 8.3% |
| c | 4 | 0.5% |
| m | 4 | 0.5% |
| n | 4 | 0.5% |
| Other values (3) | 8 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 121 | |
| A | 65 |
Space Separator
| Value | Count | Frequency (%) |
| 65 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 948 | |
| Common | 65 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| h | 184 | |
| t | 123 | |
| O | 121 | |
| e | 121 | |
| r | 121 | |
| o | 67 | 7.1% |
| A | 65 | 6.9% |
| s | 63 | 6.6% |
| p | 63 | 6.6% |
| c | 4 | 0.4% |
| Other values (5) | 16 | 1.7% |
Common
| Value | Count | Frequency (%) |
| 65 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1013 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| h | 184 | |
| t | 123 | |
| O | 121 | |
| e | 121 | |
| r | 121 | |
| o | 67 | 6.6% |
| 65 | 6.4% | |
| A | 65 | 6.4% |
| s | 63 | 6.2% |
| p | 63 | 6.2% |
| Other values (6) | 20 | 2.0% |
whom_or_how_the_house_was_designed
Categorical
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1280 |
| Missing (%) | 31.5% |
| Memory size | 31.9 KiB |
| The house is designed by a certified architect. | |
|---|---|
| I am not aware of that. | |
| The house plan is not done by an architect, nor checked by a certified architect or engineer. The house was not designed keeping in mind the legal requirements of the local authorities. The house was designed only to suit our needs. | |
| The house plan is not done by an architect, nor checked by a certified architect / engineer, the house is designed to barely pass the legal requirements of the local authorities. | 125 |
| The house plan was not done by an architect but checked by a certified architect or engineer. | 117 |
Length
| Max length | 232 |
|---|---|
| Median length | 178 |
| Mean length | 72.238951 |
| Min length | 23 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | The house is designed by a certified architect. |
|---|---|
| 2nd row | This is a house provided by the government. |
| 3rd row | The house is designed by a certified architect. |
| 4th row | The house is designed by a certified architect. |
| 5th row | The house is designed by a certified architect. |
Common Values
| Value | Count | Frequency (%) |
| The house is designed by a certified architect. | 1389 | |
| I am not aware of that. | 738 | |
| The house plan is not done by an architect, nor checked by a certified architect or engineer. The house was not designed keeping in mind the legal requirements of the local authorities. The house was designed only to suit our needs. | 359 | 8.8% |
| The house plan is not done by an architect, nor checked by a certified architect / engineer, the house is designed to barely pass the legal requirements of the local authorities. | 125 | 3.1% |
| The house plan was not done by an architect but checked by a certified architect or engineer. | 117 | 2.9% |
| This is a house provided by the government. | 55 | 1.4% |
| (Missing) | 1280 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| the | 3856 | 10.5% |
| house | 2888 | 7.9% |
| by | 2646 | 7.2% |
| architect | 2591 | 7.1% |
| designed | 2232 | 6.1% |
| is | 2053 | 5.6% |
| a | 2045 | 5.6% |
| certified | 1990 | 5.4% |
| not | 1698 | 4.6% |
| of | 1222 | 3.3% |
| Other values (31) | 13342 |
Most occurring characters
| Value | Count | Frequency (%) |
| 33780 | ||
| e | 26269 | |
| i | 14455 | 7.2% |
| t | 13961 | 6.9% |
| a | 11327 | 5.6% |
| h | 11213 | 5.6% |
| s | 9999 | 5.0% |
| n | 9808 | 4.9% |
| o | 9649 | 4.8% |
| r | 8926 | 4.4% |
| Other values (19) | 51654 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 159525 | |
| Space Separator | 33780 | 16.8% |
| Other Punctuation | 4235 | 2.1% |
| Uppercase Letter | 3501 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 26269 | |
| i | 14455 | |
| t | 13961 | 8.8% |
| a | 11327 | 7.1% |
| h | 11213 | 7.0% |
| s | 9999 | 6.3% |
| n | 9808 | 6.1% |
| o | 9649 | 6.0% |
| r | 8926 | 5.6% |
| c | 8858 | 5.6% |
| Other values (13) | 35060 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3501 | |
| , | 609 | 14.4% |
| / | 125 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 2763 | |
| I | 738 | 21.1% |
Space Separator
| Value | Count | Frequency (%) |
| 33780 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 163026 | |
| Common | 38015 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 26269 | |
| i | 14455 | 8.9% |
| t | 13961 | 8.6% |
| a | 11327 | 6.9% |
| h | 11213 | 6.9% |
| s | 9999 | 6.1% |
| n | 9808 | 6.0% |
| o | 9649 | 5.9% |
| r | 8926 | 5.5% |
| c | 8858 | 5.4% |
| Other values (15) | 38561 |
Common
| Value | Count | Frequency (%) |
| 33780 | ||
| . | 3501 | 9.2% |
| , | 609 | 1.6% |
| / | 125 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 201041 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 33780 | ||
| e | 26269 | |
| i | 14455 | 7.2% |
| t | 13961 | 6.9% |
| a | 11327 | 5.6% |
| h | 11213 | 5.6% |
| s | 9999 | 5.0% |
| n | 9808 | 4.9% |
| o | 9649 | 4.8% |
| r | 8926 | 4.4% |
| Other values (19) | 51654 |
availability_of_certificate_of_compliance
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1280 |
| Missing (%) | 31.5% |
| Memory size | 31.9 KiB |
| Yes | |
|---|---|
| Don't know | |
| No |
Length
| Max length | 10 |
|---|---|
| Median length | 3 |
| Mean length | 4.9019044 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | Yes |
| 3rd row | No |
| 4th row | No |
| 5th row | Yes |
Common Values
| Value | Count | Frequency (%) |
| Yes | 1268 | |
| Don't know | 851 | |
| No | 664 | |
| (Missing) | 1280 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| yes | 1268 | |
| don't | 851 | |
| know | 851 | |
| no | 664 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2366 | |
| n | 1702 | |
| Y | 1268 | |
| e | 1268 | |
| s | 1268 | |
| D | 851 | 6.2% |
| ' | 851 | 6.2% |
| t | 851 | 6.2% |
| 851 | 6.2% | |
| k | 851 | 6.2% |
| Other values (2) | 1515 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9157 | |
| Uppercase Letter | 2783 | 20.4% |
| Other Punctuation | 851 | 6.2% |
| Space Separator | 851 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2366 | |
| n | 1702 | |
| e | 1268 | |
| s | 1268 | |
| t | 851 | 9.3% |
| k | 851 | 9.3% |
| w | 851 | 9.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 1268 | |
| D | 851 | |
| N | 664 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 851 |
Space Separator
| Value | Count | Frequency (%) |
| 851 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11940 | |
| Common | 1702 | 12.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2366 | |
| n | 1702 | |
| Y | 1268 | |
| e | 1268 | |
| s | 1268 | |
| D | 851 | 7.1% |
| t | 851 | 7.1% |
| k | 851 | 7.1% |
| w | 851 | 7.1% |
| N | 664 | 5.6% |
Common
| Value | Count | Frequency (%) |
| ' | 851 | |
| 851 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13642 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2366 | |
| n | 1702 | |
| Y | 1268 | |
| e | 1268 | |
| s | 1268 | |
| D | 851 | 6.2% |
| ' | 851 | 6.2% |
| t | 851 | 6.2% |
| 851 | 6.2% | |
| k | 851 | 6.2% |
| Other values (2) | 1515 |
main_material_used_for_walls_of_the_house
Categorical
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| Cement Block | |
|---|---|
| Brick | |
| I am not aware of that | |
| Cabook | |
| Wood / Takaran / Asbestos | 49 |
| Other values (5) | 66 |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 9.8624169 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Brick |
|---|---|
| 2nd row | Brick |
| 3rd row | Brick |
| 4th row | Cement Block |
| 5th row | I am not aware of that |
Common Values
| Value | Count | Frequency (%) |
| Cement Block | 1837 | |
| Brick | 1621 | |
| I am not aware of that | 299 | 7.4% |
| Cabook | 191 | 4.7% |
| Wood / Takaran / Asbestos | 49 | 1.2% |
| Pressed soil blocks | 29 | 0.7% |
| Stones/Cube stones | 19 | 0.5% |
| Other | 9 | 0.2% |
| Mud | 8 | 0.2% |
| Metal Sheet | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cement | 1837 | |
| block | 1837 | |
| brick | 1621 | |
| i | 299 | 3.9% |
| am | 299 | 3.9% |
| not | 299 | 3.9% |
| aware | 299 | 3.9% |
| of | 299 | 3.9% |
| that | 299 | 3.9% |
| cabook | 191 | 2.5% |
| Other values (13) | 389 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4149 | |
| k | 3727 | 9.3% |
| 3606 | 9.0% | |
| c | 3487 | 8.7% |
| B | 3458 | 8.6% |
| o | 3060 | 7.6% |
| t | 2832 | 7.1% |
| n | 2223 | 5.5% |
| m | 2136 | 5.3% |
| C | 2047 | 5.1% |
| Other values (20) | 9346 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30330 | |
| Uppercase Letter | 6018 | 15.0% |
| Space Separator | 3606 | 9.0% |
| Other Punctuation | 117 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4149 | |
| k | 3727 | |
| c | 3487 | |
| o | 3060 | |
| t | 2832 | |
| n | 2223 | |
| m | 2136 | |
| r | 2007 | |
| l | 1896 | |
| i | 1650 | 5.4% |
| Other values (8) | 3163 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 3458 | |
| C | 2047 | |
| I | 299 | 5.0% |
| W | 49 | 0.8% |
| T | 49 | 0.8% |
| A | 49 | 0.8% |
| P | 29 | 0.5% |
| S | 20 | 0.3% |
| O | 9 | 0.1% |
| M | 9 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3606 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 117 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36348 | |
| Common | 3723 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4149 | |
| k | 3727 | |
| c | 3487 | |
| B | 3458 | |
| o | 3060 | |
| t | 2832 | 7.8% |
| n | 2223 | 6.1% |
| m | 2136 | 5.9% |
| C | 2047 | 5.6% |
| r | 2007 | 5.5% |
| Other values (18) | 7222 |
Common
| Value | Count | Frequency (%) |
| 3606 | ||
| / | 117 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40071 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4149 | |
| k | 3727 | 9.3% |
| 3606 | 9.0% | |
| c | 3487 | 8.7% |
| B | 3458 | 8.6% |
| o | 3060 | 7.6% |
| t | 2832 | 7.1% |
| n | 2223 | 5.5% |
| m | 2136 | 5.3% |
| C | 2047 | 5.1% |
| Other values (20) | 9346 |
main_material_used_for_roof_of_the_house
Categorical
Imbalance  Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 1280 |
| Missing (%) | 31.5% |
| Memory size | 31.9 KiB |
| Asbestos | |
|---|---|
| Concrete | |
| Tile | |
| Takaran | 19 |
| Metal Sheet | 14 |
| Other values (3) | 14 |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 7.298958 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Asbestos |
|---|---|
| 2nd row | Asbestos |
| 3rd row | Tile |
| 4th row | Concrete |
| 5th row | Concrete |
Common Values
| Value | Count | Frequency (%) |
| Asbestos | 1665 | |
| Concrete | 577 | 14.2% |
| Tile | 494 | 12.2% |
| Takaran | 19 | 0.5% |
| Metal Sheet | 14 | 0.3% |
| Other | 8 | 0.2% |
| Plastic sheets | 5 | 0.1% |
| Tent | 1 | < 0.1% |
| (Missing) | 1280 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| asbestos | 1665 | |
| concrete | 577 | 20.6% |
| tile | 494 | 17.6% |
| takaran | 19 | 0.7% |
| metal | 14 | 0.5% |
| sheet | 14 | 0.5% |
| other | 8 | 0.3% |
| plastic | 5 | 0.2% |
| sheets | 5 | 0.2% |
| tent | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 5010 | |
| e | 3374 | |
| t | 2289 | |
| o | 2242 | |
| A | 1665 | 8.2% |
| b | 1665 | 8.2% |
| r | 604 | 3.0% |
| n | 597 | 2.9% |
| c | 582 | 2.9% |
| C | 577 | 2.8% |
| Other values (11) | 1708 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17497 | |
| Uppercase Letter | 2797 | 13.8% |
| Space Separator | 19 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 5010 | |
| e | 3374 | |
| t | 2289 | |
| o | 2242 | |
| b | 1665 | 9.5% |
| r | 604 | 3.5% |
| n | 597 | 3.4% |
| c | 582 | 3.3% |
| l | 513 | 2.9% |
| i | 499 | 2.9% |
| Other values (3) | 122 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1665 | |
| C | 577 | 20.6% |
| T | 514 | 18.4% |
| M | 14 | 0.5% |
| S | 14 | 0.5% |
| O | 8 | 0.3% |
| P | 5 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20294 | |
| Common | 19 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 5010 | |
| e | 3374 | |
| t | 2289 | |
| o | 2242 | |
| A | 1665 | 8.2% |
| b | 1665 | 8.2% |
| r | 604 | 3.0% |
| n | 597 | 2.9% |
| c | 582 | 2.9% |
| C | 577 | 2.8% |
| Other values (10) | 1689 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 19 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20313 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 5010 | |
| e | 3374 | |
| t | 2289 | |
| o | 2242 | |
| A | 1665 | 8.2% |
| b | 1665 | 8.2% |
| r | 604 | 3.0% |
| n | 597 | 2.9% |
| c | 582 | 2.9% |
| C | 577 | 2.8% |
| Other values (11) | 1708 | 8.4% |
any_constructions_or_renovations_in_the_household
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.1 KiB |
| False | |
|---|---|
| True | 204 |
| Value | Count | Frequency (%) |
| False | 3859 | |
| True | 204 | 5.0% |
highest_level_of_education_of_the_chief_wage_earner
Categorical
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| O/L or A/L pending / Passed | |
|---|---|
| Schooling up to Grade 6 - 9 | |
| Diploma with O/L or A/L (Non graduate) | |
| Graduate / Post-Grads / Degree level professional qualification | |
| Other professional certificates with O/L or A/L / Part qualification (Non graduate) | |
| Other values (2) | 149 |
Length
| Max length | 83 |
|---|---|
| Median length | 27 |
| Mean length | 36.543441 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | O/L or A/L pending / Passed |
|---|---|
| 2nd row | Diploma with O/L or A/L (Non graduate) |
| 3rd row | Graduate / Post-Grads / Degree level professional qualification |
| 4th row | Other professional certificates with O/L or A/L / Part qualification (Non graduate) |
| 5th row | Schooling up to Grade 6 - 9 |
Common Values
| Value | Count | Frequency (%) |
| O/L or A/L pending / Passed | 1879 | |
| Schooling up to Grade 6 - 9 | 672 | 16.5% |
| Diploma with O/L or A/L (Non graduate) | 563 | 13.9% |
| Graduate / Post-Grads / Degree level professional qualification | 528 | 13.0% |
| Other professional certificates with O/L or A/L / Part qualification (Non graduate) | 272 | 6.7% |
| Primary Education | 125 | 3.1% |
| Illiterate | 24 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3879 | ||
| or | 2714 | 9.8% |
| o/l | 2714 | 9.8% |
| a/l | 2714 | 9.8% |
| pending | 1879 | 6.8% |
| passed | 1879 | 6.8% |
| graduate | 1363 | 4.9% |
| non | 835 | 3.0% |
| with | 835 | 3.0% |
| professional | 800 | 2.9% |
| Other values (17) | 8069 |
Most occurring characters
| Value | Count | Frequency (%) |
| 23618 | ||
| e | 10097 | 6.8% |
| a | 9586 | 6.5% |
| o | 9181 | 6.2% |
| / | 8635 | 5.8% |
| i | 7967 | 5.4% |
| r | 7695 | 5.2% |
| n | 6990 | 4.7% |
| s | 6686 | 4.5% |
| d | 6446 | 4.3% |
| Other values (28) | 51575 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 93602 | |
| Space Separator | 23618 | 15.9% |
| Uppercase Letter | 18407 | 12.4% |
| Other Punctuation | 8635 | 5.8% |
| Decimal Number | 1344 | 0.9% |
| Dash Punctuation | 1200 | 0.8% |
| Open Punctuation | 835 | 0.6% |
| Close Punctuation | 835 | 0.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 10097 | |
| a | 9586 | |
| o | 9181 | |
| i | 7967 | |
| r | 7695 | 8.2% |
| n | 6990 | 7.5% |
| s | 6686 | 7.1% |
| d | 6446 | 6.9% |
| t | 5459 | 5.8% |
| l | 3939 | 4.2% |
| Other values (11) | 19556 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 5428 | |
| O | 2986 | |
| P | 2804 | |
| A | 2714 | |
| G | 1728 | 9.4% |
| D | 1091 | 5.9% |
| N | 835 | 4.5% |
| S | 672 | 3.7% |
| E | 125 | 0.7% |
| I | 24 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 672 | |
| 9 | 672 |
Space Separator
| Value | Count | Frequency (%) |
| 23618 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 8635 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1200 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 835 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 835 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 112009 | |
| Common | 36467 | 24.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 10097 | 9.0% |
| a | 9586 | 8.6% |
| o | 9181 | 8.2% |
| i | 7967 | 7.1% |
| r | 7695 | 6.9% |
| n | 6990 | 6.2% |
| s | 6686 | 6.0% |
| d | 6446 | 5.8% |
| t | 5459 | 4.9% |
| L | 5428 | 4.8% |
| Other values (21) | 36474 |
Common
| Value | Count | Frequency (%) |
| 23618 | ||
| / | 8635 | 23.7% |
| - | 1200 | 3.3% |
| ( | 835 | 2.3% |
| ) | 835 | 2.3% |
| 6 | 672 | 1.8% |
| 9 | 672 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 148476 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 23618 | ||
| e | 10097 | 6.8% |
| a | 9586 | 6.5% |
| o | 9181 | 6.2% |
| / | 8635 | 5.8% |
| i | 7967 | 5.4% |
| r | 7695 | 5.2% |
| n | 6990 | 4.7% |
| s | 6686 | 4.5% |
| d | 6446 | 4.3% |
| Other values (28) | 51575 |
occupation_of_the_chief_wage_earner
Categorical
High correlation 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| Skilled Worker | |
|---|---|
| Small Businessman / Self employed (Non professional) | |
| Manager / Professional | |
| Clerk / Salesman grades | |
| Unskilled Worker | |
| Other values (14) |
Length
| Max length | 52 |
|---|---|
| Median length | 43 |
| Mean length | 24.075068 |
| Min length | 12 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Skilled Worker |
|---|---|
| 2nd row | Unskilled Worker |
| 3rd row | Middle and Senior executive |
| 4th row | 1-9 Employed |
| 5th row | Skilled Worker |
Common Values
| Value | Count | Frequency (%) |
| Skilled Worker | 1239 | |
| Small Businessman / Self employed (Non professional) | 483 | 11.9% |
| Manager / Professional | 452 | 11.1% |
| Clerk / Salesman grades | 398 | 9.8% |
| Unskilled Worker | 334 | 8.2% |
| Self employed (Professional) - No employees | 262 | 6.4% |
| Junior executive / Executive | 251 | 6.2% |
| Middle and Senior executive | 237 | 5.8% |
| 1-9 Employed | 160 | 3.9% |
| Supervisor grades | 153 | 3.8% |
| Other values (9) | 94 | 2.3% |
Length
| Value | Count | Frequency (%) |
| 1879 | ||
| worker | 1589 | 11.4% |
| skilled | 1239 | 8.9% |
| professional | 1197 | 8.6% |
| employed | 921 | 6.6% |
| self | 745 | 5.4% |
| executive | 739 | 5.3% |
| grades | 551 | 4.0% |
| non | 483 | 3.5% |
| small | 483 | 3.5% |
| Other values (34) | 4068 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 12577 | |
| 9831 | 10.1% | |
| l | 8319 | 8.5% |
| r | 6720 | 6.9% |
| o | 6687 | 6.8% |
| s | 5545 | 5.7% |
| i | 4950 | 5.1% |
| a | 4698 | 4.8% |
| n | 4629 | 4.7% |
| d | 3760 | 3.8% |
| Other values (42) | 30101 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 75099 | |
| Space Separator | 9831 | 10.1% |
| Uppercase Letter | 8974 | 9.2% |
| Other Punctuation | 1607 | 1.6% |
| Close Punctuation | 745 | 0.8% |
| Open Punctuation | 745 | 0.8% |
| Dash Punctuation | 435 | 0.4% |
| Decimal Number | 362 | 0.4% |
| Math Symbol | 16 | < 0.1% |
| Other Number | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12577 | |
| l | 8319 | |
| r | 6720 | |
| o | 6687 | |
| s | 5545 | |
| i | 4950 | 6.6% |
| a | 4698 | 6.3% |
| n | 4629 | 6.2% |
| d | 3760 | 5.0% |
| k | 3560 | 4.7% |
| Other values (14) | 13654 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 3255 | |
| W | 1589 | |
| N | 745 | 8.3% |
| P | 714 | 8.0% |
| M | 689 | 7.7% |
| B | 536 | 6.0% |
| E | 427 | 4.8% |
| C | 398 | 4.4% |
| U | 334 | 3.7% |
| J | 251 | 2.8% |
| Other values (5) | 36 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 180 | |
| 9 | 160 | |
| 0 | 16 | 4.4% |
| 2 | 3 | 0.8% |
| 5 | 3 | 0.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 432 | |
| – | 3 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 9831 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1607 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 745 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 745 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 16 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 84073 | |
| Common | 13744 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 12577 | |
| l | 8319 | 9.9% |
| r | 6720 | 8.0% |
| o | 6687 | 8.0% |
| s | 5545 | 6.6% |
| i | 4950 | 5.9% |
| a | 4698 | 5.6% |
| n | 4629 | 5.5% |
| d | 3760 | 4.5% |
| k | 3560 | 4.2% |
| Other values (29) | 22628 |
Common
| Value | Count | Frequency (%) |
| 9831 | ||
| / | 1607 | 11.7% |
| ) | 745 | 5.4% |
| ( | 745 | 5.4% |
| - | 432 | 3.1% |
| 1 | 180 | 1.3% |
| 9 | 160 | 1.2% |
| 0 | 16 | 0.1% |
| + | 16 | 0.1% |
| – | 3 | < 0.1% |
| Other values (3) | 9 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 97811 | |
| Punctuation | 3 | < 0.1% |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 12577 | |
| 9831 | 10.1% | |
| l | 8319 | 8.5% |
| r | 6720 | 6.9% |
| o | 6687 | 6.8% |
| s | 5545 | 5.7% |
| i | 4950 | 5.1% |
| a | 4698 | 4.8% |
| n | 4629 | 4.7% |
| d | 3760 | 3.8% |
| Other values (40) | 30095 |
Punctuation
| Value | Count | Frequency (%) |
| – | 3 |
None
| Value | Count | Frequency (%) |
| ½ | 3 |
socio_economic_class
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| SEC C | |
|---|---|
| SEC B | |
| SEC A | |
| SEC D | |
| SEC E |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SEC C |
|---|---|
| 2nd row | SEC D |
| 3rd row | SEC A |
| 4th row | SEC A |
| 5th row | SEC D |
Common Values
| Value | Count | Frequency (%) |
| SEC C | 1485 | |
| SEC B | 868 | |
| SEC A | 786 | |
| SEC D | 669 | |
| SEC E | 255 | 6.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| sec | 4063 | |
| c | 1485 | 18.3% |
| b | 868 | 10.7% |
| a | 786 | 9.7% |
| d | 669 | 8.2% |
| e | 255 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 5548 | |
| E | 4318 | |
| S | 4063 | |
| 4063 | ||
| B | 868 | 4.3% |
| A | 786 | 3.9% |
| D | 669 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16252 | |
| Space Separator | 4063 | 20.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 5548 | |
| E | 4318 | |
| S | 4063 | |
| B | 868 | 5.3% |
| A | 786 | 4.8% |
| D | 669 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4063 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16252 | |
| Common | 4063 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 5548 | |
| E | 4318 | |
| S | 4063 | |
| B | 868 | 5.3% |
| A | 786 | 4.8% |
| D | 669 | 4.1% |
Common
| Value | Count | Frequency (%) |
| 4063 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20315 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 5548 | |
| E | 4318 | |
| S | 4063 | |
| 4063 | ||
| B | 868 | 4.3% |
| A | 786 | 3.9% |
| D | 669 | 3.3% |
total_monthly_expenditure_of_last_month
Real number (ℝ)
Missing 
| Distinct | 85 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 135 |
| Missing (%) | 3.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71327.253 |
| Minimum | 5000 |
|---|---|
| Maximum | 275000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.9 KiB |
Quantile statistics
| Minimum | 5000 |
|---|---|
| 5-th percentile | 20000 |
| Q1 | 40000 |
| median | 60000 |
| Q3 | 100000 |
| 95-th percentile | 150000 |
| Maximum | 275000 |
| Range | 270000 |
| Interquartile range (IQR) | 60000 |
Descriptive statistics
| Standard deviation | 44311.381 |
|---|---|
| Coefficient of variation (CV) | 0.62124054 |
| Kurtosis | 2.7624431 |
| Mean | 71327.253 |
| Median Absolute Deviation (MAD) | 20000 |
| Skewness | 1.5241384 |
| Sum | 2.8017345 × 108 |
| Variance | 1.9634985 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50000 | 535 | |
| 100000 | 498 | |
| 60000 | 417 | |
| 40000 | 294 | 7.2% |
| 30000 | 249 | 6.1% |
| 80000 | 223 | 5.5% |
| 70000 | 218 | 5.4% |
| 150000 | 191 | 4.7% |
| 75000 | 158 | 3.9% |
| 35000 | 127 | 3.1% |
| Other values (75) | 1018 | |
| (Missing) | 135 | 3.3% |
| Value | Count | Frequency (%) |
| 5000 | 6 | 0.1% |
| 6000 | 2 | < 0.1% |
| 7000 | 1 | < 0.1% |
| 7500 | 2 | < 0.1% |
| 8000 | 3 | 0.1% |
| 10000 | 36 | |
| 11000 | 1 | < 0.1% |
| 12000 | 10 | 0.2% |
| 13000 | 1 | < 0.1% |
| 15000 | 64 |
| Value | Count | Frequency (%) |
| 275000 | 2 | < 0.1% |
| 270000 | 1 | < 0.1% |
| 250000 | 31 | 0.8% |
| 230000 | 3 | 0.1% |
| 225000 | 2 | < 0.1% |
| 220000 | 1 | < 0.1% |
| 215000 | 1 | < 0.1% |
| 200000 | 113 | |
| 180000 | 6 | 0.1% |
| 175000 | 6 | 0.1% |
type_of_electricity_meter
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.9 KiB |
| Smart meter | |
|---|---|
| Non smart meter |
Length
| Max length | 15 |
|---|---|
| Median length | 11 |
| Mean length | 12.847896 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Non smart meter |
|---|---|
| 2nd row | Non smart meter |
| 3rd row | Smart meter |
| 4th row | Smart meter |
| 5th row | Smart meter |
Common Values
| Value | Count | Frequency (%) |
| Smart meter | 2186 | |
| Non smart meter | 1877 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| smart | 4063 | |
| meter | 4063 | |
| non | 1877 |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 8126 | |
| t | 8126 | |
| r | 8126 | |
| e | 8126 | |
| 5940 | ||
| a | 4063 | |
| S | 2186 | 4.2% |
| N | 1877 | 3.6% |
| o | 1877 | 3.6% |
| n | 1877 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 42198 | |
| Space Separator | 5940 | 11.4% |
| Uppercase Letter | 4063 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 8126 | |
| t | 8126 | |
| r | 8126 | |
| e | 8126 | |
| a | 4063 | |
| o | 1877 | 4.4% |
| n | 1877 | 4.4% |
| s | 1877 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2186 | |
| N | 1877 |
Space Separator
| Value | Count | Frequency (%) |
| 5940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46261 | |
| Common | 5940 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 8126 | |
| t | 8126 | |
| r | 8126 | |
| e | 8126 | |
| a | 4063 | |
| S | 2186 | 4.7% |
| N | 1877 | 4.1% |
| o | 1877 | 4.1% |
| n | 1877 | 4.1% |
| s | 1877 | 4.1% |
Common
| Value | Count | Frequency (%) |
| 5940 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 52201 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| m | 8126 | |
| t | 8126 | |
| r | 8126 | |
| e | 8126 | |
| 5940 | ||
| a | 4063 | |
| S | 2186 | 4.2% |
| N | 1877 | 3.6% |
| o | 1877 | 3.6% |
| n | 1877 | 3.6% |
Interactions
Correlations
| any_constructions_or_renovations_in_the_household | availability_of_certificate_of_compliance | built_year_of_the_house | charged_method_for_rent_for_electricity | charging_method_of_renters_for_electricity | electricity_provider_csc_area | floor_area | floor_which_house_located | highest_level_of_education_of_the_chief_wage_earner | is_there_business_carried_out_in_the_household | main_material_used_for_roof_of_the_house | main_material_used_for_walls_of_the_house | no_of_electricity_meters | no_of_household_members | no_of_storeys | occupation_of_the_chief_wage_earner | occupy_renters_boarders | own_the_house_or_living_on_rent | socio_economic_class | total_monthly_expenditure_of_last_month | type_of_business | type_of_electricity_meter | type_of_house | whom_or_how_the_house_was_designed | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| any_constructions_or_renovations_in_the_household | 1.000 | 0.064 | 0.150 | 0.000 | 0.150 | 0.062 | 0.058 | 0.000 | 0.000 | 0.041 | 0.071 | 0.063 | 0.000 | 0.000 | 0.000 | 0.036 | 0.009 | 0.078 | 0.000 | 0.035 | 0.000 | 0.000 | 0.047 | 0.101 |
| availability_of_certificate_of_compliance | 0.064 | 1.000 | 0.285 | 0.182 | 0.000 | 0.240 | 0.213 | 0.279 | 0.213 | 0.005 | 0.117 | 0.268 | 0.078 | 0.022 | 0.238 | 0.197 | 0.041 | 0.278 | 0.217 | 0.154 | 0.000 | 0.125 | 0.228 | 0.435 |
| built_year_of_the_house | 0.150 | 0.285 | 1.000 | 0.000 | 0.020 | 0.079 | 0.037 | 0.068 | 0.036 | 0.039 | 0.139 | 0.290 | 0.000 | 0.048 | 0.172 | 0.034 | 0.012 | 0.442 | 0.023 | 0.036 | 0.093 | 0.075 | 0.146 | 0.221 |
| charged_method_for_rent_for_electricity | 0.000 | 0.182 | 0.000 | 1.000 | 0.000 | 0.076 | 0.032 | 0.000 | 0.000 | 0.000 | 0.000 | 0.129 | 0.000 | 0.115 | 0.107 | 0.000 | 0.000 | 0.267 | 0.030 | 0.000 | 0.000 | 0.047 | 0.000 | 0.000 |
| charging_method_of_renters_for_electricity | 0.150 | 0.000 | 0.020 | 0.000 | 1.000 | 0.171 | 0.000 | 0.000 | 0.156 | 0.075 | 0.000 | 0.131 | 0.000 | 0.163 | 0.577 | 0.013 | 0.361 | 1.000 | 0.103 | 0.043 | 0.000 | 0.000 | 0.200 | 0.000 |
| electricity_provider_csc_area | 0.062 | 0.240 | 0.079 | 0.076 | 0.171 | 1.000 | 0.144 | 0.000 | 0.177 | 0.028 | 0.158 | 0.125 | 0.057 | 0.058 | 0.193 | 0.105 | 0.075 | 0.108 | 0.177 | 0.089 | 0.172 | 0.502 | 0.124 | 0.181 |
| floor_area | 0.058 | 0.213 | 0.037 | 0.032 | 0.000 | 0.144 | 1.000 | 0.114 | 0.140 | 0.000 | 0.027 | 0.097 | 0.087 | 0.026 | 0.403 | 0.123 | 0.000 | 0.030 | 0.176 | 0.278 | 0.030 | 0.155 | 0.141 | 0.150 |
| floor_which_house_located | 0.000 | 0.279 | 0.068 | 0.000 | 0.000 | 0.000 | 0.114 | 1.000 | 0.154 | 0.000 | 0.000 | 0.000 | 0.014 | 0.039 | 0.015 | 0.137 | 1.000 | 0.000 | 0.116 | 0.073 | NaN | 0.118 | 0.373 | 0.000 |
| highest_level_of_education_of_the_chief_wage_earner | 0.000 | 0.213 | 0.036 | 0.000 | 0.156 | 0.177 | 0.140 | 0.154 | 1.000 | 0.042 | 0.049 | 0.101 | 0.021 | 0.048 | 0.159 | 0.306 | 0.000 | 0.036 | 0.650 | 0.155 | 0.159 | 0.176 | 0.138 | 0.156 |
| is_there_business_carried_out_in_the_household | 0.041 | 0.005 | 0.039 | 0.000 | 0.075 | 0.028 | 0.000 | 0.000 | 0.042 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.174 | 0.000 | 0.000 | 0.052 | 0.000 | 1.000 | 0.008 | 0.000 | 0.000 |
| main_material_used_for_roof_of_the_house | 0.071 | 0.117 | 0.139 | 0.000 | 0.000 | 0.158 | 0.027 | 0.000 | 0.049 | 0.000 | 1.000 | 0.168 | 0.103 | 0.000 | 0.181 | 0.028 | 0.106 | 0.000 | 0.057 | 0.000 | 0.000 | 0.160 | 0.151 | 0.066 |
| main_material_used_for_walls_of_the_house | 0.063 | 0.268 | 0.290 | 0.129 | 0.131 | 0.125 | 0.097 | 0.000 | 0.101 | 0.000 | 0.168 | 1.000 | 0.000 | 0.012 | 0.262 | 0.085 | 0.000 | 0.272 | 0.134 | 0.053 | 0.000 | 0.086 | 0.151 | 0.222 |
| no_of_electricity_meters | 0.000 | 0.078 | 0.000 | 0.000 | 0.000 | 0.057 | 0.087 | 0.014 | 0.021 | 0.000 | 0.103 | 0.000 | 1.000 | -0.035 | 0.100 | 0.070 | 0.000 | 0.000 | 0.048 | 0.048 | 0.051 | 0.101 | 0.158 | 0.063 |
| no_of_household_members | 0.000 | 0.022 | 0.048 | 0.115 | 0.163 | 0.058 | 0.026 | 0.039 | 0.048 | 0.000 | 0.000 | 0.012 | -0.035 | 1.000 | 0.018 | 0.029 | 0.098 | 0.021 | 0.063 | 0.275 | 0.000 | 0.000 | 0.045 | 0.029 |
| no_of_storeys | 0.000 | 0.238 | 0.172 | 0.107 | 0.577 | 0.193 | 0.403 | 0.015 | 0.159 | 0.000 | 0.181 | 0.262 | 0.100 | 0.018 | 1.000 | 0.168 | 0.000 | 0.114 | 0.157 | 0.388 | 1.000 | 0.138 | 0.352 | 0.327 |
| occupation_of_the_chief_wage_earner | 0.036 | 0.197 | 0.034 | 0.000 | 0.013 | 0.105 | 0.123 | 0.137 | 0.306 | 0.174 | 0.028 | 0.085 | 0.070 | 0.029 | 0.168 | 1.000 | 0.000 | 0.033 | 0.625 | 0.139 | 0.242 | 0.167 | 0.115 | 0.124 |
| occupy_renters_boarders | 0.009 | 0.041 | 0.012 | 0.000 | 0.361 | 0.075 | 0.000 | 1.000 | 0.000 | 0.000 | 0.106 | 0.000 | 0.000 | 0.098 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| own_the_house_or_living_on_rent | 0.078 | 0.278 | 0.442 | 0.267 | 1.000 | 0.108 | 0.030 | 0.000 | 0.036 | 0.000 | 0.000 | 0.272 | 0.000 | 0.021 | 0.114 | 0.033 | 1.000 | 1.000 | 0.039 | 0.003 | 0.000 | 0.033 | 0.063 | 0.261 |
| socio_economic_class | 0.000 | 0.217 | 0.023 | 0.030 | 0.103 | 0.177 | 0.176 | 0.116 | 0.650 | 0.052 | 0.057 | 0.134 | 0.048 | 0.063 | 0.157 | 0.625 | 0.000 | 0.039 | 1.000 | 0.218 | 0.128 | 0.192 | 0.184 | 0.168 |
| total_monthly_expenditure_of_last_month | 0.035 | 0.154 | 0.036 | 0.000 | 0.043 | 0.089 | 0.278 | 0.073 | 0.155 | 0.000 | 0.000 | 0.053 | 0.048 | 0.275 | 0.388 | 0.139 | 0.000 | 0.003 | 0.218 | 1.000 | 0.000 | 0.148 | 0.086 | 0.103 |
| type_of_business | 0.000 | 0.000 | 0.093 | 0.000 | 0.000 | 0.172 | 0.030 | NaN | 0.159 | 1.000 | 0.000 | 0.000 | 0.051 | 0.000 | 1.000 | 0.242 | 0.000 | 0.000 | 0.128 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
| type_of_electricity_meter | 0.000 | 0.125 | 0.075 | 0.047 | 0.000 | 0.502 | 0.155 | 0.118 | 0.176 | 0.008 | 0.160 | 0.086 | 0.101 | 0.000 | 0.138 | 0.167 | 0.000 | 0.033 | 0.192 | 0.148 | 0.000 | 1.000 | 0.220 | 0.101 |
| type_of_house | 0.047 | 0.228 | 0.146 | 0.000 | 0.200 | 0.124 | 0.141 | 0.373 | 0.138 | 0.000 | 0.151 | 0.151 | 0.158 | 0.045 | 0.352 | 0.115 | 0.000 | 0.063 | 0.184 | 0.086 | 0.000 | 0.220 | 1.000 | 0.211 |
| whom_or_how_the_house_was_designed | 0.101 | 0.435 | 0.221 | 0.000 | 0.000 | 0.181 | 0.150 | 0.000 | 0.156 | 0.000 | 0.066 | 0.222 | 0.063 | 0.029 | 0.327 | 0.124 | 0.000 | 0.261 | 0.168 | 0.103 | 0.000 | 0.101 | 0.211 | 1.000 |
Missing values
Sample
| household_ID | no_of_electricity_meters | electricity_provider_csc_area | own_the_house_or_living_on_rent | occupy_renters_boarders | awareness_of_electricity_consumption_of_renters | built_year_of_the_house | type_of_house | floor_which_house_located | no_of_storeys | floor_area | no_of_household_members | charging_method_of_renters_for_electricity | charged_method_for_rent_for_electricity | is_there_business_carried_out_in_the_household | type_of_business | whom_or_how_the_house_was_designed | availability_of_certificate_of_compliance | main_material_used_for_walls_of_the_house | main_material_used_for_roof_of_the_house | any_constructions_or_renovations_in_the_household | highest_level_of_education_of_the_chief_wage_earner | occupation_of_the_chief_wage_earner | socio_economic_class | total_monthly_expenditure_of_last_month | type_of_electricity_meter | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | ID0001 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Double Floor | NaN | NaN | 1500.0 | 4 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Asbestos | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 35000.0 | Non smart meter |
| 1 | ID0002 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | Before 1980 | Single House - Single Floor | NaN | NaN | 440.0 | 3 | NaN | NaN | No | NaN | This is a house provided by the government. | Yes | Brick | Asbestos | No | Diploma with O/L or A/L (Non graduate) | Unskilled Worker | SEC D | 40000.0 | Non smart meter |
| 2 | ID0003 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1980-1989 | Single House - Single Floor | NaN | NaN | 2500.0 | 4 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Tile | No | Graduate / Post-Grads / Degree level professional qualification | Middle and Senior executive | SEC A | 250000.0 | Smart meter |
| 3 | ID0004 | 1 | BORALASGAMUWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Single House - Double Floor | NaN | NaN | 2600.0 | 4 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Cement Block | Concrete | No | Other professional certificates with O/L or A/L / Part qualification (Non graduate) | 1-9 Employed | SEC A | 100000.0 | Smart meter |
| 4 | ID0005 | 1 | KOLONNAWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Flat | 10.0 | 1.0 | 480.0 | 2 | NaN | NaN | No | NaN | The house is designed by a certified architect. | Yes | I am not aware of that | Concrete | No | Schooling up to Grade 6 - 9 | Skilled Worker | SEC D | 60000.0 | Smart meter |
| 5 | ID0006 | 1 | KOLONNAWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Flat | 1.0 | 2.0 | 440.0 | 6 | NaN | NaN | No | NaN | This is a house provided by the government. | Yes | Cement Block | Concrete | No | Schooling up to Grade 6 - 9 | Unskilled Worker | SEC E | 100000.0 | Smart meter |
| 6 | ID0007 | 1 | KOLONNAWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Flat | 2.0 | 1.0 | 480.0 | 4 | NaN | NaN | No | NaN | This is a house provided by the government. | Yes | Cement Block | Concrete | No | O/L or A/L pending / Passed | Small Businessman / Self employed (Non professional) | SEC C | 60000.0 | Smart meter |
| 7 | ID0008 | 2 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Single House - Double Floor | NaN | NaN | 1400.0 | 5 | NaN | NaN | No | NaN | The house is designed by a certified architect. | Yes | Cement Block | Asbestos | No | Other professional certificates with O/L or A/L / Part qualification (Non graduate) | Clerk / Salesman grades | SEC B | 150000.0 | Smart meter |
| 8 | ID0009 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Single Floor | NaN | NaN | 350.0 | 2 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Asbestos | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 15000.0 | Non smart meter |
| 9 | ID0010 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1990-1999 | Single House - Single Floor | NaN | NaN | 1000.0 | 7 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Tile | No | Schooling up to Grade 6 - 9 | Unskilled Worker | SEC E | 50000.0 | Non smart meter |
| household_ID | no_of_electricity_meters | electricity_provider_csc_area | own_the_house_or_living_on_rent | occupy_renters_boarders | awareness_of_electricity_consumption_of_renters | built_year_of_the_house | type_of_house | floor_which_house_located | no_of_storeys | floor_area | no_of_household_members | charging_method_of_renters_for_electricity | charged_method_for_rent_for_electricity | is_there_business_carried_out_in_the_household | type_of_business | whom_or_how_the_house_was_designed | availability_of_certificate_of_compliance | main_material_used_for_walls_of_the_house | main_material_used_for_roof_of_the_house | any_constructions_or_renovations_in_the_household | highest_level_of_education_of_the_chief_wage_earner | occupation_of_the_chief_wage_earner | socio_economic_class | total_monthly_expenditure_of_last_month | type_of_electricity_meter | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4053 | ID4054 | 1 | NEGOMBO | No, I am living on rent and the rent is paid by me or a household member. | NaN | NaN | 1980-1989 | Single House - Single Floor | NaN | NaN | 1250.0 | 6 | NaN | You pay the full amount of the electricity bill. | No | NaN | NaN | NaN | Brick | NaN | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 70000.0 | Smart meter |
| 4054 | ID4055 | 1 | NEGOMBO | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Double Floor | NaN | NaN | 1500.0 | 3 | NaN | NaN | No | NaN | NaN | NaN | Brick | NaN | No | O/L or A/L pending / Passed | Middle and Senior executive | SEC B | 60000.0 | Smart meter |
| 4055 | ID4056 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Single Floor | NaN | NaN | 2400.0 | 3 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | Schooling up to Grade 6 - 9 | Unskilled Worker | SEC E | NaN | Non smart meter |
| 4056 | ID4057 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | Before 1980 | Single House - Single Floor | NaN | NaN | 2400.0 | 2 | NaN | NaN | No | NaN | NaN | NaN | Cabook | NaN | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 25000.0 | Non smart meter |
| 4057 | ID4058 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1990-1999 | Single House - Single Floor | NaN | NaN | 3000.0 | 6 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | Diploma with O/L or A/L (Non graduate) | Skilled Worker | SEC C | 8000.0 | Non smart meter |
| 4058 | ID4059 | 1 | NUGEGODA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | In 2020 or After 2020 | Single House - Double Floor | NaN | NaN | 400.0 | 4 | NaN | NaN | No | NaN | NaN | NaN | Brick | NaN | No | Graduate / Post-Grads / Degree level professional qualification | Clerk / Salesman grades | SEC B | 150000.0 | Smart meter |
| 4059 | ID4060 | 1 | HIKKADUWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | Before 1980 | Single House - Single Floor | NaN | NaN | 3000.0 | 1 | NaN | NaN | No | NaN | NaN | NaN | Pressed soil blocks | NaN | No | O/L or A/L pending / Passed | Boutique owner | SEC B | 50000.0 | Non smart meter |
| 4060 | ID4061 | 1 | WATTALA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Single Floor | NaN | NaN | 680.0 | 2 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | O/L or A/L pending / Passed | Unskilled Worker | SEC D | 20000.0 | Non smart meter |
| 4061 | ID4062 | 1 | ALUTHGAMA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1980-1989 | Single House - Single Floor | NaN | NaN | 700.0 | 2 | NaN | NaN | No | NaN | NaN | NaN | Cabook | NaN | No | O/L or A/L pending / Passed | Self employed (Professional) - No employees | SEC B | 30000.0 | Non smart meter |
| 4062 | ID4063 | 1 | ALUTHGAMA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Single House - Double Floor | NaN | NaN | 2400.0 | 5 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | Schooling up to Grade 6 - 9 | Skilled Worker | SEC D | 100000.0 | Non smart meter |