Overview
Brought to you by YData
Dataset statistics
Number of variables | 26 |
---|---|
Number of observations | 4,063 |
Missing cells | 27,643 |
Missing cells (%) | 26.2% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 825.4 KiB |
Average record size in memory | 208.0 B |
Variable types
Text | 1 |
---|---|
Numeric | 6 |
Categorical | 17 |
Boolean | 2 |
awareness_of_electricity_consumption_of_renters has constant value "I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc." | Constant |
charging_method_of_renters_for_electricity is highly overall correlated with no_of_storeys and 1 other fields | High correlation |
electricity_provider_csc_area is highly overall correlated with type_of_electricity_meter | High correlation |
floor_which_house_located is highly overall correlated with occupy_renters_boarders | High correlation |
highest_level_of_education_of_the_chief_wage_earner is highly overall correlated with socio_economic_class | High correlation |
is_there_business_carried_out_in_the_household is highly overall correlated with type_of_business | High correlation |
no_of_storeys is highly overall correlated with charging_method_of_renters_for_electricity and 1 other fields | High correlation |
occupation_of_the_chief_wage_earner is highly overall correlated with socio_economic_class | High correlation |
occupy_renters_boarders is highly overall correlated with floor_which_house_located and 1 other fields | High correlation |
own_the_house_or_living_on_rent is highly overall correlated with charging_method_of_renters_for_electricity and 1 other fields | High correlation |
socio_economic_class is highly overall correlated with highest_level_of_education_of_the_chief_wage_earner and 1 other fields | High correlation |
type_of_business is highly overall correlated with is_there_business_carried_out_in_the_household and 1 other fields | High correlation |
type_of_electricity_meter is highly overall correlated with electricity_provider_csc_area | High correlation |
own_the_house_or_living_on_rent is highly imbalanced (68.5%) | Imbalance |
occupy_renters_boarders is highly imbalanced (86.2%) | Imbalance |
type_of_house is highly imbalanced (59.8%) | Imbalance |
charged_method_for_rent_for_electricity is highly imbalanced (79.8%) | Imbalance |
is_there_business_carried_out_in_the_household is highly imbalanced (73.2%) | Imbalance |
main_material_used_for_roof_of_the_house is highly imbalanced (50.4%) | Imbalance |
any_constructions_or_renovations_in_the_household is highly imbalanced (71.3%) | Imbalance |
occupy_renters_boarders has 536 (13.2%) missing values | Missing |
awareness_of_electricity_consumption_of_renters has 3959 (97.4%) missing values | Missing |
floor_which_house_located has 3970 (97.7%) missing values | Missing |
no_of_storeys has 3814 (93.9%) missing values | Missing |
charging_method_of_renters_for_electricity has 3959 (97.4%) missing values | Missing |
charged_method_for_rent_for_electricity has 3527 (86.8%) missing values | Missing |
type_of_business has 3877 (95.4%) missing values | Missing |
whom_or_how_the_house_was_designed has 1280 (31.5%) missing values | Missing |
availability_of_certificate_of_compliance has 1280 (31.5%) missing values | Missing |
main_material_used_for_roof_of_the_house has 1280 (31.5%) missing values | Missing |
total_monthly_expenditure_of_last_month has 135 (3.3%) missing values | Missing |
household_ID has unique values | Unique |
Reproduction
Analysis started | 2024-12-06 05:54:12.129767 |
---|---|
Analysis finished | 2024-12-06 05:54:18.907451 |
Duration | 6.78 seconds |
Software version | ydata-profiling vv4.11.0 |
Download configuration | config.json |
Variables
household_ID
Text
Unique 
Distinct | 4063 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
Value | Count | Frequency (%) |
id0039 | 1 | < 0.1% |
id4063 | 1 | < 0.1% |
id0001 | 1 | < 0.1% |
id0002 | 1 | < 0.1% |
id0003 | 1 | < 0.1% |
id0004 | 1 | < 0.1% |
id0005 | 1 | < 0.1% |
id0006 | 1 | < 0.1% |
id0007 | 1 | < 0.1% |
id0008 | 1 | < 0.1% |
Other values (4053) | 4053 |
Most occurring characters
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 | |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | 5.3% |
5 | 1216 | 5.0% |
6 | 1210 | 5.0% |
7 | 1206 | 4.9% |
Other values (2) | 2412 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16252 | |
Uppercase Letter | 8126 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | |
5 | 1216 | |
6 | 1210 | |
7 | 1206 | |
8 | 1206 | |
9 | 1206 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 16252 | |
Latin | 8126 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | |
5 | 1216 | |
6 | 1210 | |
7 | 1206 | |
8 | 1206 | |
9 | 1206 |
Latin
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 24378 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 | |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | 5.3% |
5 | 1216 | 5.0% |
6 | 1210 | 5.0% |
7 | 1206 | 4.9% |
Other values (2) | 2412 |
no_of_electricity_meters
Real number (ℝ)
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.0762983 |
Minimum | 1 |
---|---|
Maximum | 7 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 31.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 2 |
Maximum | 7 |
Range | 6 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 0.31628519 |
---|---|
Coefficient of variation (CV) | 0.29386387 |
Kurtosis | 51.158272 |
Mean | 1.0762983 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.6452849 |
Sum | 4373 |
Variance | 0.10003632 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 3799 | |
2 | 225 | 5.5% |
3 | 36 | 0.9% |
5 | 1 | < 0.1% |
7 | 1 | < 0.1% |
4 | 1 | < 0.1% |
Value | Count | Frequency (%) |
1 | 3799 | |
2 | 225 | 5.5% |
3 | 36 | 0.9% |
4 | 1 | < 0.1% |
5 | 1 | < 0.1% |
7 | 1 | < 0.1% |
Value | Count | Frequency (%) |
7 | 1 | < 0.1% |
5 | 1 | < 0.1% |
4 | 1 | < 0.1% |
3 | 36 | 0.9% |
2 | 225 | 5.5% |
1 | 3799 |
electricity_provider_csc_area
Categorical
High correlation 
Distinct | 23 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
MORATUWA NORTH | |
---|---|
MORATUWA SOUTH | |
PANADURA | |
GALLE | 216 |
KESELWATTA | 206 |
Other values (18) |
Common Values
Value | Count | Frequency (%) |
MORATUWA NORTH | 533 | 13.1% |
MORATUWA SOUTH | 370 | 9.1% |
PANADURA | 357 | 8.8% |
GALLE | 216 | 5.3% |
KESELWATTA | 206 | 5.1% |
MAHARAGAMA | 202 | 5.0% |
PAYAGALA | 196 | 4.8% |
KALUTARA | 189 | 4.7% |
HIKKADUWA | 163 | 4.0% |
ALUTHGAMA | 158 | 3.9% |
Other values (13) | 1473 |
Length
Value | Count | Frequency (%) |
moratuwa | 903 | |
north | 533 | 10.7% |
south | 370 | 7.5% |
panadura | 357 | 7.2% |
galle | 216 | 4.3% |
keselwatta | 206 | 4.1% |
maharagama | 202 | 4.1% |
payagala | 196 | 3.9% |
kalutara | 189 | 3.8% |
hikkaduwa | 163 | 3.3% |
Other values (14) | 1631 |
Most occurring characters
Value | Count | Frequency (%) |
A | 10055 | |
T | 3417 | 8.7% |
O | 2914 | 7.4% |
U | 2568 | 6.6% |
R | 2454 | 6.3% |
M | 2125 | 5.4% |
L | 1849 | 4.7% |
W | 1788 | 4.6% |
N | 1693 | 4.3% |
H | 1565 | 4.0% |
Other values (12) | 8770 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 38065 | |
Space Separator | 903 | 2.3% |
Dash Punctuation | 230 | 0.6% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 10055 | |
T | 3417 | 9.0% |
O | 2914 | 7.7% |
U | 2568 | 6.7% |
R | 2454 | 6.4% |
M | 2125 | 5.6% |
L | 1849 | 4.9% |
W | 1788 | 4.7% |
N | 1693 | 4.4% |
H | 1565 | 4.1% |
Other values (10) | 7637 |
Space Separator
Value | Count | Frequency (%) |
903 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 230 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 38065 | |
Common | 1133 | 2.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 10055 | |
T | 3417 | 9.0% |
O | 2914 | 7.7% |
U | 2568 | 6.7% |
R | 2454 | 6.4% |
M | 2125 | 5.6% |
L | 1849 | 4.9% |
W | 1788 | 4.7% |
N | 1693 | 4.4% |
H | 1565 | 4.1% |
Other values (10) | 7637 |
Common
Value | Count | Frequency (%) |
903 | ||
- | 230 | 20.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 39198 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 10055 | |
T | 3417 | 8.7% |
O | 2914 | 7.4% |
U | 2568 | 6.6% |
R | 2454 | 6.3% |
M | 2125 | 5.4% |
L | 1849 | 4.7% |
W | 1788 | 4.6% |
N | 1693 | 4.3% |
H | 1565 | 4.0% |
Other values (12) | 8770 |
own_the_house_or_living_on_rent
Categorical
High correlation  Imbalance 
Distinct | 4 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
Yes, I or a household member owns it. | |
---|---|
No, I am living on rent and the rent is paid by me or a household member. | |
No, I or any household member does not own or rent this household. We occupy this household without any payment of rent. | 50 |
No, I am living on rent and the rent is paid by the employer. | 4 |
Length
Max length | 120 |
---|---|
Median length | 37 |
Mean length | 42.315777 |
Min length | 37 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Yes, I or a household member owns it. |
---|---|
2nd row | Yes, I or a household member owns it. |
3rd row | Yes, I or a household member owns it. |
4th row | Yes, I or a household member owns it. |
5th row | Yes, I or a household member owns it. |
Common Values
Value | Count | Frequency (%) |
Yes, I or a household member owns it. | 3527 | |
No, I am living on rent and the rent is paid by me or a household member. | 482 | 11.9% |
No, I or any household member does not own or rent this household. We occupy this household without any payment of rent. | 50 | 1.2% |
No, I am living on rent and the rent is paid by the employer. | 4 | 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
household | 4159 | |
or | 4109 | |
i | 4063 | |
member | 4059 | |
a | 4009 | |
yes | 3527 | |
owns | 3527 | |
it | 3527 | |
rent | 1072 | 2.9% |
no | 536 | 1.4% |
Other values (20) | 4978 |
Most occurring characters
Value | Count | Frequency (%) |
33503 | ||
e | 18006 | 10.5% |
o | 17280 | 10.1% |
s | 11849 | 6.9% |
r | 9244 | 5.4% |
m | 9140 | 5.3% |
h | 8958 | 5.2% |
n | 6307 | 3.7% |
i | 5621 | 3.3% |
a | 5617 | 3.3% |
Other values (18) | 46404 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 122074 | |
Space Separator | 33503 | 19.5% |
Other Punctuation | 8176 | 4.8% |
Uppercase Letter | 8176 | 4.8% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 18006 | |
o | 17280 | |
s | 11849 | |
r | 9244 | 7.6% |
m | 9140 | 7.5% |
h | 8958 | 7.3% |
n | 6307 | 5.2% |
i | 5621 | 4.6% |
a | 5617 | 4.6% |
t | 5389 | 4.4% |
Other values (11) | 24663 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 4063 | |
Y | 3527 | |
N | 536 | 6.6% |
W | 50 | 0.6% |
Other Punctuation
Value | Count | Frequency (%) |
. | 4113 | |
, | 4063 |
Space Separator
Value | Count | Frequency (%) |
33503 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 130250 | |
Common | 41679 | 24.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 18006 | |
o | 17280 | |
s | 11849 | 9.1% |
r | 9244 | 7.1% |
m | 9140 | 7.0% |
h | 8958 | 6.9% |
n | 6307 | 4.8% |
i | 5621 | 4.3% |
a | 5617 | 4.3% |
t | 5389 | 4.1% |
Other values (15) | 32839 |
Common
Value | Count | Frequency (%) |
33503 | ||
. | 4113 | 9.9% |
, | 4063 | 9.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 171929 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
33503 | ||
e | 18006 | 10.5% |
o | 17280 | 10.1% |
s | 11849 | 6.9% |
r | 9244 | 5.4% |
m | 9140 | 5.3% |
h | 8958 | 5.2% |
n | 6307 | 3.7% |
i | 5621 | 3.3% |
a | 5617 | 3.3% |
Other values (18) | 46404 |
occupy_renters_boarders
Categorical
High correlation  Imbalance  Missing 
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 536 |
Missing (%) | 13.2% |
Memory size | 31.9 KiB |
I don't occupy any of the above. | |
---|---|
Renters / boarders who are living in your annexe or any other attached place, maintaining separate living conditions but share the same electricity meter. | 72 |
Boarders who live in your house using a room/s that is attached to your living conditions. | 32 |
Length
Max length | 154 |
---|---|
Median length | 32 |
Mean length | 35.016728 |
Min length | 32 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | I don't occupy any of the above. |
---|---|
2nd row | I don't occupy any of the above. |
3rd row | I don't occupy any of the above. |
4th row | I don't occupy any of the above. |
5th row | I don't occupy any of the above. |
Common Values
Value | Count | Frequency (%) |
I don't occupy any of the above. | 3423 | |
Renters / boarders who are living in your annexe or any other attached place, maintaining separate living conditions but share the same electricity meter. | 72 | 1.8% |
Boarders who live in your house using a room/s that is attached to your living conditions. | 32 | 0.8% |
(Missing) | 536 | 13.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
any | 3495 | |
the | 3495 | |
don't | 3423 | |
i | 3423 | |
occupy | 3423 | |
of | 3423 | |
above | 3423 | |
living | 176 | 0.7% |
your | 136 | 0.5% |
in | 104 | 0.4% |
Other values (26) | 1680 |
Most occurring characters
Value | Count | Frequency (%) |
22674 | ||
o | 14516 | |
e | 8270 | 6.7% |
a | 7942 | 6.4% |
t | 7902 | 6.4% |
n | 7870 | 6.4% |
c | 7270 | 5.9% |
y | 7126 | 5.8% |
h | 3911 | 3.2% |
d | 3735 | 3.0% |
Other values (20) | 32288 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 90177 | |
Space Separator | 22674 | 18.4% |
Other Punctuation | 7126 | 5.8% |
Uppercase Letter | 3527 | 2.9% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 14516 | |
e | 8270 | |
a | 7942 | |
t | 7902 | |
n | 7870 | |
c | 7270 | |
y | 7126 | 7.9% |
h | 3911 | 4.3% |
d | 3735 | 4.1% |
u | 3695 | 4.1% |
Other values (12) | 17940 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3527 | |
' | 3423 | |
/ | 104 | 1.5% |
, | 72 | 1.0% |
Uppercase Letter
Value | Count | Frequency (%) |
I | 3423 | |
R | 72 | 2.0% |
B | 32 | 0.9% |
Space Separator
Value | Count | Frequency (%) |
22674 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 93704 | |
Common | 29800 | 24.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 14516 | |
e | 8270 | 8.8% |
a | 7942 | 8.5% |
t | 7902 | 8.4% |
n | 7870 | 8.4% |
c | 7270 | 7.8% |
y | 7126 | 7.6% |
h | 3911 | 4.2% |
d | 3735 | 4.0% |
u | 3695 | 3.9% |
Other values (15) | 21467 |
Common
Value | Count | Frequency (%) |
22674 | ||
. | 3527 | 11.8% |
' | 3423 | 11.5% |
/ | 104 | 0.3% |
, | 72 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 123504 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
22674 | ||
o | 14516 | |
e | 8270 | 6.7% |
a | 7942 | 6.4% |
t | 7902 | 6.4% |
n | 7870 | 6.4% |
c | 7270 | 5.9% |
y | 7126 | 5.8% |
h | 3911 | 3.2% |
d | 3735 | 3.0% |
Other values (20) | 32288 |
awareness_of_electricity_consumption_of_renters
Categorical
Constant  Missing 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 3959 |
Missing (%) | 97.4% |
Memory size | 31.9 KiB |
I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
---|
Length
Max length | 218 |
---|---|
Median length | 218 |
Mean length | 218 |
Min length | 218 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
---|---|
2nd row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
3rd row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
4th row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
5th row | I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. |
Common Values
Value | Count | Frequency (%) |
I know all the details about the electricity consumption of the renters/ boarders; i.e.; the appliances they use and the number of hours they use each appliance, the times they keep the lights and fans switched on etc. | 104 | 2.6% |
(Missing) | 3959 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
the | 728 | |
they | 312 | 7.9% |
of | 208 | 5.3% |
use | 208 | 5.3% |
and | 208 | 5.3% |
all | 104 | 2.6% |
details | 104 | 2.6% |
electricity | 104 | 2.6% |
know | 104 | 2.6% |
about | 104 | 2.6% |
Other values (17) | 1768 |
Most occurring characters
Value | Count | Frequency (%) |
3848 | ||
e | 2912 | |
t | 2080 | 9.2% |
h | 1456 | 6.4% |
a | 1248 | 5.5% |
s | 1248 | 5.5% |
n | 1144 | 5.0% |
i | 1040 | 4.6% |
o | 936 | 4.1% |
c | 832 | 3.7% |
Other values (17) | 5928 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 17992 | |
Space Separator | 3848 | 17.0% |
Other Punctuation | 728 | 3.2% |
Uppercase Letter | 104 | 0.5% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 2912 | |
t | 2080 | |
h | 1456 | 8.1% |
a | 1248 | 6.9% |
s | 1248 | 6.9% |
n | 1144 | 6.4% |
i | 1040 | 5.8% |
o | 936 | 5.2% |
c | 832 | 4.6% |
l | 728 | 4.0% |
Other values (11) | 4368 |
Other Punctuation
Value | Count | Frequency (%) |
. | 312 | |
; | 208 | |
/ | 104 | 14.3% |
, | 104 | 14.3% |
Space Separator
Value | Count | Frequency (%) |
3848 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 104 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 18096 | |
Common | 4576 | 20.2% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 2912 | |
t | 2080 | |
h | 1456 | 8.0% |
a | 1248 | 6.9% |
s | 1248 | 6.9% |
n | 1144 | 6.3% |
i | 1040 | 5.7% |
o | 936 | 5.2% |
c | 832 | 4.6% |
l | 728 | 4.0% |
Other values (12) | 4472 |
Common
Value | Count | Frequency (%) |
3848 | ||
. | 312 | 6.8% |
; | 208 | 4.5% |
/ | 104 | 2.3% |
, | 104 | 2.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 22672 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
3848 | ||
e | 2912 | |
t | 2080 | 9.2% |
h | 1456 | 6.4% |
a | 1248 | 5.5% |
s | 1248 | 5.5% |
n | 1144 | 5.0% |
i | 1040 | 4.6% |
o | 936 | 4.1% |
c | 832 | 3.7% |
Other values (17) | 5928 |
built_year_of_the_house
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
2000-2009 | |
---|---|
2010-2019 | |
Before 1980 | |
1990-1999 | |
1980-1989 | |
Other values (2) |
Length
Max length | 21 |
---|---|
Median length | 9 |
Mean length | 10.108787 |
Min length | 9 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2000-2009 |
---|---|
2nd row | Before 1980 |
3rd row | 1980-1989 |
4th row | 2010-2019 |
5th row | 2010-2019 |
Common Values
Value | Count | Frequency (%) |
2000-2009 | 918 | |
2010-2019 | 758 | |
Before 1980 | 740 | |
1990-1999 | 615 | |
1980-1989 | 482 | |
Don't know | 325 | 8.0% |
In 2020 or After 2020 | 225 | 5.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2000-2009 | 918 | |
2010-2019 | 758 | |
before | 740 | |
1980 | 740 | |
1990-1999 | 615 | |
1980-1989 | 482 | |
2020 | 450 | |
don't | 325 | 5.4% |
know | 325 | 5.4% |
in | 225 | 3.7% |
Other values (2) | 450 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 9601 | |
9 | 6937 | |
1 | 4450 | |
2 | 4252 | |
- | 2773 | 6.8% |
1965 | 4.8% | |
e | 1705 | 4.2% |
8 | 1704 | 4.1% |
o | 1615 | 3.9% |
r | 1190 | 2.9% |
Other values (10) | 4880 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 26944 | |
Lowercase Letter | 7550 | 18.4% |
Dash Punctuation | 2773 | 6.8% |
Space Separator | 1965 | 4.8% |
Uppercase Letter | 1515 | 3.7% |
Other Punctuation | 325 | 0.8% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 1705 | |
o | 1615 | |
r | 1190 | |
f | 965 | |
n | 875 | |
t | 550 | 7.3% |
k | 325 | 4.3% |
w | 325 | 4.3% |
Decimal Number
Value | Count | Frequency (%) |
0 | 9601 | |
9 | 6937 | |
1 | 4450 | |
2 | 4252 | |
8 | 1704 | 6.3% |
Uppercase Letter
Value | Count | Frequency (%) |
B | 740 | |
D | 325 | |
I | 225 | 14.9% |
A | 225 | 14.9% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2773 |
Space Separator
Value | Count | Frequency (%) |
1965 |
Other Punctuation
Value | Count | Frequency (%) |
' | 325 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 32007 | |
Latin | 9065 | 22.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 1705 | |
o | 1615 | |
r | 1190 | |
f | 965 | |
n | 875 | |
B | 740 | |
t | 550 | 6.1% |
D | 325 | 3.6% |
k | 325 | 3.6% |
w | 325 | 3.6% |
Other values (2) | 450 | 5.0% |
Common
Value | Count | Frequency (%) |
0 | 9601 | |
9 | 6937 | |
1 | 4450 | |
2 | 4252 | |
- | 2773 | 8.7% |
1965 | 6.1% | |
8 | 1704 | 5.3% |
' | 325 | 1.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 41072 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 9601 | |
9 | 6937 | |
1 | 4450 | |
2 | 4252 | |
- | 2773 | 6.8% |
1965 | 4.8% | |
e | 1705 | 4.2% |
8 | 1704 | 4.1% |
o | 1615 | 3.9% |
r | 1190 | 2.9% |
Other values (10) | 4880 |
type_of_house
Categorical
Imbalance 
Distinct | 10 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
Single House - Single Floor | |
---|---|
Single House - Double Floor | |
Single House - More than 2 floors | 113 |
Flat | 80 |
Condominium/ Luxury apartments | 13 |
Other values (5) | 43 |
Length
Max length | 33 |
---|---|
Median length | 27 |
Mean length | 26.605464 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Single House - Double Floor |
---|---|
2nd row | Single House - Single Floor |
3rd row | Single House - Single Floor |
4th row | Single House - Double Floor |
5th row | Flat |
Common Values
Value | Count | Frequency (%) |
Single House - Single Floor | 2482 | |
Single House - Double Floor | 1332 | |
Single House - More than 2 floors | 113 | 2.8% |
Flat | 80 | 2.0% |
Condominium/ Luxury apartments | 13 | 0.3% |
Slum / Shanty | 11 | 0.3% |
Line room/row house | 11 | 0.3% |
Attached house / Annex | 10 | 0.2% |
Twin houses | 9 | 0.2% |
Other | 2 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
single | 6409 | |
house | 3948 | |
3948 | ||
floor | 3814 | |
double | 1332 | 6.6% |
more | 113 | 0.6% |
than | 113 | 0.6% |
2 | 113 | 0.6% |
floors | 113 | 0.6% |
flat | 80 | 0.4% |
Other values (12) | 123 | 0.6% |
Most occurring characters
Value | Count | Frequency (%) |
16043 | ||
o | 13315 | |
e | 11857 | |
l | 11759 | |
n | 6612 | 6.1% |
i | 6455 | 6.0% |
S | 6431 | 5.9% |
g | 6409 | 5.9% |
u | 5339 | 4.9% |
s | 4092 | 3.8% |
Other values (25) | 19786 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 72205 | |
Space Separator | 16043 | 14.8% |
Uppercase Letter | 15765 | 14.6% |
Dash Punctuation | 3927 | 3.6% |
Decimal Number | 113 | 0.1% |
Other Punctuation | 45 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 13315 | |
e | 11857 | |
l | 11759 | |
n | 6612 | |
i | 6455 | |
g | 6409 | |
u | 5339 | |
s | 4092 | 5.7% |
r | 4090 | 5.7% |
b | 1332 | 1.8% |
Other values (11) | 945 | 1.3% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 6431 | |
H | 3927 | |
F | 3894 | |
D | 1332 | 8.4% |
M | 113 | 0.7% |
L | 24 | 0.2% |
A | 20 | 0.1% |
C | 13 | 0.1% |
T | 9 | 0.1% |
O | 2 | < 0.1% |
Space Separator
Value | Count | Frequency (%) |
16043 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3927 |
Decimal Number
Value | Count | Frequency (%) |
2 | 113 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 45 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 87970 | |
Common | 20128 | 18.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 13315 | |
e | 11857 | |
l | 11759 | |
n | 6612 | |
i | 6455 | |
S | 6431 | |
g | 6409 | |
u | 5339 | |
s | 4092 | 4.7% |
r | 4090 | 4.6% |
Other values (21) | 11611 |
Common
Value | Count | Frequency (%) |
16043 | ||
- | 3927 | 19.5% |
2 | 113 | 0.6% |
/ | 45 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 108098 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
16043 | ||
o | 13315 | |
e | 11857 | |
l | 11759 | |
n | 6612 | 6.1% |
i | 6455 | 6.0% |
S | 6431 | 5.9% |
g | 6409 | 5.9% |
u | 5339 | 4.9% |
s | 4092 | 3.8% |
Other values (25) | 19786 |
floor_which_house_located
Real number (ℝ)
High correlation  Missing 
Distinct | 12 |
---|---|
Distinct (%) | 12.9% |
Missing | 3970 |
Missing (%) | 97.7% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 2.7741935 |
Minimum | 0 |
---|---|
Maximum | 11 |
Zeros | 17 |
Zeros (%) | 0.4% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 31.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 2 |
Q3 | 4 |
95-th percentile | 9 |
Maximum | 11 |
Range | 11 |
Interquartile range (IQR) | 3 |
Descriptive statistics
Standard deviation | 2.8173725 |
---|---|
Coefficient of variation (CV) | 1.0155645 |
Kurtosis | 0.61342339 |
Mean | 2.7741935 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 1.2342572 |
Sum | 258 |
Variance | 7.9375877 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 24 | 0.6% |
2 | 18 | 0.4% |
0 | 17 | 0.4% |
3 | 7 | 0.2% |
4 | 7 | 0.2% |
5 | 4 | 0.1% |
8 | 4 | 0.1% |
6 | 3 | 0.1% |
7 | 3 | 0.1% |
9 | 3 | 0.1% |
Other values (2) | 3 | 0.1% |
(Missing) | 3970 |
Value | Count | Frequency (%) |
0 | 17 | |
1 | 24 | |
2 | 18 | |
3 | 7 | 0.2% |
4 | 7 | 0.2% |
5 | 4 | 0.1% |
6 | 3 | 0.1% |
7 | 3 | 0.1% |
8 | 4 | 0.1% |
9 | 3 | 0.1% |
Value | Count | Frequency (%) |
11 | 1 | < 0.1% |
10 | 2 | < 0.1% |
9 | 3 | 0.1% |
8 | 4 | 0.1% |
7 | 3 | 0.1% |
6 | 3 | 0.1% |
5 | 4 | 0.1% |
4 | 7 | 0.2% |
3 | 7 | 0.2% |
2 | 18 |
no_of_storeys
Real number (ℝ)
High correlation  Missing 
Distinct | 6 |
---|---|
Distinct (%) | 2.4% |
Missing | 3814 |
Missing (%) | 93.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.746988 |
Minimum | 0 |
---|---|
Maximum | 5 |
Zeros | 35 |
Zeros (%) | 0.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 31.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 1 |
median | 1 |
Q3 | 3 |
95-th percentile | 3 |
Maximum | 5 |
Range | 5 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.1378039 |
---|---|
Coefficient of variation (CV) | 0.65129465 |
Kurtosis | -1.2583883 |
Mean | 1.746988 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.013050986 |
Sum | 435 |
Variance | 1.2945977 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 93 | 2.3% |
1 | 91 | 2.2% |
0 | 35 | 0.9% |
2 | 28 | 0.7% |
4 | 1 | < 0.1% |
5 | 1 | < 0.1% |
(Missing) | 3814 |
Value | Count | Frequency (%) |
0 | 35 | 0.9% |
1 | 91 | |
2 | 28 | 0.7% |
3 | 93 | |
4 | 1 | < 0.1% |
5 | 1 | < 0.1% |
Value | Count | Frequency (%) |
5 | 1 | < 0.1% |
4 | 1 | < 0.1% |
3 | 93 | |
2 | 28 | 0.7% |
1 | 91 | |
0 | 35 | 0.9% |
floor_area
Real number (ℝ)
Distinct | 386 |
---|---|
Distinct (%) | 9.6% |
Missing | 26 |
Missing (%) | 0.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1356.1812 |
Minimum | 100 |
---|---|
Maximum | 9000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 31.9 KiB |
Quantile statistics
Minimum | 100 |
---|---|
5-th percentile | 300 |
Q1 | 600 |
median | 1000 |
Q3 | 2000 |
95-th percentile | 3000 |
Maximum | 9000 |
Range | 8900 |
Interquartile range (IQR) | 1400 |
Descriptive statistics
Standard deviation | 950.2113 |
---|---|
Coefficient of variation (CV) | 0.70065219 |
Kurtosis | 2.1828572 |
Mean | 1356.1812 |
Median Absolute Deviation (MAD) | 500 |
Skewness | 1.2111539 |
Sum | 5474903.3 |
Variance | 902901.51 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3000 | 277 | 6.8% |
1000 | 264 | 6.5% |
1200 | 256 | 6.3% |
800 | 203 | 5.0% |
600 | 202 | 5.0% |
2400 | 180 | 4.4% |
1500 | 175 | 4.3% |
2000 | 171 | 4.2% |
500 | 140 | 3.4% |
400 | 111 | 2.7% |
Other values (376) | 2058 |
Value | Count | Frequency (%) |
100 | 12 | |
108 | 1 | < 0.1% |
120 | 2 | < 0.1% |
125 | 1 | < 0.1% |
136.5 | 1 | < 0.1% |
140 | 2 | < 0.1% |
143 | 1 | < 0.1% |
144 | 1 | < 0.1% |
150 | 22 | |
160 | 1 | < 0.1% |
Value | Count | Frequency (%) |
9000 | 2 | < 0.1% |
6000 | 1 | < 0.1% |
5000 | 1 | < 0.1% |
4700 | 1 | < 0.1% |
4600 | 7 | 0.2% |
4400 | 8 | 0.2% |
4200 | 15 | |
4000 | 23 | |
3960 | 2 | < 0.1% |
3900 | 1 | < 0.1% |
no_of_household_members
Real number (ℝ)
Distinct | 13 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.0044302 |
Minimum | 1 |
---|---|
Maximum | 13 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 31.9 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 3 |
median | 4 |
Q3 | 5 |
95-th percentile | 7 |
Maximum | 13 |
Range | 12 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.6872622 |
---|---|
Coefficient of variation (CV) | 0.42134889 |
Kurtosis | 1.2549374 |
Mean | 4.0044302 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 0.68467186 |
Sum | 16270 |
Variance | 2.8468538 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 1014 | |
3 | 818 | |
5 | 772 | |
2 | 602 | |
6 | 407 | |
1 | 186 | 4.6% |
7 | 144 | 3.5% |
8 | 65 | 1.6% |
9 | 29 | 0.7% |
10 | 15 | 0.4% |
Other values (3) | 11 | 0.3% |
Value | Count | Frequency (%) |
1 | 186 | 4.6% |
2 | 602 | |
3 | 818 | |
4 | 1014 | |
5 | 772 | |
6 | 407 | |
7 | 144 | 3.5% |
8 | 65 | 1.6% |
9 | 29 | 0.7% |
10 | 15 | 0.4% |
Value | Count | Frequency (%) |
13 | 2 | < 0.1% |
12 | 4 | 0.1% |
11 | 5 | 0.1% |
10 | 15 | 0.4% |
9 | 29 | 0.7% |
8 | 65 | 1.6% |
7 | 144 | 3.5% |
6 | 407 | |
5 | 772 | |
4 | 1014 |
charging_method_of_renters_for_electricity
Categorical
High correlation  Missing 
Distinct | 5 |
---|---|
Distinct (%) | 4.8% |
Missing | 3959 |
Missing (%) | 97.4% |
Memory size | 31.9 KiB |
You charge a fixed amount every month for electricity. | |
---|---|
You don't charge them for electricity consumption. | |
You charge an amount for electricity depending on the variance of the bill. | |
You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. | |
You don't charge a specific amount for electricity but charge a varied amount for all the utilities such as electricity, water etc. The amount charged varied based on the utility bills. |
Length
Max length | 185 |
---|---|
Median length | 130 |
Mean length | 78.759615 |
Min length | 50 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | You don't charge them for electricity consumption. |
---|---|
2nd row | You don't charge them for electricity consumption. |
3rd row | You don't charge them for electricity consumption. |
4th row | You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. |
5th row | You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. |
Common Values
Value | Count | Frequency (%) |
You charge a fixed amount every month for electricity. | 34 | 0.8% |
You don't charge them for electricity consumption. | 24 | 0.6% |
You charge an amount for electricity depending on the variance of the bill. | 21 | 0.5% |
You don't charge a specific amount for electricity but charge a fixed amount for all the utilities such as electricity, water etc. | 19 | 0.5% |
You don't charge a specific amount for electricity but charge a varied amount for all the utilities such as electricity, water etc. The amount charged varied based on the utility bills. | 6 | 0.1% |
(Missing) | 3959 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
charge | 129 | 9.5% |
for | 129 | 9.5% |
electricity | 129 | 9.5% |
amount | 111 | 8.2% |
you | 104 | 7.7% |
a | 84 | 6.2% |
the | 79 | 5.8% |
fixed | 53 | 3.9% |
don't | 49 | 3.6% |
every | 34 | 2.5% |
Other values (22) | 450 |
Most occurring characters
Value | Count | Frequency (%) |
1247 | ||
e | 798 | 9.7% |
t | 710 | 8.7% |
i | 553 | 6.8% |
c | 538 | 6.6% |
o | 523 | 6.4% |
a | 486 | 5.9% |
r | 485 | 5.9% |
n | 353 | 4.3% |
u | 320 | 3.9% |
Other values (18) | 2178 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 6650 | |
Space Separator | 1247 | 15.2% |
Other Punctuation | 184 | 2.2% |
Uppercase Letter | 110 | 1.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 798 | |
t | 710 | |
i | 553 | 8.3% |
c | 538 | 8.1% |
o | 523 | 7.9% |
a | 486 | 7.3% |
r | 485 | 7.3% |
n | 353 | 5.3% |
u | 320 | 4.8% |
h | 297 | 4.5% |
Other values (12) | 1587 |
Other Punctuation
Value | Count | Frequency (%) |
. | 110 | |
' | 49 | |
, | 25 | 13.6% |
Uppercase Letter
Value | Count | Frequency (%) |
Y | 104 | |
T | 6 | 5.5% |
Space Separator
Value | Count | Frequency (%) |
1247 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 6760 | |
Common | 1431 | 17.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 798 | |
t | 710 | |
i | 553 | 8.2% |
c | 538 | 8.0% |
o | 523 | 7.7% |
a | 486 | 7.2% |
r | 485 | 7.2% |
n | 353 | 5.2% |
u | 320 | 4.7% |
h | 297 | 4.4% |
Other values (14) | 1697 |
Common
Value | Count | Frequency (%) |
1247 | ||
. | 110 | 7.7% |
' | 49 | 3.4% |
, | 25 | 1.7% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8191 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1247 | ||
e | 798 | 9.7% |
t | 710 | 8.7% |
i | 553 | 6.8% |
c | 538 | 6.6% |
o | 523 | 6.4% |
a | 486 | 5.9% |
r | 485 | 5.9% |
n | 353 | 4.3% |
u | 320 | 3.9% |
Other values (18) | 2178 |
charged_method_for_rent_for_electricity
Categorical
Imbalance  Missing 
Distinct | 6 |
---|---|
Distinct (%) | 1.1% |
Missing | 3527 |
Missing (%) | 86.8% |
Memory size | 31.9 KiB |
You pay the full amount of the electricity bill. | |
---|---|
You don't pay the owner for electricity consumption. | 17 |
You pay a fixed amount to the owner every month for electricity. | 13 |
You pay a varied amount to the owner every month for electricity. The amount paid varies depending on the variance of the bill. | 6 |
You don't pay a specific amount for electricity, but pay a fixed amount for all the utilities such as electricity, water etc. | 3 |
Length
Max length | 197 |
---|---|
Median length | 48 |
Mean length | 50.108209 |
Min length | 48 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | 0.2% |
Sample
1st row | You pay the full amount of the electricity bill. |
---|---|
2nd row | You pay the full amount of the electricity bill. |
3rd row | You don't pay the owner for electricity consumption. |
4th row | You pay the full amount of the electricity bill. |
5th row | You pay the full amount of the electricity bill. |
Common Values
Value | Count | Frequency (%) |
You pay the full amount of the electricity bill. | 496 | 12.2% |
You don't pay the owner for electricity consumption. | 17 | 0.4% |
You pay a fixed amount to the owner every month for electricity. | 13 | 0.3% |
You pay a varied amount to the owner every month for electricity. The amount paid varies depending on the variance of the bill. | 6 | 0.1% |
You don't pay a specific amount for electricity, but pay a fixed amount for all the utilities such as electricity, water etc. | 3 | 0.1% |
You don't pay a specific amount for electricity, but pay a varied amount for all the utilities such as electricity, water etc. The amount paid varies depending on the variance of the utility bills. | 1 | < 0.1% |
(Missing) | 3527 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
the | 1053 | |
electricity | 540 | |
pay | 540 | |
you | 536 | |
amount | 530 | |
of | 503 | |
bill | 502 | |
full | 496 | |
for | 44 | 0.9% |
owner | 36 | 0.7% |
Other values (23) | 214 | 4.3% |
Most occurring characters
Value | Count | Frequency (%) |
4458 | ||
t | 2754 | |
l | 2551 | 9.5% |
e | 2274 | 8.5% |
o | 1749 | 6.5% |
i | 1673 | 6.2% |
u | 1592 | 5.9% |
a | 1144 | 4.3% |
c | 1120 | 4.2% |
y | 1100 | 4.1% |
Other values (18) | 6443 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 21285 | |
Space Separator | 4458 | 16.6% |
Other Punctuation | 572 | 2.1% |
Uppercase Letter | 543 | 2.0% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
t | 2754 | |
l | 2551 | |
e | 2274 | |
o | 1749 | |
i | 1673 | 7.9% |
u | 1592 | 7.5% |
a | 1144 | 5.4% |
c | 1120 | 5.3% |
y | 1100 | 5.2% |
h | 1076 | 5.1% |
Other values (12) | 4252 |
Other Punctuation
Value | Count | Frequency (%) |
. | 543 | |
' | 21 | 3.7% |
, | 8 | 1.4% |
Uppercase Letter
Value | Count | Frequency (%) |
Y | 536 | |
T | 7 | 1.3% |
Space Separator
Value | Count | Frequency (%) |
4458 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 21828 | |
Common | 5030 | 18.7% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
t | 2754 | |
l | 2551 | |
e | 2274 | |
o | 1749 | 8.0% |
i | 1673 | 7.7% |
u | 1592 | 7.3% |
a | 1144 | 5.2% |
c | 1120 | 5.1% |
y | 1100 | 5.0% |
h | 1076 | 4.9% |
Other values (14) | 4795 |
Common
Value | Count | Frequency (%) |
4458 | ||
. | 543 | 10.8% |
' | 21 | 0.4% |
, | 8 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 26858 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
4458 | ||
t | 2754 | |
l | 2551 | 9.5% |
e | 2274 | 8.5% |
o | 1749 | 6.5% |
i | 1673 | 6.2% |
u | 1592 | 5.9% |
a | 1144 | 4.3% |
c | 1120 | 4.2% |
y | 1100 | 4.1% |
Other values (18) | 6443 |
is_there_business_carried_out_in_the_household
Boolean
High correlation  Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True | 186 |
Value | Count | Frequency (%) |
False | 3877 | |
True | 186 | 4.6% |
type_of_business
Categorical
High correlation  Missing 
Distinct | 3 |
---|---|
Distinct (%) | 1.6% |
Missing | 3877 |
Missing (%) | 95.4% |
Memory size | 31.9 KiB |
Other | |
---|---|
A shop | |
A communication | 2 |
Common Values
Value | Count | Frequency (%) |
Other | 121 | 3.0% |
A shop | 63 | 1.6% |
A communication | 2 | < 0.1% |
(Missing) | 3877 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
other | 121 | |
a | 65 | |
shop | 63 | |
communication | 2 | 0.8% |
Most occurring characters
Value | Count | Frequency (%) |
h | 184 | |
t | 123 | |
O | 121 | |
e | 121 | |
r | 121 | |
o | 67 | 6.6% |
65 | 6.4% | |
A | 65 | 6.4% |
s | 63 | 6.2% |
p | 63 | 6.2% |
Other values (6) | 20 | 2.0% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 762 | |
Uppercase Letter | 186 | 18.4% |
Space Separator | 65 | 6.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
h | 184 | |
t | 123 | |
e | 121 | |
r | 121 | |
o | 67 | 8.8% |
s | 63 | 8.3% |
p | 63 | 8.3% |
c | 4 | 0.5% |
m | 4 | 0.5% |
n | 4 | 0.5% |
Other values (3) | 8 | 1.0% |
Uppercase Letter
Value | Count | Frequency (%) |
O | 121 | |
A | 65 |
Space Separator
Value | Count | Frequency (%) |
65 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 948 | |
Common | 65 | 6.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
h | 184 | |
t | 123 | |
O | 121 | |
e | 121 | |
r | 121 | |
o | 67 | 7.1% |
A | 65 | 6.9% |
s | 63 | 6.6% |
p | 63 | 6.6% |
c | 4 | 0.4% |
Other values (5) | 16 | 1.7% |
Common
Value | Count | Frequency (%) |
65 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1013 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
h | 184 | |
t | 123 | |
O | 121 | |
e | 121 | |
r | 121 | |
o | 67 | 6.6% |
65 | 6.4% | |
A | 65 | 6.4% |
s | 63 | 6.2% |
p | 63 | 6.2% |
Other values (6) | 20 | 2.0% |
whom_or_how_the_house_was_designed
Categorical
Missing 
Distinct | 6 |
---|---|
Distinct (%) | 0.2% |
Missing | 1280 |
Missing (%) | 31.5% |
Memory size | 31.9 KiB |
The house is designed by a certified architect. | |
---|---|
I am not aware of that. | |
The house plan is not done by an architect, nor checked by a certified architect or engineer. The house was not designed keeping in mind the legal requirements of the local authorities. The house was designed only to suit our needs. | |
The house plan is not done by an architect, nor checked by a certified architect / engineer, the house is designed to barely pass the legal requirements of the local authorities. | 125 |
The house plan was not done by an architect but checked by a certified architect or engineer. | 117 |
Length
Max length | 232 |
---|---|
Median length | 178 |
Mean length | 72.238951 |
Min length | 23 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | The house is designed by a certified architect. |
---|---|
2nd row | This is a house provided by the government. |
3rd row | The house is designed by a certified architect. |
4th row | The house is designed by a certified architect. |
5th row | The house is designed by a certified architect. |
Common Values
Value | Count | Frequency (%) |
The house is designed by a certified architect. | 1389 | |
I am not aware of that. | 738 | |
The house plan is not done by an architect, nor checked by a certified architect or engineer. The house was not designed keeping in mind the legal requirements of the local authorities. The house was designed only to suit our needs. | 359 | 8.8% |
The house plan is not done by an architect, nor checked by a certified architect / engineer, the house is designed to barely pass the legal requirements of the local authorities. | 125 | 3.1% |
The house plan was not done by an architect but checked by a certified architect or engineer. | 117 | 2.9% |
This is a house provided by the government. | 55 | 1.4% |
(Missing) | 1280 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
the | 3856 | 10.5% |
house | 2888 | 7.9% |
by | 2646 | 7.2% |
architect | 2591 | 7.1% |
designed | 2232 | 6.1% |
is | 2053 | 5.6% |
a | 2045 | 5.6% |
certified | 1990 | 5.4% |
not | 1698 | 4.6% |
of | 1222 | 3.3% |
Other values (31) | 13342 |
Most occurring characters
Value | Count | Frequency (%) |
33780 | ||
e | 26269 | |
i | 14455 | 7.2% |
t | 13961 | 6.9% |
a | 11327 | 5.6% |
h | 11213 | 5.6% |
s | 9999 | 5.0% |
n | 9808 | 4.9% |
o | 9649 | 4.8% |
r | 8926 | 4.4% |
Other values (19) | 51654 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 159525 | |
Space Separator | 33780 | 16.8% |
Other Punctuation | 4235 | 2.1% |
Uppercase Letter | 3501 | 1.7% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 26269 | |
i | 14455 | |
t | 13961 | 8.8% |
a | 11327 | 7.1% |
h | 11213 | 7.0% |
s | 9999 | 6.3% |
n | 9808 | 6.1% |
o | 9649 | 6.0% |
r | 8926 | 5.6% |
c | 8858 | 5.6% |
Other values (13) | 35060 |
Other Punctuation
Value | Count | Frequency (%) |
. | 3501 | |
, | 609 | 14.4% |
/ | 125 | 3.0% |
Uppercase Letter
Value | Count | Frequency (%) |
T | 2763 | |
I | 738 | 21.1% |
Space Separator
Value | Count | Frequency (%) |
33780 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 163026 | |
Common | 38015 | 18.9% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 26269 | |
i | 14455 | 8.9% |
t | 13961 | 8.6% |
a | 11327 | 6.9% |
h | 11213 | 6.9% |
s | 9999 | 6.1% |
n | 9808 | 6.0% |
o | 9649 | 5.9% |
r | 8926 | 5.5% |
c | 8858 | 5.4% |
Other values (15) | 38561 |
Common
Value | Count | Frequency (%) |
33780 | ||
. | 3501 | 9.2% |
, | 609 | 1.6% |
/ | 125 | 0.3% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 201041 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
33780 | ||
e | 26269 | |
i | 14455 | 7.2% |
t | 13961 | 6.9% |
a | 11327 | 5.6% |
h | 11213 | 5.6% |
s | 9999 | 5.0% |
n | 9808 | 4.9% |
o | 9649 | 4.8% |
r | 8926 | 4.4% |
Other values (19) | 51654 |
availability_of_certificate_of_compliance
Categorical
Missing 
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 1280 |
Missing (%) | 31.5% |
Memory size | 31.9 KiB |
Yes | |
---|---|
Don't know | |
No |
Common Values
Value | Count | Frequency (%) |
Yes | 1268 | |
Don't know | 851 | |
No | 664 | |
(Missing) | 1280 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
yes | 1268 | |
don't | 851 | |
know | 851 | |
no | 664 |
Most occurring characters
Value | Count | Frequency (%) |
o | 2366 | |
n | 1702 | |
Y | 1268 | |
e | 1268 | |
s | 1268 | |
D | 851 | 6.2% |
' | 851 | 6.2% |
t | 851 | 6.2% |
851 | 6.2% | |
k | 851 | 6.2% |
Other values (2) | 1515 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 9157 | |
Uppercase Letter | 2783 | 20.4% |
Other Punctuation | 851 | 6.2% |
Space Separator | 851 | 6.2% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 2366 | |
n | 1702 | |
e | 1268 | |
s | 1268 | |
t | 851 | 9.3% |
k | 851 | 9.3% |
w | 851 | 9.3% |
Uppercase Letter
Value | Count | Frequency (%) |
Y | 1268 | |
D | 851 | |
N | 664 |
Other Punctuation
Value | Count | Frequency (%) |
' | 851 |
Space Separator
Value | Count | Frequency (%) |
851 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 11940 | |
Common | 1702 | 12.5% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 2366 | |
n | 1702 | |
Y | 1268 | |
e | 1268 | |
s | 1268 | |
D | 851 | 7.1% |
t | 851 | 7.1% |
k | 851 | 7.1% |
w | 851 | 7.1% |
N | 664 | 5.6% |
Common
Value | Count | Frequency (%) |
' | 851 | |
851 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 13642 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
o | 2366 | |
n | 1702 | |
Y | 1268 | |
e | 1268 | |
s | 1268 | |
D | 851 | 6.2% |
' | 851 | 6.2% |
t | 851 | 6.2% |
851 | 6.2% | |
k | 851 | 6.2% |
Other values (2) | 1515 |
main_material_used_for_walls_of_the_house
Categorical
Distinct | 10 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
Cement Block | |
---|---|
Brick | |
I am not aware of that | |
Cabook | |
Wood / Takaran / Asbestos | 49 |
Other values (5) | 66 |
Length
Max length | 25 |
---|---|
Median length | 22 |
Mean length | 9.8624169 |
Min length | 3 |
Unique
Unique | 1 ? |
---|---|
Unique (%) | < 0.1% |
Sample
1st row | Brick |
---|---|
2nd row | Brick |
3rd row | Brick |
4th row | Cement Block |
5th row | I am not aware of that |
Common Values
Value | Count | Frequency (%) |
Cement Block | 1837 | |
Brick | 1621 | |
I am not aware of that | 299 | 7.4% |
Cabook | 191 | 4.7% |
Wood / Takaran / Asbestos | 49 | 1.2% |
Pressed soil blocks | 29 | 0.7% |
Stones/Cube stones | 19 | 0.5% |
Other | 9 | 0.2% |
Mud | 8 | 0.2% |
Metal Sheet | 1 | < 0.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
cement | 1837 | |
block | 1837 | |
brick | 1621 | |
i | 299 | 3.9% |
am | 299 | 3.9% |
not | 299 | 3.9% |
aware | 299 | 3.9% |
of | 299 | 3.9% |
that | 299 | 3.9% |
cabook | 191 | 2.5% |
Other values (13) | 389 | 5.1% |
Most occurring characters
Value | Count | Frequency (%) |
e | 4149 | |
k | 3727 | 9.3% |
3606 | 9.0% | |
c | 3487 | 8.7% |
B | 3458 | 8.6% |
o | 3060 | 7.6% |
t | 2832 | 7.1% |
n | 2223 | 5.5% |
m | 2136 | 5.3% |
C | 2047 | 5.1% |
Other values (20) | 9346 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 30330 | |
Uppercase Letter | 6018 | 15.0% |
Space Separator | 3606 | 9.0% |
Other Punctuation | 117 | 0.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 4149 | |
k | 3727 | |
c | 3487 | |
o | 3060 | |
t | 2832 | |
n | 2223 | |
m | 2136 | |
r | 2007 | |
l | 1896 | |
i | 1650 | 5.4% |
Other values (8) | 3163 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 3458 | |
C | 2047 | |
I | 299 | 5.0% |
W | 49 | 0.8% |
T | 49 | 0.8% |
A | 49 | 0.8% |
P | 29 | 0.5% |
S | 20 | 0.3% |
O | 9 | 0.1% |
M | 9 | 0.1% |
Space Separator
Value | Count | Frequency (%) |
3606 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 117 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 36348 | |
Common | 3723 | 9.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 4149 | |
k | 3727 | |
c | 3487 | |
B | 3458 | |
o | 3060 | |
t | 2832 | 7.8% |
n | 2223 | 6.1% |
m | 2136 | 5.9% |
C | 2047 | 5.6% |
r | 2007 | 5.5% |
Other values (18) | 7222 |
Common
Value | Count | Frequency (%) |
3606 | ||
/ | 117 | 3.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 40071 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 4149 | |
k | 3727 | 9.3% |
3606 | 9.0% | |
c | 3487 | 8.7% |
B | 3458 | 8.6% |
o | 3060 | 7.6% |
t | 2832 | 7.1% |
n | 2223 | 5.5% |
m | 2136 | 5.3% |
C | 2047 | 5.1% |
Other values (20) | 9346 |
main_material_used_for_roof_of_the_house
Categorical
Imbalance  Missing 
Distinct | 8 |
---|---|
Distinct (%) | 0.3% |
Missing | 1280 |
Missing (%) | 31.5% |
Memory size | 31.9 KiB |
Asbestos | |
---|---|
Concrete | |
Tile | |
Takaran | 19 |
Metal Sheet | 14 |
Other values (3) | 14 |
Common Values
Value | Count | Frequency (%) |
Asbestos | 1665 | |
Concrete | 577 | 14.2% |
Tile | 494 | 12.2% |
Takaran | 19 | 0.5% |
Metal Sheet | 14 | 0.3% |
Other | 8 | 0.2% |
Plastic sheets | 5 | 0.1% |
Tent | 1 | < 0.1% |
(Missing) | 1280 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
asbestos | 1665 | |
concrete | 577 | 20.6% |
tile | 494 | 17.6% |
takaran | 19 | 0.7% |
metal | 14 | 0.5% |
sheet | 14 | 0.5% |
other | 8 | 0.3% |
plastic | 5 | 0.2% |
sheets | 5 | 0.2% |
tent | 1 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
s | 5010 | |
e | 3374 | |
t | 2289 | |
o | 2242 | |
A | 1665 | 8.2% |
b | 1665 | 8.2% |
r | 604 | 3.0% |
n | 597 | 2.9% |
c | 582 | 2.9% |
C | 577 | 2.8% |
Other values (11) | 1708 | 8.4% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 17497 | |
Uppercase Letter | 2797 | 13.8% |
Space Separator | 19 | 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
s | 5010 | |
e | 3374 | |
t | 2289 | |
o | 2242 | |
b | 1665 | 9.5% |
r | 604 | 3.5% |
n | 597 | 3.4% |
c | 582 | 3.3% |
l | 513 | 2.9% |
i | 499 | 2.9% |
Other values (3) | 122 | 0.7% |
Uppercase Letter
Value | Count | Frequency (%) |
A | 1665 | |
C | 577 | 20.6% |
T | 514 | 18.4% |
M | 14 | 0.5% |
S | 14 | 0.5% |
O | 8 | 0.3% |
P | 5 | 0.2% |
Space Separator
Value | Count | Frequency (%) |
19 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 20294 | |
Common | 19 | 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
s | 5010 | |
e | 3374 | |
t | 2289 | |
o | 2242 | |
A | 1665 | 8.2% |
b | 1665 | 8.2% |
r | 604 | 3.0% |
n | 597 | 2.9% |
c | 582 | 2.9% |
C | 577 | 2.8% |
Other values (10) | 1689 | 8.3% |
Common
Value | Count | Frequency (%) |
19 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20313 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
s | 5010 | |
e | 3374 | |
t | 2289 | |
o | 2242 | |
A | 1665 | 8.2% |
b | 1665 | 8.2% |
r | 604 | 3.0% |
n | 597 | 2.9% |
c | 582 | 2.9% |
C | 577 | 2.8% |
Other values (11) | 1708 | 8.4% |
any_constructions_or_renovations_in_the_household
Boolean
Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True | 204 |
Value | Count | Frequency (%) |
False | 3859 | |
True | 204 | 5.0% |
highest_level_of_education_of_the_chief_wage_earner
Categorical
High correlation 
Distinct | 7 |
---|---|
Distinct (%) | 0.2% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
O/L or A/L pending / Passed | |
---|---|
Schooling up to Grade 6 - 9 | |
Diploma with O/L or A/L (Non graduate) | |
Graduate / Post-Grads / Degree level professional qualification | |
Other professional certificates with O/L or A/L / Part qualification (Non graduate) | |
Other values (2) | 149 |
Length
Max length | 83 |
---|---|
Median length | 27 |
Mean length | 36.543441 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | O/L or A/L pending / Passed |
---|---|
2nd row | Diploma with O/L or A/L (Non graduate) |
3rd row | Graduate / Post-Grads / Degree level professional qualification |
4th row | Other professional certificates with O/L or A/L / Part qualification (Non graduate) |
5th row | Schooling up to Grade 6 - 9 |
Common Values
Value | Count | Frequency (%) |
O/L or A/L pending / Passed | 1879 | |
Schooling up to Grade 6 - 9 | 672 | 16.5% |
Diploma with O/L or A/L (Non graduate) | 563 | 13.9% |
Graduate / Post-Grads / Degree level professional qualification | 528 | 13.0% |
Other professional certificates with O/L or A/L / Part qualification (Non graduate) | 272 | 6.7% |
Primary Education | 125 | 3.1% |
Illiterate | 24 | 0.6% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
3879 | ||
or | 2714 | 9.8% |
o/l | 2714 | 9.8% |
a/l | 2714 | 9.8% |
pending | 1879 | 6.8% |
passed | 1879 | 6.8% |
graduate | 1363 | 4.9% |
non | 835 | 3.0% |
with | 835 | 3.0% |
professional | 800 | 2.9% |
Other values (17) | 8069 |
Most occurring characters
Value | Count | Frequency (%) |
23618 | ||
e | 10097 | 6.8% |
a | 9586 | 6.5% |
o | 9181 | 6.2% |
/ | 8635 | 5.8% |
i | 7967 | 5.4% |
r | 7695 | 5.2% |
n | 6990 | 4.7% |
s | 6686 | 4.5% |
d | 6446 | 4.3% |
Other values (28) | 51575 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 93602 | |
Space Separator | 23618 | 15.9% |
Uppercase Letter | 18407 | 12.4% |
Other Punctuation | 8635 | 5.8% |
Decimal Number | 1344 | 0.9% |
Dash Punctuation | 1200 | 0.8% |
Open Punctuation | 835 | 0.6% |
Close Punctuation | 835 | 0.6% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 10097 | |
a | 9586 | |
o | 9181 | |
i | 7967 | |
r | 7695 | 8.2% |
n | 6990 | 7.5% |
s | 6686 | 7.1% |
d | 6446 | 6.9% |
t | 5459 | 5.8% |
l | 3939 | 4.2% |
Other values (11) | 19556 |
Uppercase Letter
Value | Count | Frequency (%) |
L | 5428 | |
O | 2986 | |
P | 2804 | |
A | 2714 | |
G | 1728 | 9.4% |
D | 1091 | 5.9% |
N | 835 | 4.5% |
S | 672 | 3.7% |
E | 125 | 0.7% |
I | 24 | 0.1% |
Decimal Number
Value | Count | Frequency (%) |
6 | 672 | |
9 | 672 |
Space Separator
Value | Count | Frequency (%) |
23618 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 8635 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 1200 |
Open Punctuation
Value | Count | Frequency (%) |
( | 835 |
Close Punctuation
Value | Count | Frequency (%) |
) | 835 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 112009 | |
Common | 36467 | 24.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 10097 | 9.0% |
a | 9586 | 8.6% |
o | 9181 | 8.2% |
i | 7967 | 7.1% |
r | 7695 | 6.9% |
n | 6990 | 6.2% |
s | 6686 | 6.0% |
d | 6446 | 5.8% |
t | 5459 | 4.9% |
L | 5428 | 4.8% |
Other values (21) | 36474 |
Common
Value | Count | Frequency (%) |
23618 | ||
/ | 8635 | 23.7% |
- | 1200 | 3.3% |
( | 835 | 2.3% |
) | 835 | 2.3% |
6 | 672 | 1.8% |
9 | 672 | 1.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 148476 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
23618 | ||
e | 10097 | 6.8% |
a | 9586 | 6.5% |
o | 9181 | 6.2% |
/ | 8635 | 5.8% |
i | 7967 | 5.4% |
r | 7695 | 5.2% |
n | 6990 | 4.7% |
s | 6686 | 4.5% |
d | 6446 | 4.3% |
Other values (28) | 51575 |
occupation_of_the_chief_wage_earner
Categorical
High correlation 
Distinct | 19 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
Skilled Worker | |
---|---|
Small Businessman / Self employed (Non professional) | |
Manager / Professional | |
Clerk / Salesman grades | |
Unskilled Worker | |
Other values (14) |
Length
Max length | 52 |
---|---|
Median length | 43 |
Mean length | 24.075068 |
Min length | 12 |
Unique
Unique | 3 ? |
---|---|
Unique (%) | 0.1% |
Sample
1st row | Skilled Worker |
---|---|
2nd row | Unskilled Worker |
3rd row | Middle and Senior executive |
4th row | 1-9 Employed |
5th row | Skilled Worker |
Common Values
Value | Count | Frequency (%) |
Skilled Worker | 1239 | |
Small Businessman / Self employed (Non professional) | 483 | 11.9% |
Manager / Professional | 452 | 11.1% |
Clerk / Salesman grades | 398 | 9.8% |
Unskilled Worker | 334 | 8.2% |
Self employed (Professional) - No employees | 262 | 6.4% |
Junior executive / Executive | 251 | 6.2% |
Middle and Senior executive | 237 | 5.8% |
1-9 Employed | 160 | 3.9% |
Supervisor grades | 153 | 3.8% |
Other values (9) | 94 | 2.3% |
Length
Value | Count | Frequency (%) |
1879 | ||
worker | 1589 | 11.4% |
skilled | 1239 | 8.9% |
professional | 1197 | 8.6% |
employed | 921 | 6.6% |
self | 745 | 5.4% |
executive | 739 | 5.3% |
grades | 551 | 4.0% |
non | 483 | 3.5% |
small | 483 | 3.5% |
Other values (34) | 4068 |
Most occurring characters
Value | Count | Frequency (%) |
e | 12577 | |
9831 | 10.1% | |
l | 8319 | 8.5% |
r | 6720 | 6.9% |
o | 6687 | 6.8% |
s | 5545 | 5.7% |
i | 4950 | 5.1% |
a | 4698 | 4.8% |
n | 4629 | 4.7% |
d | 3760 | 3.8% |
Other values (42) | 30101 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 75099 | |
Space Separator | 9831 | 10.1% |
Uppercase Letter | 8974 | 9.2% |
Other Punctuation | 1607 | 1.6% |
Close Punctuation | 745 | 0.8% |
Open Punctuation | 745 | 0.8% |
Dash Punctuation | 435 | 0.4% |
Decimal Number | 362 | 0.4% |
Math Symbol | 16 | < 0.1% |
Other Number | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
e | 12577 | |
l | 8319 | |
r | 6720 | |
o | 6687 | |
s | 5545 | |
i | 4950 | 6.6% |
a | 4698 | 6.3% |
n | 4629 | 6.2% |
d | 3760 | 5.0% |
k | 3560 | 4.7% |
Other values (14) | 13654 |
Uppercase Letter
Value | Count | Frequency (%) |
S | 3255 | |
W | 1589 | |
N | 745 | 8.3% |
P | 714 | 8.0% |
M | 689 | 7.7% |
B | 536 | 6.0% |
E | 427 | 4.8% |
C | 398 | 4.4% |
U | 334 | 3.7% |
J | 251 | 2.8% |
Other values (5) | 36 | 0.4% |
Decimal Number
Value | Count | Frequency (%) |
1 | 180 | |
9 | 160 | |
0 | 16 | 4.4% |
2 | 3 | 0.8% |
5 | 3 | 0.8% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 432 | |
– | 3 | 0.7% |
Space Separator
Value | Count | Frequency (%) |
9831 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 1607 |
Close Punctuation
Value | Count | Frequency (%) |
) | 745 |
Open Punctuation
Value | Count | Frequency (%) |
( | 745 |
Math Symbol
Value | Count | Frequency (%) |
+ | 16 |
Other Number
Value | Count | Frequency (%) |
½ | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 84073 | |
Common | 13744 | 14.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
e | 12577 | |
l | 8319 | 9.9% |
r | 6720 | 8.0% |
o | 6687 | 8.0% |
s | 5545 | 6.6% |
i | 4950 | 5.9% |
a | 4698 | 5.6% |
n | 4629 | 5.5% |
d | 3760 | 4.5% |
k | 3560 | 4.2% |
Other values (29) | 22628 |
Common
Value | Count | Frequency (%) |
9831 | ||
/ | 1607 | 11.7% |
) | 745 | 5.4% |
( | 745 | 5.4% |
- | 432 | 3.1% |
1 | 180 | 1.3% |
9 | 160 | 1.2% |
0 | 16 | 0.1% |
+ | 16 | 0.1% |
– | 3 | < 0.1% |
Other values (3) | 9 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 97811 | |
Punctuation | 3 | < 0.1% |
None | 3 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
e | 12577 | |
9831 | 10.1% | |
l | 8319 | 8.5% |
r | 6720 | 6.9% |
o | 6687 | 6.8% |
s | 5545 | 5.7% |
i | 4950 | 5.1% |
a | 4698 | 4.8% |
n | 4629 | 4.7% |
d | 3760 | 3.8% |
Other values (40) | 30095 |
Punctuation
Value | Count | Frequency (%) |
– | 3 |
None
Value | Count | Frequency (%) |
½ | 3 |
socio_economic_class
Categorical
High correlation 
Distinct | 5 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
SEC C | |
---|---|
SEC B | |
SEC A | |
SEC D | |
SEC E |
Common Values
Value | Count | Frequency (%) |
SEC C | 1485 | |
SEC B | 868 | |
SEC A | 786 | |
SEC D | 669 | |
SEC E | 255 | 6.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
sec | 4063 | |
c | 1485 | 18.3% |
b | 868 | 10.7% |
a | 786 | 9.7% |
d | 669 | 8.2% |
e | 255 | 3.1% |
Most occurring characters
Value | Count | Frequency (%) |
C | 5548 | |
E | 4318 | |
S | 4063 | |
4063 | ||
B | 868 | 4.3% |
A | 786 | 3.9% |
D | 669 | 3.3% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 16252 | |
Space Separator | 4063 | 20.0% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 5548 | |
E | 4318 | |
S | 4063 | |
B | 868 | 5.3% |
A | 786 | 4.8% |
D | 669 | 4.1% |
Space Separator
Value | Count | Frequency (%) |
4063 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 16252 | |
Common | 4063 | 20.0% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 5548 | |
E | 4318 | |
S | 4063 | |
B | 868 | 5.3% |
A | 786 | 4.8% |
D | 669 | 4.1% |
Common
Value | Count | Frequency (%) |
4063 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 20315 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
C | 5548 | |
E | 4318 | |
S | 4063 | |
4063 | ||
B | 868 | 4.3% |
A | 786 | 3.9% |
D | 669 | 3.3% |
total_monthly_expenditure_of_last_month
Real number (ℝ)
Missing 
Distinct | 85 |
---|---|
Distinct (%) | 2.2% |
Missing | 135 |
Missing (%) | 3.3% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 71327.253 |
Minimum | 5000 |
---|---|
Maximum | 275000 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 31.9 KiB |
Quantile statistics
Minimum | 5000 |
---|---|
5-th percentile | 20000 |
Q1 | 40000 |
median | 60000 |
Q3 | 100000 |
95-th percentile | 150000 |
Maximum | 275000 |
Range | 270000 |
Interquartile range (IQR) | 60000 |
Descriptive statistics
Standard deviation | 44311.381 |
---|---|
Coefficient of variation (CV) | 0.62124054 |
Kurtosis | 2.7624431 |
Mean | 71327.253 |
Median Absolute Deviation (MAD) | 20000 |
Skewness | 1.5241384 |
Sum | 2.8017345 × 108 |
Variance | 1.9634985 × 109 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
50000 | 535 | |
100000 | 498 | |
60000 | 417 | |
40000 | 294 | 7.2% |
30000 | 249 | 6.1% |
80000 | 223 | 5.5% |
70000 | 218 | 5.4% |
150000 | 191 | 4.7% |
75000 | 158 | 3.9% |
35000 | 127 | 3.1% |
Other values (75) | 1018 | |
(Missing) | 135 | 3.3% |
Value | Count | Frequency (%) |
5000 | 6 | 0.1% |
6000 | 2 | < 0.1% |
7000 | 1 | < 0.1% |
7500 | 2 | < 0.1% |
8000 | 3 | 0.1% |
10000 | 36 | |
11000 | 1 | < 0.1% |
12000 | 10 | 0.2% |
13000 | 1 | < 0.1% |
15000 | 64 |
Value | Count | Frequency (%) |
275000 | 2 | < 0.1% |
270000 | 1 | < 0.1% |
250000 | 31 | 0.8% |
230000 | 3 | 0.1% |
225000 | 2 | < 0.1% |
220000 | 1 | < 0.1% |
215000 | 1 | < 0.1% |
200000 | 113 | |
180000 | 6 | 0.1% |
175000 | 6 | 0.1% |
type_of_electricity_meter
Categorical
High correlation 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
Smart meter | |
---|---|
Non smart meter |
Length
Max length | 15 |
---|---|
Median length | 11 |
Mean length | 12.847896 |
Min length | 11 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Non smart meter |
---|---|
2nd row | Non smart meter |
3rd row | Smart meter |
4th row | Smart meter |
5th row | Smart meter |
Common Values
Value | Count | Frequency (%) |
Smart meter | 2186 | |
Non smart meter | 1877 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
smart | 4063 | |
meter | 4063 | |
non | 1877 |
Most occurring characters
Value | Count | Frequency (%) |
m | 8126 | |
t | 8126 | |
r | 8126 | |
e | 8126 | |
5940 | ||
a | 4063 | |
S | 2186 | 4.2% |
N | 1877 | 3.6% |
o | 1877 | 3.6% |
n | 1877 | 3.6% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 42198 | |
Space Separator | 5940 | 11.4% |
Uppercase Letter | 4063 | 7.8% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
m | 8126 | |
t | 8126 | |
r | 8126 | |
e | 8126 | |
a | 4063 | |
o | 1877 | 4.4% |
n | 1877 | 4.4% |
s | 1877 | 4.4% |
Uppercase Letter
Value | Count | Frequency (%) |
S | 2186 | |
N | 1877 |
Space Separator
Value | Count | Frequency (%) |
5940 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 46261 | |
Common | 5940 | 11.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
m | 8126 | |
t | 8126 | |
r | 8126 | |
e | 8126 | |
a | 4063 | |
S | 2186 | 4.7% |
N | 1877 | 4.1% |
o | 1877 | 4.1% |
n | 1877 | 4.1% |
s | 1877 | 4.1% |
Common
Value | Count | Frequency (%) |
5940 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 52201 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
m | 8126 | |
t | 8126 | |
r | 8126 | |
e | 8126 | |
5940 | ||
a | 4063 | |
S | 2186 | 4.2% |
N | 1877 | 3.6% |
o | 1877 | 3.6% |
n | 1877 | 3.6% |
Interactions
Correlations
any_constructions_or_renovations_in_the_household | availability_of_certificate_of_compliance | built_year_of_the_house | charged_method_for_rent_for_electricity | charging_method_of_renters_for_electricity | electricity_provider_csc_area | floor_area | floor_which_house_located | highest_level_of_education_of_the_chief_wage_earner | is_there_business_carried_out_in_the_household | main_material_used_for_roof_of_the_house | main_material_used_for_walls_of_the_house | no_of_electricity_meters | no_of_household_members | no_of_storeys | occupation_of_the_chief_wage_earner | occupy_renters_boarders | own_the_house_or_living_on_rent | socio_economic_class | total_monthly_expenditure_of_last_month | type_of_business | type_of_electricity_meter | type_of_house | whom_or_how_the_house_was_designed | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
any_constructions_or_renovations_in_the_household | 1.000 | 0.064 | 0.150 | 0.000 | 0.150 | 0.062 | 0.058 | 0.000 | 0.000 | 0.041 | 0.071 | 0.063 | 0.000 | 0.000 | 0.000 | 0.036 | 0.009 | 0.078 | 0.000 | 0.035 | 0.000 | 0.000 | 0.047 | 0.101 |
availability_of_certificate_of_compliance | 0.064 | 1.000 | 0.285 | 0.182 | 0.000 | 0.240 | 0.213 | 0.279 | 0.213 | 0.005 | 0.117 | 0.268 | 0.078 | 0.022 | 0.238 | 0.197 | 0.041 | 0.278 | 0.217 | 0.154 | 0.000 | 0.125 | 0.228 | 0.435 |
built_year_of_the_house | 0.150 | 0.285 | 1.000 | 0.000 | 0.020 | 0.079 | 0.037 | 0.068 | 0.036 | 0.039 | 0.139 | 0.290 | 0.000 | 0.048 | 0.172 | 0.034 | 0.012 | 0.442 | 0.023 | 0.036 | 0.093 | 0.075 | 0.146 | 0.221 |
charged_method_for_rent_for_electricity | 0.000 | 0.182 | 0.000 | 1.000 | 0.000 | 0.076 | 0.032 | 0.000 | 0.000 | 0.000 | 0.000 | 0.129 | 0.000 | 0.115 | 0.107 | 0.000 | 0.000 | 0.267 | 0.030 | 0.000 | 0.000 | 0.047 | 0.000 | 0.000 |
charging_method_of_renters_for_electricity | 0.150 | 0.000 | 0.020 | 0.000 | 1.000 | 0.171 | 0.000 | 0.000 | 0.156 | 0.075 | 0.000 | 0.131 | 0.000 | 0.163 | 0.577 | 0.013 | 0.361 | 1.000 | 0.103 | 0.043 | 0.000 | 0.000 | 0.200 | 0.000 |
electricity_provider_csc_area | 0.062 | 0.240 | 0.079 | 0.076 | 0.171 | 1.000 | 0.144 | 0.000 | 0.177 | 0.028 | 0.158 | 0.125 | 0.057 | 0.058 | 0.193 | 0.105 | 0.075 | 0.108 | 0.177 | 0.089 | 0.172 | 0.502 | 0.124 | 0.181 |
floor_area | 0.058 | 0.213 | 0.037 | 0.032 | 0.000 | 0.144 | 1.000 | 0.114 | 0.140 | 0.000 | 0.027 | 0.097 | 0.087 | 0.026 | 0.403 | 0.123 | 0.000 | 0.030 | 0.176 | 0.278 | 0.030 | 0.155 | 0.141 | 0.150 |
floor_which_house_located | 0.000 | 0.279 | 0.068 | 0.000 | 0.000 | 0.000 | 0.114 | 1.000 | 0.154 | 0.000 | 0.000 | 0.000 | 0.014 | 0.039 | 0.015 | 0.137 | 1.000 | 0.000 | 0.116 | 0.073 | NaN | 0.118 | 0.373 | 0.000 |
highest_level_of_education_of_the_chief_wage_earner | 0.000 | 0.213 | 0.036 | 0.000 | 0.156 | 0.177 | 0.140 | 0.154 | 1.000 | 0.042 | 0.049 | 0.101 | 0.021 | 0.048 | 0.159 | 0.306 | 0.000 | 0.036 | 0.650 | 0.155 | 0.159 | 0.176 | 0.138 | 0.156 |
is_there_business_carried_out_in_the_household | 0.041 | 0.005 | 0.039 | 0.000 | 0.075 | 0.028 | 0.000 | 0.000 | 0.042 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.174 | 0.000 | 0.000 | 0.052 | 0.000 | 1.000 | 0.008 | 0.000 | 0.000 |
main_material_used_for_roof_of_the_house | 0.071 | 0.117 | 0.139 | 0.000 | 0.000 | 0.158 | 0.027 | 0.000 | 0.049 | 0.000 | 1.000 | 0.168 | 0.103 | 0.000 | 0.181 | 0.028 | 0.106 | 0.000 | 0.057 | 0.000 | 0.000 | 0.160 | 0.151 | 0.066 |
main_material_used_for_walls_of_the_house | 0.063 | 0.268 | 0.290 | 0.129 | 0.131 | 0.125 | 0.097 | 0.000 | 0.101 | 0.000 | 0.168 | 1.000 | 0.000 | 0.012 | 0.262 | 0.085 | 0.000 | 0.272 | 0.134 | 0.053 | 0.000 | 0.086 | 0.151 | 0.222 |
no_of_electricity_meters | 0.000 | 0.078 | 0.000 | 0.000 | 0.000 | 0.057 | 0.087 | 0.014 | 0.021 | 0.000 | 0.103 | 0.000 | 1.000 | -0.035 | 0.100 | 0.070 | 0.000 | 0.000 | 0.048 | 0.048 | 0.051 | 0.101 | 0.158 | 0.063 |
no_of_household_members | 0.000 | 0.022 | 0.048 | 0.115 | 0.163 | 0.058 | 0.026 | 0.039 | 0.048 | 0.000 | 0.000 | 0.012 | -0.035 | 1.000 | 0.018 | 0.029 | 0.098 | 0.021 | 0.063 | 0.275 | 0.000 | 0.000 | 0.045 | 0.029 |
no_of_storeys | 0.000 | 0.238 | 0.172 | 0.107 | 0.577 | 0.193 | 0.403 | 0.015 | 0.159 | 0.000 | 0.181 | 0.262 | 0.100 | 0.018 | 1.000 | 0.168 | 0.000 | 0.114 | 0.157 | 0.388 | 1.000 | 0.138 | 0.352 | 0.327 |
occupation_of_the_chief_wage_earner | 0.036 | 0.197 | 0.034 | 0.000 | 0.013 | 0.105 | 0.123 | 0.137 | 0.306 | 0.174 | 0.028 | 0.085 | 0.070 | 0.029 | 0.168 | 1.000 | 0.000 | 0.033 | 0.625 | 0.139 | 0.242 | 0.167 | 0.115 | 0.124 |
occupy_renters_boarders | 0.009 | 0.041 | 0.012 | 0.000 | 0.361 | 0.075 | 0.000 | 1.000 | 0.000 | 0.000 | 0.106 | 0.000 | 0.000 | 0.098 | 0.000 | 0.000 | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
own_the_house_or_living_on_rent | 0.078 | 0.278 | 0.442 | 0.267 | 1.000 | 0.108 | 0.030 | 0.000 | 0.036 | 0.000 | 0.000 | 0.272 | 0.000 | 0.021 | 0.114 | 0.033 | 1.000 | 1.000 | 0.039 | 0.003 | 0.000 | 0.033 | 0.063 | 0.261 |
socio_economic_class | 0.000 | 0.217 | 0.023 | 0.030 | 0.103 | 0.177 | 0.176 | 0.116 | 0.650 | 0.052 | 0.057 | 0.134 | 0.048 | 0.063 | 0.157 | 0.625 | 0.000 | 0.039 | 1.000 | 0.218 | 0.128 | 0.192 | 0.184 | 0.168 |
total_monthly_expenditure_of_last_month | 0.035 | 0.154 | 0.036 | 0.000 | 0.043 | 0.089 | 0.278 | 0.073 | 0.155 | 0.000 | 0.000 | 0.053 | 0.048 | 0.275 | 0.388 | 0.139 | 0.000 | 0.003 | 0.218 | 1.000 | 0.000 | 0.148 | 0.086 | 0.103 |
type_of_business | 0.000 | 0.000 | 0.093 | 0.000 | 0.000 | 0.172 | 0.030 | NaN | 0.159 | 1.000 | 0.000 | 0.000 | 0.051 | 0.000 | 1.000 | 0.242 | 0.000 | 0.000 | 0.128 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
type_of_electricity_meter | 0.000 | 0.125 | 0.075 | 0.047 | 0.000 | 0.502 | 0.155 | 0.118 | 0.176 | 0.008 | 0.160 | 0.086 | 0.101 | 0.000 | 0.138 | 0.167 | 0.000 | 0.033 | 0.192 | 0.148 | 0.000 | 1.000 | 0.220 | 0.101 |
type_of_house | 0.047 | 0.228 | 0.146 | 0.000 | 0.200 | 0.124 | 0.141 | 0.373 | 0.138 | 0.000 | 0.151 | 0.151 | 0.158 | 0.045 | 0.352 | 0.115 | 0.000 | 0.063 | 0.184 | 0.086 | 0.000 | 0.220 | 1.000 | 0.211 |
whom_or_how_the_house_was_designed | 0.101 | 0.435 | 0.221 | 0.000 | 0.000 | 0.181 | 0.150 | 0.000 | 0.156 | 0.000 | 0.066 | 0.222 | 0.063 | 0.029 | 0.327 | 0.124 | 0.000 | 0.261 | 0.168 | 0.103 | 0.000 | 0.101 | 0.211 | 1.000 |
Missing values
Sample
household_ID | no_of_electricity_meters | electricity_provider_csc_area | own_the_house_or_living_on_rent | occupy_renters_boarders | awareness_of_electricity_consumption_of_renters | built_year_of_the_house | type_of_house | floor_which_house_located | no_of_storeys | floor_area | no_of_household_members | charging_method_of_renters_for_electricity | charged_method_for_rent_for_electricity | is_there_business_carried_out_in_the_household | type_of_business | whom_or_how_the_house_was_designed | availability_of_certificate_of_compliance | main_material_used_for_walls_of_the_house | main_material_used_for_roof_of_the_house | any_constructions_or_renovations_in_the_household | highest_level_of_education_of_the_chief_wage_earner | occupation_of_the_chief_wage_earner | socio_economic_class | total_monthly_expenditure_of_last_month | type_of_electricity_meter | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | ID0001 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Double Floor | NaN | NaN | 1500.0 | 4 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Asbestos | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 35000.0 | Non smart meter |
1 | ID0002 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | Before 1980 | Single House - Single Floor | NaN | NaN | 440.0 | 3 | NaN | NaN | No | NaN | This is a house provided by the government. | Yes | Brick | Asbestos | No | Diploma with O/L or A/L (Non graduate) | Unskilled Worker | SEC D | 40000.0 | Non smart meter |
2 | ID0003 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1980-1989 | Single House - Single Floor | NaN | NaN | 2500.0 | 4 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Tile | No | Graduate / Post-Grads / Degree level professional qualification | Middle and Senior executive | SEC A | 250000.0 | Smart meter |
3 | ID0004 | 1 | BORALASGAMUWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Single House - Double Floor | NaN | NaN | 2600.0 | 4 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Cement Block | Concrete | No | Other professional certificates with O/L or A/L / Part qualification (Non graduate) | 1-9 Employed | SEC A | 100000.0 | Smart meter |
4 | ID0005 | 1 | KOLONNAWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Flat | 10.0 | 1.0 | 480.0 | 2 | NaN | NaN | No | NaN | The house is designed by a certified architect. | Yes | I am not aware of that | Concrete | No | Schooling up to Grade 6 - 9 | Skilled Worker | SEC D | 60000.0 | Smart meter |
5 | ID0006 | 1 | KOLONNAWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Flat | 1.0 | 2.0 | 440.0 | 6 | NaN | NaN | No | NaN | This is a house provided by the government. | Yes | Cement Block | Concrete | No | Schooling up to Grade 6 - 9 | Unskilled Worker | SEC E | 100000.0 | Smart meter |
6 | ID0007 | 1 | KOLONNAWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Flat | 2.0 | 1.0 | 480.0 | 4 | NaN | NaN | No | NaN | This is a house provided by the government. | Yes | Cement Block | Concrete | No | O/L or A/L pending / Passed | Small Businessman / Self employed (Non professional) | SEC C | 60000.0 | Smart meter |
7 | ID0008 | 2 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Single House - Double Floor | NaN | NaN | 1400.0 | 5 | NaN | NaN | No | NaN | The house is designed by a certified architect. | Yes | Cement Block | Asbestos | No | Other professional certificates with O/L or A/L / Part qualification (Non graduate) | Clerk / Salesman grades | SEC B | 150000.0 | Smart meter |
8 | ID0009 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Single Floor | NaN | NaN | 350.0 | 2 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Asbestos | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 15000.0 | Non smart meter |
9 | ID0010 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1990-1999 | Single House - Single Floor | NaN | NaN | 1000.0 | 7 | NaN | NaN | No | NaN | The house is designed by a certified architect. | No | Brick | Tile | No | Schooling up to Grade 6 - 9 | Unskilled Worker | SEC E | 50000.0 | Non smart meter |
household_ID | no_of_electricity_meters | electricity_provider_csc_area | own_the_house_or_living_on_rent | occupy_renters_boarders | awareness_of_electricity_consumption_of_renters | built_year_of_the_house | type_of_house | floor_which_house_located | no_of_storeys | floor_area | no_of_household_members | charging_method_of_renters_for_electricity | charged_method_for_rent_for_electricity | is_there_business_carried_out_in_the_household | type_of_business | whom_or_how_the_house_was_designed | availability_of_certificate_of_compliance | main_material_used_for_walls_of_the_house | main_material_used_for_roof_of_the_house | any_constructions_or_renovations_in_the_household | highest_level_of_education_of_the_chief_wage_earner | occupation_of_the_chief_wage_earner | socio_economic_class | total_monthly_expenditure_of_last_month | type_of_electricity_meter | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
4053 | ID4054 | 1 | NEGOMBO | No, I am living on rent and the rent is paid by me or a household member. | NaN | NaN | 1980-1989 | Single House - Single Floor | NaN | NaN | 1250.0 | 6 | NaN | You pay the full amount of the electricity bill. | No | NaN | NaN | NaN | Brick | NaN | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 70000.0 | Smart meter |
4054 | ID4055 | 1 | NEGOMBO | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Double Floor | NaN | NaN | 1500.0 | 3 | NaN | NaN | No | NaN | NaN | NaN | Brick | NaN | No | O/L or A/L pending / Passed | Middle and Senior executive | SEC B | 60000.0 | Smart meter |
4055 | ID4056 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Single Floor | NaN | NaN | 2400.0 | 3 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | Schooling up to Grade 6 - 9 | Unskilled Worker | SEC E | NaN | Non smart meter |
4056 | ID4057 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | Before 1980 | Single House - Single Floor | NaN | NaN | 2400.0 | 2 | NaN | NaN | No | NaN | NaN | NaN | Cabook | NaN | No | O/L or A/L pending / Passed | Skilled Worker | SEC C | 25000.0 | Non smart meter |
4057 | ID4058 | 1 | GALLE | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1990-1999 | Single House - Single Floor | NaN | NaN | 3000.0 | 6 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | Diploma with O/L or A/L (Non graduate) | Skilled Worker | SEC C | 8000.0 | Non smart meter |
4058 | ID4059 | 1 | NUGEGODA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | In 2020 or After 2020 | Single House - Double Floor | NaN | NaN | 400.0 | 4 | NaN | NaN | No | NaN | NaN | NaN | Brick | NaN | No | Graduate / Post-Grads / Degree level professional qualification | Clerk / Salesman grades | SEC B | 150000.0 | Smart meter |
4059 | ID4060 | 1 | HIKKADUWA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | Before 1980 | Single House - Single Floor | NaN | NaN | 3000.0 | 1 | NaN | NaN | No | NaN | NaN | NaN | Pressed soil blocks | NaN | No | O/L or A/L pending / Passed | Boutique owner | SEC B | 50000.0 | Non smart meter |
4060 | ID4061 | 1 | WATTALA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2000-2009 | Single House - Single Floor | NaN | NaN | 680.0 | 2 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | O/L or A/L pending / Passed | Unskilled Worker | SEC D | 20000.0 | Non smart meter |
4061 | ID4062 | 1 | ALUTHGAMA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 1980-1989 | Single House - Single Floor | NaN | NaN | 700.0 | 2 | NaN | NaN | No | NaN | NaN | NaN | Cabook | NaN | No | O/L or A/L pending / Passed | Self employed (Professional) - No employees | SEC B | 30000.0 | Non smart meter |
4062 | ID4063 | 1 | ALUTHGAMA | Yes, I or a household member owns it. | I don't occupy any of the above. | NaN | 2010-2019 | Single House - Double Floor | NaN | NaN | 2400.0 | 5 | NaN | NaN | No | NaN | NaN | NaN | Cement Block | NaN | No | Schooling up to Grade 6 - 9 | Skilled Worker | SEC D | 100000.0 | Non smart meter |