Overview
Brought to you by YData
Dataset statistics
Number of variables | 36 |
---|---|
Number of observations | 4,063 |
Missing cells | 52,098 |
Missing cells (%) | 35.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.1 MiB |
Average record size in memory | 288.0 B |
Variable types
Text | 1 |
---|---|
Boolean | 29 |
Numeric | 1 |
Categorical | 5 |
biogas_used_for_cooking has constant value "False" | Constant |
aware_of_no_of_units_generated_by_solar_system is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 3 other fields | High correlation |
boil_water_before_drinking is highly overall correlated with source_of_energy_for_boiling_drinking_water | High correlation |
coconut_shells_or_charcoal_used_for_cooking is highly overall correlated with aware_of_no_of_units_generated_by_solar_system and 14 other fields | High correlation |
does_water_heating_equipment_serve_other_housing_units is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 1 other fields | High correlation |
firewood_used_for_cooking is highly overall correlated with source_of_energy_for_boiling_drinking_water | High correlation |
gas_used_for_cooking is highly overall correlated with source_of_energy_for_boiling_drinking_water | High correlation |
generate_electicity_using_mini_hydropower is highly overall correlated with no_of_units_generated_by_solar_system | High correlation |
generate_electicity_using_solar_energy is highly overall correlated with aware_of_no_of_units_generated_by_solar_system and 12 other fields | High correlation |
generate_electicity_using_wind_power is highly overall correlated with no_of_units_generated_by_solar_system | High correlation |
household_members_used_hot_water_last_week is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking | High correlation |
no_of_units_generated_by_solar_system is highly overall correlated with aware_of_no_of_units_generated_by_solar_system and 6 other fields | High correlation |
sawdust_or_paddy_husk_used_for_cooking is highly overall correlated with aware_of_no_of_units_generated_by_solar_system and 13 other fields | High correlation |
solar_energy_used_for_agricultural_systems is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 4 other fields | High correlation |
solar_energy_used_for_all_above is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 2 other fields | High correlation |
solar_energy_used_for_car_charging is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 2 other fields | High correlation |
solar_energy_used_for_cooking is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 2 other fields | High correlation |
solar_energy_used_for_other_purposes is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 2 other fields | High correlation |
solar_energy_used_for_outdoor_lighting is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 3 other fields | High correlation |
solar_energy_used_for_water_heating is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 3 other fields | High correlation |
solar_system_invertor_or_noninvertor is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 2 other fields | High correlation |
solar_system_ongrid_or_offgird is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 4 other fields | High correlation |
source_of_energy_for_boiling_drinking_water is highly overall correlated with boil_water_before_drinking and 5 other fields | High correlation |
water_heating_method_for_bathing is highly overall correlated with generate_electicity_using_solar_energy | High correlation |
when_was_solar_system_installed is highly overall correlated with coconut_shells_or_charcoal_used_for_cooking and 2 other fields | High correlation |
have_backup_generator is highly imbalanced (81.7%) | Imbalance |
generate_electicity_using_solar_energy is highly imbalanced (53.2%) | Imbalance |
generate_electicity_using_bio_energy is highly imbalanced (95.1%) | Imbalance |
generate_electicity_using_mini_hydropower is highly imbalanced (97.9%) | Imbalance |
generate_electicity_using_wind_power is highly imbalanced (97.5%) | Imbalance |
generate_electicity_using_other_methods is highly imbalanced (96.5%) | Imbalance |
solar_energy_used_for_water_heating is highly imbalanced (60.1%) | Imbalance |
solar_energy_used_for_cooking is highly imbalanced (82.0%) | Imbalance |
solar_energy_used_for_outdoor_lighting is highly imbalanced (59.3%) | Imbalance |
solar_energy_used_for_car_charging is highly imbalanced (93.7%) | Imbalance |
solar_energy_used_for_agricultural_systems is highly imbalanced (97.5%) | Imbalance |
solar_energy_used_for_all_above is highly imbalanced (86.0%) | Imbalance |
solar_energy_used_for_other_purposes is highly imbalanced (78.3%) | Imbalance |
have_system_to_store_backup_energy is highly imbalanced (61.6%) | Imbalance |
method_of_receiving_water is highly imbalanced (72.2%) | Imbalance |
does_water_heating_equipment_serve_other_housing_units is highly imbalanced (57.0%) | Imbalance |
electricity_generated_using_solar_energy_used_for_cooking is highly imbalanced (90.9%) | Imbalance |
kerosene_used_for_cooking is highly imbalanced (89.1%) | Imbalance |
sawdust_or_paddy_husk_used_for_cooking is highly imbalanced (97.9%) | Imbalance |
coconut_shells_or_charcoal_used_for_cooking is highly imbalanced (99.7%) | Imbalance |
other_methods_used_for_cooking is highly imbalanced (90.0%) | Imbalance |
solar_system_ongrid_or_offgird has 3658 (90.0%) missing values | Missing |
solar_system_invertor_or_noninvertor has 3658 (90.0%) missing values | Missing |
solar_energy_used_for_water_heating has 3658 (90.0%) missing values | Missing |
solar_energy_used_for_cooking has 3658 (90.0%) missing values | Missing |
solar_energy_used_for_outdoor_lighting has 3658 (90.0%) missing values | Missing |
solar_energy_used_for_car_charging has 3658 (90.0%) missing values | Missing |
solar_energy_used_for_agricultural_systems has 3658 (90.0%) missing values | Missing |
solar_energy_used_for_all_above has 3658 (90.0%) missing values | Missing |
solar_energy_used_for_other_purposes has 3658 (90.0%) missing values | Missing |
aware_of_no_of_units_generated_by_solar_system has 3658 (90.0%) missing values | Missing |
no_of_units_generated_by_solar_system has 3868 (95.2%) missing values | Missing |
when_was_solar_system_installed has 3658 (90.0%) missing values | Missing |
does_water_heating_equipment_serve_other_housing_units has 3302 (81.3%) missing values | Missing |
household_members_used_hot_water_last_week has 2457 (60.5%) missing values | Missing |
source_of_energy_for_boiling_drinking_water has 2233 (55.0%) missing values | Missing |
household_ID has unique values | Unique |
Reproduction
Analysis started | 2024-12-06 05:55:00.385407 |
---|---|
Analysis finished | 2024-12-06 05:55:04.072929 |
Duration | 3.69 seconds |
Software version | ydata-profiling vv4.11.0 |
Download configuration | config.json |
Variables
household_ID
Text
Unique 
Distinct | 4063 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 31.9 KiB |
Value | Count | Frequency (%) |
id0039 | 1 | < 0.1% |
id4063 | 1 | < 0.1% |
id0001 | 1 | < 0.1% |
id0002 | 1 | < 0.1% |
id0003 | 1 | < 0.1% |
id0004 | 1 | < 0.1% |
id0005 | 1 | < 0.1% |
id0006 | 1 | < 0.1% |
id0007 | 1 | < 0.1% |
id0008 | 1 | < 0.1% |
Other values (4053) | 4053 |
Most occurring characters
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 | |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | 5.3% |
5 | 1216 | 5.0% |
6 | 1210 | 5.0% |
7 | 1206 | 4.9% |
Other values (2) | 2412 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 16252 | |
Uppercase Letter | 8126 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | |
5 | 1216 | |
6 | 1210 | |
7 | 1206 | |
8 | 1206 | |
9 | 1206 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 16252 | |
Latin | 8126 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | |
5 | 1216 | |
6 | 1210 | |
7 | 1206 | |
8 | 1206 | |
9 | 1206 |
Latin
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 24378 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
I | 4063 | |
D | 4063 | |
0 | 2277 | |
3 | 2217 | |
2 | 2217 | |
1 | 2217 | |
4 | 1280 | 5.3% |
5 | 1216 | 5.0% |
6 | 1210 | 5.0% |
7 | 1206 | 4.9% |
Other values (2) | 2412 |
have_backup_generator
Boolean
Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True | 113 |
Value | Count | Frequency (%) |
False | 3950 | |
True | 113 | 2.8% |
generate_electicity_using_solar_energy
Boolean
High correlation  Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 3658 | |
True | 405 | 10.0% |
generate_electicity_using_bio_energy
Boolean
Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True | 22 |
Value | Count | Frequency (%) |
False | 4041 | |
True | 22 | 0.5% |
generate_electicity_using_mini_hydropower
Boolean
High correlation  Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True | 8 |
Value | Count | Frequency (%) |
False | 4055 | |
True | 8 | 0.2% |
generate_electicity_using_wind_power
Boolean
High correlation  Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True | 10 |
Value | Count | Frequency (%) |
False | 4053 | |
True | 10 | 0.2% |
generate_electicity_using_other_methods
Boolean
Imbalance 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.1 KiB |
False | |
---|---|
True | 15 |
Value | Count | Frequency (%) |
False | 4048 | |
True | 15 | 0.4% |
solar_system_ongrid_or_offgird
Boolean
High correlation  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
True | 330 |
---|---|
False | 75 |
(Missing) |
Value | Count | Frequency (%) |
True | 330 | 8.1% |
False | 75 | 1.8% |
(Missing) | 3658 |
solar_system_invertor_or_noninvertor
Boolean
High correlation  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
True | 308 |
---|---|
False | 97 |
(Missing) |
Value | Count | Frequency (%) |
True | 308 | 7.6% |
False | 97 | 2.4% |
(Missing) | 3658 |
solar_energy_used_for_water_heating
Boolean
High correlation  Imbalance  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | |
---|---|
True | 32 |
(Missing) |
Value | Count | Frequency (%) |
False | 373 | 9.2% |
True | 32 | 0.8% |
(Missing) | 3658 |
solar_energy_used_for_cooking
Boolean
High correlation  Imbalance  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | |
---|---|
True | 11 |
(Missing) |
Value | Count | Frequency (%) |
False | 394 | 9.7% |
True | 11 | 0.3% |
(Missing) | 3658 |
solar_energy_used_for_outdoor_lighting
Boolean
High correlation  Imbalance  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | |
---|---|
True | 33 |
(Missing) |
Value | Count | Frequency (%) |
False | 372 | 9.2% |
True | 33 | 0.8% |
(Missing) | 3658 |
solar_energy_used_for_car_charging
Boolean
High correlation  Imbalance  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | |
---|---|
True | 3 |
(Missing) |
Value | Count | Frequency (%) |
False | 402 | 9.9% |
True | 3 | 0.1% |
(Missing) | 3658 |
solar_energy_used_for_agricultural_systems
Boolean
High correlation  Imbalance  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | |
---|---|
True | 1 |
(Missing) |
Value | Count | Frequency (%) |
False | 404 | 9.9% |
True | 1 | < 0.1% |
(Missing) | 3658 |
solar_energy_used_for_all_above
Boolean
High correlation  Imbalance  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | |
---|---|
True | 8 |
(Missing) |
Value | Count | Frequency (%) |
False | 397 | 9.8% |
True | 8 | 0.2% |
(Missing) | 3658 |
solar_energy_used_for_other_purposes
Boolean
High correlation  Imbalance  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | |
---|---|
True | 14 |
(Missing) |
Value | Count | Frequency (%) |
False | 391 | 9.6% |
True | 14 | 0.3% |
(Missing) | 3658 |
aware_of_no_of_units_generated_by_solar_system
Boolean
High correlation  Missing 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 3658 |
Missing (%) | 90.0% |
Memory size | 8.1 KiB |
False | 210 |
---|---|
True | 195 |
(Missing) |
Value | Count | Frequency (%) |
False | 210 | 5.2% |
True | 195 | 4.8% |
(Missing) | 3658 |
no_of_units_generated_by_solar_system
Real number (ℝ)
High correlation  Missing 
Distinct | 102 |
---|---|
Distinct (%) | 52.3% |
Missing | 3868 |
Missing (%) | 95.2% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 445.34718 |
Minimum | 0 |
---|---|
Maximum | 2500 |
Zeros | 22 |
Zeros (%) | 0.5% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 31.9 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 240 |
median | 400 |
Q3 | 600 |
95-th percentile | 1045 |
Maximum | 2500 |
Range | 2500 |
Interquartile range (IQR) | 360 |
Descriptive statistics
Standard deviation | 381.12807 |
---|---|
Coefficient of variation (CV) | 0.8557999 |
Kurtosis | 8.4346225 |
Mean | 445.34718 |
Median Absolute Deviation (MAD) | 180 |
Skewness | 2.3187772 |
Sum | 86842.7 |
Variance | 145258.61 |
Monotonicity | Not monotonic |