Overview
Brought to you by YData
Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 16270 |
Missing cells | 45187 |
Missing cells (%) | 18.5% |
Duplicate rows | 542 |
Duplicate rows (%) | 3.3% |
Total size in memory | 2.5 MiB |
Average record size in memory | 160.5 B |
Variable types
Categorical | 4 |
---|---|
Numeric | 11 |
Dataset has 542 (3.3%) duplicate rows | Duplicates |
current_attendance_in_any_education_instituition has 418 (2.6%) missing values | Missing |
highest_level_of_education has 775 (4.8%) missing values | Missing |
main_activity_engaged_in has 2133 (13.1%) missing values | Missing |
main_occupation has 9902 (60.9%) missing values | Missing |
daily_wage_owner_or_not has 10090 (62.0%) missing values | Missing |
employment_status_of_the_main_occupation has 9902 (60.9%) missing values | Missing |
member_went_out_for_work_or_not_during_last_week has 11967 (73.6%) missing values | Missing |
highest_level_of_education has 216 (1.3%) zeros | Zeros |
no_of_hours_stayed_at_home_during_last_week has 699 (4.3%) zeros | Zeros |
Reproduction
Analysis started | 2024-11-18 08:36:49.218635 |
---|---|
Analysis finished | 2024-11-18 08:37:01.677034 |
Duration | 12.46 seconds |
Software version | ydata-profiling vv4.11.0 |
Download configuration | config.json |
Variables
member_ID
Categorical
Distinct | 13 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 770.3 KiB |
I_1 | |
---|---|
I_2 | |
I_3 | |
I_4 | |
I_5 | |
Other values (8) |
Common Values
Value | Count | Frequency (%) |
I_1 | 4063 | |
I_2 | 3877 | |
I_3 | 3275 | |
I_4 | 2457 | |
I_5 | 1443 | 8.9% |
I_6 | 671 | 4.1% |
I_7 | 264 | 1.6% |
I_8 | 120 | 0.7% |
I_9 | 55 | 0.3% |
I_10 | 26 | 0.2% |
Other values (3) | 19 | 0.1% |
Length
Value | Count | Frequency (%) |
i_1 | 4063 | |
i_2 | 3877 | |
i_3 | 3275 | |
i_4 | 2457 | |
i_5 | 1443 | 8.9% |
i_6 | 671 | 4.1% |
i_7 | 264 | 1.6% |
i_8 | 120 | 0.7% |
i_9 | 55 | 0.3% |
i_10 | 26 | 0.2% |
Other values (3) | 19 | 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
I | 16270 | |
_ | 16270 | |
1 | 4119 | 8.4% |
2 | 3883 | 7.9% |
3 | 3277 | 6.7% |
4 | 2457 | 5.0% |
5 | 1443 | 3.0% |
6 | 671 | 1.4% |
7 | 264 | 0.5% |
8 | 120 | 0.2% |
Other values (2) | 81 | 0.2% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 48855 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
I | 16270 | |
_ | 16270 | |
1 | 4119 | 8.4% |
2 | 3883 | 7.9% |
3 | 3277 | 6.7% |
4 | 2457 | 5.0% |
5 | 1443 | 3.0% |
6 | 671 | 1.4% |
7 | 264 | 0.5% |
8 | 120 | 0.2% |
Other values (2) | 81 | 0.2% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 48855 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
I | 16270 | |
_ | 16270 | |
1 | 4119 | 8.4% |
2 | 3883 | 7.9% |
3 | 3277 | 6.7% |
4 | 2457 | 5.0% |
5 | 1443 | 3.0% |
6 | 671 | 1.4% |
7 | 264 | 0.5% |
8 | 120 | 0.2% |
Other values (2) | 81 | 0.2% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 48855 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
I | 16270 | |
_ | 16270 | |
1 | 4119 | 8.4% |
2 | 3883 | 7.9% |
3 | 3277 | 6.7% |
4 | 2457 | 5.0% |
5 | 1443 | 3.0% |
6 | 671 | 1.4% |
7 | 264 | 0.5% |
8 | 120 | 0.2% |
Other values (2) | 81 | 0.2% |
age
Real number (ℝ)
Distinct | 97 |
---|---|
Distinct (%) | 0.6% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38.39287 |
Minimum | 0 |
---|---|
Maximum | 98 |
Zeros | 149 |
Zeros (%) | 0.9% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 5 |
Q1 | 19 |
median | 38 |
Q3 | 56 |
95-th percentile | 75 |
Maximum | 98 |
Range | 98 |
Interquartile range (IQR) | 37 |
Descriptive statistics
Standard deviation | 22.075172 |
---|---|
Coefficient of variation (CV) | 0.57498103 |
Kurtosis | -0.99725102 |
Mean | 38.39287 |
Median Absolute Deviation (MAD) | 18 |
Skewness | 0.14195376 |
Sum | 624652 |
Variance | 487.31322 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
19 | 282 | 1.7% |
17 | 278 | 1.7% |
23 | 276 | 1.7% |
15 | 267 | 1.6% |
18 | 266 | 1.6% |
20 | 263 | 1.6% |
16 | 256 | 1.6% |
42 | 252 | 1.5% |
22 | 251 | 1.5% |
45 | 245 | 1.5% |
Other values (87) | 13634 |
Value | Count | Frequency (%) |
0 | 149 | |
1 | 131 | |
2 | 138 | |
3 | 186 | |
4 | 171 | |
5 | 177 | |
6 | 163 | |
7 | 164 | |
8 | 196 | |
9 | 198 |
Value | Count | Frequency (%) |
98 | 1 | < 0.1% |
96 | 1 | < 0.1% |
95 | 3 | < 0.1% |
93 | 8 | < 0.1% |
92 | 5 | < 0.1% |
91 | 7 | < 0.1% |
90 | 17 | |
89 | 20 | |
88 | 16 | |
87 | 14 |
relationship_to_the_head_of_household
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 7.8720344 |
Minimum | 1 |
---|---|
Maximum | 109 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 3 |
Q3 | 3 |
95-th percentile | 75 |
Maximum | 109 |
Range | 108 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 20.1174 |
---|---|
Coefficient of variation (CV) | 2.5555529 |
Kurtosis | 11.327254 |
Mean | 7.8720344 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 3.5888571 |
Sum | 128078 |
Variance | 404.70979 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 5654 | |
1 | 4012 | |
2 | 3226 | |
5 | 1198 | 7.4% |
75 | 685 | 4.2% |
6 | 666 | 4.1% |
4 | 434 | 2.7% |
97 | 237 | 1.5% |
86 | 101 | 0.6% |
109 | 52 | 0.3% |
Other values (2) | 5 | < 0.1% |
Value | Count | Frequency (%) |
1 | 4012 | |
2 | 3226 | |
3 | 5654 | |
4 | 434 | 2.7% |
5 | 1198 | 7.4% |
6 | 666 | 4.1% |
12 | 3 | < 0.1% |
75 | 685 | 4.2% |
86 | 101 | 0.6% |
88 | 2 | < 0.1% |
Value | Count | Frequency (%) |
109 | 52 | 0.3% |
97 | 237 | 1.5% |
88 | 2 | < 0.1% |
86 | 101 | 0.6% |
75 | 685 | 4.2% |
12 | 3 | < 0.1% |
6 | 666 | 4.1% |
5 | 1198 | 7.4% |
4 | 434 | 2.7% |
3 | 5654 |
Common Values
Value | Count | Frequency (%) |
1 | 8386 | |
0 | 7884 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1 | 8386 | |
0 | 7884 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 8386 | |
0 | 7884 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 16270 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
1 | 8386 | |
0 | 7884 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 16270 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
1 | 8386 | |
0 | 7884 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 16270 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
1 | 8386 | |
0 | 7884 |
ethnicity
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.4357714 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 1 |
95-th percentile | 4 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 0 |
Descriptive statistics
Standard deviation | 1.0512815 |
---|---|
Coefficient of variation (CV) | 0.73220675 |
Kurtosis | 4.8198478 |
Mean | 1.4357714 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 2.326844 |
Sum | 23360 |
Variance | 1.1051928 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 13560 | |
4 | 1968 | 12.1% |
2 | 572 | 3.5% |
3 | 82 | 0.5% |
6 | 42 | 0.3% |
5 | 32 | 0.2% |
9 | 14 | 0.1% |
Value | Count | Frequency (%) |
1 | 13560 | |
2 | 572 | 3.5% |
3 | 82 | 0.5% |
4 | 1968 | 12.1% |
5 | 32 | 0.2% |
6 | 42 | 0.3% |
9 | 14 | 0.1% |
Value | Count | Frequency (%) |
9 | 14 | 0.1% |
6 | 42 | 0.3% |
5 | 32 | 0.2% |
4 | 1968 | 12.1% |
3 | 82 | 0.5% |
2 | 572 | 3.5% |
1 | 13560 |
religion
Real number (ℝ)
Distinct | 7 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.8679164 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 3 |
95-th percentile | 4 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 1.306145 |
---|---|
Coefficient of variation (CV) | 0.69925236 |
Kurtosis | -0.14726716 |
Mean | 1.8679164 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 1.0956388 |
Sum | 30391 |
Variance | 1.7060147 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 10807 | |
4 | 2442 | 15.0% |
3 | 2053 | 12.6% |
5 | 547 | 3.4% |
2 | 407 | 2.5% |
9 | 8 | < 0.1% |
6 | 6 | < 0.1% |
Value | Count | Frequency (%) |
1 | 10807 | |
2 | 407 | 2.5% |
3 | 2053 | 12.6% |
4 | 2442 | 15.0% |
5 | 547 | 3.4% |
6 | 6 | < 0.1% |
9 | 8 | < 0.1% |
Value | Count | Frequency (%) |
9 | 8 | < 0.1% |
6 | 6 | < 0.1% |
5 | 547 | 3.4% |
4 | 2442 | 15.0% |
3 | 2053 | 12.6% |
2 | 407 | 2.5% |
1 | 10807 |
marital_status
Real number (ℝ)
Distinct | 9 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.9127843 |
Minimum | 1 |
---|---|
Maximum | 9 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 2 |
Q3 | 2 |
95-th percentile | 4 |
Maximum | 9 |
Range | 8 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.1643284 |
---|---|
Coefficient of variation (CV) | 0.60870866 |
Kurtosis | 13.576298 |
Mean | 1.9127843 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 2.9990392 |
Sum | 31121 |
Variance | 1.3556605 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
2 | 7417 | |
1 | 6330 | |
3 | 1311 | 8.1% |
4 | 893 | 5.5% |
9 | 138 | 0.8% |
7 | 64 | 0.4% |
8 | 52 | 0.3% |
5 | 44 | 0.3% |
6 | 21 | 0.1% |
Value | Count | Frequency (%) |
1 | 6330 | |
2 | 7417 | |
3 | 1311 | 8.1% |
4 | 893 | 5.5% |
5 | 44 | 0.3% |
6 | 21 | 0.1% |
7 | 64 | 0.4% |
8 | 52 | 0.3% |
9 | 138 | 0.8% |
Value | Count | Frequency (%) |
9 | 138 | 0.8% |
8 | 52 | 0.3% |
7 | 64 | 0.4% |
6 | 21 | 0.1% |
5 | 44 | 0.3% |
4 | 893 | 5.5% |
3 | 1311 | 8.1% |
2 | 7417 | |
1 | 6330 |
current_attendance_in_any_education_instituition
Real number (ℝ)
Missing 
Distinct | 8 |
---|---|
Distinct (%) | 0.1% |
Missing | 418 |
Missing (%) | 2.6% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 6.4575448 |
Minimum | 1 |
---|---|
Maximum | 8 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 2 |
Q1 | 4 |
median | 8 |
Q3 | 8 |
95-th percentile | 8 |
Maximum | 8 |
Range | 7 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 2.5556012 |
---|---|
Coefficient of variation (CV) | 0.39575432 |
Kurtosis | -0.64997855 |
Mean | 6.4575448 |
Median Absolute Deviation (MAD) | 0 |
Skewness | -1.1196853 |
Sum | 102365 |
Variance | 6.5310976 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
8 | 11435 | |
2 | 2964 | 18.2% |
3 | 560 | 3.4% |
1 | 298 | 1.8% |
4 | 233 | 1.4% |
5 | 191 | 1.2% |
6 | 105 | 0.6% |
7 | 66 | 0.4% |
(Missing) | 418 | 2.6% |
Value | Count | Frequency (%) |
1 | 298 | 1.8% |
2 | 2964 | 18.2% |
3 | 560 | 3.4% |
4 | 233 | 1.4% |
5 | 191 | 1.2% |
6 | 105 | 0.6% |
7 | 66 | 0.4% |
8 | 11435 |
Value | Count | Frequency (%) |
8 | 11435 | |
7 | 66 | 0.4% |
6 | 105 | 0.6% |
5 | 191 | 1.2% |
4 | 233 | 1.4% |
3 | 560 | 3.4% |
2 | 2964 | 18.2% |
1 | 298 | 1.8% |
highest_level_of_education
Real number (ℝ)
Missing  Zeros 
Distinct | 20 |
---|---|
Distinct (%) | 0.1% |
Missing | 775 |
Missing (%) | 4.8% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 11.294095 |
Minimum | 0 |
---|---|
Maximum | 19 |
Zeros | 216 |
Zeros (%) | 1.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 4 |
Q1 | 10 |
median | 12 |
Q3 | 14 |
95-th percentile | 16 |
Maximum | 19 |
Range | 19 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 3.6555127 |
---|---|
Coefficient of variation (CV) | 0.32366584 |
Kurtosis | 0.90851411 |
Mean | 11.294095 |
Median Absolute Deviation (MAD) | 2 |
Skewness | -1.0071066 |
Sum | 175002 |
Variance | 13.362773 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
14 | 2989 | |
12 | 2697 | |
11 | 2484 | |
16 | 1333 | |
13 | 1219 | |
9 | 772 | 4.7% |
10 | 638 | 3.9% |
6 | 567 | 3.5% |
8 | 509 | 3.1% |
7 | 407 | 2.5% |
Other values (10) | 1880 | |
(Missing) | 775 | 4.8% |
Value | Count | Frequency (%) |
0 | 216 | 1.3% |
1 | 143 | 0.9% |
2 | 161 | 1.0% |
3 | 233 | 1.4% |
4 | 277 | 1.7% |
5 | 327 | |
6 | 567 | |
7 | 407 | |
8 | 509 | |
9 | 772 |
Value | Count | Frequency (%) |
19 | 42 | 0.3% |
18 | 84 | 0.5% |
17 | 235 | 1.4% |
16 | 1333 | |
15 | 162 | 1.0% |
14 | 2989 | |
13 | 1219 | |
12 | 2697 | |
11 | 2484 | |
10 | 638 | 3.9% |
main_activity_engaged_in
Real number (ℝ)
Missing 
Distinct | 10 |
---|---|
Distinct (%) | 0.1% |
Missing | 2133 |
Missing (%) | 13.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 4.5640518 |
Minimum | 1 |
---|---|
Maximum | 10 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 6 |
Q3 | 7 |
95-th percentile | 9 |
Maximum | 10 |
Range | 9 |
Interquartile range (IQR) | 6 |
Descriptive statistics
Standard deviation | 3.210986 |
---|---|
Coefficient of variation (CV) | 0.70353846 |
Kurtosis | -1.7877888 |
Mean | 4.5640518 |
Median Absolute Deviation (MAD) | 3 |
Skewness | -0.030884242 |
Sum | 64522 |
Variance | 10.310431 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 5473 | |
7 | 3465 | |
8 | 2513 | |
9 | 859 | 5.3% |
2 | 707 | 4.3% |
4 | 364 | 2.2% |
5 | 266 | 1.6% |
3 | 215 | 1.3% |
6 | 159 | 1.0% |
10 | 116 | 0.7% |
(Missing) | 2133 | 13.1% |
Value | Count | Frequency (%) |
1 | 5473 | |
2 | 707 | 4.3% |
3 | 215 | 1.3% |
4 | 364 | 2.2% |
5 | 266 | 1.6% |
6 | 159 | 1.0% |
7 | 3465 | |
8 | 2513 | |
9 | 859 | 5.3% |
10 | 116 | 0.7% |
Value | Count | Frequency (%) |
10 | 116 | 0.7% |
9 | 859 | 5.3% |
8 | 2513 | |
7 | 3465 | |
6 | 159 | 1.0% |
5 | 266 | 1.6% |
4 | 364 | 2.2% |
3 | 215 | 1.3% |
2 | 707 | 4.3% |
1 | 5473 |
main_occupation
Real number (ℝ)
Missing 
Distinct | 11 |
---|---|
Distinct (%) | 0.2% |
Missing | 9902 |
Missing (%) | 60.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.8406093 |
Minimum | 1 |
---|---|
Maximum | 99 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 2 |
median | 5 |
Q3 | 6 |
95-th percentile | 11 |
Maximum | 99 |
Range | 98 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 10.312635 |
---|---|
Coefficient of variation (CV) | 1.7656779 |
Kurtosis | 72.386715 |
Mean | 5.8406093 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 8.3275904 |
Sum | 37193 |
Variance | 106.35043 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
5 | 1777 | 10.9% |
2 | 1272 | 7.8% |
9 | 605 | 3.7% |
4 | 527 | 3.2% |
3 | 501 | 3.1% |
1 | 469 | 2.9% |
6 | 333 | 2.0% |
11 | 296 | 1.8% |
7 | 271 | 1.7% |
8 | 245 | 1.5% |
(Missing) | 9902 |
Value | Count | Frequency (%) |
1 | 469 | 2.9% |
2 | 1272 | |
3 | 501 | 3.1% |
4 | 527 | 3.2% |
5 | 1777 | |
6 | 333 | 2.0% |
7 | 271 | 1.7% |
8 | 245 | 1.5% |
9 | 605 | 3.7% |
11 | 296 | 1.8% |
Value | Count | Frequency (%) |
99 | 72 | 0.4% |
11 | 296 | 1.8% |
9 | 605 | 3.7% |
8 | 245 | 1.5% |
7 | 271 | 1.7% |
6 | 333 | 2.0% |
5 | 1777 | |
4 | 527 | 3.2% |
3 | 501 | 3.1% |
2 | 1272 |
daily_wage_owner_or_not
Categorical
Missing 
Distinct | 2 |
---|---|
Distinct (%) | < 0.1% |
Missing | 10090 |
Missing (%) | 62.0% |
Memory size | 770.3 KiB |
2.0 | |
---|---|
1.0 |
Common Values
Value | Count | Frequency (%) |
2.0 | 4064 | |
1.0 | 2116 | 13.0% |
(Missing) | 10090 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2.0 | 4064 | |
1.0 | 2116 |
Most occurring characters
Value | Count | Frequency (%) |
. | 6180 | |
0 | 6180 | |
2 | 4064 | |
1 | 2116 | 11.4% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 18540 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
. | 6180 | |
0 | 6180 | |
2 | 4064 | |
1 | 2116 | 11.4% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 18540 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
. | 6180 | |
0 | 6180 | |
2 | 4064 | |
1 | 2116 | 11.4% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 18540 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
. | 6180 | |
0 | 6180 | |
2 | 4064 | |
1 | 2116 | 11.4% |
employment_status_of_the_main_occupation
Real number (ℝ)
Missing 
Distinct | 6 |
---|---|
Distinct (%) | 0.1% |
Missing | 9902 |
Missing (%) | 60.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 3.182946 |
Minimum | 1 |
---|---|
Maximum | 6 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 3 |
Q3 | 4 |
95-th percentile | 5 |
Maximum | 6 |
Range | 5 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.2221368 |
---|---|
Coefficient of variation (CV) | 0.38396404 |
Kurtosis | -0.079889298 |
Mean | 3.182946 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 0.079925575 |
Sum | 20269 |
Variance | 1.4936183 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
3 | 3698 | 22.7% |
5 | 1084 | 6.7% |
1 | 859 | 5.3% |
4 | 415 | 2.6% |
2 | 159 | 1.0% |
6 | 153 | 0.9% |
(Missing) | 9902 |
Value | Count | Frequency (%) |
1 | 859 | 5.3% |
2 | 159 | 1.0% |
3 | 3698 | |
4 | 415 | 2.6% |
5 | 1084 | 6.7% |
6 | 153 | 0.9% |
Value | Count | Frequency (%) |
6 | 153 | 0.9% |
5 | 1084 | 6.7% |
4 | 415 | 2.6% |
3 | 3698 | |
2 | 159 | 1.0% |
1 | 859 | 5.3% |
no_of_hours_stayed_at_home_during_last_week
Real number (ℝ)
Zeros 
Distinct | 326 |
---|---|
Distinct (%) | 2.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 124.84774 |
Minimum | 0 |
---|---|
Maximum | 168 |
Zeros | 699 |
Zeros (%) | 4.3% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 770.3 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 10 |
Q1 | 96 |
median | 140 |
Q3 | 168 |
95-th percentile | 168 |
Maximum | 168 |
Range | 168 |
Interquartile range (IQR) | 72 |
Descriptive statistics
Standard deviation | 47.768165 |
---|---|
Coefficient of variation (CV) | 0.38261138 |
Kurtosis | 0.29131003 |
Mean | 124.84774 |
Median Absolute Deviation (MAD) | 28 |
Skewness | -1.0383694 |
Sum | 2031272.7 |
Variance | 2281.7976 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
168 | 5380 | |
84 | 761 | 4.7% |
0 | 699 | 4.3% |
160 | 449 | 2.8% |
150 | 439 | 2.7% |
120 | 422 | 2.6% |
140 | 360 | 2.2% |
100 | 333 | 2.0% |
108 | 301 | 1.9% |
96 | 282 | 1.7% |
Other values (316) | 6844 |
Value | Count | Frequency (%) |
0 | 699 | |
0.142 | 1 | < 0.1% |
0.147 | 1 | < 0.1% |
0.159 | 1 | < 0.1% |
0.168 | 1 | < 0.1% |
0.25 | 1 | < 0.1% |
0.3 | 1 | < 0.1% |
1 | 13 | 0.1% |
2 | 13 | 0.1% |
2.3 | 2 | < 0.1% |
Value | Count | Frequency (%) |
168 | 5380 | |
167.5 | 1 | < 0.1% |
167.3 | 1 | < 0.1% |
167.25 | 1 | < 0.1% |
167 | 36 | 0.2% |
166.5 | 1 | < 0.1% |
166 | 87 | 0.5% |
165.9 | 1 | < 0.1% |
165.75 | 1 | < 0.1% |
165.7 | 1 | < 0.1% |
member_went_out_for_work_or_not_during_last_week
Categorical
Missing 
Distinct | 3 |
---|---|
Distinct (%) | 0.1% |
Missing | 11967 |
Missing (%) | 73.6% |
Memory size | 770.3 KiB |
1.0 | |
---|---|
3.0 | |
2.0 |
Common Values
Value | Count | Frequency (%) |
1.0 | 2764 | 17.0% |
3.0 | 857 | 5.3% |
2.0 | 682 | 4.2% |
(Missing) | 11967 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
1.0 | 2764 | |
3.0 | 857 | 19.9% |
2.0 | 682 | 15.8% |
Most occurring characters
Value | Count | Frequency (%) |
. | 4303 | |
0 | 4303 | |
1 | 2764 | |
3 | 857 | 6.6% |
2 | 682 | 5.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 12909 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
. | 4303 | |
0 | 4303 | |
1 | 2764 | |
3 | 857 | 6.6% |
2 | 682 | 5.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 12909 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
. | 4303 | |
0 | 4303 | |
1 | 2764 | |
3 | 857 | 6.6% |
2 | 682 | 5.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 12909 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
. | 4303 | |
0 | 4303 | |
1 | 2764 | |
3 | 857 | 6.6% |
2 | 682 | 5.3% |
Interactions
Missing values
Sample
member_ID | age | relationship_to_the_head_of_household | gender | ethnicity | religion | marital_status | current_attendance_in_any_education_instituition | highest_level_of_education | main_activity_engaged_in | main_occupation | daily_wage_owner_or_not | employment_status_of_the_main_occupation | no_of_hours_stayed_at_home_during_last_week | member_went_out_for_work_or_not_during_last_week | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
household_ID | |||||||||||||||
ID0001 | I_1 | 71 | 1 | 0 | 1 | 1 | 2 | 8.0 | 14.0 | 2.0 | 99.0 | 2.0 | 1.0 | 168.0 | 3.0 |
ID0001 | I_2 | 66 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
ID0001 | I_3 | 32 | 3 | 0 | 1 | 1 | 2 | 8.0 | 17.0 | 1.0 | 2.0 | 2.0 | 1.0 | 70.0 | 1.0 |
ID0001 | I_4 | 30 | 4 | 1 | 1 | 1 | 2 | 8.0 | 17.0 | 1.0 | 2.0 | 2.0 | 1.0 | 150.0 | 1.0 |
ID0002 | I_1 | 85 | 1 | 0 | 1 | 1 | 2 | 8.0 | 7.0 | 4.0 | NaN | NaN | NaN | 168.0 | NaN |
ID0002 | I_2 | 66 | 5 | 0 | 1 | 1 | 2 | 8.0 | 14.0 | 2.0 | 7.0 | 2.0 | 3.0 | 0.0 | 2.0 |
ID0002 | I_3 | 59 | 3 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 2.0 | 4.0 | 2.0 | 1.0 | 168.0 | 3.0 |
ID0003 | I_1 | 44 | 1 | 0 | 1 | 1 | 2 | 8.0 | 16.0 | 2.0 | 2.0 | 2.0 | 1.0 | 100.0 | 1.0 |
ID0003 | I_2 | 41 | 2 | 1 | 1 | 1 | 2 | 8.0 | 17.0 | 2.0 | 2.0 | 2.0 | 1.0 | 100.0 | 1.0 |
ID0003 | I_3 | 74 | 5 | 1 | 1 | 1 | 4 | 8.0 | 16.0 | 4.0 | NaN | NaN | NaN | 168.0 | NaN |
member_ID | age | relationship_to_the_head_of_household | gender | ethnicity | religion | marital_status | current_attendance_in_any_education_instituition | highest_level_of_education | main_activity_engaged_in | main_occupation | daily_wage_owner_or_not | employment_status_of_the_main_occupation | no_of_hours_stayed_at_home_during_last_week | member_went_out_for_work_or_not_during_last_week | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
household_ID | |||||||||||||||
ID4060 | I_1 | 78 | 1 | 1 | 1 | 1 | 1 | 8.0 | 11.0 | 7.0 | NaN | NaN | NaN | 130.0 | NaN |
ID4061 | I_1 | 82 | 1 | 1 | 1 | 1 | 4 | 8.0 | 12.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
ID4061 | I_2 | 53 | 3 | 0 | 1 | 1 | 1 | 8.0 | 12.0 | 1.0 | 9.0 | 1.0 | 3.0 | 70.0 | NaN |
ID4062 | I_1 | 73 | 1 | 0 | 1 | 1 | 2 | 8.0 | 12.0 | 1.0 | 5.0 | 1.0 | 5.0 | 168.0 | NaN |
ID4062 | I_2 | 66 | 2 | 1 | 1 | 1 | 2 | 8.0 | 12.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
ID4063 | I_1 | 62 | 1 | 1 | 1 | 1 | 4 | 8.0 | 11.0 | 7.0 | NaN | NaN | NaN | 48.0 | NaN |
ID4063 | I_2 | 49 | 4 | 0 | 1 | 1 | 2 | 8.0 | 11.0 | 1.0 | 5.0 | 1.0 | 3.0 | 120.0 | NaN |
ID4063 | I_3 | 42 | 3 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 48.0 | NaN |
ID4063 | I_4 | 37 | 3 | 0 | 1 | 1 | 1 | 8.0 | 11.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN |
ID4063 | I_5 | 36 | 3 | 0 | 1 | 1 | 1 | 8.0 | 11.0 | 1.0 | 3.0 | 2.0 | 3.0 | 0.0 | NaN |
Duplicate rows
Most frequently occurring
member_ID | age | relationship_to_the_head_of_household | gender | ethnicity | religion | marital_status | current_attendance_in_any_education_instituition | highest_level_of_education | main_activity_engaged_in | main_occupation | daily_wage_owner_or_not | employment_status_of_the_main_occupation | no_of_hours_stayed_at_home_during_last_week | member_went_out_for_work_or_not_during_last_week | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
103 | I_2 | 42 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 9 |
163 | I_2 | 51 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 9 |
356 | I_4 | 0 | 3 | 1 | 1 | 1 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | 168.0 | NaN | 9 |
130 | I_2 | 46 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 8 |
245 | I_2 | 67 | 2 | 1 | 1 | 1 | 2 | 8.0 | 12.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 8 |
261 | I_3 | 1 | 3 | 0 | 1 | 1 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | 168.0 | NaN | 8 |
366 | I_4 | 2 | 3 | 1 | 1 | 1 | 1 | NaN | NaN | NaN | NaN | NaN | NaN | 168.0 | NaN | 8 |
79 | I_2 | 38 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 7 |
146 | I_2 | 48 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 7 |
167 | I_2 | 52 | 2 | 1 | 1 | 1 | 2 | 8.0 | 14.0 | 7.0 | NaN | NaN | NaN | 168.0 | NaN | 7 |