Overview

Brought to you by YData

Dataset statistics

Number of variables4
Number of observations53,599
Missing cells21,780
Missing cells (%)10.2%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 MiB
Average record size in memory32.0 B

Variable types

Text3
Numeric1

Alerts

no_of_hours_used_during_last_week has 21780 (40.6%) missing values Missing
no_of_hours_used_during_last_week has 7837 (14.6%) zeros Zeros

Reproduction

Analysis started2024-12-06 05:54:32.951821
Analysis finished2024-12-06 05:54:33.577925
Duration0.63 seconds
Software versionydata-profiling vv4.11.0
Download configurationconfig.json

Variables

Distinct4055
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Memory size418.9 KiB
2024-12-06T11:24:33.762266image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters321,594
Distinct characters12
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique53 ?
Unique (%)0.1%

Sample

1st rowID0001
2nd rowID0001
3rd rowID0001
4th rowID0001
5th rowID0001
ValueCountFrequency (%)
id0255 66
 
0.1%
id0772 59
 
0.1%
id3663 55
 
0.1%
id2774 54
 
0.1%
id2262 53
 
0.1%
id3068 52
 
0.1%
id0841 52
 
0.1%
id3787 52
 
0.1%
id3910 51
 
0.1%
id2420 51
 
0.1%
Other values (4045) 53054
99.0%
2024-12-06T11:24:34.309345image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
I 53599
16.7%
D 53599
16.7%
3 30301
9.4%
0 29498
9.2%
2 29272
9.1%
1 29216
9.1%
7 16459
 
5.1%
4 16396
 
5.1%
6 16312
 
5.1%
8 15797
 
4.9%
Other values (2) 31145
9.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 214396
66.7%
Uppercase Letter 107198
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 30301
14.1%
0 29498
13.8%
2 29272
13.7%
1 29216
13.6%
7 16459
7.7%
4 16396
7.6%
6 16312
7.6%
8 15797
7.4%
9 15598
7.3%
5 15547
7.3%
Uppercase Letter
ValueCountFrequency (%)
I 53599
50.0%
D 53599
50.0%

Most occurring scripts

ValueCountFrequency (%)
Common 214396
66.7%
Latin 107198
33.3%

Most frequent character per script

Common
ValueCountFrequency (%)
3 30301
14.1%
0 29498
13.8%
2 29272
13.7%
1 29216
13.6%
7 16459
7.7%
4 16396
7.6%
6 16312
7.6%
8 15797
7.4%
9 15598
7.3%
5 15547
7.3%
Latin
ValueCountFrequency (%)
I 53599
50.0%
D 53599
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 321594
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
I 53599
16.7%
D 53599
16.7%
3 30301
9.4%
0 29498
9.2%
2 29272
9.1%
1 29216
9.1%
7 16459
 
5.1%
4 16396
 
5.1%
6 16312
 
5.1%
8 15797
 
4.9%
Other values (2) 31145
9.7%
Distinct210
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size418.9 KiB
2024-12-06T11:24:34.564292image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/

Length

Max length5
Median length5
Mean length4.8323663
Min length4

Characters and Unicode

Total characters259,010
Distinct characters12
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)0.1%

Sample

1st rowO1_1
2nd rowO12_1
3rd rowO26_1
4th rowO31_1
5th rowO45_1
ValueCountFrequency (%)
o45_1 3575
 
6.7%
o26_1 3482
 
6.5%
o1_1 3433
 
6.4%
o31_1 3230
 
6.0%
o12_1 3059
 
5.7%
o8_1 2965
 
5.5%
o45_2 2523
 
4.7%
o24_1 2351
 
4.4%
o33_1 1677
 
3.1%
o13_1 1586
 
3.0%
Other values (200) 25718
48.0%
2024-12-06T11:24:34.910773image/svg+xmlMatplotlib v3.9.2, https://matplotlib.org/