Dataset statistics
Number of variables | 17 |
---|---|
Number of observations | 383 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 16 |
Duplicate rows (%) | 4.2% |
Total size in memory | 51.0 KiB |
Average record size in memory | 136.3 B |
Variable types
Numeric | 1 |
---|---|
Categorical | 12 |
Boolean | 4 |
Dataset has 16 (4.2%) duplicate rows | Duplicates |
Hx Smoking is highly imbalanced (62.3%) | Imbalance |
Hx Radiothreapy is highly imbalanced (86.8%) | Imbalance |
Thyroid Function is highly imbalanced (64.9%) | Imbalance |
M is highly imbalanced (72.6%) | Imbalance |
Stage is highly imbalanced (67.9%) | Imbalance |
Reproduction
Analysis started | 2024-06-05 22:15:53.096619 |
---|---|
Analysis finished | 2024-06-05 22:15:53.725874 |
Duration | 0.63 seconds |
Software version | ydata-profiling v4.8.3 |
Download configuration | config.json |
Age
Real number (ℝ)
Distinct | 65 |
---|---|
Distinct (%) | 17.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.866841 |
Minimum | 15 |
---|---|
Maximum | 82 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 3.1 KiB |
Quantile statistics
Minimum | 15 |
---|---|
5-th percentile | 21 |
Q1 | 29 |
median | 37 |
Q3 | 51 |
95-th percentile | 70 |
Maximum | 82 |
Range | 67 |
Interquartile range (IQR) | 22 |
Descriptive statistics
Standard deviation | 15.134494 |
---|---|
Coefficient of variation (CV) | 0.37033677 |
Kurtosis | -0.27154098 |
Mean | 40.866841 |
Median Absolute Deviation (MAD) | 10 |
Skewness | 0.71973186 |
Sum | 15652 |
Variance | 229.0529 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
31 | 22 | 5.7% |
27 | 13 | 3.4% |
40 | 12 | 3.1% |
26 | 12 | 3.1% |
28 | 12 | 3.1% |
35 | 12 | 3.1% |
30 | 12 | 3.1% |
33 | 12 | 3.1% |
34 | 11 | 2.9% |
29 | 11 | 2.9% |
Other values (55) | 254 |
Value | Count | Frequency (%) |
15 | 1 | 0.3% |
17 | 3 | 0.8% |
18 | 1 | 0.3% |
19 | 2 | 0.5% |
20 | 6 | |
21 | 9 | |
22 | 6 | |
23 | 6 | |
24 | 8 | |
25 | 7 |
Value | Count | Frequency (%) |
82 | 1 | 0.3% |
81 | 2 | |
80 | 3 | |
79 | 1 | 0.3% |
78 | 1 | 0.3% |
76 | 1 | 0.3% |
75 | 2 | |
73 | 3 | |
72 | 3 | |
71 | 2 |
Gender
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
F | |
---|---|
M |
Common Values
Value | Count | Frequency (%) |
F | 312 | |
M | 71 | 18.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
f | 312 | |
m | 71 | 18.5% |
Most occurring characters
Value | Count | Frequency (%) |
F | 312 | |
M | 71 | 18.5% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 383 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
F | 312 | |
M | 71 | 18.5% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 383 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
F | 312 | |
M | 71 | 18.5% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 383 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
F | 312 | |
M | 71 | 18.5% |
Smoking
Boolean
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 515.0 B |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 334 | |
True | 49 | 12.8% |
Hx Smoking
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 515.0 B |
False | |
---|---|
True | 28 |
Value | Count | Frequency (%) |
False | 355 | |
True | 28 | 7.3% |
Hx Radiothreapy
Boolean
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 515.0 B |
False | |
---|---|
True | 7 |
Value | Count | Frequency (%) |
False | 376 | |
True | 7 | 1.8% |
Thyroid Function
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
Euthyroid | |
---|---|
Clinical Hyperthyroidism | 20 |
Subclinical Hypothyroidism | 14 |
Clinical Hypothyroidism | 12 |
Subclinical Hyperthyroidism | 5 |
Length
Max length | 27 |
---|---|
Median length | 9 |
Mean length | 11.078329 |
Min length | 9 |
Characters and Unicode
Total characters | 4243 |
---|---|
Distinct characters | 22 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Euthyroid |
---|---|
2nd row | Euthyroid |
3rd row | Euthyroid |
4th row | Euthyroid |
5th row | Euthyroid |
Common Values
Value | Count | Frequency (%) |
Euthyroid | 332 | |
Clinical Hyperthyroidism | 20 | 5.2% |
Subclinical Hypothyroidism | 14 | 3.7% |
Clinical Hypothyroidism | 12 | 3.1% |
Subclinical Hyperthyroidism | 5 | 1.3% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
euthyroid | 332 | |
clinical | 32 | 7.4% |
hypothyroidism | 26 | 6.0% |
hyperthyroidism | 25 | 5.8% |
subclinical | 19 | 4.4% |
Most occurring characters
Value | Count | Frequency (%) |
i | 536 | |
y | 434 | |
o | 409 | |
r | 408 | |
t | 383 | |
h | 383 | |
d | 383 | |
u | 351 | |
E | 332 | |
l | 102 | 2.4% |
Other values (12) | 522 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 4243 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 536 | |
y | 434 | |
o | 409 | |
r | 408 | |
t | 383 | |
h | 383 | |
d | 383 | |
u | 351 | |
E | 332 | |
l | 102 | 2.4% |
Other values (12) | 522 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 4243 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 536 | |
y | 434 | |
o | 409 | |
r | 408 | |
t | 383 | |
h | 383 | |
d | 383 | |
u | 351 | |
E | 332 | |
l | 102 | 2.4% |
Other values (12) | 522 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 4243 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 536 | |
y | 434 | |
o | 409 | |
r | 408 | |
t | 383 | |
h | 383 | |
d | 383 | |
u | 351 | |
E | 332 | |
l | 102 | 2.4% |
Other values (12) | 522 |
Physical Examination
Categorical
Distinct | 5 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
Multinodular goiter | |
---|---|
Single nodular goiter-right | |
Single nodular goiter-left | |
Normal | 7 |
Diffuse goiter | 7 |
Length
Max length | 27 |
---|---|
Median length | 26 |
Mean length | 23.221932 |
Min length | 6 |
Characters and Unicode
Total characters | 8894 |
---|---|
Distinct characters | 21 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Single nodular goiter-left |
---|---|
2nd row | Multinodular goiter |
3rd row | Single nodular goiter-right |
4th row | Single nodular goiter-right |
5th row | Multinodular goiter |
Common Values
Value | Count | Frequency (%) |
Multinodular goiter | 140 | |
Single nodular goiter-right | 140 | |
Single nodular goiter-left | 89 | |
Normal | 7 | 1.8% |
Diffuse goiter | 7 | 1.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
single | 229 | |
nodular | 229 | |
goiter | 147 | |
multinodular | 140 | |
goiter-right | 140 | |
goiter-left | 89 | 9.0% |
normal | 7 | 0.7% |
diffuse | 7 | 0.7% |
Most occurring characters
Value | Count | Frequency (%) |
i | 892 | |
r | 892 | |
l | 834 | |
o | 752 | |
t | 745 | |
g | 745 | |
e | 701 | |
605 | 6.8% | |
n | 598 | 6.7% |
u | 516 | 5.8% |
Other values (11) | 1614 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 8894 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
i | 892 | |
r | 892 | |
l | 834 | |
o | 752 | |
t | 745 | |
g | 745 | |
e | 701 | |
605 | 6.8% | |
n | 598 | 6.7% |
u | 516 | 5.8% |
Other values (11) | 1614 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 8894 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
i | 892 | |
r | 892 | |
l | 834 | |
o | 752 | |
t | 745 | |
g | 745 | |
e | 701 | |
605 | 6.8% | |
n | 598 | 6.7% |
u | 516 | 5.8% |
Other values (11) | 1614 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 8894 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
i | 892 | |
r | 892 | |
l | 834 | |
o | 752 | |
t | 745 | |
g | 745 | |
e | 701 | |
605 | 6.8% | |
n | 598 | 6.7% |
u | 516 | 5.8% |
Other values (11) | 1614 |
Adenopathy
Categorical
Distinct | 6 |
---|---|
Distinct (%) | 1.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
No | |
---|---|
Right | |
Bilateral | |
Left | 17 |
Extensive | 7 |
Common Values
Value | Count | Frequency (%) |
No | 277 | |
Right | 48 | 12.5% |
Bilateral | 32 | 8.4% |
Left | 17 | 4.4% |
Extensive | 7 | 1.8% |
Posterior | 2 | 0.5% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
no | 277 | |
right | 48 | 12.5% |
bilateral | 32 | 8.4% |
left | 17 | 4.4% |
extensive | 7 | 1.8% |
posterior | 2 | 0.5% |
Most occurring characters
Value | Count | Frequency (%) |
o | 281 | |
N | 277 | |
t | 106 | 8.6% |
i | 89 | 7.2% |
e | 65 | 5.3% |
l | 64 | 5.2% |
a | 64 | 5.2% |
h | 48 | 3.9% |
g | 48 | 3.9% |
R | 48 | 3.9% |
Other values (10) | 141 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 1231 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
o | 281 | |
N | 277 | |
t | 106 | 8.6% |
i | 89 | 7.2% |
e | 65 | 5.3% |
l | 64 | 5.2% |
a | 64 | 5.2% |
h | 48 | 3.9% |
g | 48 | 3.9% |
R | 48 | 3.9% |
Other values (10) | 141 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 1231 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
o | 281 | |
N | 277 | |
t | 106 | 8.6% |
i | 89 | 7.2% |
e | 65 | 5.3% |
l | 64 | 5.2% |
a | 64 | 5.2% |
h | 48 | 3.9% |
g | 48 | 3.9% |
R | 48 | 3.9% |
Other values (10) | 141 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 1231 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
o | 281 | |
N | 277 | |
t | 106 | 8.6% |
i | 89 | 7.2% |
e | 65 | 5.3% |
l | 64 | 5.2% |
a | 64 | 5.2% |
h | 48 | 3.9% |
g | 48 | 3.9% |
R | 48 | 3.9% |
Other values (10) | 141 |
Pathology
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
Papillary | |
---|---|
Micropapillary | |
Follicular | 28 |
Hurthel cell | 20 |
Length
Max length | 14 |
---|---|
Median length | 9 |
Mean length | 9.8563969 |
Min length | 9 |
Characters and Unicode
Total characters | 3775 |
---|---|
Distinct characters | 17 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Micropapillary |
---|---|
2nd row | Micropapillary |
3rd row | Micropapillary |
4th row | Micropapillary |
5th row | Micropapillary |
Common Values
Value | Count | Frequency (%) |
Papillary | 287 | |
Micropapillary | 48 | 12.5% |
Follicular | 28 | 7.3% |
Hurthel cell | 20 | 5.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
papillary | 287 | |
micropapillary | 48 | 11.9% |
follicular | 28 | 6.9% |
hurthel | 20 | 5.0% |
cell | 20 | 5.0% |
Most occurring characters
Value | Count | Frequency (%) |
l | 814 | |
a | 698 | |
r | 431 | |
i | 411 | |
p | 383 | |
y | 335 | |
P | 287 | 7.6% |
c | 96 | 2.5% |
o | 76 | 2.0% |
M | 48 | 1.3% |
Other values (7) | 196 | 5.2% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3775 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
l | 814 | |
a | 698 | |
r | 431 | |
i | 411 | |
p | 383 | |
y | 335 | |
P | 287 | 7.6% |
c | 96 | 2.5% |
o | 76 | 2.0% |
M | 48 | 1.3% |
Other values (7) | 196 | 5.2% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3775 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
l | 814 | |
a | 698 | |
r | 431 | |
i | 411 | |
p | 383 | |
y | 335 | |
P | 287 | 7.6% |
c | 96 | 2.5% |
o | 76 | 2.0% |
M | 48 | 1.3% |
Other values (7) | 196 | 5.2% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3775 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
l | 814 | |
a | 698 | |
r | 431 | |
i | 411 | |
p | 383 | |
y | 335 | |
P | 287 | 7.6% |
c | 96 | 2.5% |
o | 76 | 2.0% |
M | 48 | 1.3% |
Other values (7) | 196 | 5.2% |
Focality
Categorical
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
Uni-Focal | |
---|---|
Multi-Focal |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 9.7101828 |
Min length | 9 |
Characters and Unicode
Total characters | 3719 |
---|---|
Distinct characters | 12 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Uni-Focal |
---|---|
2nd row | Uni-Focal |
3rd row | Uni-Focal |
4th row | Uni-Focal |
5th row | Multi-Focal |
Common Values
Value | Count | Frequency (%) |
Uni-Focal | 247 | |
Multi-Focal | 136 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
uni-focal | 247 | |
multi-focal | 136 |
Most occurring characters
Value | Count | Frequency (%) |
l | 519 | |
i | 383 | |
- | 383 | |
F | 383 | |
o | 383 | |
c | 383 | |
a | 383 | |
U | 247 | |
n | 247 | |
M | 136 | 3.7% |
Other values (2) | 272 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 3719 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
l | 519 | |
i | 383 | |
- | 383 | |
F | 383 | |
o | 383 | |
c | 383 | |
a | 383 | |
U | 247 | |
n | 247 | |
M | 136 | 3.7% |
Other values (2) | 272 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 3719 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
l | 519 | |
i | 383 | |
- | 383 | |
F | 383 | |
o | 383 | |
c | 383 | |
a | 383 | |
U | 247 | |
n | 247 | |
M | 136 | 3.7% |
Other values (2) | 272 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 3719 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
l | 519 | |
i | 383 | |
- | 383 | |
F | 383 | |
o | 383 | |
c | 383 | |
a | 383 | |
U | 247 | |
n | 247 | |
M | 136 | 3.7% |
Other values (2) | 272 |
Risk
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
Low | |
---|---|
Intermediate | |
High |
Common Values
Value | Count | Frequency (%) |
Low | 249 | |
Intermediate | 102 | |
High | 32 | 8.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
low | 249 | |
intermediate | 102 | |
high | 32 | 8.4% |
Most occurring characters
Value | Count | Frequency (%) |
e | 306 | |
L | 249 | |
o | 249 | |
w | 249 | |
t | 204 | |
i | 134 | |
I | 102 | 4.9% |
n | 102 | 4.9% |
r | 102 | 4.9% |
m | 102 | 4.9% |
Other values (5) | 300 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 2099 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 306 | |
L | 249 | |
o | 249 | |
w | 249 | |
t | 204 | |
i | 134 | |
I | 102 | 4.9% |
n | 102 | 4.9% |
r | 102 | 4.9% |
m | 102 | 4.9% |
Other values (5) | 300 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 2099 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 306 | |
L | 249 | |
o | 249 | |
w | 249 | |
t | 204 | |
i | 134 | |
I | 102 | 4.9% |
n | 102 | 4.9% |
r | 102 | 4.9% |
m | 102 | 4.9% |
Other values (5) | 300 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 2099 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 306 | |
L | 249 | |
o | 249 | |
w | 249 | |
t | 204 | |
i | 134 | |
I | 102 | 4.9% |
n | 102 | 4.9% |
r | 102 | 4.9% |
m | 102 | 4.9% |
Other values (5) | 300 |
T
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 1.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
T2 | |
---|---|
T3a | |
T1a | |
T1b | |
T4a | |
Other values (2) |
Common Values
Value | Count | Frequency (%) |
T2 | 151 | |
T3a | 96 | |
T1a | 49 | 12.8% |
T1b | 43 | 11.2% |
T4a | 20 | 5.2% |
T3b | 16 | 4.2% |
T4b | 8 | 2.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
t2 | 151 | |
t3a | 96 | |
t1a | 49 | 12.8% |
t1b | 43 | 11.2% |
t4a | 20 | 5.2% |
t3b | 16 | 4.2% |
t4b | 8 | 2.1% |
Most occurring characters
Value | Count | Frequency (%) |
T | 383 | |
a | 165 | |
2 | 151 | 15.1% |
3 | 112 | 11.2% |
1 | 92 | 9.2% |
b | 67 | 6.7% |
4 | 28 | 2.8% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 998 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
T | 383 | |
a | 165 | |
2 | 151 | 15.1% |
3 | 112 | 11.2% |
1 | 92 | 9.2% |
b | 67 | 6.7% |
4 | 28 | 2.8% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 998 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
T | 383 | |
a | 165 | |
2 | 151 | 15.1% |
3 | 112 | 11.2% |
1 | 92 | 9.2% |
b | 67 | 6.7% |
4 | 28 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 998 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
T | 383 | |
a | 165 | |
2 | 151 | 15.1% |
3 | 112 | 11.2% |
1 | 92 | 9.2% |
b | 67 | 6.7% |
4 | 28 | 2.8% |
N
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
N0 | |
---|---|
N1b | |
N1a | 22 |
Common Values
Value | Count | Frequency (%) |
N0 | 268 | |
N1b | 93 | 24.3% |
N1a | 22 | 5.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
n0 | 268 | |
n1b | 93 | 24.3% |
n1a | 22 | 5.7% |
Most occurring characters
Value | Count | Frequency (%) |
N | 383 | |
0 | 268 | |
1 | 115 | 13.1% |
b | 93 | 10.6% |
a | 22 | 2.5% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 881 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
N | 383 | |
0 | 268 | |
1 | 115 | 13.1% |
b | 93 | 10.6% |
a | 22 | 2.5% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 881 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
N | 383 | |
0 | 268 | |
1 | 115 | 13.1% |
b | 93 | 10.6% |
a | 22 | 2.5% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 881 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
N | 383 | |
0 | 268 | |
1 | 115 | 13.1% |
b | 93 | 10.6% |
a | 22 | 2.5% |
M
Categorical
IMBALANCE
 
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
M0 | |
---|---|
M1 | 18 |
Common Values
Value | Count | Frequency (%) |
M0 | 365 | |
M1 | 18 | 4.7% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
m0 | 365 | |
m1 | 18 | 4.7% |
Most occurring characters
Value | Count | Frequency (%) |
M | 383 | |
0 | 365 | |
1 | 18 | 2.3% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 766 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
M | 383 | |
0 | 365 | |
1 | 18 | 2.3% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 766 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
M | 383 | |
0 | 365 | |
1 | 18 | 2.3% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 766 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
M | 383 | |
0 | 365 | |
1 | 18 | 2.3% |
Stage
Categorical
IMBALANCE
 
Distinct | 5 |
---|---|
Distinct (%) | 1.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
I | |
---|---|
II | 32 |
IVB | 11 |
III | 4 |
IVA | 3 |
Common Values
Value | Count | Frequency (%) |
I | 333 | |
II | 32 | 8.4% |
IVB | 11 | 2.9% |
III | 4 | 1.0% |
IVA | 3 | 0.8% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
i | 333 | |
ii | 32 | 8.4% |
ivb | 11 | 2.9% |
iii | 4 | 1.0% |
iva | 3 | 0.8% |
Most occurring characters
Value | Count | Frequency (%) |
I | 423 | |
V | 14 | 3.1% |
B | 11 | 2.4% |
A | 3 | 0.7% |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 451 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
I | 423 | |
V | 14 | 3.1% |
B | 11 | 2.4% |
A | 3 | 0.7% |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 451 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
I | 423 | |
V | 14 | 3.1% |
B | 11 | 2.4% |
A | 3 | 0.7% |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 451 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
I | 423 | |
V | 14 | 3.1% |
B | 11 | 2.4% |
A | 3 | 0.7% |
Response
Categorical
Distinct | 4 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.1 KiB |
Excellent | |
---|---|
Structural Incomplete | |
Indeterminate | |
Biochemical Incomplete |
Length
Max length | 22 |
---|---|
Median length | 9 |
Mean length | 13.26893 |
Min length | 9 |
Characters and Unicode
Total characters | 5082 |
---|---|
Distinct characters | 20 |
Distinct categories | 1 ? |
Distinct scripts | 1 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Indeterminate |
---|---|
2nd row | Excellent |
3rd row | Excellent |
4th row | Excellent |
5th row | Excellent |
Common Values
Value | Count | Frequency (%) |
Excellent | 208 | |
Structural Incomplete | 91 | |
Indeterminate | 61 | 15.9% |
Biochemical Incomplete | 23 | 6.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
excellent | 208 | |
incomplete | 114 | |
structural | 91 | |
indeterminate | 61 | 12.3% |
biochemical | 23 | 4.6% |
Most occurring characters
Value | Count | Frequency (%) |
e | 850 | |
l | 644 | |
t | 626 | |
c | 459 | |
n | 444 | |
r | 243 | 4.8% |
E | 208 | 4.1% |
x | 208 | 4.1% |
m | 198 | 3.9% |
u | 182 | 3.6% |
Other values (10) | 1020 |
Most occurring categories
Value | Count | Frequency (%) |
(unknown) | 5082 |
Most frequent character per category
(unknown)
Value | Count | Frequency (%) |
e | 850 | |
l | 644 | |
t | 626 | |
c | 459 | |
n | 444 | |
r | 243 | 4.8% |
E | 208 | 4.1% |
x | 208 | 4.1% |
m | 198 | 3.9% |
u | 182 | 3.6% |
Other values (10) | 1020 |
Most occurring scripts
Value | Count | Frequency (%) |
(unknown) | 5082 |
Most frequent character per script
(unknown)
Value | Count | Frequency (%) |
e | 850 | |
l | 644 | |
t | 626 | |
c | 459 | |
n | 444 | |
r | 243 | 4.8% |
E | 208 | 4.1% |
x | 208 | 4.1% |
m | 198 | 3.9% |
u | 182 | 3.6% |
Other values (10) | 1020 |
Most occurring blocks
Value | Count | Frequency (%) |
(unknown) | 5082 |
Most frequent character per block
(unknown)
Value | Count | Frequency (%) |
e | 850 | |
l | 644 | |
t | 626 | |
c | 459 | |
n | 444 | |
r | 243 | 4.8% |
E | 208 | 4.1% |
x | 208 | 4.1% |
m | 198 | 3.9% |
u | 182 | 3.6% |
Other values (10) | 1020 |
Recurred
Boolean
Distinct | 2 |
---|---|
Distinct (%) | 0.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 515.0 B |
False | |
---|---|
True |
Value | Count | Frequency (%) |
False | 275 | |
True | 108 | 28.2% |
Age | Gender | Smoking | Hx Smoking | Hx Radiothreapy | Thyroid Function | Physical Examination | Adenopathy | Pathology | Focality | Risk | T | N | M | Stage | Response | Recurred | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 27 | F | No | No | No | Euthyroid | Single nodular goiter-left | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Indeterminate | No |
1 | 34 | F | No | Yes | No | Euthyroid | Multinodular goiter | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
2 | 30 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
3 | 62 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
4 | 62 | F | No | No | No | Euthyroid | Multinodular goiter | No | Micropapillary | Multi-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
5 | 52 | M | Yes | No | No | Euthyroid | Multinodular goiter | No | Micropapillary | Multi-Focal | Low | T1a | N0 | M0 | I | Indeterminate | No |
6 | 41 | F | No | Yes | No | Clinical Hyperthyroidism | Single nodular goiter-right | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
7 | 46 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
8 | 51 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
9 | 40 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No |
Age | Gender | Smoking | Hx Smoking | Hx Radiothreapy | Thyroid Function | Physical Examination | Adenopathy | Pathology | Focality | Risk | T | N | M | Stage | Response | Recurred | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
373 | 31 | M | Yes | No | Yes | Euthyroid | Single nodular goiter-left | Extensive | Papillary | Multi-Focal | High | T4a | N1b | M1 | II | Structural Incomplete | Yes |
374 | 62 | M | Yes | No | No | Euthyroid | Single nodular goiter-left | Left | Papillary | Multi-Focal | High | T4a | N1b | M1 | IVB | Structural Incomplete | Yes |
375 | 59 | F | No | No | No | Euthyroid | Multinodular goiter | Bilateral | Papillary | Multi-Focal | High | T4b | N1b | M0 | IVB | Structural Incomplete | Yes |
376 | 40 | M | Yes | No | No | Euthyroid | Multinodular goiter | Bilateral | Papillary | Multi-Focal | High | T4b | N1b | M0 | I | Structural Incomplete | Yes |
377 | 46 | M | Yes | No | No | Euthyroid | Single nodular goiter-left | Bilateral | Follicular | Uni-Focal | High | T4b | N1b | M1 | II | Structural Incomplete | Yes |
378 | 72 | M | Yes | Yes | Yes | Euthyroid | Single nodular goiter-right | Right | Papillary | Uni-Focal | High | T4b | N1b | M1 | IVB | Biochemical Incomplete | Yes |
379 | 81 | M | Yes | No | Yes | Euthyroid | Multinodular goiter | Extensive | Papillary | Multi-Focal | High | T4b | N1b | M1 | IVB | Structural Incomplete | Yes |
380 | 72 | M | Yes | Yes | No | Euthyroid | Multinodular goiter | Bilateral | Papillary | Multi-Focal | High | T4b | N1b | M1 | IVB | Structural Incomplete | Yes |
381 | 61 | M | Yes | Yes | Yes | Clinical Hyperthyroidism | Multinodular goiter | Extensive | Hurthel cell | Multi-Focal | High | T4b | N1b | M0 | IVA | Structural Incomplete | Yes |
382 | 67 | M | Yes | No | No | Euthyroid | Multinodular goiter | Bilateral | Papillary | Multi-Focal | High | T4b | N1b | M0 | IVA | Structural Incomplete | Yes |
Most frequently occurring
Age | Gender | Smoking | Hx Smoking | Hx Radiothreapy | Thyroid Function | Physical Examination | Adenopathy | Pathology | Focality | Risk | T | N | M | Stage | Response | Recurred | # duplicates | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2 | 26 | F | No | No | No | Euthyroid | Multinodular goiter | No | Papillary | Uni-Focal | Low | T2 | N0 | M0 | I | Excellent | No | 4 |
6 | 32 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Papillary | Uni-Focal | Low | T2 | N0 | M0 | I | Excellent | No | 3 |
0 | 21 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Papillary | Uni-Focal | Low | T2 | N0 | M0 | I | Excellent | No | 2 |
1 | 22 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Papillary | Uni-Focal | Low | T2 | N0 | M0 | I | Excellent | No | 2 |
3 | 28 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Papillary | Uni-Focal | Low | T2 | N0 | M0 | I | Excellent | No | 2 |
4 | 29 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Papillary | Uni-Focal | Low | T1b | N0 | M0 | I | Excellent | No | 2 |
5 | 31 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Papillary | Uni-Focal | Low | T2 | N0 | M0 | I | Excellent | No | 2 |
7 | 34 | F | No | No | No | Euthyroid | Multinodular goiter | No | Papillary | Uni-Focal | Low | T2 | N0 | M0 | I | Excellent | No | 2 |
8 | 35 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Papillary | Uni-Focal | Low | T1b | N0 | M0 | I | Excellent | No | 2 |
9 | 36 | F | No | No | No | Euthyroid | Single nodular goiter-right | No | Micropapillary | Uni-Focal | Low | T1a | N0 | M0 | I | Excellent | No | 2 |