Overview
Rows
97,009
Columns
23
Missing Values
1.61%
36,018 total
Duplicate Rows
8
25.63 MB in memory
Data Quality
Missing Values by Column
Outlier Summary
| Column | Outliers | % | Lower Bound | Upper Bound |
|---|---|---|---|---|
| shopping_pt | 50 | 0.05% | 0.50 | 12.50 |
| group_size | 21,359 | 22.02% | 1.00 | 1.00 |
| car_age | 913 | 0.94% | -10.50 | 25.50 |
| married_couple | 20,536 | 21.17% | 0.00 | 0.00 |
| A | 37,427 | 38.58% | 1.00 | 1.00 |
| cost | 700 | 0.72% | 518.00 | 750.00 |
Distributions
shopping_pt — Distribution
shopping_pt — Box Plot
Min: 3.00Q1: 5.00Med: 7.00Q3: 8.00Max: 12.00
record_type — Distribution
record_type — Box Plot
Min: 1.00Q1: 1.00Med: 1.00Q3: 1.00Max: 1.00
day — Distribution
day — Box Plot
Min: 0.00Q1: 1.00Med: 2.00Q3: 3.00Max: 6.00
location — Distribution
location — Box Plot
Min: 10001.00Q1: 10937.00Med: 12031.00Q3: 13429.00Max: 16580.00
group_size — Distribution
group_size — Box Plot
Min: 1.00Q1: 1.00Med: 1.00Q3: 1.00Max: 1.00
homeowner — Distribution
homeowner — Box Plot
Min: 0.00Q1: 0.00Med: 1.00Q3: 1.00Max: 1.00
car_age — Distribution
car_age — Box Plot
Min: 0.00Q1: 3.00Med: 8.00Q3: 12.00Max: 25.00
risk_factor — Distribution
risk_factor — Box Plot
Min: 1.00Q1: 2.00Med: 3.00Q3: 4.00Max: 4.00
age_oldest — Distribution
age_oldest — Box Plot
Min: 18.00Q1: 29.00Med: 44.00Q3: 60.00Max: 75.00
age_youngest — Distribution
age_youngest — Box Plot
Min: 16.00Q1: 26.00Med: 40.00Q3: 57.00Max: 75.00
married_couple — Distribution
married_couple — Box Plot
Min: 0.00Q1: 0.00Med: 0.00Q3: 0.00Max: 0.00
C_previous — Distribution
C_previous — Box Plot
Min: 1.00Q1: 1.00Med: 3.00Q3: 3.00Max: 4.00
duration_previous — Distribution
duration_previous — Box Plot
Min: 0.00Q1: 2.00Med: 5.00Q3: 9.00Max: 15.00
A — Distribution
A — Box Plot
Min: 1.00Q1: 1.00Med: 1.00Q3: 1.00Max: 1.00
B — Distribution
B — Box Plot
Min: 0.00Q1: 0.00Med: 0.00Q3: 1.00Max: 1.00
C — Distribution
C — Box Plot
Min: 1.00Q1: 1.00Med: 2.00Q3: 3.00Max: 4.00
D — Distribution
D — Box Plot
Min: 1.00Q1: 2.00Med: 3.00Q3: 3.00Max: 3.00
E — Distribution
E — Box Plot
Min: 0.00Q1: 0.00Med: 0.00Q3: 1.00Max: 1.00
F — Distribution
F — Box Plot
Min: 0.00Q1: 0.00Med: 1.00Q3: 2.00Max: 3.00
G — Distribution
G — Box Plot
Min: 1.00Q1: 2.00Med: 2.00Q3: 3.00Max: 4.00
cost — Distribution
cost — Box Plot
Min: 518.00Q1: 605.00Med: 634.00Q3: 663.00Max: 750.00
Summary Statistics
| Column | Count | Mean | Std | Min | Median | Max | Skew | Kurt |
|---|---|---|---|---|---|---|---|---|
| shopping_pt | 97,009 | 6.86 | 2.00 | 3.00 | 7.00 | 13.00 | -0.04 | -0.54 |
| record_type | 97,009 | 1.00 | 0.00 | 1.00 | 1.00 | 1.00 | 0.00 | 0.00 |
| day | 97,009 | 2.08 | 1.47 | 0.00 | 2.00 | 6.00 | 0.04 | -1.22 |
| location | 97,009 | 12272.87 | 1564.61 | 10001.00 | 12031.00 | 16580.00 | 0.48 | -0.74 |
| group_size | 97,009 | 1.24 | 0.46 | 1.00 | 1.00 | 4.00 | 1.81 | 2.86 |
| homeowner | 97,009 | 0.55 | 0.50 | 0.00 | 1.00 | 1.00 | -0.18 | -1.97 |
| car_age | 97,009 | 8.18 | 5.80 | 0.00 | 8.00 | 85.00 | 1.22 | 4.12 |
| risk_factor | 62,663 | 2.56 | 1.12 | 1.00 | 3.00 | 4.00 | -0.10 | -1.35 |
| age_oldest | 97,009 | 45.18 | 17.39 | 18.00 | 44.00 | 75.00 | 0.21 | -1.23 |
| age_youngest | 97,009 | 42.68 | 17.49 | 16.00 | 40.00 | 75.00 | 0.35 | -1.15 |
| married_couple | 97,009 | 0.21 | 0.41 | 0.00 | 0.00 | 1.00 | 1.41 | -0.01 |
| C_previous | 96,173 | 2.45 | 1.03 | 1.00 | 3.00 | 4.00 | -0.19 | -1.19 |
| duration_previous | 96,173 | 6.09 | 4.70 | 0.00 | 5.00 | 15.00 | 0.72 | -0.70 |
| A | 97,009 | 0.95 | 0.62 | 0.00 | 1.00 | 2.00 | 0.04 | -0.40 |
| B | 97,009 | 0.48 | 0.50 | 0.00 | 0.00 | 1.00 | 0.10 | -1.99 |
| C | 97,009 | 2.29 | 1.00 | 1.00 | 2.00 | 4.00 | -0.01 | -1.23 |
| D | 97,009 | 2.52 | 0.71 | 1.00 | 3.00 | 3.00 | -1.13 | -0.16 |
| E | 97,009 | 0.46 | 0.50 | 0.00 | 0.00 | 1.00 | 0.15 | -1.98 |
| F | 97,009 | 1.17 | 0.95 | 0.00 | 1.00 | 3.00 | 0.07 | -1.23 |
| G | 97,009 | 2.28 | 0.89 | 1.00 | 2.00 | 4.00 | 0.16 | -0.76 |
| cost | 97,009 | 634.47 | 43.07 | 263.00 | 634.00 | 839.00 | -0.07 | 0.96 |
Correlations
Pearson Correlation Matrix
-1+1
Top Correlated Pairs
| Variable 1 | Variable 2 | Correlation |
|---|---|---|
| age_oldest | age_youngest | 0.9105 |
| group_size | married_couple | 0.7621 |
| C_previous | C | 0.7054 |
| C | D | 0.6083 |
| A | F | 0.5276 |
| car_age | A | -0.5137 |
| C_previous | D | 0.4483 |
| homeowner | age_oldest | 0.4113 |
| B | E | 0.3994 |
| homeowner | age_youngest | 0.3594 |
Cramer's V (Categorical Associations)
| Variable 1 | Variable 2 | Cramer's V |
|---|---|---|
| state | car_value | 0.0474 |