Dashboard

Overview

Rows

97,009

Columns

23

Missing Values

1.61%

36,018 total

Duplicate Rows

8

25.63 MB in memory

Data Quality

Missing Values by Column

Outlier Summary

ColumnOutliers%Lower BoundUpper Bound
shopping_pt500.05%0.5012.50
group_size21,35922.02%1.001.00
car_age9130.94%-10.5025.50
married_couple20,53621.17%0.000.00
A37,42738.58%1.001.00
cost7000.72%518.00750.00

Distributions

shopping_pt — Distribution

shopping_pt — Box Plot

Min: 3.00Q1: 5.00Med: 7.00Q3: 8.00Max: 12.00

record_type — Distribution

record_type — Box Plot

Min: 1.00Q1: 1.00Med: 1.00Q3: 1.00Max: 1.00

day — Distribution

day — Box Plot

Min: 0.00Q1: 1.00Med: 2.00Q3: 3.00Max: 6.00

location — Distribution

location — Box Plot

Min: 10001.00Q1: 10937.00Med: 12031.00Q3: 13429.00Max: 16580.00

group_size — Distribution

group_size — Box Plot

Min: 1.00Q1: 1.00Med: 1.00Q3: 1.00Max: 1.00

homeowner — Distribution

homeowner — Box Plot

Min: 0.00Q1: 0.00Med: 1.00Q3: 1.00Max: 1.00

car_age — Distribution

car_age — Box Plot

Min: 0.00Q1: 3.00Med: 8.00Q3: 12.00Max: 25.00

risk_factor — Distribution

risk_factor — Box Plot

Min: 1.00Q1: 2.00Med: 3.00Q3: 4.00Max: 4.00

age_oldest — Distribution

age_oldest — Box Plot

Min: 18.00Q1: 29.00Med: 44.00Q3: 60.00Max: 75.00

age_youngest — Distribution

age_youngest — Box Plot

Min: 16.00Q1: 26.00Med: 40.00Q3: 57.00Max: 75.00

married_couple — Distribution

married_couple — Box Plot

Min: 0.00Q1: 0.00Med: 0.00Q3: 0.00Max: 0.00

C_previous — Distribution

C_previous — Box Plot

Min: 1.00Q1: 1.00Med: 3.00Q3: 3.00Max: 4.00

duration_previous — Distribution

duration_previous — Box Plot

Min: 0.00Q1: 2.00Med: 5.00Q3: 9.00Max: 15.00

A — Distribution

A — Box Plot

Min: 1.00Q1: 1.00Med: 1.00Q3: 1.00Max: 1.00

B — Distribution

B — Box Plot

Min: 0.00Q1: 0.00Med: 0.00Q3: 1.00Max: 1.00

C — Distribution

C — Box Plot

Min: 1.00Q1: 1.00Med: 2.00Q3: 3.00Max: 4.00

D — Distribution

D — Box Plot

Min: 1.00Q1: 2.00Med: 3.00Q3: 3.00Max: 3.00

E — Distribution

E — Box Plot

Min: 0.00Q1: 0.00Med: 0.00Q3: 1.00Max: 1.00

F — Distribution

F — Box Plot

Min: 0.00Q1: 0.00Med: 1.00Q3: 2.00Max: 3.00

G — Distribution

G — Box Plot

Min: 1.00Q1: 2.00Med: 2.00Q3: 3.00Max: 4.00

cost — Distribution

cost — Box Plot

Min: 518.00Q1: 605.00Med: 634.00Q3: 663.00Max: 750.00

Summary Statistics

ColumnCountMeanStdMinMedianMaxSkewKurt
shopping_pt97,0096.862.003.007.0013.00-0.04-0.54
record_type97,0091.000.001.001.001.000.000.00
day97,0092.081.470.002.006.000.04-1.22
location97,00912272.871564.6110001.0012031.0016580.000.48-0.74
group_size97,0091.240.461.001.004.001.812.86
homeowner97,0090.550.500.001.001.00-0.18-1.97
car_age97,0098.185.800.008.0085.001.224.12
risk_factor62,6632.561.121.003.004.00-0.10-1.35
age_oldest97,00945.1817.3918.0044.0075.000.21-1.23
age_youngest97,00942.6817.4916.0040.0075.000.35-1.15
married_couple97,0090.210.410.000.001.001.41-0.01
C_previous96,1732.451.031.003.004.00-0.19-1.19
duration_previous96,1736.094.700.005.0015.000.72-0.70
A97,0090.950.620.001.002.000.04-0.40
B97,0090.480.500.000.001.000.10-1.99
C97,0092.291.001.002.004.00-0.01-1.23
D97,0092.520.711.003.003.00-1.13-0.16
E97,0090.460.500.000.001.000.15-1.98
F97,0091.170.950.001.003.000.07-1.23
G97,0092.280.891.002.004.000.16-0.76
cost97,009634.4743.07263.00634.00839.00-0.070.96

Correlations

Pearson Correlation Matrix

shopping_ptrecord_typedaylocationgroup_sizehomeownercar_agerisk_factorage_oldestage_youngestmarried_coupleC_previousduration_pre…ABCDEFGcostshopping_ptrecord_typedaylocationgroup_sizehomeownercar_agerisk_factorage_oldestage_youngestmarried_coupleC_previousduration_pre…ABCDEFGcost
-1
+1

Top Correlated Pairs

Variable 1Variable 2Correlation
age_oldestage_youngest0.9105
group_sizemarried_couple0.7621
C_previousC0.7054
CD0.6083
AF0.5276
car_ageA-0.5137
C_previousD0.4483
homeownerage_oldest0.4113
BE0.3994
homeownerage_youngest0.3594

Cramer's V (Categorical Associations)

Variable 1Variable 2Cramer's V
statecar_value0.0474