Dataset Information & Creation
Two Datasets Will Be Created:
- All variables after categorical filters
- Includes variables with missing data
- Used for individual outlier detection
- Only variables meeting completion threshold
- Only complete cases (no missing values)
- Used for Mahalanobis, PCA, etc.
Data Filtering
Step 2: Subset selection
- Variables with completion ≥ threshold → Used in multivariate analysis
- Variables with completion < threshold → Only used in univariate analysis
- Red line in plot shows current threshold
📊 Dataset for Univariate Outliers (Overall View)
Characteristics:
- All subjects after categorical filtering
- Includes all variables (even with missing data)
- Used for detecting outliers in individual variables
Data Availability Pyramid (Comparison by Sex):
🔍 Filter Detailed Statistics
📋 Variable Statistics Details
Top Variables by Missing Rate:
Multivariate Dataset (B) Summary Statistics:
Outlier Detection Settings
Current Status:
Boxplot Settings
Outlier Detection Method:
Subject Outlier Summary
Outlier Matrix (First 10 variables):
Variable Boxplots
Comprehensive Outlier Analysis Settings
Run All Outlier Detection Methods
Methods Included:
- IQR Method (1.5*IQR)
- Standard Deviation (3 SD)
- Percentile (1st-99th)
- Box-Cox
- Reference Interval
Download Summary Table
Outlier Detection Summary
Visualization of Outlier Counts (Subjects Perspective)
Visualization of Outlier Counts (Variables Perspective)
Method Agreement Analysis
Consistency Across Methods:
Mahalanobis Distance Plot
Outlier Controls
Subject Profile (Normalized Parameters)
Click on a point in the Mahalanobis plots above to view that subject's detailed profile across both sexes.