Can someone help me decipher this margin plot?
This plot is based on Titanic data set. After exploring this data, I decided to plot age vs cabin, since they have the highest number of missing values. After I installed VIM package:
library(VIM) marginplot(train[c(6,11)], col = c('Blue','Purple'))
I tried to dig deeper by checking its documentation, I came to know that Blue color represents the available data while purple represents missing data. Yet, I am unable to form a story from this plot.
Along with margin plot, I tried aggr plot as well, which looks like this:
aggr1_plot <- aggr(train, col = c('Blue','Purple'), numbers = TRUE, sortVars = TRUE, cex.axis = .8, gap = 3, ylab = c('Histogram of Missing Data','Pattern'))
For some reason, in this aggr plot, variable name ‘age’ is missing on x axis.
Please help me decipher the margin plot and pattern(aggr plot).