Understanding and modeling data extremes