Synthetic data, when properly used, can enhance patterns in real data and thus provide insights into different problems. Here, the estimation of tail probabilities of rare events from a moderately large number of observations is considered. The problem is approached by a large number of augmentations or fusions of the real data with computer-generated synthetic samples. The tail probability of interest is approximated by subsequences created by a novel iterative process. The estimates are found to be quite precise.
Keywords: B-curve; Density ratio model; Iterative process; Repeated out of sample fusion; Residential radon; Upper bounds.
© Grace Scientific Publishing 2021.