We propose an automated approach to rank the most salient variables related to a certain clinical phenomenon from scientific literature. Our solution is an automated approach to improve the efficiency of the collection of different health-related measures from a population, and to accelerate the discovery of novel associations and dependencies between health-related concepts.
Keywords: Digital health; feature selection; natural language processing.