Population Data BC: Supporting population data science in British Columbia

Int J Popul Data Sci. 2020 Mar 26;4(2):1133. doi: 10.23889/ijpds.v5i1.1133.

Abstract

Background: Population Data BC (PopData) was established as a multi-university data and education resource to support training and education, data linkage, and access to individual level, de-identified data for research in a wide variety of areas including human and community development and well-being.

Approach: A combination of deterministic and probabilistic linkage is conducted based on the quality and availability of identifiers for data linkage. PopData utilizes a harmonized data request and approval process for data stewards and researchers to increase efficiency and ease of access to linked data. Researchers access linked data through a secure research environment (SRE) that is equipped with a wide variety of tools for analysis. The SRE also allows for ongoing management and control of data. PopData continues to expand its data holdings and to evolve its services as well as governance and data access process.

Discussion: PopData has provided efficient and cost-effective access to linked data sets for research. After two decades of learning, future planned developments for the organization include, but are not limited to, policies to facilitate programs of research, access to reusable datasets, evaluation and use of new data linkage techniques such as privacy preserving record linkage (PPRL).

Conclusion: PopData continues to maintain and grow the number and type of data holdings available for research. Its existing models support a number of large-scale research projects and demonstrate the benefits of having a third-party data linkage and provisioning center for research purposes. Building further connections with existing data holders and governing bodies will be important to ensure ongoing access to data and changes in policy exist to facilitate access for researchers.