"Is a picture really worth a thousand words?": A case study on classifying user attributes on Instagram

PLoS One. 2018 Oct 5;13(10):e0204938. doi: 10.1371/journal.pone.0204938. eCollection 2018.

Abstract

Because using social media has become a major part of people's daily lives, many of their personal characteristics are often implicitly or explicitly reflected in the content they share. We present a study of two personal characteristics-age and gender-related to user engagement on Instagram that can be determined through the characterization of images and tags. We demonstrate the strong influence of age and gender on Instagram use in terms of topical and content differences. We then build age and gender classification models that yield F1 scores of up to 88% and 74% in the detection of age and gender, respectively, and that better characterize users by images than by tags. We further demonstrate the robustness of our models using a new set of test data, with which the models exhibit greater overall performance than human raters. Our study highlights that future research should look to exploit images to a greater degree because they complement text and there are many unexamined images with no embedded text available.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Age Factors
  • Female
  • Humans
  • Machine Learning
  • Male
  • Models, Theoretical
  • Sex Factors
  • Social Media / statistics & numerical data*
  • Surveys and Questionnaires

Grants and funding

This work was supported by S-2017-G0001-00040: Ajou University (Kyungsik Han, https://www.ajou.ac.kr/); NRF-2017R1A2B3004581: National Research Foundation of Korea (Sang-Wook Kim, https://nrf.re.kr/); NRF-2017R1C1B5017391: National Research Foundation of Korea (Kyungsik Han, https://nrf.re.kr/); NRF-2017M3C4A7069440: National Research Foundation of Korea (Sang-Wook Kim; https://nrf.re.kr/); NRF-2017M3C4A7083529: National Research Foundation of Korea (Kyungsik Han, https://nrf.re.kr/); NSF CNS-1422215: National Science Foundation (Dongwon Lee, https://www.nsf.gov/); NSF CNS-1742702: National Science Foundation (Dongwon Lee, https://www.nsf.gov/); NSF IUSE-1525601: National Science Foundation (Dongwon Lee, https://www.nsf.gov/); and Samsung GRO 2015 awards (Dongwon Lee).