This page has only limited features, please log in for full access.
This research proposes a new feature extraction algorithm using aggregated user engagements on social media in order to achieve demographics and personality discovery tasks. Our proposed framework can discover seven essential attributes, including gender identity, age group, residential area, education level, political affiliation, religious belief, and personality type. Multiple feature sets are developed, including comment text, community activity, and hybrid features. Various machine learning algorithms are explored, such as support vector machines, random forest, multi-layer perceptron, and naïve Bayes. An empirical analysis is performed on various aspects, including correctness, robustness, training time, and the class imbalance problem. We obtained the highest prediction performance by using our proposed feature extraction algorithm. The result on personality type prediction was 87.18%. For the demographic attribute prediction task, our feature sets also outperformed the baseline at 98.1% for residential area, 94.7% for education level, 92.1% for gender identity, 91.5% for political affiliation, 60.6% for religious belief, and 52.0% for the age group. Moreover, this paper provides the guideline for the choice of classifiers with appropriate feature sets.
Sarach Tuomchomtam; Nuanwan Soonthornphisaj. Demographics and Personality Discovery on Social Media: A Machine Learning Approach. Information 2021, 12, 353 .
AMA StyleSarach Tuomchomtam, Nuanwan Soonthornphisaj. Demographics and Personality Discovery on Social Media: A Machine Learning Approach. Information. 2021; 12 (9):353.
Chicago/Turabian StyleSarach Tuomchomtam; Nuanwan Soonthornphisaj. 2021. "Demographics and Personality Discovery on Social Media: A Machine Learning Approach." Information 12, no. 9: 353.
Sarach Tuomchomtam; Nuanwan Soonthornphisaj. Community recommendation for text post in social media: A case study on Reddit. Intelligent Data Analysis 2019, 23, 407 -424.
AMA StyleSarach Tuomchomtam, Nuanwan Soonthornphisaj. Community recommendation for text post in social media: A case study on Reddit. Intelligent Data Analysis. 2019; 23 (2):407-424.
Chicago/Turabian StyleSarach Tuomchomtam; Nuanwan Soonthornphisaj. 2019. "Community recommendation for text post in social media: A case study on Reddit." Intelligent Data Analysis 23, no. 2: 407-424.