Prediction of Academic Talent Capacity Based on Gradient Boosting Decision Tree
Applied and Computational Mathematics
Volume 8, Issue 4, August 2019, Pages: 75-81
Received: Aug. 5, 2019; Published: Sep. 27, 2019
Views 645      Downloads 203
Shunshun Shi, School of Information, Zhejiang University of Finance and Economics, Hangzhou, China
Mingzhou Chen, School of Information, Zhejiang University of Finance and Economics, Hangzhou, China
Rui Feng, School of Information, Zhejiang University of Finance and Economics, Hangzhou, China
Hua Zhang, School of Information, Zhejiang University of Finance and Economics, Hangzhou, China
Shuai Zhang, School of Information, Zhejiang University of Finance and Economics, Hangzhou, China
Article Tools
Follow on us
Talent introduction is an important force of academic development in universities. As the core of talent introduction, prediction of academic talent capacity is an essential and valuable research. However, it is hard to apply traditional statistical methods to extract knowledge from the mass and multi-dimensional talent information. Data mining approaches as up-to-date and efficient technologies are good at analyzing information, extracting patterns or rules from a big dataset and then making a prediction based on the relationship among extracted information. In this study, a series of data mining approaches are employed to evaluate the academic capacity of talent and to analyze the correlation between features. The Principal Component Analysis and Random Forest are used to feature extraction for improving the accuracy of prediction. A classical classification model, Gradient Boosting Decision Tree, is used as the primary analytic model to prediction. In order to validate the effectiveness of the model, other five classification models are used to conduct a comparative experiment based on prediction accuracy values and the F-measure metric. Further, to investigate the contribution of some important features, we make a marginal utility analysis of important features which have a high correlation with academic talent capacity. The experiment results reveals the important features for academic capacity and the positive factors for the academic production of talents.
Data Mining, Classification Models, Prediction, Talent Introduction, Academic Talent Capacity
To cite this article
Shunshun Shi, Mingzhou Chen, Rui Feng, Hua Zhang, Shuai Zhang, Prediction of Academic Talent Capacity Based on Gradient Boosting Decision Tree, Applied and Computational Mathematics. Vol. 8, No. 4, 2019, pp. 75-81. doi: 10.11648/j.acm.20190804.12
Copyright © 2019 Authors retain the copyright of this article.
This article is an open access article distributed under the Creative Commons Attribution License ( which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Hanif, M. I. & Yunfei, S. (2013), The role of talent management and HR generic strategies for talent retention, African Journal of Business Management, 7, 2827-2835.
Kellogg, R. P. (2012), China’s brain gain: Attitudes and future plans of overseas Chinese students in the US, Journal of Chinese Overseas, 8, 83-104.
Tharenou, P. & Seet, P. S. (2014), China's reverse brain drain: regaining and retaining talent, International Studies of Management and Organization, 44, 55-74.
Ma, Y. P. & Pan, S. Y. (2015), Chinese returnees from overseas study: An understanding of brain gain and brain circulation in the age of globalization, Frontiers of Education in China, 10, 306-329.
Lievens, K. van Dam, & Anderson, N. (2002), Recent trends and challenges in personnel selection, Personnel Review, 31, 580-601.
Friedman, J. H. (2001), Greedy function approximation: A gradient boosting machine, Annals of Statistics, 29, 1189-1232.
Quinlan, J. R. (1987), Simplifying decision trees, International Journal of Man-machine Studies, 27, 221-234.
Breiman, L. (2001), Random forests, Machine learning, 45, 5-32.
Jain, A. K., Mao, J., & Mohiuddin, K. M. (1996), Artificial neural networks: A tutorial, Computer, 29, 31-44.
Chen, J., Huang, H., Tian, S., & Qu, Y. (2009), Feature selection for text classification with Naive Bayes, Expert Systems with Applications, 36, 5432-5435.
Suykens J. A. & Vandewalle, J. (1999), Least squares support vector machine classifiers, Neural Processing Letters, 9, 293-300.
Shaw, M. J., Subramaniam, C., Tan, G. W., & Welge, M. E. (2001), Knowledge management and data mining for marketing, Decision Support Systems, 31, 127-137.
Hormozi, A. M. & Giles, S. (2004), Data mining: A competitive weapon for banking and retail industries, Information Systems Management, 21, 62-71.
Koh, H. C. & Tan, G. (2011), Data mining applications in healthcare, Journal of Healthcare Information Management, 19, 65-72.
Romero, C. & Ventura, S. (2013), Data mining in education, Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 3, 12-27.
Chien, C. F. & Chen, L. F. (2008), Data mining to improve personnel selection and enhance human capital: A case study in high-technology industry, Expert Systems with Applications, 34, 280-290.
Ranjan, J., Goyal, D. P. & Ahson, S. I. (2008), Data mining techniques for better decisions in human resource management systems, International Journal of Business Information Systems, 3, 464-481.
Gupta, S., Mokashi, U. M., & Suma, V. (2017). Entropy-based discretisation for performance prediction of employee: strategy for improving software quality, International Journal of Productivity and Quality Management, 21, 411-428.
Huang, M. J., Tsou, Y. L. & Lee, S. C. (2006), Integrating fuzzy data mining and fuzzy artificial neural networks for discovering implicit knowledge, Knowledge-Based Systems, 19, 396-403.
Han, Y. (2016). Improved BIRCH Clustering Algorithm and Human Resource Management Efficiency: An Organizational Learning Perspective. International Journal of Security and Its Applications, 10 (8), 385-394.
Fadhil, R., Djatna, T., & Maarif, M. S. (2017). Analysis and Design of a Human Resources Performance Measurement System for the Nutmeg Oil Agro-industry in Aceh. Journal of Regional and City Planning, 28 (2), 99-110.
Chien, C. F. & Chen, L. F. (2007), Using rough set theory to recruit and retain high-potential talents for semiconductor manufacturing, IEEE Transactions on Semiconductor Manufacturing, 20, 528-541.
Saron, M. & Othman, Z. A. (2012), Academic talent model based on human resource data mart, International Journal of Research in Computer Science, 2, 29-35.
Science Publishing Group
1 Rockefeller Plaza,
10th and 11th Floors,
New York, NY 10020
Tel: (001)347-983-5186