International Journal on Data Science and Technology
Volume 4, Issue 2, June 2018, Pages: 42-48
Received: Jun. 25, 2018;
Published: Jun. 26, 2018
Views 681 Downloads 44
Hui Zhang, Computer Engineering and Computer Science Department, University of Louisville, Louisville, USA
R has been adopted as a popular data analysis and mining tool in many domain fields over the past decade. As Big Data overwhelms those fields, the computational needs and workload of existing R solutions increases significantly. With recent hardware and software developments, it is possible to enable massive parallelism with existing R solutions with little to no modification. In this paper, three different approaches are evaluated to speed up R computations with the utilization of the multiple cores, the Intel Xeon Phi SE10P Co-processor, and the general purpose graphic processing unit (GPGPU). Performance engineering and evaluation efforts in this study are based on a popular R benchmark script. The paper presents preliminary results on running R-benchmark with the above packages and hardware technology combinations.
Performance Engineering for Scientific Computing with R, International Journal on Data Science and Technology.
Vol. 4, No. 2,
2018, pp. 42-48.
Accelerating the intel math kernel library, 2007. M. Intel. Intel math kernel library, 2007.
A hardware accelerator for the Intel Math Kernel. J. L. Gustafson and B. S. Greer. ClearSpeed whitepaper.
Y. El-Khamra, N. Gaffney, D. Walling, E. Wernert, W. Xu, and H. Zhang. Performance evaluation of r with intel xeon phicoprocessor. In Big Data, 2013 IEEE International Conference on, pages 23–30. IEEE, 2013.
Hui Zhang, Sidharth Thakur, and Andrew J. Hanson. Haptic exploration of mathematical knots. In ISVC (1), pages 745–756, 2007.
Lin Jing, Xipei Huang, Yiwen Zhong, Yin Wu, and Hui Zhang. Python based 4d visualization environment. International Journal of Advancements in Computing Technology, 4 (16):460–469, September 2012.
Hui Zhang, Jianguang Weng, and Andrew J. Hanson. A pseudo-haptic knot diagram interface. In Proc. SPIE, volume 7868, pages 786807–786807–14, 2011.
Guangchen Ruan and Hui Zhang. Conquering Big Data with High Performance Computing, chapter Large-Scale Multimodal Data Exploration with Human in the Loop. Springer International Publishing, Springer International Publishing Switzerland, 2016.
Jian Zou and Hui Zhang. Conquering Big Data with High Performance Computing, chapter High-Frequency Financial Analysis through High Performance Computing. Springer International Publishing, Springer International Publishing Switzerland, 2016.
Weijia Xu, Ruizhu Huang, and Hui Zhang. Conquering Big Data with High Performance Computing, chapter Empowering R with High Performance Computing Resources for Big Data Analytics. Springer International Publishing, Springer International Publishing Switzerland, 2016.
Hui Zhang, Huian Li, Michael J. Boyles, Robert Henschel, Eduardo Kazuo Kohara, and Masatoshi Ando. Exploiting hpc resources for the 3d-time series analysis of caries lesion activity. In Proceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the Campus and Beyond, XSEDE ’12, pages 19:1–19:8, New York, NY, USA, 2012. ACM.
Hui Zhang, Michael J. Boyles, Guangchen Ruan, Huian Li, Hongwei Shen, and Masatoshi Ando. Xsede-enabled highthroughput lesion activity assessment. In Proceedings of the Conference on Extreme Science and Engineering Discovery Environment: Gateway to Discovery, XSEDE ’13, pages 10:1–10:8, New York, NY, USA, 2013. ACM.
Hui Zhang, Jianguang Weng, and Guangchen Ruan. Visualizing 2-dimensional manifolds with curve handles in 4d. IEEE Transactions on Visualization and Computer Graphics, 20 (12):2575–2584, Dec 2014.
Riqing Chen and Hui Zhang. Large-scale 3D Reconstruction with an R-based Analysis Workflow. In Proceedings of the Fourth IEEE/ACM International Conference on Big Data Computing, Applications and Technologies (BDCAT '17). ACM, New York, NY, USA.
Hui Zhang, Yiwen. Zhong and Juan Lin, Divide-and-conquer strategies for large-scale simulations in R, 2017 IEEE International Conference on Big Data (Big Data), Boston, MA, 2017, pp. 3517-3523.