| Peer-Reviewed

Analysis and Prediction of COVID-19 Data Quality Based on Benford's Law-- Take Data from 51 Countries and Regions as an Example

Received: 8 March 2021    Accepted:     Published: 26 April 2021
Views:       Downloads:
Abstract

In the absence of effective drugs to control the COVID-19 epidemic, the main intervention of human factors, namely strict isolation, may be the best prevention and control method at present. The conclusion of data empirical study using Benford's Law is of great significance.Research purpose of this paper analyze COVID - 19 data to predict the authenticity and reliability, and on this basis, the method is to use Benford's Law and the panel model for 51 countries or regions COVID - 19 data statistical analysis, the results of the study found that "other areas" unreliable data, Australia, Pakistan and global data are greatly influenced by artificial factors, Africa, Oceania data with several other states Data according to have significant difference, compared the southern hemisphere and northern hemisphere, the first phase of the data and the second stage also has the obvious difference between the data, The COVID-19 data are also predicted to suggest that the outbreak may have multiple iterations.In conclusion, in most cases, when COVID-19 data deviates from Benford's Law, epidemic prevention and control is better; otherwise, it is worse.

Published in Science Innovation (Volume 9, Issue 2)
DOI 10.11648/j.si.20210902.14
Page(s) 53-62
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2024. Published by Science Publishing Group

Keywords

Benford's Law, COVID-19, Data Quality, Prediction

References
[1] Collins, C. (2017). Using Excel and Benford’s Law to Detect Fraud. Journal of Accountancy, April.
[2] Crocetti, E., & Randi, G. (2016). Using the Benford’s Law as a first step to assess the quality of the cancer registry data. Frontiers in public health, 4, 225.
[3] Daniels, J., Caetano, S. J., Huyer, D., Stephen, A., Fernandes, J., Lytwyn, A., & Hoppe, F. M. (2017). Benford's law for quality assurance of manner of death counts in small and large databases. Journal of forensic sciences, 62(5), 1326-1331.
[4] Goodman, W. (2016). The promises and pitfalls of Benford's law. Significance, 13(3), 38-41.
[5] Idrovo, A. J., Fernández-Niño, J. A., Bojórquez-Chapela, I., & Moreno-Montoya, J. (2011). Performance of public health surveillance systems during the influenza A (H1N1) pandemic in the Americas: testing a new method based on Benford's Law. Epidemiology & Infection, 139(12), 1827-1834.
[6] Kuruppu, N., & Muscat, O. (2019). The Application of Benford’s Law in Fraud Detection: A Systematic Methodology. International Business Research, 12(10), 1-10.
[7] Lu, F., & Boritz, J. E. (2005). Detecting fraud in health insurance data: Learning to model incomplete Benford’s Law distributions. In European Conference on Machine Learning (pp. 633-640). Springer, Berlin, Heidelberg.
[8] Manrique-Hernandez, E. F., Fernandez-Nino, J. A., & Idrovo, A. J. (2017). Global performance of epidemiologic surveillance of Zika virus: rapid assessment of an ongoing epidemic. Public health, 143, 14-16.
[9] Nigrini, M. J. (2019). The Patterns of the Numbers used in Occupational Fraud Schemes. Managerial Auditing Journal, 34(5), 606-626.
[10] Pomykacz, M., Olmsted, C., & Tantinan, K. (2017). Benford’s Law in Appraisal. The Appraisal Journal, Fall, 274-284.
[11] Sambridge, M., Tkalčić, H., & Jackson, A. (2010). Benford's law in the natural sciences. Geophysical research letters, 37(22).
[12] Singleton, T. W. (2011). Understanding and Applying Benford’s Law. ISACA Journal, 3, 4.
[13] Zhang, J. (2020). Testing Case Number of Coronavirus Disease 2019 in China with Newcomb-Benford Law. arXiv preprint arXiv:2002.05695.
[14] 杜玉忠,张铭驱,范秀红,卢文涛,曾茜茜,黄燕,黄燕琼.基于向量自回归模型的清远市手足口病发病预测分析[J].实用预防医学,2019(2):247:250.
[15] 韩兆洲,程学伟.GDP统计数据质量实证研究:基于Benford法则和空间面板模型[J].数理统计与管理,2019,38(03):394-404.
[16] 刘云霞,曾五一.关于综合利用Benford法则与其他方法评估统计数据质量的进一步研究[J].统计研究,2013,30(08):3-9.
[17] 罗玉波,张冬霞,吴亚炳.Benford’s Law在财务审计中的实证研究[J]. 中国审计评论,2015(2):69-78.
[18] 吕娜,邹薇.健康人力资本投资与居民收入——基于私人和公共部门健康支出的实证分析[J].中国地质大学学报(社会科学版),2015,15(01):113-119.
[19] 许存兴,王大江,张芙蓉.上市公司审计意见实证分析——基于Benford法则的造假检测[J]南京财经大学学报,2009(4):56:60.
Cite This Article
  • APA Style

    Cheng Han Leung, Yu Bo Luo, Tan Cheng Lok, Zi Chen Luo. (2021). Analysis and Prediction of COVID-19 Data Quality Based on Benford's Law-- Take Data from 51 Countries and Regions as an Example. Science Innovation, 9(2), 53-62. https://doi.org/10.11648/j.si.20210902.14

    Copy | Download

    ACS Style

    Cheng Han Leung; Yu Bo Luo; Tan Cheng Lok; Zi Chen Luo. Analysis and Prediction of COVID-19 Data Quality Based on Benford's Law-- Take Data from 51 Countries and Regions as an Example. Sci. Innov. 2021, 9(2), 53-62. doi: 10.11648/j.si.20210902.14

    Copy | Download

    AMA Style

    Cheng Han Leung, Yu Bo Luo, Tan Cheng Lok, Zi Chen Luo. Analysis and Prediction of COVID-19 Data Quality Based on Benford's Law-- Take Data from 51 Countries and Regions as an Example. Sci Innov. 2021;9(2):53-62. doi: 10.11648/j.si.20210902.14

    Copy | Download

  • @article{10.11648/j.si.20210902.14,
      author = {Cheng Han Leung and Yu Bo Luo and Tan Cheng Lok and Zi Chen Luo},
      title = {Analysis and Prediction of COVID-19 Data Quality Based on Benford's Law-- Take Data from 51 Countries and Regions as an Example},
      journal = {Science Innovation},
      volume = {9},
      number = {2},
      pages = {53-62},
      doi = {10.11648/j.si.20210902.14},
      url = {https://doi.org/10.11648/j.si.20210902.14},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.si.20210902.14},
      abstract = {In the absence of effective drugs to control the COVID-19 epidemic, the main intervention of human factors, namely strict isolation, may be the best prevention and control method at present. The conclusion of data empirical study using Benford's Law is of great significance.Research purpose of this paper analyze COVID - 19 data to predict the authenticity and reliability, and on this basis, the method is to use Benford's Law and the panel model for 51 countries or regions COVID - 19 data statistical analysis, the results of the study found that "other areas" unreliable data, Australia, Pakistan and global data are greatly influenced by artificial factors, Africa, Oceania data with several other states Data according to have significant difference, compared the southern hemisphere and northern hemisphere, the first phase of the data and the second stage also has the obvious difference between the data, The COVID-19 data are also predicted to suggest that the outbreak may have multiple iterations.In conclusion, in most cases, when COVID-19 data deviates from Benford's Law, epidemic prevention and control is better; otherwise, it is worse.},
     year = {2021}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - Analysis and Prediction of COVID-19 Data Quality Based on Benford's Law-- Take Data from 51 Countries and Regions as an Example
    AU  - Cheng Han Leung
    AU  - Yu Bo Luo
    AU  - Tan Cheng Lok
    AU  - Zi Chen Luo
    Y1  - 2021/04/26
    PY  - 2021
    N1  - https://doi.org/10.11648/j.si.20210902.14
    DO  - 10.11648/j.si.20210902.14
    T2  - Science Innovation
    JF  - Science Innovation
    JO  - Science Innovation
    SP  - 53
    EP  - 62
    PB  - Science Publishing Group
    SN  - 2328-787X
    UR  - https://doi.org/10.11648/j.si.20210902.14
    AB  - In the absence of effective drugs to control the COVID-19 epidemic, the main intervention of human factors, namely strict isolation, may be the best prevention and control method at present. The conclusion of data empirical study using Benford's Law is of great significance.Research purpose of this paper analyze COVID - 19 data to predict the authenticity and reliability, and on this basis, the method is to use Benford's Law and the panel model for 51 countries or regions COVID - 19 data statistical analysis, the results of the study found that "other areas" unreliable data, Australia, Pakistan and global data are greatly influenced by artificial factors, Africa, Oceania data with several other states Data according to have significant difference, compared the southern hemisphere and northern hemisphere, the first phase of the data and the second stage also has the obvious difference between the data, The COVID-19 data are also predicted to suggest that the outbreak may have multiple iterations.In conclusion, in most cases, when COVID-19 data deviates from Benford's Law, epidemic prevention and control is better; otherwise, it is worse.
    VL  - 9
    IS  - 2
    ER  - 

    Copy | Download

Author Information
  • School of Business, City University of Macau, Macau, China

  • School of Business, City University of Macau, Macau, China

  • School of Business, City University of Macau, Macau, China

  • Institute of Data Science, City University of Macau, Macau, China

  • Sections