A business-driven reference architecture for big data analytics implementation by public sector organizations: a case study of Uganda
Matendo Didas, Frederick Henri Chali and Noe Elisa
Abstract
Advanced data analytics especially big data analytics (BDA) has proved to have a great potential to improve public service delivery both in urban and rural or remote areas. However, many of the BDA implementation projects in public sector organizations are failing. Early efforts to the successful implementation of BDA, suggest that business-driven reference architecture (RA) for these projects is among the viable solutions to have BDA project failure rate addressed in public sector organizations. The current study aimed at designing business-driven RA for big data analytics implementation (BRABDAI) to act as a frame of reference (blueprint) for delivering big data analytics implementation projects in public sector organizations. Uganda as a case study, three public organizations were considered (Ministry of Health, Ministry of Education and Sports, and Uganda Bureau of Statics). To accomplish the study's goals, the design science research methodology (DSRM) was used supported by a questionnaire survey and mini literature review for data collection. The developed RA offers a mechanism to systematically consider business use cases and processes in the first place before assembling BDA technical aspects and procedures. Hence, the main result of this paper is BRABDAI for public sector organizations. The practical implication of the paper is that feasibility study, requirement engineering, design, implementation, deployments, and maintenance of big data analytics projects in public sector organizations can now be driven by the need of the organization in question, other than assembling technology first which can result into detrimental misalignment between organizational vision and technology.
Keyword
Big data analytics, Reference architecture, Design science research methodology , Business-driven reference architecture.
Cite this article
Didas M, Chali FH, Elisa N.A business-driven reference architecture for big data analytics implementation by public sector organizations: a case study of Uganda. International Journal of Advanced Computer Research. 2024;14(69):125-149. DOI:10.19101/IJACR.2023.1362047
Refference
[1]Hidalgo-sanchis P. UN global pulse: a UN innovation initiative with a multiplier effect. Data Science for Social Good: Philanthropy and Social Impact in a Complex World. 2021:29-40.
[2]Klievink B, Romijn BJ, Cunningham S, De BH. Big data in the public sector: uncertainties and readiness. Information Systems Frontiers. 2017; 19(2):267-83.
[3]Henrion M. Why most big data analytics projects fail: how to succeed by engaging with your clients. ORMS Today, Informs. 2019.
[4]Rahmani AM, Azhir E, Ali S, Mohammadi M, Ahmed OH, Ghafour MY, et al. Artificial intelligence approaches and mechanisms for big data analytics: a systematic study. Peer J Computer Science. 2021; 7:e488.
[5]Chen B, Tu Y, Song Y, Theobald DM, Zhang T, Ren Z, et al. Mapping essential urban land use categories with open big data: results for five metropolitan areas in the United States of America. ISPRS Journal of Photogrammetry and Remote Sensing. 2021; 178:203-18.
[6]Guenduez AA, Mettler T, Schedler K. Technological frames in public administration: what do public managers think of big data. Government Information Quarterly. 2020; 37(1):101406.
[7]Desouza KC, Jacob B. Big data in the public sector: lessons for practitioners and scholars. Administration & Society. 2017; 49(7):1043-64.
[8]Munne R. Big data in the public sector. New Horizons for a Data-driven Economy. 2016:195-208. Springer Open.
[9]Wang Y, Kung L, Byrd TA. Big data analytics: understanding its capabilities and potential benefits for healthcare organizations. Technological Forecasting and Social Change. 2018; 126:3-13.
[10]Aggarwal S, Manuel N. Big data analytics should be driven by business needs, not technology. McKinsey & Co; 2016.
[11]Ataei P, Litchfield AT. Big data reference architectures, a systematic literature review. Australasian conference on information systems 2020 (pp. 1-11).
[12]Gorton I, Klein J. Distribution, data, deployment: software architecture convergence in big data systems. IEEE Software. 2014; 32(3):78-85.
[13]Interoperability NB. NIST big data public working group definitions and taxonomies subgroup in: framework: definitions. NIST Special Publication. 2015.
[14]Muchenje G, Seppänen M. Unpacking task-technology fit to explore the business value of big data analytics. International Journal of Information Management. 2023; 69:102619.
[15]Ntawiha W, Anderson B. Ugandas data ecosystem. 2016.
[16]Grover V, Chiang RH, Liang TP, Zhang D. Creating strategic business value from big data analytics: a research framework. Journal of Management Information Systems. 2018; 35(2):388-423.
[17]Gärtner B, Hiebl MR. Issues with big data. In the Routledge companion to accounting information systems 2017 (pp. 161-72). Routledge.
[18]Van DVHG, Klievink AJ, Arnaboldi M, Meijer AJ. Rationality and politics of algorithms. Will the promise of big data survive the dynamics of public decision making. Government Information Quarterly. 2019; 36(1):27-38.
[19]Marr B. Data strategy: how to profit from a world of big data, analytics and the internet of things. Kogan Page Publishers; 2017.
[20]Davenport TH, Dyché J. Big data in big companies. International Institute for Analytics. 2013.
[21]Mishra D, Gunasekaran A, Papadopoulos T, Childe SJ. Big data and supply chain management: a review and bibliometric analysis. Annals of Operations Research. 2018; 270:313-36.
[22]Barham H, Daim T. The use of readiness assessment for big data projects. Sustainable Cities and Society. 2020; 60:102233.
[23]Kaufmann M. Big data management canvas: a reference model for value creation from data. Big Data and Cognitive Computing. 2019; 3(1):19.
[24]Weibl J, Hess T. Success or failure of big data: insights of managerial challenges from a technology assimilation perspective. In MKWI 2018 (pp. 47-58).
[25]Alharthi A, Krotov V, Bowman M. Addressing barriers to big data. Business Horizons. 2017; 60(3):285-92.
[26]Spangenberg N, Wilke M, Franczyk B. A big data architecture for intra-surgical remaining time predictions. Procedia Computer Science. 2017; 113:310-7.
[27]Naganathan V. Comparative analysis of big data, big data analytics: challenges and trends. International Research Journal of Engineering and Technology. 2018; 5(05):1948-64.
[28]Li L, Ma S, Wang R, Wang Y, Zheng Y. Citizen participation in the co-production of urban natural resource assets: analysis based on social media big data. Journal of Global Information Management. 2021; 30(6):1-21.
[29]Zhuang YT, Wu F, Chen C, Pan YH. Challenges and opportunities: from big data to knowledge in AI 2.0. Frontiers of Information Technology & Electronic Engineering. 2017; 18:3-14.
[30]Jordan L. The problem with big data in translational medicine. a review of where we've been and the possibilities ahead. Applied Translational Genomics. 2015; 6:3-6.
[31]Henninger M. The value and challenges of public sector information. Cosmopolitan Civil Societies: An Interdisciplinary Journal. 2013; 5(3):75-95.
[32]Lande D, Subach I, Puchkov A. A system for analysis of big data from social media. Information & Security. 2020; 47(1):44-61.
[33]Kim GH, Trimi S, Chung JH. Big-data applications in the government sector. Communications of the ACM. 2014; 57(3):78-85.
[34]Bassoo V, Ramnarain-seetohul V, Hurbungs V, Fowdur TP, Beeharry Y. Big data analytics for smart cities. Internet of Things and Big Data Analytics Toward Next-Generation Intelligence. 2018:359-79.
[35]Dash S, Shakyawar SK, Sharma M, Kaushik S. Big data in healthcare: management, analysis and future prospects. Journal of Big Data. 2019; 6(1):1-25.
[36]Mcdermott S, Turk MA. What are the implications of the big data paradigm shift for disability and health. Disability and Health Journal. 2015; 8(3):303-4.
[37]Elsheikh AS. A brief outlook of enterprise architecture role in the digital age. In proceedings of the federated Africa and middle east conference on software engineering 2022 (pp. 51-58). ACM.
[38]Sofyana L, Putera AR. Business architecture planning with TOGAF framework. In journal of physics: conference series 2019 (pp. 1-10). IOP Publishing.
[39]Saputra H, Gustiana I, Soeryanto SE. Design of enterprise information system using TOGAF framework for public service agency. Journal of Engineering Science and Technology. 2022; 17(4).
[40]KR RB. Design and development of QoS guaranteed cloud scheduling. Doctoral Dissertation, Cochin University of Science and Technology. 2019.
[41]Tamke M, Nicholas P, Zwierzycki M. Machine learning for architectural design: practices and infrastructure. International Journal of Architectural Computing. 2018; 16(2):123-43.
[42]Breivold HP. Internet-of-things and cloud computing for smart industry: a systematic mapping study. In 5th international conference on enterprise systems (ES) 2017 (pp. 299-304). IEEE.
[43]Peterson L, Al-shabibi A, Anshutz T, Baker S, Bavier A, Das S, et al. Central office re-architected as a data center. IEEE Communications Magazine. 2016; 54(10):96-101.
[44]https://www.nttdata.com/global/ja/-/media/nttdataglobal-ja/files/investors/library/ar/ar12_j_all.pdf. Accessed 28 June 2024.
[45]Moreno J, Serrano MA, Fernandez-medina E, Fernandez EB. Towards a security reference architecture for big data. In DOLAP 2018 (pp. 1-9).
[46]Velásquez N, Estevez EC, Pesado PM. Cloud computing, big data and the industry 4.0 reference architectures. Journal of Computer Science & Technology. 2018.
[47]Varghese B, Buyya R. Next generation cloud computing: new trends and research directions. Future Generation Computer Systems. 2018; 79:849-61.
[48]Nirmala MB, Raj P. Big data computing and the reference architecture. In big data: concepts, methodologies, tools, and applications 2016 (pp. 56-72). IGI Global.
[49]Singh N, Lai KH, Vejvar M, Cheng TC. Big data technology: challenges, prospects, and realities. IEEE Engineering Management Review. 2019; 47(1):58-66.
[50]García S, Ramírez-gallego S, Luengo J, Benítez JM, Herrera F. Big data preprocessing: methods and prospects. Big Data Analytics. 2016; 1:1-22.
[51]Becker DK. Predicting outcomes for big data projects: big data project dynamics (BDPD): research in progress. In international conference on big data (big data) 2017 (pp. 2320-30). IEEE.
[52]Baran ML. Mixed methods research design. In research anthology on innovative research methodologies and utilization across multiple disciplines 2022 (pp. 312-33). IGI Global.
[53]Xing W, Whalley J. The golden tax project, value-added tax statistics, and the analysis of internal trade in China. China Economic Review. 2014; 30:448-58.
[54]Saltz J, Shamshurin I, Connors C. A framework for describing big data projects. In business information systems workshops: BIS 2016 international workshops, Leipzig, Germany 2017 (pp. 183-95). Springer International Publishing.
[55]Ibrahim SA, Hamzah N, Wahab AR, Abdullah JM, Malim NH, Sumari P, et al. Big brain data initiative in universiti sains Malaysia: challenges in brain mapping for Malaysia. The Malaysian Journal of Medical Sciences: MJMS. 2020; 27(4):1-8.
[56]Wieringa RJ. Design science methodology for information systems and software engineering. Springer; 2014.
[57]Bartram B. Using questionnaires. In practical research methods in education 2019 (pp. 1-11). Routledge.
[58]Ikart EM. Survey questionnaire survey pretesting method: an evaluation of survey questionnaire via expert reviews technique. Asian Journal of Social Science Studies. 2019; 4(2).
[59]Krosnick JA. Questionnaire design. The Palgrave handbook of survey research. 2018:439-55.
[60]Lund B. The questionnaire method in systems research: an overview of sample sizes, response rates and statistical approaches utilized in studies. VINE Journal of Information and Knowledge Management Systems. 2023; 53(1):1-10.
[61]Ong MH, Puteh F. Quantitative data analysis: choosing between SPSS, PLS, and AMOS in social science research. International Interdisciplinary Journal of Scientific Research. 2017; 3(1):14-25.
[62]Yang M. Big data: issues, challenges, tools. Thesis, Centria University of Applied Sciences. 2020.
[63]Alzahrani SM. Big data analytics tools: twitter API and spark. In international conference of women in data science at Taif university 2021 (pp. 1-6). IEEE.
[64]Hiebl MR. Sample selection in systematic literature reviews of management research. Organizational Research Methods. 2023; 26(2):229-61.
[65]Booth A, James MS, Clowes M, Sutton A. Systematic approaches to a successful literature review. Sage Publications; 2021.
[66]Lame G. Systematic literature reviews: an introduction. In proceedings of the design society: international conference on engineering design 2019 (pp. 1633-42). Cambridge University Press.
[67]Rethlefsen ML, Kirtley S, Waffenschmidt S, Ayala AP, Moher D, Page MJ, et al. PRISMA-S: an extension to the PRISMA statement for reporting literature searches in systematic reviews. Systematic Reviews. 2021; 10:1-9.
[68]Nti IK, Quarcoo JA, Aning J, Fosu GK. A mini-review of machine learning in big data analytics: applications, challenges, and prospects. Big Data Mining and Analytics. 2022; 5(2):81-97.
[69]Mohd SMH, Fitri PA, Ngalimat MS, Sim KJ. A mini literature review on current advancements in protein purification techniques. In biology and life sciences forum 2022 (pp. 1-10). MDPI.
[70]Bree RT, Gallagher G. Using microsoft excel to code and thematically analyse qualitative data: a simple, cost-effective approach. All Ireland Journal of Higher Education. 2016; 8(2).
[71]Gibbs GR. Analyzing qualitative data. Sage Publications; 2018.
[72]Prayag G. Post-positivism and mixed methods research in rural tourism. In handbook on tourism and rural community development 2023 (pp. 62-75). Edward Elgar Publishing.
[73]Didas M, Chali F, Elisa N, Matogoro J. Requirements for building a business-driven reference architecture for implementation of big data analytics by public sector organizations: a case study of Uganda. In first international conference on the advancements of artificial intelligence in African context 2023 (pp. 1-12). IEEE.
[74]Andrade C. The inconvenient truth about convenience and purposive samples. Indian Journal of Psychological Medicine. 2021; 43(1):86-8.
[75]Baltes S, Ralph P. Sampling in software engineering research: a critical review and guidelines. Empirical Software Engineering. 2022; 27(4):94.
[76]Walters WH. Survey design, sampling, and significance testing: key issues. The Journal of Academic Librarianship. 202; 47(3):102344.
[77]Sharma G. Pros and cons of different sampling techniques. International Journal of Applied Research. 2017; 3(7):749-52.
[78]Nanjundeswaraswamy TS, Divakar S. Determination of sample size and sampling methods in applied research. Proceedings on Engineering Sciences. 2021; 3(1):25-32.
[79]Chaokromthong K, Sintao N. Sample size estimation using yamane and cochran and krejcie and morgan and green formulas and cohen statistical power analysis by G power and comparisions. Apheit International Journal. 2021; 10(2):76-86.
[80]Tamym L, Benyoucef L, Moh AN, El OMD. Big data analytics-based approach for robust, flexible and sustainable collaborative networked enterprises. Advanced Engineering Informatics. 2023; 55:101873.
[81]Shi Y. Advances in big data analytics. Adv Big Data Anal; 2022. Springer.
[82]Watson HJ. Tutorial: big data analytics: concepts, technologies, and applications. Communications of the Association for Information Systems. 2014; 34(1):65.
[83]Fredriksson C, Mubarak F, Tuohimaa M, Zhan M. Big data in the public sector: a systematic literature review. Scandinavian Journal of Public Administration. 2017; 21(3):39-61.
[84]Kiganda C, Akcayol MA. Forecasting the spread of COVID-19 using deep learning and big data analytics methods. SN Computer Science. 2023; 4(4):374.
[85]Coyne EM, Coyne JG, Walker KB. Big Data information governance by accountants. International Journal of Accounting & Information Management. 2018; 26(1):153-70.
[86]Joubert A, Murawski M, Bick M. Measuring the big data readiness of developing countries–index development and its application to Africa. Information Systems Frontiers. 2023; 25(1):327-50.
[87]Arora B. Big data analytics: the underlying technologies used by organizations for value generation. Understanding the Role of Business Analytics: Some Applications. 2019:9-30.
[88]Sang GM, Xu L, De VP. Simplifying big data analytics systems with a reference architecture. In collaboration in a data-rich world: 18th IFIP WG 5.5 working conference on virtual enterprises, Vicenza, Italy, 2017 (pp. 242-9). Springer International Publishing.