Efficient Auto Scaling and Cost-Effective Architecture in Apache Hadoop

No Thumbnail Available

Date

2022-01-24

Journal Title

Journal ISSN

Volume Title

Publisher

University of Eloued جامعة الوادي

Abstract

In the age of Big Data Analytics, Cloud Computing has been regarded as a feasible and applicable technology to address Big Data Challenges, from storage capacities to distributed processing computations. One of the keys of its success is its high scalability which refers to the ability of the system to increase its performance, resources and functionalities according to the workload. This flexibility has been seen as an appropriate way to decrease datacenters' energy consumption and thus assures cost-saving and efficiency without effecting performance of the system. In order to handle Big Data operations, Cloud Computing has implemented various platforms and tools such as Apache Hadoop and provides distributed processing of very large data sets across multiple clusters. This paper proposes an auto scaling architecture based on the framework of Hadoop; it adjusts automatically the computation resources depending on the workload. In order to validate the effectiveness of the proposed architecture, a case study about Twitter data analysis in a cloud simulated environment has been implemented to improve the cost-effectiveness and the efficiency of the system.

Description

Forum Intervention of Artificial Intelligence and Its Applications. Faculty of Exat science. University of Eloued

Keywords

Big Data, Cloud Computing, Apache Hadoop, Auto Scaling.

Citation

NEMOUCHI, Warda Ismahene, BOUDOUDA, Souheila. ZAROUR, Nacer eddine. Efficient Auto Scaling and Cost-Effective Architecture in Apache Hadoop. Forum of Artificial Intelligence and Its Applications. 24-26 Jan 2022. Faculty of Exat science. University of Eloued. [visited in ../../….]. available from [copy the link here]