Ajou University repository

Novel data-placement scheme for improving the data locality of Hadoop in heterogeneous environments
Citations

SCOPUS

11

Citation Export

DC Field Value Language
dc.contributor.authorBae, Minho-
dc.contributor.authorYeo, Sangho-
dc.contributor.authorPark, Gyudong-
dc.contributor.authorOh, Sangyoon-
dc.date.issued2021-09-25-
dc.identifier.urihttps://dspace.ajou.ac.kr/dev/handle/2018.oak/31246-
dc.description.abstractTo address the challenging needs of high-performance big data processing, parallel-distributed frameworks such as Hadoop are being utilized extensively. However, in heterogeneous environments, the performance of Hadoop clusters is below par. This is primarily because the blocks of the clusters are allocated equally to all nodes without regard to differences in the capability of individual nodes. This results in reduced data locality. Thus, a new data-placement scheme that enhances data locality is required for Hadoop in heterogeneous environments. This article proposes a new data placement scheme that preserves the same degree of data locality in heterogeneous environments as that of the standard Hadoop, with only a small amount of replicated data. In the proposed scheme, only those blocks with the highest probability of being accessed remotely are selected and replicated. The results of experiments conducted indicate that the proposed scheme incurs only a 20% disk space overhead and has virtually the same data locality ratio as the standard Hadoop, which has a replication factor of three and 200% disk space overhead.-
dc.description.sponsorshipThis research is supported by C2 integrating and interfacing technologies laboratory of Agency for Defense Development (UD180010ED).-
dc.language.isoeng-
dc.publisherJohn Wiley and Sons Ltd-
dc.subject.meshData locality-
dc.subject.meshData placement-
dc.subject.meshHadoop MapReduce-
dc.subject.meshHeterogeneous environments-
dc.subject.meshreplication-
dc.titleNovel data-placement scheme for improving the data locality of Hadoop in heterogeneous environments-
dc.typeConference Paper-
dc.citation.titleConcurrency and Computation: Practice and Experience-
dc.citation.volume33-
dc.identifier.bibliographicCitationConcurrency and Computation: Practice and Experience, Vol.33-
dc.identifier.doi10.1002/cpe.5752-
dc.identifier.scopusid2-s2.0-85082974056-
dc.identifier.urlhttp://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1532-0634-
dc.subject.keyworddata locality-
dc.subject.keyworddata placement-
dc.subject.keywordHadoop MapReduce-
dc.subject.keywordheterogeneous environment-
dc.subject.keywordreplication-
dc.description.isoafalse-
dc.subject.subareaSoftware-
dc.subject.subareaTheoretical Computer Science-
dc.subject.subareaComputer Science Applications-
dc.subject.subareaComputer Networks and Communications-
dc.subject.subareaComputational Theory and Mathematics-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Oh, Sangyoon Image
Oh, Sangyoon오상윤
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.