Ajou University repository

DSMM: A Dynamic Setting for Memory Management in Apache Spark
Citations

SCOPUS

0

Citation Export

DC Field Value Language
dc.contributor.authorChae, Suk Joo-
dc.contributor.authorChung, Tae Sun-
dc.date.issued2019-04-22-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/36459-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85065394150&origin=inward-
dc.description.abstractApache Spark (Spark) is a unified analytics engine for large-scale data processing. Unlike traditional data processing engines like Hadoop, Spark is a framework that caches data in memory. Therefore, memory management in Spark is importance. However, there are several factors that interfere with memory management. First, if users want to cache data in memory, they need to choose their own storage level. In this case, if they do not select the optimal storage level, Spark will be put a heavy burden on memory. Next, users need to select the ratio for spark memory directly within Spark. If they do not choose optimal ratio for spark memory, garbage collection overheads will be incurred. In this poster, we propose DSMM that dynamically select the above factors on the system for memory management. Our experimental result shows 13% execution time improvement as compared to standard Spark.-
dc.language.isoeng-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.subject.meshCache data-
dc.subject.meshDynamic settings-
dc.subject.meshGarbage collection-
dc.subject.meshLarge-scale data processing-
dc.subject.meshMemory management-
dc.subject.meshProcessing engine-
dc.subject.meshStorage level-
dc.titleDSMM: A Dynamic Setting for Memory Management in Apache Spark-
dc.typeConference-
dc.citation.conferenceDate2019.3.24. ~ 2019.3.26.-
dc.citation.conferenceName2019 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2019-
dc.citation.editionProceedings - 2019 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2019-
dc.citation.endPage144-
dc.citation.startPage143-
dc.citation.titleProceedings - 2019 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2019-
dc.identifier.bibliographicCitationProceedings - 2019 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2019, pp.143-144-
dc.identifier.doi10.1109/ispass.2019.00024-
dc.identifier.scopusid2-s2.0-85065394150-
dc.identifier.urlhttp://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=8686044-
dc.subject.keywordApache Spark-
dc.subject.keywordConfiguration Setting-
dc.subject.keywordSpark memory-
dc.subject.keywordStorage Levels-
dc.type.otherConference Paper-
dc.description.isoafalse-
dc.subject.subareaHardware and Architecture-
dc.subject.subareaSoftware-
dc.subject.subareaSafety, Risk, Reliability and Quality-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Chung, Tae-Sun Image
Chung, Tae-Sun정태선
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.