Citation Export
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Dey, Sangeeta | - |
dc.contributor.author | Lee, Seok Won | - |
dc.date.issued | 2023-03-27 | - |
dc.identifier.uri | https://aurora.ajou.ac.kr/handle/2018.oak/36991 | - |
dc.identifier.uri | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85162869276&origin=inward | - |
dc.description.abstract | In the days of AI, data-centric machine learning (ML) models are increasingly used in various complex systems. While many researchers are focusing on specifying ML-specific performance requirements, not enough guideline is provided to engineer the data requirements systematically involving diverse stakeholders. Lack of written agreement about the training data, collaboration bottlenecks, lack of data validation framework, etc. are posing new challenges to ensuring training data fitness for safety-critical ML components. To reduce these gaps, we propose a multi-layered framework that helps to perceive and elicit data requirements. We provide a template for verifiable data requirements specifications. Moreover, we show how such requirements can facilitate an evidence-driven assessment of the training data quality based on the experts' judgments about the satisfaction of the requirements. We use Dempster Shafer's theory to combine experts' subjective opinions in the process. A preliminary case study on the CityPersons dataset for the pedestrian detection feature of autonomous cars shows the usefulness of the proposed framework for data requirements understanding and the confidence assessment of the dataset. | - |
dc.description.sponsorship | This work was supported by the BK21 FOUR program of the National Research Foundation (NRF) of Korea funded by the Ministry of Education (NRF5199991014091) and the Basic Science Research Program through the NRF funded by the Ministry of Science and ICT (NRF-2020R1F1A1075605). | - |
dc.language.iso | eng | - |
dc.publisher | Association for Computing Machinery | - |
dc.subject.mesh | Collaborative framework | - |
dc.subject.mesh | Data centric | - |
dc.subject.mesh | Data requirements | - |
dc.subject.mesh | Machine learning models | - |
dc.subject.mesh | Machine-learning | - |
dc.subject.mesh | Multi-layered | - |
dc.subject.mesh | Requirement engineering | - |
dc.subject.mesh | Safety critical systems | - |
dc.subject.mesh | Training data | - |
dc.subject.mesh | Uncertainty | - |
dc.title | A Multi-layered Collaborative Framework for Evidence-driven Data Requirements Engineering for Machine Learning-based Safety-critical Systems | - |
dc.type | Conference | - |
dc.citation.conferenceDate | 2023.3.27. ~ 2023.3.31. | - |
dc.citation.conferenceName | 38th Annual ACM Symposium on Applied Computing, SAC 2023 | - |
dc.citation.edition | Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing, SAC 2023 | - |
dc.citation.endPage | 1413 | - |
dc.citation.startPage | 1404 | - |
dc.citation.title | Proceedings of the ACM Symposium on Applied Computing | - |
dc.identifier.bibliographicCitation | Proceedings of the ACM Symposium on Applied Computing, pp.1404-1413 | - |
dc.identifier.doi | 10.1145/3555776.3577647 | - |
dc.identifier.scopusid | 2-s2.0-85162869276 | - |
dc.subject.keyword | data requirements | - |
dc.subject.keyword | machine learning | - |
dc.subject.keyword | reliability | - |
dc.subject.keyword | safety | - |
dc.subject.keyword | uncertainty | - |
dc.type.other | Conference Paper | - |
dc.description.isoa | false | - |
dc.subject.subarea | Software | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.