Ajou University repository

Optimized Signature Selection for Efficient String Similarity Searchoa mark
Citations

SCOPUS

0

Citation Export

DC Field Value Language
dc.contributor.authorLee, Taegyoung-
dc.contributor.authorChung, Tae Sun-
dc.contributor.authorKim, Jongik-
dc.date.issued2020-01-01-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://aurora.ajou.ac.kr/handle/2018.oak/31359-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85086454823&origin=inward-
dc.description.abstractIn this paper, we study the problem of string similarity search to retrieve in a database all strings similar to a query string within a given threshold. To measure the similarity between strings, we use edit distance. Many algorithms have been proposed under a filtering-and-verification framework to solve the problem. To reduce the overhead of edit distance verification, it is crucial to efficiently generate a small number of candidates in the filtering phase. Recently, an index structure named HSTree has been proposed for efficiently generating candidate strings. To generate candidates, they select and utilize HSTree nodes at a specific level calculated from a given threshold. In this paper, we observe that there are many alternative ways to select HSTree nodes, and propose a novel technique that selects HSTree nodes in an optimized way based on the observation. We also propose a modified HSTree, named a threaded HSTree, which connects inverted lists of an HSTree node to inverted lists of its child nodes. With a threaded HSTree, we can reduce the overhead of index lookups in HSTree nodes while selecting optimal tree nodes. Experimental results show that the proposed technique significantly outperforms the existing technique using the HSTree.-
dc.description.sponsorshipThis work was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Korean Government (Ministry of Science and ICT) under Grant 2019R1F1A1059795 and Grant 2019R1F1A1058548.-
dc.language.isoeng-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.subject.meshEdit distance-
dc.subject.meshIndex structure-
dc.subject.meshInverted list-
dc.subject.meshNovel techniques-
dc.subject.meshQuery string-
dc.subject.meshSignature selections-
dc.subject.meshString similarity-
dc.subject.meshVerification framework-
dc.titleOptimized Signature Selection for Efficient String Similarity Search-
dc.typeArticle-
dc.citation.endPage98204-
dc.citation.startPage98193-
dc.citation.titleIEEE Access-
dc.citation.volume8-
dc.identifier.bibliographicCitationIEEE Access, Vol.8, pp.98193-98204-
dc.identifier.doi2-s2.0-85086454823-
dc.identifier.scopusid2-s2.0-85086454823-
dc.identifier.urlhttp://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639-
dc.subject.keywordEdit distance-
dc.subject.keywordHierarchical tree index-
dc.subject.keywordOptimized signature selection-
dc.subject.keywordPartition signature scheme-
dc.subject.keywordString similarity search-
dc.type.otherArticle-
dc.description.isoatrue-
dc.subject.subareaComputer Science (all)-
dc.subject.subareaMaterials Science (all)-
dc.subject.subareaEngineering (all)-
dc.subject.subareaElectrical and Electronic Engineering-
Show simple item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Chung, Tae-Sun Image
Chung, Tae-Sun정태선
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.