Ajou University repository

Optimized Signature Selection for Efficient String Similarity Searchoa mark
Citations

SCOPUS

0

Citation Export

Publication Year
2020-01-01
Journal
IEEE Access
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
IEEE Access, Vol.8, pp.98193-98204
Keyword
Edit distanceHierarchical tree indexOptimized signature selectionPartition signature schemeString similarity search
Mesh Keyword
Edit distanceIndex structureInverted listNovel techniquesQuery stringSignature selectionsString similarityVerification framework
All Science Classification Codes (ASJC)
Computer Science (all)Materials Science (all)Engineering (all)Electrical and Electronic Engineering
Abstract
In this paper, we study the problem of string similarity search to retrieve in a database all strings similar to a query string within a given threshold. To measure the similarity between strings, we use edit distance. Many algorithms have been proposed under a filtering-and-verification framework to solve the problem. To reduce the overhead of edit distance verification, it is crucial to efficiently generate a small number of candidates in the filtering phase. Recently, an index structure named HSTree has been proposed for efficiently generating candidate strings. To generate candidates, they select and utilize HSTree nodes at a specific level calculated from a given threshold. In this paper, we observe that there are many alternative ways to select HSTree nodes, and propose a novel technique that selects HSTree nodes in an optimized way based on the observation. We also propose a modified HSTree, named a threaded HSTree, which connects inverted lists of an HSTree node to inverted lists of its child nodes. With a threaded HSTree, we can reduce the overhead of index lookups in HSTree nodes while selecting optimal tree nodes. Experimental results show that the proposed technique significantly outperforms the existing technique using the HSTree.
ISSN
2169-3536
Language
eng
URI
https://aurora.ajou.ac.kr/handle/2018.oak/31359
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85086454823&origin=inward
DOI
https://doi.org/2-s2.0-85086454823
Journal URL
http://ieeexplore.ieee.org/xpl/RecentIssue.jsp?punumber=6287639
Type
Article
Funding
This work was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Korean Government (Ministry of Science and ICT) under Grant 2019R1F1A1059795 and Grant 2019R1F1A1058548.
Show full item record

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Chung, Tae-Sun Image
Chung, Tae-Sun정태선
Department of Software and Computer Engineering
Read More

Total Views & Downloads

File Download

  • There are no files associated with this item.