JISE


  [1] [2] [3] [4] [5] [6] [7] [8] [9] [10] [11] [12] [13] [14] [15] [16] [17] [18] [19] [20] [21] [22] [23] [24]


Journal of Information Science and Engineering, Vol. 26 No. 2, pp. 333-346


Testing Whether a Set of Code Words Satisfies a Given Set of Constraints


HSIN-WEN WEI, WAN-CHEN LU, PEI-CHI HUANG, WEI-KUAN SHIH AND MING-YANG KAO+
Department of Computer Science 
National Tsing Hua University 
Hsinchu 300, Taiwan 
E-mail: {bertha, wanchen, peggy, wshih}@rtlab.cs.nthu.edu.tw 
+Department of Electrical Engineering and Computer Science 
Northwestern University 
Evanston 60208, U.S.A. 
E-mail: kao@northwestern.edu


    This paper investigates the problem of testing whether a set of code words satisfies certain biologically motivated Hamming distance constraints. The paper provides three efficient techniques to verify the code words, namely, the Enumeration, Table Lookup, and Encoding methods, with applications to the design of DNA words. The Enumeration method enumerates all combinations of positions in a word, so that all the words in a set can be compared simultaneously and the testing process is improved. The Table Lookup method constructs a data table and divide each word into sub-words to reduce the time complexity of the testing process. The Encoding method which is similar to Table Lookup method uses a linked list to store necessary information in addition. The proposed methods run in O(n) - O(log log n) times faster than the naive method when l = O(log n), where n is the number of code words in a set and l is the length of a word.


Keywords: DNA verification, DNA word design, code word verification, distance constraint, testing algorithm

  Retrieve PDF document (JISE_201002_01.pdf)