Getting poster data...
Hector Zenil, Venkateshan Kannan, Jesper Tegnér (Karolinska Institutet)In this poster we are proposing what we call DNA or gene walks in analogy to random walks. They are constructed by folding DNA on a rectangular lattice where each of the 4 nucleotides determines a direction: up, down, left and right. This makes the DNA to fold into an artificial one dimensional shape, unique for every sequence. We present a folding metric defined by the average Euclidian distance between a DNA sequence walk and a randomized walk, that is a random walk of the same gene length. These gene walks can be used to visualize nucleotide skewness, for example GC-content, and other statistical differences among coding and non-coding DNA regions. In this poster there is a sample of 10 genes from human chromosome X poor in G and C as can be seen directly from the figures. The gene walks are coloured from red to yellow to indicate direction only, blue segments are protein coding sequences (gene exons). Yet to investigate is if the magnitudes of the distance and the skewness of coding versus non-coding regions can group genes and other regions in any useful way.