Selected ETH Polymer Physics publications

by
abstracts hide pdf's show images
matching keyword & author

1 selected entry
Article   A.N. Gorban, T.G. Popova, A.Y. Zinovyev
Self-Organizing Approach for Automated Gene Identification
Open Sys. Information Dyn. 10 (2003) 1-13
Self-training technique for automated gene recognition both in entire genomes and in unassembled ones is proposed. It is based on a simple measure (namely, the vector of frequencies of non-overlapping triplets in sliding window), and needs neither predetermined information, nor preliminary learning. The sliding window length is the only one tuning parameter. It should be chosen close to the average exon length typical to the DNA text under investigation. An essential feature of the technique proposed is preliminary visualization of the set of vectors in the subspace of the first three principal components. It was shown, the distribution of DNA sites has the bullet-like structure with one central cluster (corresponding to non-coding sites) and three or six ank ones (corresponding to protein-coding sites). The bullet-like structure itself revealed in the distribution seems to be very interesting illustration of triplet usage in DNA sequence. The method was examined on several genomes (mitochondrion of P.wickerhamii, bacteria C.crescentus and primitive eukaryot S.cerevisiae). The percentage of truly predicted nucleotides exceeds 90%.


for LaTeX users
@article{ANGorban2003-10,
 author = {A. N. Gorban and T. G. Popova and A. Y. Zinovyev},
 title = {Self-Organizing Approach for Automated Gene Identification},
 journal = {Open Sys. Information Dyn.},
 volume = {10},
 pages = {1-13},
 year = {2003}
}

\bibitem{ANGorban2003-10} A.N. Gorban, T.G. Popova, A.Y. Zinovyev,
Self-Organizing Approach for Automated Gene Identification,
Open Sys. Information Dyn. {\bf 10} (2003) 1-13.

ANGorban2003-10
A.N. Gorban, T.G. Popova, A.Y. Zinovyev
Self-Organizing Approach for Automated Gene Identification
Open Sys. Information Dyn.,10,2003,1-13


© 19 May 2024 mk@mat.ethz.ch      1 out of 816 entries requested [H-factor to-date: > 0]