•  
  •  
 

Abstract

Study on Generalization Capability of Support Vector Machine in Splice Site Type Recognition of DNA Sequence. Recently, support vector machine has become a popular model as machine learning. A particular advantage of SVM over other machine learning is that it can be analyzed theoretically and at same time can achieve a good performance when applied to real problems. This paper will describe analytically the using of SVM to solve pattern recognition problem with a preliminary case study in determining the type of splice site on the DNA sequence, particularity on the generalization capability. The result obtained show that SVM has a good generalization capability of around 95.4 %.

References

[1] J. Moody, C. Darken, Neural Computation 1 (1989) 281. [2] H. Ogawa, Proceeding of International Conference on Intelligent Information Processing System , Beijing, RRC, 1992. [3] C.J.C. Burges, Data Mining and Knowledge Discovery 2 (1998) 955. [4] M.A. Hearst, B. Schölkopf, S. Dumais. E. Osuna, J. Platt, IEEE Intelligent Systems. 13 (1998) 18. [5] N. Cristianini, J. Shawe-Taylor, An Introduction to Support Vector Machines and other kernel-based learning method, Cambridge University Press, New York, 2000. [6] J. Shawe-Taylor, N. Cristianini, Kernel Methods for Pattern Analysis, Cambridge University Press, New York, 2004. [7] V.N. Vapnik, The Nature of Statistical Learning Theory, Springer, New York, 1999. [8] M. Yamamura, O. Gotoh, Genome Informatics. 14 (2003) 426. [9] Molecular Biology Data Base, http://www.ics.edu/~mlearn/ Mlsummary. html, 2004

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.