Paper
15 October 2012 Feature selection from short amino acid sequences in phosphorylation prediction problem
Jakub Węcławski, Stanisław Jankowski, Zbigniew Szymański
Author Affiliations +
Proceedings Volume 8454, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2012; 84541W (2012) https://doi.org/10.1117/12.2001270
Event: Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2012, 2012, Wilga, Poland
Abstract
The paper describes solution of feature selection from amino acid sequences in phosphorylation prediction problem. We show that even for short sequences the variable selection leads to better classification performance. Moreover, the final simplicity of models allows for better data understanding and can be used by an expert for further analysis. The feature selection process is divided into two parts: i) the classification tree is used for finding the most relevant positions in amino acid sequences, ii) then the contrast pattern kernel is applied for pattern selection. This work summarizes the research made on classification of short amino acid sequences. The results of the research allowed us to propose a general scheme of amino acid sequence analysis.
© (2012) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jakub Węcławski, Stanisław Jankowski, and Zbigniew Szymański "Feature selection from short amino acid sequences in phosphorylation prediction problem", Proc. SPIE 8454, Photonics Applications in Astronomy, Communications, Industry, and High-Energy Physics Experiments 2012, 84541W (15 October 2012); https://doi.org/10.1117/12.2001270
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature selection

Image classification

Analytical research

Data modeling

Proteins

Biological research

Machine learning

RELATED CONTENT

Contrast pattern kernel for strings
Proceedings of SPIE (October 06 2011)
Network link prediction based on machine learning methods
Proceedings of SPIE (October 15 2021)
A review of contrast pattern based data mining
Proceedings of SPIE (July 06 2015)
Storage, data management, and retrieval in bioinformatics
Proceedings of SPIE (December 19 2001)

Back to Top