Gene Pre-Processing Methodology In Cancer Identification System
Abstract
A Novel pre-processing of gene expression data is proposed, whichemphasizes the filtering and normalization steps since these steps determine the set of probes used in the subsequent analyses. There are two parameters that are set during filtering. In the beginning, for every review, a cut-off is established for the recognition p-value, and a sample is assessed as existing if its detection p-value is smaller than the cut-off. In addition, an established limit is established. For a reviewto be incorporated into the dataset, it must contain a certain number of samples. Information can then be normalized after filtering. Several methods are compared,and for the dataset, the normalized information on the original scale produces themost stable results.