Login

Join for Free!
17722 members
table of contents table of contents

Biology Articles » Bioinformatics » Computational cluster validation in post-genomic data analysis » References

References
- Computational cluster validation in post-genomic data analysis

REFERENCES 

Ankerst, M., Breunig, M., Kriegel, H.-P., Sander, J. (1999) OPTICS: ordering points to identify clustering structure. In Delis, A. (Ed.), et al. Proceedings of the 1999 International Conference on Management of Data, , New York ACM Press, pp. 49–60.

Bandyopadhyay, S. and Manlik, U. (2001) Nonparametric genetic clustering: comparison of validity indices. IEEE Trans. Syst. Man Cybernet., 31, 120–125.

Technical report Ben-Dor, A., Friedman, M., Yakhini, Z. (2002) Overabundance analysis and class discovery in gene expression data. , Palo Aeto Agilent Laboratories.

Ben-Hur, A., Elisseeff, A., Guyon, I. (2002) A stability based method for discovering structure in clustered data. In Aetman, R.B. (Ed.), et al. Pacific Symposium on Biocomputing, , New Jersey World Scientific Publishing Co.

Bezdek, J. and Pal, N. (1998) Some new indexes of cluster validity. IEEE Trans. Syst. Man Cybernet., 28, 301–315[.

Bilu, Y. and Linial, M. (2002) The advantage of functional prediction based on clustering of yeast genes and its correlation with non-sequence based classification. J. Comput. Biol., 9, 193–210.

Bittner, M., et al. (2000) Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature, 406, 536–540.

Bolshakova, N. and Azuaje, F. (2003) Cluster validation techniques for genome expression data. Signal Processing, 83, 825–833.

Bolshakova, N., et al. (2005) An integrated tool for microarray data clustering and cluster validity assessment. Bioinformatics, 21, 451–455.

Breckenridge, J. (1989) Replicating cluster analysis: method, consistency and validity. Multivar. Behav. Res., 24, 147–161.

Breckenridge, J. (2000) Validating cluster analysis: consistent replication and symmetry. Multivar. Behav. Res., 35, 261–285.

Datta, S. and Datta, S. (2003) Comparison and validation of statistical clustering techniques for microarray gene expression data. Bioinformatics, 19, 459–466.

Davies, D.L. and Bouldin, D.W. (1979) A cluster separation measure. IEEE Trans. Pattern Anal. Machine Intell., 1, 224–227.

Ding, C. and He, C. (2004) K-nearest neighbor consistency in data clustering: incorporating local information into global optimization. In Haddad, H.M. (Ed.), et al. Proceedings of the 2004 ACM Symposium on Applied Computing, , New York ACM Press, pp. 584–589.

Dubes, R. and Jain, A.K. (1979) Validity studies in clustering methodologies. Pattern Recog. Lett., 11, 235–254.

Duda, R.O., Hart, P.E., Stork, D.G. Pattern Classification, (2001) 2nd edn John Wiley and Sons Ltd.

Dunn, J.C. (1974) Well separated clusters and fuzzy partitions. J. Cybernet., 4, 95–104.

Edwards, A.L. The Correlation Coefficient, (1967) W.H. Freeman, pp. 33–46.

Efron, B. and Tibshirani, R.J. An Introduction to the Bootstrap, (1993) Chapman and Hall.

Eisen, M.B. (1998) Cluster analysis and display of genome-wide expression patterns. Proc. Natl Acad. Sci. USA, 95, 14863–14868.

Ester, M., Kriegel, H.P., Sander, J. (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In Simoudis, E. (Ed.), et al. Proceedings of the Second International Conference on Knowledge Discovery and Data-Mining, , Menlo Park AIII Press.

Estivill-Castro, V. (2002) Why so many clustering algorithms: a position paper. ACM SIGKDD Explor. Newslett., 4, 65–75.

Everitt, B.S. Cluster Analysis, (1993) Edward Arnold.

Fonseca, C.M. and Fleming, P.J. (1996) On the performance assessment and comparison of stochastic multiobjective optimizers. In Voigt, H.M. (Ed.), et al. Proceedings of the Fourth International Conference on Parallel Problem Solving from Nature, , Berlin Springer-Verlag, pp. 584–593.

Technical report Fridlyand, J. and Dudoit, S. (2001) Applications of resampling methods to estimate the number of clusters and to improve the accuracy of a clustering method. , Berkeley Department of Statistics.

Gasch, A.P. and Eisen, M.B. (2002) Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering. Genome Biol., 3, 1–22.

Gat-Viks, I., et al. (2003) Scoring clustering solutions by their biological relevance. Bioinformatics, 19, 2381–2389.

Golub, T.R., et al. (1999) Molecular classification of cancer: class discovery and class prediction by gene expression. Science, 286, 531–537.

Goodacre, R., et al. (1998) Rapid identification of urinary tract infection bacteria using hyperspectral whole organism fingerprinting and artificial neural networks. Microbiology, 144, 1157–1170.

Gordon, A.D. Classification, (1999) 2nd edn Chapman and Hall.

Halkidi, M., et al. (2001) On clustering validation techniques. J. Intell. Inform. Syst., 17, 107–145.

Handl, J. and Knowles, J. (2005) Exploiting the trade-off—the benefits of multiple objectives in data clustering. In Coello, L.A. (Ed.), et al. Proceedings of the Third International Conference on Evolutionary Multicriterion Optimization, , Berlin Springer-Verlag, pp. 547–560.

Hastie, T., et al. (2000) Gene shaving as a method for identifying distinct sets of genes with similar expression patterns. Genome Biol., 1, 1–21.

Hastie, T., Tibshirani, R., Friedman, J. The Elements of Statistical Learning: Data Mining, Inference and Prediction, (2001) Springer-Verlag.

Herrero, J., et al. (2001) A hierarchical unsupervised growing neural network for clustering gene expression data. Bioinformatics, 17, 126–136.

Hubert, A. (1985) Comparing partitions. J. Classif., 2, 193–198.

Jaccard, S. (1908) Nouvelles recherches sur la distribution florale. Bull. Soc. Vaud. Sci. Nat., 44, 223–270.

Jain, A.K., et al. (1999) Data clustering: a review. ACM Comput. Surv., 31, 264–323.

Jardine, N. and Sibson, R. Mathematical Taxonomy, (1971) John Wiley and Sons.

Kaplan, N., et al. (2004) A functional hierarchical organization of the protein sequence space. BMC Bioinformatics, 5, .

Kell, D.B. and Oliver, S.G. (2004) Here is the evidence, now what is the hypothesis? The complementary roles of inductive and hypothesis-driven science in the post-genomic era. Bioessays, 26, 99–105.

Kerr, M.K. and Churchill, G.A. (2001) Bootstrapping cluster analysis: assessing the reliability of conclusions from microarray experiments. Proc. Natl Acad. Sci. USA, 98, 8961–8965.

Kohonen, T. (2001) Self-organizing maps. Springer Series in Information Sciences, Springer-Verlag Vol. 30, .

Krasnogor, N. and Pelta, D.A. (2004) Measuring the similarity of protein structures by means of the universal similarity metric. Bioinformatics, 20, 1015–1021.

Krieger, A.M. and Green, P. (1999) A cautionary note on using internal crossvalidation. Psychometrika, 64, 341–353.

Lange, T., et al. (2004) Stability-based validation of clustering solutions. Neural comput., 16, 1299–1323.

Lehmann, E.L. and D'Abrera, H.J.M. Nonparametrics: Statistical Methods Based on Ranks, (1998) , Prentice-Hall.

Levine, E. and Domany, E. (2001) Resampling method for unsupervised estimation of cluster validity. Neural Comput., 13, 2573–2593.

Li, C. and Wong, W.H. (2001) Model-based analysis of oligonucleotide arrays: model validation, design issues and standard error application. Genome Biol., 2, 1–11.

MacQueen, L. (1967) Some methods for classification and analysis of multivariate observations. In de Cam, L.M. (Ed.), et al. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, , Berkeley University of California Press, pp. 281–297.

Madeira, S.C. and Oliveira, A.L. (2004) Biclustering algorithms for biological data analysis: a survey. IEEE Trans. Comput. Biol. Bioinformatics, 1, 24–45.

McLachlan, G. and Krishman, T. The EM Algorithm and Extensions, (1997) John Wiley and Son Ltd.

McShane, L.M., et al. (2002) Methods for assessing reproducibility of clustering patterns observed in analyses of microarray data. Bioinformatics, 18, 1462–1469.

Mendes, D.J., et al. (2003) Artificial gene networks for objective comparison of analysis algorithms. Bioinformatics, 19, 122–129.

Michaud, D.J., et al. (2003) eXPatGen: generating dynamic expression patterns for the systematic evaluation of analytical methods. Bioinformatics, 19, 1140–1146.

Milligan, G.W. and Cooper, M.C. (1986) A study of the comparability of external criteria for hierachical cluster ananlysis. Multivar. Behav. Res., 21, 441–458.

Pal, N.R. and Bezdek, J.C. (1995) On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Syst., 3, 370–379.

Pareto, V. Manual of Political Economy, 1971 Translation of 1927 Edition, (1971) Augustus M. Kelley.

Quackenbush, J. (2001) Computational analysis of microarray data. Nat. Rev. Genet., 2, 418–427

Rand, W. (1971) Objective criteria for the evaluation of clustering methods. J. Am. Stat. Assoc., 66, 846–850.

Rayward-Smith, V.J., Osman, I.H., Reeves, C.R., Smith, G.D. Modern Heuristic Search Methods, (1996) John Wiley and Sons Ltd.

Romesburg, H.C. Cluster Analysis for Researchers, (1984) , Belmont.

Rousseeuw, P.J. (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math., 20, 53–65.

Shaw, A.D., et al. (1997) Discrimination of the variety and region of origin of extra virgin olive oils using C-13 NMR and multivariate calibration with variable reduction. Anal. Chim. Acta, 384, 357–374.

Slonim, D.K. (2002) From patterns to pathways: gene expression data analysis comesof age. Nat. Genet., 32, 502–508.

De Smet, F., et al. (2002) Adaptive quality-based clustering of gene expression profiles. Bioinformatics, 18, 735–746.

Tamayo, P., et al. (1999) Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. Proc. Natl Acad. Sci. USA, 96, 2907–2912.

Tavazoie, S., et al. (1999) Systematic determination of genetic network architecture. Nat. Genet., 22, 281–285.

Technical report Tibshirani, R., Walther, G., Botstein, D., Brown, P. (2001a) Cluster validation by prediction strength. , CA Department of Statistics, Stanford University.

Tibshirani, R., et al. (2001b) Estimating the number of clusters in a data set via the gap statistic. J. R. Stat. Soc. Ser. B, 63, 411–423.

Toronen, P. (2004) Selection of informative clusters from hierarchical cluster tree with gene classes. BMC Bioinformatics, 5, 34.

Information Retrieval van Rijsbergen, C. (1979) 2nd edn , Butterworths.

Vorhees, E. The effectiveness and efficiency of agglomerative hierarchical clustering in document retrieval (1985) PhD thesis Department of Computer Science, Cornell University.

Yeung, K.Y., et al. (2001a) Validating clustering for gene expression data. Bioinformatics, 17, 309–318.

Yeung, K.Y., et al. (2001b) Model-based clustering and data transformation for gene expression data. Bioinformatics, 17, 977–987.


rating: 7.25 from 4 votes | updated on: 16 Aug 2006 | views: 1442 |

Rate article:







excellent!bad…