Clustering of protein expression data: a benchmark of statistical and neural approaches

Abstract  Clustering issues are fundamental to exploratory analysis of bioinformatics data. This process may follow algorithms that
are reproducible but make assumptions about, for instance, the ability to estimate the global structure by su…

Abstract  

Clustering issues are fundamental to exploratory analysis of bioinformatics data. This process may follow algorithms that
are reproducible but make assumptions about, for instance, the ability to estimate the global structure by successful local
agglomeration or alternatively, they use pattern recognition methods that are sensitive to the initial conditions. This paper
reviews two clustering methodologies and highlights the differences that result from the changes in data representation, applied
to a protein expression data set for breast cancer (n = 1,076). The two clustering methodologies are a reproducible approach to model-free clustering and a probabilistic competitive
neural network. The results from the two methods are compared with existing studies of the same data set, and the preferred
clustering solutions are profiled for clinical interpretation.

  • Content Type Journal Article
  • DOI 10.1007/s00500-010-0596-9
  • Authors
    • I. H. Jarman, Liverpool John Moores University School of Computing and Mathematical Sciences Liverpool UK
    • T. A. Etchells, Liverpool John Moores University School of Computing and Mathematical Sciences Liverpool UK
    • D. Bacciu, University of Pisa Department of Computer Science Pisa Italy
    • J. M. Garibaldi, University of Nottingham School of Computer Science Nottingham UK
    • I. O. Ellis, University of Nottingham Department of Histopathology, School of Molecular Medical Sciences Nottingham UK
    • P. J. G. Lisboa, Liverpool John Moores University School of Computing and Mathematical Sciences Liverpool UK

Simulated annealing for supervised gene selection

Abstract  Genomic data, and more generally biomedical data, are often characterized by high dimensionality. An input selection procedure
can attain the two objectives of highlighting the relevant variables (genes) and possibly improving clas…

Abstract  

Genomic data, and more generally biomedical data, are often characterized by high dimensionality. An input selection procedure
can attain the two objectives of highlighting the relevant variables (genes) and possibly improving classification results.
In this paper, we propose a wrapper approach to gene selection in classification of gene expression data using simulated annealing
along with supervised classification. The proposed approach can perform global combinatorial searches through the space of
all possible input subsets, can handle cases with numerical, categorical or mixed inputs, and is able to find (sub-)optimal
subsets of inputs giving low classification errors. The method has been tested on publicly available bioinformatics data sets
using support vector machines and on a mixed type data set using classification trees. We also propose some heuristics able
to speed up the convergence. The experimental results highlight the ability of the method to select minimal sets of relevant
features.

  • Content Type Journal Article
  • DOI 10.1007/s00500-010-0597-8
  • Authors
    • Maurizio Filippone, University of Glasgow Department of Computing Science Sir Alwyn Williams Building G12 8QQ Glasgow UK
    • Francesco Masulli, University of Genova Department of Computer and Information Sciences Genoa Italy
    • Stefano Rovetta, University of Genova Department of Computer and Information Sciences Genoa Italy

SIGEVOlution Volume 4, Issue 4, is now available

The SIGEVOlution newsletter Volume 4 Issue 4 is now available for download from: http://www.sigevolution.orgThe new issue features:Galactic Arms Race by Erin J. Hastings & Kenneth O. StanleyA Perl Primer for EA Practitioners by Juan-Julián MereloN…

The SIGEVOlution newsletter Volume 4 Issue 4 is now available for download from: http://www.sigevolution.org
The new issue features:
  • Galactic Arms Race by Erin J. Hastings & Kenneth O. Stanley
  • A Perl Primer for EA Practitioners by Juan-Julián Merelo
  • New issues of journals
  • Calls & calendar
The newsletter is intended to be viewed electronically.
Thanks to Pier Luca Lanzi, SIGEvolution Editor-in-Chief.

Computing transparently: the independent sets in a graph

Abstract  A procedure is given for finding the independent sets in an undirected graph by xeroxing onto transparent plastic sheets.
Let an undirected graph having n vertices and m edges be given. A list of all the independent subsets of the …

Abstract  

A procedure is given for finding the independent sets in an undirected graph by xeroxing onto transparent plastic sheets.
Let an undirected graph having n vertices and m edges be given. A list of all the independent subsets of the set of vertices of the graph is constructed by using a xerox machine in a manner that requires
the formation of only n + m + 1 successive transparencies. An accompanying list of the counts of the elements in each independent set is then constructed
using only O(n
2) additional transparencies. The list with counts provides a list of all maximum independent sets. This gives an O(n
2) step solution for the classical problem of finding the cardinality of a maximal independent set in a graph. The applicability
of these procedures is limited, of course, by the increase in the information density on the transparencies when n is large. Our ultimate purpose here is to give hand tested ‘ultra parallel’ algorithmic procedures that may prove suitable for realization
using future optical technologies.

  • Content Type Journal Article
  • Pages 1-10
  • DOI 10.1007/s11047-010-9186-0
  • Authors
    • Tom Head, Binghamton University Mathematical Sciences Binghamton NY 13902-6000 USA

A relation by palindromic subwords

Abstract  We define a new relation on words by a finite series of insertions and/or deletions of palindromic subwords. In particular
we concentrate on insertion or deletion of Watson–Crick palindromes. We show that the new relation ∼θ i…

Abstract  

We define a new relation on words by a finite series of insertions and/or deletions of palindromic subwords. In particular
we concentrate on insertion or deletion of Watson–Crick palindromes. We show that the new relation ∼θ is, in fact, an equivalence relation where θ is any arbitrary antimorphic involution that is not the identity on the letters
of the alphabet. We also show that the set of all θ-palindromic free words defined in (Daley et al. in preparation) is ∼θ-independent. Using the relation we define a new subclass of primitive words which we call as ∼θ-primitive words and show that the class of all ∼θ-primitive words is closed under circular permutations. We also define ∼θ-conjugates and ∼θ-commutativity and study the properties of such words and show that they are similar to that of conjugate words and words
that commute.

  • Content Type Journal Article
  • DOI 10.1007/s11047-010-9179-z
  • Authors
    • Mark Daley, Department of Computer Science and Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
    • Kalpana Mahalingam, Department of Mathematics, Indian Institute of Technology, Chennai, TN 600 042, India

SIGEVOlution Volume 4 Issue 4

The new issue of SIGEVOlution is now available for you to download from: http://www.sigevolution.org The issue features: Galactic Arms Race by Erin J. Hastings and Kenneth O. Stanley A Perl Primer for EA Practitioners by Juan-Julián Merelo New issues of … Continue reading

Galactic Arms Race

The new issue of SIGEVOlution is now available for you to download from:
http://www.sigevolution.org

The issue features:

  • Galactic Arms Race by Erin J. Hastings and Kenneth O. Stanley
  • A Perl Primer for EA Practitioners by Juan-Julián Merelo
  • New issues of journals
  • Calls & calendar

The newsletter is intended to be viewed electronically.

Pier Luca Lanzi (EIC)

SIGEVOlution Volume 4 Issue 4

The new issue of SIGEVOlution is now available for you to download from:
http://www.sigevolution.org
The issue features:

Galactic Arms Race by Erin J. Hastings and Kenneth O. Stanley
A Perl Primer for EA Practitioners by Juan-Julián Merelo
New issues of journals
Calls & calendar

The newsletter is intended to be viewed electronically.
Pier Luca Lanzi (EIC)
Related Posts

Galactic Arms Race

The new issue of SIGEVOlution is now available for you to download from:
http://www.sigevolution.org

The issue features:

  • Galactic Arms Race by Erin J. Hastings and Kenneth O. Stanley
  • A Perl Primer for EA Practitioners by Juan-Julián Merelo
  • New issues of journals
  • Calls & calendar

The newsletter is intended to be viewed electronically.

Pier Luca Lanzi (EIC)