Implicitly Controlling Bloat in Genetic Programming

During the evolution of solutions using genetic programming (GP) there is generally an increase in average tree size without a corresponding increase in fitness-a phenomenon commonly referred to as bloat. Although previously studied from theoretical an…

During the evolution of solutions using genetic programming (GP) there is generally an increase in average tree size without a corresponding increase in fitness-a phenomenon commonly referred to as bloat. Although previously studied from theoretical and practical viewpoints there has been little progress in deriving controls for bloat which do not explicitly refer to tree size. Here, the use of spatial population structure in combination with local elitist replacement is shown to reduce bloat without a subsequent loss of performance. Theoretical concepts regarding inbreeding and the role of elitism are used to support the described approach. The proposed system behavior is confirmed via extensive computer simulations on benchmark problems. The main practical result is that by placing a population on a torus, with selection defined by a Moore neighborhood and local elitist replacement, bloat can be substantially reduced without compromising performance.

Clustering of protein expression data: a benchmark of statistical and neural approaches

Abstract  Clustering issues are fundamental to exploratory analysis of bioinformatics data. This process may follow algorithms that
are reproducible but make assumptions about, for instance, the ability to estimate the global structure by su…

Abstract  

Clustering issues are fundamental to exploratory analysis of bioinformatics data. This process may follow algorithms that
are reproducible but make assumptions about, for instance, the ability to estimate the global structure by successful local
agglomeration or alternatively, they use pattern recognition methods that are sensitive to the initial conditions. This paper
reviews two clustering methodologies and highlights the differences that result from the changes in data representation, applied
to a protein expression data set for breast cancer (n = 1,076). The two clustering methodologies are a reproducible approach to model-free clustering and a probabilistic competitive
neural network. The results from the two methods are compared with existing studies of the same data set, and the preferred
clustering solutions are profiled for clinical interpretation.

  • Content Type Journal Article
  • DOI 10.1007/s00500-010-0596-9
  • Authors
    • I. H. Jarman, Liverpool John Moores University School of Computing and Mathematical Sciences Liverpool UK
    • T. A. Etchells, Liverpool John Moores University School of Computing and Mathematical Sciences Liverpool UK
    • D. Bacciu, University of Pisa Department of Computer Science Pisa Italy
    • J. M. Garibaldi, University of Nottingham School of Computer Science Nottingham UK
    • I. O. Ellis, University of Nottingham Department of Histopathology, School of Molecular Medical Sciences Nottingham UK
    • P. J. G. Lisboa, Liverpool John Moores University School of Computing and Mathematical Sciences Liverpool UK

Simulated annealing for supervised gene selection

Abstract  Genomic data, and more generally biomedical data, are often characterized by high dimensionality. An input selection procedure
can attain the two objectives of highlighting the relevant variables (genes) and possibly improving clas…

Abstract  

Genomic data, and more generally biomedical data, are often characterized by high dimensionality. An input selection procedure
can attain the two objectives of highlighting the relevant variables (genes) and possibly improving classification results.
In this paper, we propose a wrapper approach to gene selection in classification of gene expression data using simulated annealing
along with supervised classification. The proposed approach can perform global combinatorial searches through the space of
all possible input subsets, can handle cases with numerical, categorical or mixed inputs, and is able to find (sub-)optimal
subsets of inputs giving low classification errors. The method has been tested on publicly available bioinformatics data sets
using support vector machines and on a mixed type data set using classification trees. We also propose some heuristics able
to speed up the convergence. The experimental results highlight the ability of the method to select minimal sets of relevant
features.

  • Content Type Journal Article
  • DOI 10.1007/s00500-010-0597-8
  • Authors
    • Maurizio Filippone, University of Glasgow Department of Computing Science Sir Alwyn Williams Building G12 8QQ Glasgow UK
    • Francesco Masulli, University of Genova Department of Computer and Information Sciences Genoa Italy
    • Stefano Rovetta, University of Genova Department of Computer and Information Sciences Genoa Italy

Computing transparently: the independent sets in a graph

Abstract  A procedure is given for finding the independent sets in an undirected graph by xeroxing onto transparent plastic sheets.
Let an undirected graph having n vertices and m edges be given. A list of all the independent subsets of the …

Abstract  

A procedure is given for finding the independent sets in an undirected graph by xeroxing onto transparent plastic sheets.
Let an undirected graph having n vertices and m edges be given. A list of all the independent subsets of the set of vertices of the graph is constructed by using a xerox machine in a manner that requires
the formation of only n + m + 1 successive transparencies. An accompanying list of the counts of the elements in each independent set is then constructed
using only O(n
2) additional transparencies. The list with counts provides a list of all maximum independent sets. This gives an O(n
2) step solution for the classical problem of finding the cardinality of a maximal independent set in a graph. The applicability
of these procedures is limited, of course, by the increase in the information density on the transparencies when n is large. Our ultimate purpose here is to give hand tested ‘ultra parallel’ algorithmic procedures that may prove suitable for realization
using future optical technologies.

  • Content Type Journal Article
  • Pages 1-10
  • DOI 10.1007/s11047-010-9186-0
  • Authors
    • Tom Head, Binghamton University Mathematical Sciences Binghamton NY 13902-6000 USA

A relation by palindromic subwords

Abstract  We define a new relation on words by a finite series of insertions and/or deletions of palindromic subwords. In particular
we concentrate on insertion or deletion of Watson–Crick palindromes. We show that the new relation ∼θ i…

Abstract  

We define a new relation on words by a finite series of insertions and/or deletions of palindromic subwords. In particular
we concentrate on insertion or deletion of Watson–Crick palindromes. We show that the new relation ∼θ is, in fact, an equivalence relation where θ is any arbitrary antimorphic involution that is not the identity on the letters
of the alphabet. We also show that the set of all θ-palindromic free words defined in (Daley et al. in preparation) is ∼θ-independent. Using the relation we define a new subclass of primitive words which we call as ∼θ-primitive words and show that the class of all ∼θ-primitive words is closed under circular permutations. We also define ∼θ-conjugates and ∼θ-commutativity and study the properties of such words and show that they are similar to that of conjugate words and words
that commute.

  • Content Type Journal Article
  • DOI 10.1007/s11047-010-9179-z
  • Authors
    • Mark Daley, Department of Computer Science and Department of Biology, University of Western Ontario, London, ON N6A 5B7, Canada
    • Kalpana Mahalingam, Department of Mathematics, Indian Institute of Technology, Chennai, TN 600 042, India

An empirical evaluation of P system testing techniques

Abstract  This paper presents the existing techniques for P system testing and performs an empirical evaluation of their fault-detection
efficiency. The comparison is performed using mutation testing and, based on the results obtained, some …

Abstract  

This paper presents the existing techniques for P system testing and performs an empirical evaluation of their fault-detection
efficiency. The comparison is performed using mutation testing and, based on the results obtained, some improved testing methodologies
are proposed.

  • Content Type Journal Article
  • Pages 1-15
  • DOI 10.1007/s11047-010-9188-y
  • Authors
    • Raluca Lefticaru, University of Pitesti Department of Computer Science Str Targu din Vale 1 110040 Pitesti Romania
    • Marian Gheorghe, University of Pitesti Department of Computer Science Str Targu din Vale 1 110040 Pitesti Romania
    • Florentin Ipate, University of Pitesti Department of Computer Science Str Targu din Vale 1 110040 Pitesti Romania

Continuous-action reinforcement learning with fast policy search and adaptive basis function selection

Abstract  As an important approach to solving complex sequential decision problems, reinforcement learning (RL) has been widely studied
in the community of artificial intelligence and machine learning. However, the generalization ability of …

Abstract  

As an important approach to solving complex sequential decision problems, reinforcement learning (RL) has been widely studied
in the community of artificial intelligence and machine learning. However, the generalization ability of RL is still an open
problem and it is difficult for existing RL algorithms to solve Markov decision problems (MDPs) with both continuous state
and action spaces. In this paper, a novel RL approach with fast policy search and adaptive basis function selection, which
is called Continuous-action Approximate Policy Iteration (CAPI), is proposed for RL in MDPs with both continuous state and
action spaces. In CAPI, based on the value functions estimated by temporal-difference learning, a fast policy search technique
is suggested to search for optimal actions in continuous spaces, which is computationally efficient and easy to implement.
To improve the generalization ability and learning efficiency of CAPI, two adaptive basis function selection methods are developed
so that sparse approximation of value functions can be obtained efficiently both for linear function approximators and kernel
machines. Simulation results on benchmark learning control tasks with continuous state and action spaces show that the proposed
approach not only can converge to a near-optimal policy in a few iterations but also can obtain comparable or even better
performance than Sarsa-learning, and previous approximate policy iteration methods such as LSPI and KLSPI.

  • Content Type Journal Article
  • DOI 10.1007/s00500-010-0581-3
  • Authors
    • Xin Xu, National University of Defense Technology College of Mechatronics and Automation, Institute of Automation ChangSha Hunan 410073 People’s Republic of China
    • Chunming Liu, National University of Defense Technology College of Mechatronics and Automation, Institute of Automation ChangSha Hunan 410073 People’s Republic of China
    • Dewen Hu, National University of Defense Technology College of Mechatronics and Automation, Institute of Automation ChangSha Hunan 410073 People’s Republic of China

A computational modeling for real ecosystems based on P systems

Abstract  In this paper, a P systems based general framework for modeling ecosystems dynamics is presented. Particularly, ecosystems
are specified by means of multienvironment P systems composed of a finite number of environments, each of th…

Abstract  

In this paper, a P systems based general framework for modeling ecosystems dynamics is presented. Particularly, ecosystems
are specified by means of multienvironment P systems composed of a finite number of environments, each of them having an extended
P system with active membranes. The semantics is of a probabilistic type and it is implemented by assigning each rule of the
system a probabilistic constant which depends on the environment and the run time. As a case study, two real ecosystems are
described: scavenger birds in the Catalan Pyrenees and the zebra mussel (Dreissena Polymorpha) in Ribarroja reservoir (Spain).

  • Content Type Journal Article
  • Pages 1-15
  • DOI 10.1007/s11047-010-9191-3
  • Authors
    • Mónica Cardona, University of Lleida Department of Mathematics Av. Alcalde Rovira Roure, 191 25198 Lleida Spain
    • M. Angels Colomer, University of Lleida Department of Mathematics Av. Alcalde Rovira Roure, 191 25198 Lleida Spain
    • Antoni Margalida, Bearded Vulture Study & Protection Group Adpo. 43 25520 El Pont de Suert (Lleida) Spain
    • Antoni Palau, Dirección de Medio Ambiente y Desarrollo Sostenible Endesa Spain
    • Ignacio Pérez-Hurtado, University of Sevilla Research Group on Natural Computing, Department of Computer Science and Artificial Intelligence Avda. Reina Mercedes s/n 41012 Sevilla Spain
    • Mario J. Pérez-Jiménez, University of Sevilla Research Group on Natural Computing, Department of Computer Science and Artificial Intelligence Avda. Reina Mercedes s/n 41012 Sevilla Spain
    • Delfí Sanuy, University of Lleida Department of Animal Production Av. Alcalde Rovira Roure, 191 25198 Lleida Spain

A dynamic game of coalition formation under ambiguity

Abstract  In a previous paper, we generalized to the mixed strategy case the γ model of coalition formation (introduced by Hart and
Kurz in Econometrica 51(4):1047–1064, 1983) for situations in which players have ambiguous expectations ab…

Abstract  

In a previous paper, we generalized to the mixed strategy case the γ model of coalition formation (introduced by Hart and
Kurz in Econometrica 51(4):1047–1064, 1983) for situations in which players have ambiguous expectations about the formation of the coalitions in which they are not
involved; then we analyzed the corresponding evolutionary games. In this paper, we embody into the model rationality of the
players; it follows that allowing for mixed strategies makes it impossible to construct unequivocally a von Neumann–Morgestein
expected utility function coherent (in the sense of de Finetti B in Sul Significato Soggettivo della Probabilità, Fundamenta Mathematicae, T, vol XVIII, pp
298–329, 1931) to every strategy profile. We find out that if the multiplicity of coherent beliefs problem is approached by considering
“ambiguity loving” players then existence results for classical static equilibria can be obtained in this model. Moreover,
we provide conditions for the game to be dynamically playable and we find how the coalition structure beliefs might evolve
coherently (according) to the evolution of the strategies.

  • Content Type Journal Article
  • DOI 10.1007/s00500-010-0573-3
  • Authors
    • Giuseppe De Marco, Università di Napoli Parthenope Dipartimento di Statistica e Matematica per la Ricerca Economica Via Medina 40 80133 Naples Italy
    • Maria Romaniello, Seconda Università di Napoli Dipartimento di Strategie Aziendali e Metodologie Quantitative Corso Gran Priorato di Malta 81043 Capua Italy