Automatic Reproduction of a Genius Algorithm: Strassen’s Algorithm Revisited by Genetic Search

In 1968, Volker Strassen, a young German mathematician, announced a clever algorithm to reduce the asymptotic complexity of nÿn matrix multiplication from the order of n 3 to n 2.81. It soon became one of the most famous scientific discoveries in the …

In 1968, Volker Strassen, a young German mathematician, announced a clever algorithm to reduce the asymptotic complexity of nÿn matrix multiplication from the order of n 3 to n 2.81. It soon became one of the most famous scientific discoveries in the 20th century and provoked numerous studies by other mathematicians to improve upon it. Although a number of improvements have been made, Strassen’s algorithm is still optimal in his original framework, the bilinear systems of 2 ÿ 2 matrix multiplication, and people are still curious how Strassen developed his algorithm. We examined it to see if we could automatically reproduce Strassen’s discovery using a search algorithm and find other algorithms of the same quality. In total, we found 608 algorithms that have the same quality as Strassen’s, including Strassen’s original algorithm. We partitioned the algorithms into nine different groups based on the way they are constructed. This paper was made possible by the combination of genetic search and linear-algebraic techniques. To the best of our knowledge, this is the first work that automatically reproduced Strassen’s algorithm, and furthermore, discovered new algorithms with equivalent asymptotic complexity using a search algorithm.

The CMA-ES on Riemannian Manifolds to Reconstruct Shapes in 3-D Voxel Images

The covariance matrix adaptation evolution strategy (CMA-ES) has been successfully used to minimize functionals on vector spaces. We generalize the concept of the CMA-ES to Riemannian manifolds and evaluate its performance in two experiments. First, we…

The covariance matrix adaptation evolution strategy (CMA-ES) has been successfully used to minimize functionals on vector spaces. We generalize the concept of the CMA-ES to Riemannian manifolds and evaluate its performance in two experiments. First, we minimize synthetic functionals on the 2-D sphere. Second, we consider the reconstruction of shapes in 3-D voxel data. A novel formulation of this problem leads to the minimization of edge and region-based segmentation functionals on the Riemannian manifold of parametric 3-D medial axis representation. We compare the results to gradient-based methods on manifolds and particle swarm optimization on tangent spaces and differential evolution.

Computational Evolutionary Embryogeny

Evolutionary and developmental processes are used to evolve the configurations of 3-D structures in silico to achieve desired performances. Natural systems utilize the combination of both evolution and development processes to produce remarkable perfor…

Evolutionary and developmental processes are used to evolve the configurations of 3-D structures in silico to achieve desired performances. Natural systems utilize the combination of both evolution and development processes to produce remarkable performance and diversity. However, this approach has not yet been applied extensively to the design of continuous 3-D load-supporting structures. Beginning with a single artificial cell containing information analogous to a DNA sequence, a structure is grown according to the rules encoded in the sequence. Each artificial cell in the structure contains the same sequence of growth and development rules, and each artificial cell is an element in a finite element mesh representing the structure of the mature individual. Rule sequences are evolved over many generations through selection and survival of individuals in a population. Modularity and symmetry are visible in nearly every natural and engineered structure. An understanding of the evolution and expression of symmetry and modularity is emerging from recent biological research. Initial evidence of these attributes is present in the phenotypes that are developed from the artificial evolution, although neither characteristic is imposed nor selected-for directly. The computational evolutionary development approach presented here shows promise for synthesizing novel configurations of high-performance systems. The approach may advance the system design to a new paradigm, where current design strategies have difficulty producing useful solutions.

Nonlinear Network Optimization—An Embedding Vector Space Approach

This paper proposes a normed-space vector representation of networks which allows defining evolutionary operators for network optimization that resemble continuous-space operators. These operators are employed here to build a genetic algorithm which be…

This paper proposes a normed-space vector representation of networks which allows defining evolutionary operators for network optimization that resemble continuous-space operators. These operators are employed here to build a genetic algorithm which becomes generic for the optimization of tree networks, without the requirement of any special encoding scheme. Such a genetic algorithm has been compared with several encoding-based genetic algorithms, on 25 and 50-node instances of the optimal communication spanning tree and of the quadratic minimum spanning tree, and has been shown to outperform all other algorithms in a stochastic dominance analysis. The proposed approach has also been applied to an electric power distribution network design (a multibranch problem), outperforming the results presented in a former reference (which have been obtained with an Ant Colony algorithm). The results of some landscape dispersion analysis suggest that the proposed normed-space network vector representation is analogous to some continuous-variable space dilation operations, which define favorable space coordinates for optimization.

On the Importance of Data Balancing for Symbolic Regression

Symbolic regression of input-output data conventionally treats data records equally. We suggest a framework for automatic assignment of weights to data samples, which takes into account the sample’s relative importance. In this paper, we study the poss…

Symbolic regression of input-output data conventionally treats data records equally. We suggest a framework for automatic assignment of weights to data samples, which takes into account the sample’s relative importance. In this paper, we study the possibilities of improving symbolic regression on real-life data by incorporating weights into the fitness function. We introduce four weighting schemes defining the importance of a point relative to proximity, surrounding, remoteness, and nonlinear deviation from k nearest-in-the-input-space neighbors. For enhanced analysis and modeling of large imbalanced data sets we introduce a simple multidimensional iterative technique for subsampling. This technique allows a sensible partitioning (and compression) of data to nested subsets of an arbitrary size in such a way that the subsets are balanced with respect to either of the presented weighting schemes. For cases where a given input-output data set contains some redundancy, we suggest an approach to considerably improve the effectiveness of regression by applying more modeling effort to a smaller subset of the data set that has a similar information content. Such improvement is achieved due to better exploration of the search space of potential solutions at the same number of function evaluations. We compare different approaches to regression on five benchmark problems with a fixed budget allocation. We demonstrate that the significant improvement in the quality of the regression models can be obtained either with the weighted regression, exploratory regression using a compressed subset with a similar information content, or exploratory weighted regression on the compressed subset, which is weighted with one of the proposed weighting schemes.

A Favorable Weight-Based Evolutionary Algorithm for Multiple Criteria Problems

In this paper, we present a favorable weight-based evolutionary algorithm for multiple criteria problems. The algorithm tries to both approximate the Pareto frontier and evenly distribute the solutions over the frontier. These two goals are common for …

In this paper, we present a favorable weight-based evolutionary algorithm for multiple criteria problems. The algorithm tries to both approximate the Pareto frontier and evenly distribute the solutions over the frontier. These two goals are common for many multiobjective evolutionary algorithms. To achieve these goals in our algorithm, each member selects its own weights for a weighted Tchebycheff distance function to define its fitness score. The fitness scores favor solutions that are closer to the Pareto frontier and that are located at underrepresented regions. We compare the performance of the algorithm with two leading evolutionary algorithms on various continuous test problems having different number of criteria.

Implicitly Controlling Bloat in Genetic Programming

During the evolution of solutions using genetic programming (GP) there is generally an increase in average tree size without a corresponding increase in fitness-a phenomenon commonly referred to as bloat. Although previously studied from theoretical an…

During the evolution of solutions using genetic programming (GP) there is generally an increase in average tree size without a corresponding increase in fitness-a phenomenon commonly referred to as bloat. Although previously studied from theoretical and practical viewpoints there has been little progress in deriving controls for bloat which do not explicitly refer to tree size. Here, the use of spatial population structure in combination with local elitist replacement is shown to reduce bloat without a subsequent loss of performance. Theoretical concepts regarding inbreeding and the role of elitism are used to support the described approach. The proposed system behavior is confirmed via extensive computer simulations on benchmark problems. The main practical result is that by placing a population on a torus, with selection defined by a Moore neighborhood and local elitist replacement, bloat can be substantially reduced without compromising performance.

A Novel Set-Based Particle Swarm Optimization Method for Discrete Optimization Problems

Particle swarm optimization (PSO) is predominately used to find solutions for continuous optimization problems. As the operators of PSO are originally designed in an n-dimensional continuous space, the advancement of using PSO to find solutions in a di…

Particle swarm optimization (PSO) is predominately used to find solutions for continuous optimization problems. As the operators of PSO are originally designed in an n-dimensional continuous space, the advancement of using PSO to find solutions in a discrete space is at a slow pace. In this paper, a novel set-based PSO (S-PSO) method for the solutions of some combinatorial optimization problems (COPs) in discrete space is presented. The proposed S-PSO features the following characteristics. First, it is based on using a set-based representation scheme that enables S-PSO to characterize the discrete search space of COPs. Second, the candidate solution and velocity are defined as a crisp set, and a set with possibilities, respectively. All arithmetic operators in the velocity and position updating rules used in the original PSO are replaced by the operators and procedures defined on crisp sets, and sets with possibilities in S-PSO. The S-PSO method can thus follow a similar structure to the original PSO for searching in a discrete space. Based on the proposed S-PSO method, most of the existing PSO variants, such as the global version PSO, the local version PSO with different topologies, and the comprehensive learning PSO (CLPSO), can be extended to their corresponding discrete versions. These discrete PSO versions based on S-PSO are tested on two famous COPs: the traveling salesman problem and the multidimensional knapsack problem. Experimental results show that the discrete version of the CLPSO algorithm based on S-PSO is promising.