LV EN

DEGREE

PROGRAMME

FACULTY

YEAR

LANGUAGE

KEYWORDS

Improvement of machine leaning algorithms performance by data set dimensionality reduction using cellular automata

A significant challenge in Machine Learning is dealing with high-dimensional data. Complexity knowns as the "curse of dimensionality" results in deterioration оf Machine Learning algorithms performance as the dimensionality and dataset size increases. Cellular automata are a dynamical discrete computational system with mathematical functions knows as rules that result in complex global behaviour. We used one-dimensional elementary cellular automata as a tool for dataset size. Model variables were selected for initial status vector generation and its further transformation to format that is suitable for cellular automata rules application known in cellular automata theory as configuration. Then model iterated through all possible cellular automata rules and various epochs variations were applied. Model performance for reduced dataset was compared with benchmark results of original dataset after standard dimensionality reduction technics used. It was concluded that applied cellular automata rules can be used as alternative methods for dataset size reduction without deteriorating model performance.

Author: Alexey Kuchvalskiy

Supervisor: Dmitry Pavlyuk

Degree: Master

Year: 2024

Work Language: English

Study programme: Computer Sciences

More...

Table View
Text View