Classifying Potentially Unbounded Hierarchical Data Streams with Incremental Gaussian Naive Bayes

Resumo


Hierarchical data stream classification inherits the properties and constraints of hierarchical classification and data stream classification concomitantly. Therefore, it requires novel approaches that (i) can handle class hierarchies, (ii) can be updated over time, and (iii) are computationally light-weighted regarding processing time and memory usage. In this study, we propose the Gaussian Naive Bayes for Hierarchical Data Streams (GNB-hDS) method: an incremental Gaussian Naive Bayes for classifying potentially unbounded hierarchical data streams. GNB-hDS uses statistical summaries of the data stream instead of storing actual instances. These statistical summaries allow more efficient data storage, keep constant computational time and memory, and calculate the probability of an instance belonging to a specific class via the Bayes’ Theorem. We compare our method against a technique that stores raw instances, and results show that our method obtains equivalent prediction rates while being significantly faster.
Palavras-chave: Hierarchical classification, Data stream classification, Gaussian Naive Bayes, Incremental learning
Publicado
29/11/2021
TIEPPO, Eduardo; BARDDAL, Jean Paul; NIEVOLA, Júlio Cesar. Classifying Potentially Unbounded Hierarchical Data Streams with Incremental Gaussian Naive Bayes. In: BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 10. , 2021, Online. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . ISSN 2643-6264.