Internal and collective interpretation for improving human interpretability of multi-layered neural networks

The present paper aims to propose a new type of information-theoretic method to interpret the inference mechanism of neural networks. We interpret the internal inference mechanism for itself without any external methods such as symbolic or fuzzy rules. In addition, we make interpretation processes a...

Full description

Saved in:
Bibliographic Details
Main Author: Kamimura, Ryotaro (Author)
Format: EJournal Article
Published: Universitas Ahmad Dahlan, 2019-10-29.
Subjects:
Online Access:Get Fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02427 am a22002653u 4500
001 IJAIN_420_ijain_v5i3_p179-192
042 |a dc 
100 1 0 |a Kamimura, Ryotaro  |e author 
100 1 0 |e contributor 
245 0 0 |a Internal and collective interpretation for improving human interpretability of multi-layered neural networks 
260 |b Universitas Ahmad Dahlan,   |c 2019-10-29. 
500 |a https://ijain.org/index.php/IJAIN/article/view/420 
520 |a The present paper aims to propose a new type of information-theoretic method to interpret the inference mechanism of neural networks. We interpret the internal inference mechanism for itself without any external methods such as symbolic or fuzzy rules. In addition, we make interpretation processes as stable as possible. This means that we interpret the inference mechanism, considering all internal representations, created by those different conditions and patterns. To make the internal interpretation possible, we try to compress multi-layered neural networks into the simplest ones without hidden layers. Then, the natural information loss in the process of compression is complemented by the introduction of a mutual information augmentation component. The method was applied to two data sets, namely, the glass data set and the pregnancy data set. In both data sets, information augmentation and compression methods could improve generalization performance. In addition, compressed or collective weights from the multi-layered networks tended to produce weights, ironically, similar to the linear correlation coefficients between inputs and targets, while the conventional methods such as the logistic regression analysis failed to do so. 
540 |a Copyright (c) 2019 Ryotaro Kamimura 
540 |a https://creativecommons.org/licenses/by-sa/4.0 
546 |a eng 
690 |a Mutual information; Internal interpretation; Collective interpretation; Inference mechanism; Generalization 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n International Journal of Advances in Intelligent Informatics; Vol 5, No 3 (2019): November 2019; 179-192 
786 0 |n 2548-3161 
786 0 |n 2442-6571 
787 0 |n https://ijain.org/index.php/IJAIN/article/view/420/ijain_v5i3_p179-192 
856 4 1 |u https://ijain.org/index.php/IJAIN/article/view/420/ijain_v5i3_p179-192  |z Get Fulltext