Author/Editor     Vidmar, Gaj
Title     Pixelisation-based statistical visualisation for categorical datasets with spreadsheet software
Type     članek
Source     Lect Notes Comput Sci
Vol. and No.     Letnik 4370
Publication year     2007
Volume     str. 48-54
Language     eng
Abstract     A heat-map type of chart for depicting large number of cases and up to twenty-five categorical variables with spreadsheet software is presented. It is implemented in Microsoft Excel using standard formulas, sorting and simple VBA code. The motivating example depicts accuracy of automated assignment of MeSH descriptor headings to abstracts of medical articles. Within each abstract, predicted support for each heading is ranked, then for each heading actually assigned/non-assigned by human specialist (depicted by black/white cell), high/low support is depicted on nine-point two-colour scale. Thus, each case (abstract) is depicted by one row of a table and each variable (heading) with two adjacent columns. Rank-based classification accuracy measure is calculated for each case, and rows are sorted in increasing accuracy order downwards. Based on analogous measure, variables are sorted in increasing prediction accuracy order rightwards. Another biomedical dataset is presented with a similar chart. Different methods for predicting binary outcomes can be visualised, and the procedure is easily extended to polytomous variables.
Descriptors     DATA INTERPRETATION, STATISTICAL
COMPUTER GRAPHICS
VOCABULARY, CONTROLLED