Avtor/Urednik     Kastrin, Andrej; Hristovski, Dimitar
Naslov     Hiter in preprost algoritem za razdvoumljanje simbolov genov
Prevedeni naslov     A fast and simple document classification algorithm for gene symbol disambiguation
Tip     članek
Vir     Inform Med Slov
Vol. in št.     Letnik 13, št. 1
Leto izdaje     2008
Obseg     str. 1-8
Jezik     slo
Abstrakt     Gene symbol disambiguation is an important problem for biomedical text mining systems. When detecting gene symbols in MEDLINE® citations one of the biggest challenges is the fact that many gene symbols also denote other, more general biomedical concepts (e.g. CT, MR). Our approach to this problem is first to classify the citations into genetic and nongenetic domains and then to detect gene symbols only in the genetic domain. We used ontological information provided by Medical Subject Headings (MeSH®) for this classification task. The proposed algorithm is fast and is able to process the full MEDLINE distribution in a few hours. It achieves predictive accuracy of 0,91. The algorithm is currently implemented in the BITOLA literature-based discovery support system.
Deskriptorji     GENES
NOMENCLATURE
MEDLINE
SUBJECT HEADINGS
ALGORITHMS