OAK

Joint triplet loss with semi-hard constraint for data augmentation and disease prediction using gene expression data

Metadata Downloads
Abstract
"The accurate prediction of patients with complex diseases, such as Alzheimers disease (AD), as well as disease stages, including early- and late-stage cancer, is challenging owing to substantial variability among patients and limited availability of clinical data. Deep metric learning has emerged as a promising approach for addressing these challenges by improving data representation. In this study, we propose a joint triplet loss model with a semi-hard constraint (JTSC) to represent data in a small number of samples. JTSC strictly selects semi-hard samples by switching anchors and positive samples during the learning process in triplet embedding and combines a triplet loss function with an angular loss function. Our results indicate that JTSC significantly improves the number of appropriately represented samples during training when applied to the gene expression data of AD and to cancer stage prediction tasks. Furthermore, we demonstrate that using an embedding vector from JTSC as an input to the classifiers for AD and cancer stage prediction significantly improves classification performance by extracting more accurate features. In conclusion, we show that feature embedding through JTSC can aid in classification when there are a small number of samples compared to a larger number of features. ? 2023, Springer Nature Limited.
Author(s)
Chung, YeonwooLee, Hyunju
Issued Date
2023-10
Type
Article
DOI
10.1038/s41598-023-45467-8
URI
https://scholar.gist.ac.kr/handle/local/9945
Publisher
Nature Research
Citation
Scientific Reports, v.13, no.1
ISSN
2045-2322
Appears in Collections:
Department of AI Convergence > 1. Journal Articles
공개 및 라이선스
  • 공개 구분공개
파일 목록

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.