OAK

GIST Library Login

Metadata Downloads

Abstract: The purpose for efficient drug discovery is increasingly leveraging computational predictions of compound-protein interactions (CPI), a vital aspect due to the vast chemical space and multitude of protein types. Traditional structure-based and physics-based methods for predicting CPIs, while useful, often stumble due to the unavailability of 3D structural data. To circumvent these limitations, our study introduces a deep learning-based model. To address the cold start problem commonly encountered in CPI prediction, we implemented two key strategies: (i) Use domain related embedding methods to generate a generalized representation of each drugs and proteins. (ii) Contrastive learning, which refines the model's capacity to predict affinities for unseen data by minimizing and maximizing vector distances between positive and negative samples, respectively. Our model demonstrates superior performance than baseline models, evidenced by lower root mean squared error (RMSE) and higher Pearson correlation coefficients. Ablation studies further underscore the value of embedding methods and contrastive learning in enhancing the model's accuracy and generalizability.
In summary, this study presented a novel approach that applies the already verified embedding methods to generate protein and compound embeddings, respectively, and uses a contrastive learning method to integrate them effectively.

Appears in Collections:: Department of Electrical Engineering and Computer Science > 3. Theses(Master)

공개 및 라이선스

qrcode

OAK GIST Scholar는 국립중앙도서관 OAK Repository 보급사업으로 구축되었습니다.