DTA

Archivio Digitale delle Tesi e degli elaborati finali elettronici

 

Tesi etd-10102025-173022

Tipo di tesi
Corso Ordinario Secondo Livello
Autore
RONCOLI, ANDREA
URN
etd-10102025-173022
Titolo
Transfer Learning of Molecular Representations: Strings to Graphs
Struttura
Classe Scienze Sperimentali
Corso di studi
INGEGNERIA - INGEGNERIA
Relatori
Tutor Prof. STEFANINI, CESARE
Relatore Prof. AVIZZANO, CARLO ALBERTO
Parole chiave
  • chemical language models
  • molecular machine learning
  • transfer learning
Data inizio appello
12/12/2025;
Disponibilità
completa
Riassunto analitico
This work explores the potential of transfer learning between different molecular representations by leveraging pretrained language models on SMILES strings to enhance downstream graph-based machine learning models. We investigate whether representations learned in the sequence (string) domain can provide useful inductive biases for models operating on molecular graphs, thereby bridging two common paradigms in molecular representation learning. To enable this cross-representation transfer, we develop a method to align SMILES tokens with corresponding graph atoms, allowing pretrained embeddings from language models to be integrated as node features in graph neural networks.
File