Treffer: High-throughput deep learning variant effect prediction with Sequence UNET

Title:
High-throughput deep learning variant effect prediction with Sequence UNET
Publication Year:
2023
Collection:
Columbia University: Academic Commons
Document Type:
Fachzeitschrift article in journal/newspaper
Language:
English
DOI:
10.7916/sfyd-q097
Accession Number:
edsbas.2D5F7C05
Database:
BASE

Weitere Informationen

Understanding coding mutations is important for many applications in biology and medicine but the vast mutation space makes comprehensive experimental characterisation impossible. Current predictors are often computationally intensive and difficult to scale, including recent deep learning models. We introduce Sequence UNET, a highly scalable deep learning architecture that classifies and predicts variant frequency from sequence alone using multi-scale representations from a fully convolutional compression/expansion architecture. It achieves comparable pathogenicity prediction to recent methods. We demonstrate scalability by analysing 8.3B variants in 904,134 proteins detected through large-scale proteomics. Sequence UNET runs on modest hardware with a simple Python package.