Summary of "Variant to Function (V2F) Symposium: Johanes Linder (2025)"

Summary of Johannes Linder’s Talk at the Variant to Function (V2F) Symposium (2025)

Johannes Linder presented on the development and application of machine learning models to predict gene regulatory functions directly from DNA sequence, focusing on interpreting genetic variation and its effects on gene regulation. His work is conducted within David Kelly’s group at Calico Life Sciences.


Main Ideas and Concepts


Methodology and Model Details

Borsoy Model

Interpretation Techniques

Applications

Borsoy Prime Model


Key Lessons and Insights


Discussion Highlights


Summary of Methodology / Instructions

Training a Gene Regulatory Prediction Model (e.g., Borsoy)

  1. Collect large-scale sequencing coverage datasets (RNA-seq, epigenomics, CAGE) across tissues/species.
  2. Prepare large genomic sequence inputs (~500 kb).
  3. Design a deep neural network with convolutional layers, subsampling, self-attention, and upsampling.
  4. Train to predict one-dimensional coverage profiles at fine resolution.
  5. Use alternating batches of human and mouse data for training.
  6. Validate on held-out genomic regions.

Interpreting Model Predictions

Extending to Cell Type-Specific Predictions


Speakers / Sources Featured

Other Referenced Models and Groups


This summary captures the core content, methodology, and discussion points from Johannes Linder’s presentation on predicting gene regulatory function from DNA sequence using deep learning.

Category ?

Educational

Share this summary

Video