Skip to content
@MachineLearningLifeScience

Machine Learning in Life Science

Welcome to the github page for the Center for Basic Machine Learning Research in Life Science

We conduct the basic machine learning research needed to estimate representations of biomedical data that are

  • Robust
  • Interpretable
  • Data efficient
  • Reflective of inherent data uncertainty
  • Able to leverage existing knowledge

These representations are both predictive and knowledge discovery tasks.

Research

Our research focuses on four themes, and each theme advances different aspects of representation learning for life science and support each other:

  1. Meaningful representation of data and computational and mathematical tools development to realize the answer.
  2. Geometric constructions to incorporate existing knowledge into representations and ensure that the result is understandable by humans.
  3. Representation of data often appearing within life science, such as trees, graphs, and sequences.
  4. Inclusion of real data that is “noisy” and investigation of how associated uncertainty is best encoded.

Pinned Loading

  1. meaningful-protein-representations meaningful-protein-representations Public

    Jupyter Notebook 107 7

  2. stochman stochman Public

    Algorithms for computations on random manifolds made easier

    Python 86 11

  3. BEND BEND Public

    Forked from frederikkemarin/BEND

    Benchmarking DNA Language Models on Biologically Meaningful Tasks

    Python 1

  4. poli poli Public

    A library of discrete objectives

    Python 17 1

  5. hdbo_benchmark hdbo_benchmark Public

    Code for "A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences"

    Python 9

  6. torchplot torchplot Public

    Plotting pytorch tensors made easy!

    Python 14 1

Repositories

Showing 10 of 12 repositories
  • hdbo_benchmark Public

    Code for "A survey and benchmark of high-dimensional Bayesian optimization of discrete sequences"

    MachineLearningLifeScience/hdbo_benchmark’s past year of commit activity
    Python 9 0 1 1 Updated Jan 6, 2025
  • poli-baselines Public

    A collection of objective functions and black box optimization algorithms related to proteins and small molecules

    MachineLearningLifeScience/poli-baselines’s past year of commit activity
    Python 5 MIT 3 16 (1 issue needs help) 4 Updated Dec 22, 2024
  • poli-docs Public

    Documentation for poli and poli-baselines

    MachineLearningLifeScience/poli-docs’s past year of commit activity
    5 0 5 2 Updated Dec 22, 2024
  • poli Public

    A library of discrete objectives

    MachineLearningLifeScience/poli’s past year of commit activity
    Python 17 MIT 1 49 5 Updated Dec 2, 2024
  • BEND Public Forked from frederikkemarin/BEND

    Benchmarking DNA Language Models on Biologically Meaningful Tasks

    MachineLearningLifeScience/BEND’s past year of commit activity
    Python 1 BSD-3-Clause 13 1 0 Updated Oct 31, 2024
  • poli-assets Public

    Assets and datasets for `poli` and `poli-baselines`

    MachineLearningLifeScience/poli-assets’s past year of commit activity
    0 0 0 0 Updated Sep 12, 2024
  • protein_regression Public

    The codebase to replicate the analysis of "A systematic analysis of regression models for protein engineering" (2024).

    MachineLearningLifeScience/protein_regression’s past year of commit activity
    Jupyter Notebook 4 MIT 2 0 0 Updated Jun 12, 2024
  • corel Public
    MachineLearningLifeScience/corel’s past year of commit activity
    Python 2 MIT 1 4 0 Updated Apr 12, 2024
  • stochman Public

    Algorithms for computations on random manifolds made easier

    MachineLearningLifeScience/stochman’s past year of commit activity
    Python 86 Apache-2.0 11 10 0 Updated Dec 4, 2023
  • .github Public
    MachineLearningLifeScience/.github’s past year of commit activity
    1 0 0 0 Updated Aug 18, 2023

Top languages

Loading…

Most used topics

Loading…