cerberus

Attentive multi-headed and highly trained companion to sequence-to-function deep-learning models.

Cerberus is a PyTorch-based framework for genomic sequence-to-function (S2F) model training. It implements efficient data loading infrastructure for handling genomic intervals, DNA sequences (FASTA), and functional signal tracks (BigWig/BigBed). The library provides composable sampling strategies—including sliding windows and weighted multi-source mixing—and on-the-fly data transformations such as jittering and reverse-complement augmentation. By abstracting these components into a unified pipeline, Cerberus facilitates the training of deep learning models with complex input/output architectures on large-scale genomic datasets.

Installation

Cerberus requires Python 3.12 or later. It is recommended to install Cerberus in a virtual environment.

1. Create and activate a virtual environment

python3.12 -m venv .venv
source .venv/bin/activate

2. Install Cerberus

To install Cerberus and its dependencies, use pip from the root directory of the repository.

Standard Installation

pip install .

Editable Installation (recommended for development)

pip install -e .

Development

Install Development Dependencies

To run tests and contribute, install the development dependencies:

pip install -e .[dev]

Running Tests

Run the standard test suite with pytest:

pytest tests/

Running Slow Tests

Some tests require downloading large genomic files and are skipped by default. To run these tests, set the RUN_SLOW_TESTS environment variable:

RUN_SLOW_TESTS=1 pytest tests/

Name		Name	Last commit message	Last commit date
Latest commit History 457 Commits
.github/workflows		.github/workflows
docs		docs
examples		examples
notebooks		notebooks
pretrained		pretrained
src/cerberus		src/cerberus
tests		tests
tools		tools
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cerberus

Installation

Development

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

cerberus

Installation

Development

About

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages