SCALLOPS

SCALLOPS (Scalable Library for Optical Pooled Screens) is a comprehensive Python package designed to streamline and scale the analysis of Optical Pooled Screens (OPS) for biological data. With a focus on handling large-scale, high-throughput screening data, SCALLOPS provides tools for efficiently processing, analyzing, and interpreting OPS data, leveraging modern distributed computing frameworks like Dask.

Documentation

The full documentation, API reference, and tutorials can be found at: http://scallops.readthedocs.io

Repository Structure

scallops/
├── .github/             # CI/CD workflows
├── docs/                # Documentation source (Sphinx)
├── scallops/            # Main Python package source
│   ├── cli/             # CLI entry points
│   ├── core/            # Core processing logic
│   └── utils/           # Utilities
├── wdl/                 # WDL pipeline definitions
├── Dockerfile           # Docker image definition
├── pyproject.toml       # Build metadata
├── requirements.txt     # Main dependencies
├── setup.py             # Installation script
└── README.md            # Project overview

Getting Started

Prerequisites

SCALLOPS requires Python 3.11 or newer.

1. Environment Setup (Recommended)

We recommend using uv for high-performance Python environment management. You will need uv installed on your system. Installation instructions can be found here: https://docs.astral.sh/uv/

To set up a virtual environment:

# Create a virtual environment with a specific Python version
uv venv --python 3.12

# Activate the environment
# On macOS/Linux:
source .venv/bin/activate
# On Windows:
.venv\Scripts\activate

2. Installation and Usage Options

Option 1: Install from PyPI (Standard)

The easiest way to install the stable version is via pip (or uv pip):

uv pip install scallops

Option 2: Run via Docker (GitHub Container Registry)

SCALLOPS is available as a containerized image via the GitHub Container Registry (GHCR). This is the best option for ensuring environment consistency.

# Pull the latest image
docker pull ghcr.io/genentech/scallops:latest

# Run the CLI directly
docker run --rm ghcr.io/genentech/scallops:latest scallops --help

Option 3: Install from Source (Development)

If you wish to contribute to the codebase or need the latest unreleased changes:

Clone the repository:

git clone [https://github.com/Genentech/scallops.git](https://github.com/Genentech/scallops.git)
cd scallops

Install in editable mode:

uv pip install -r requirements.txt -e .

Main Focus Areas

High-Throughput Data Processing: Designed to manage massive datasets typical of OPS experiments across multiple scales.
Scalability and Performance: Optimized for both local and cloud-based distributed environments using Dask.
Modular Workflows: Includes customizable WDL workflows for cloud platforms like Terra or Cromwell.

Key Features

Efficient Data Handling: Advanced memory management and lazy evaluation to minimize resource usage.
Command-Line Interface (CLI): Automates batch processing for seamless pipeline integration.
Customizable Outputs: Generates versatile data visualizations and summary statistics.
Notebook Examples: Practical Jupyter notebooks are included to guide users through real-world workflows.
Rich API: A comprehensive API that allows for the creation of fully customized biological data pipelines.

Typical Use Cases

Large-Scale Screening: Handling the immense data loads of genome-wide OPS projects.
Biological Discovery: Identifying and quantifying biological perturbations from high-throughput imaging.

Contributing to SCALLOPS

We welcome all forms of contributions, including bug reports, documentation improvements, and feature enhancements.

Name		Name	Last commit message	Last commit date
Latest commit History 550 Commits
.github/workflows		.github/workflows
docs		docs
scallops		scallops
wdl		wdl
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yaml		.readthedocs.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
Manifest.in		Manifest.in
README.md		README.md
pyproject.toml		pyproject.toml
requirements.cellpose.txt		requirements.cellpose.txt
requirements.doc.txt		requirements.doc.txt
requirements.txt		requirements.txt
requirements.ufish.txt		requirements.ufish.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SCALLOPS

Documentation

Repository Structure

Getting Started

Prerequisites

1. Environment Setup (Recommended)

2. Installation and Usage Options

Option 1: Install from PyPI (Standard)

Option 2: Run via Docker (GitHub Container Registry)

Option 3: Install from Source (Development)

Main Focus Areas

Key Features

Typical Use Cases

Contributing to SCALLOPS

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SCALLOPS

Documentation

Repository Structure

Getting Started

Prerequisites

1. Environment Setup (Recommended)

2. Installation and Usage Options

Option 1: Install from PyPI (Standard)

Option 2: Run via Docker (GitHub Container Registry)

Option 3: Install from Source (Development)

Main Focus Areas

Key Features

Typical Use Cases

Contributing to SCALLOPS

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages