Add support of Grain input pipeline for DPO. by igorts-git · Pull Request #4009 · AI-Hypercomputer/maxtext

igorts-git · 2026-05-28T16:53:41Z

Description

Add support of Grain input reading when using the new Tunix-based DPO/ORPO.
Modify dpo.yml to have some reasonable defaults that allow invoking DPO without extra config parameters.

Correct the case where config.dpo.max_prompt_length is not set. The default value is max_target_length // 2. It should be the same value that is passed to both the input pipeline and to the Tunix DPOTrainer class. Thus, I moved its computation to types.py.

In a follow up PR I will add a detailed logits comparison test for DPO.

BUGS: b/485626968

Tests

CI tests.
Ran DPO/ORPO while reading using Grain from parquet files.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

codecov · 2026-05-28T16:58:56Z

Codecov Report

❌ Patch coverage is 78.26087% with 5 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...rc/maxtext/input_pipeline/grain_data_processing.py	70.58%	2 Missing and 3 partials ⚠️

📢 Thoughts on this report? Let us know!

github-actions · 2026-06-02T15:17:29Z

🤖 Hi @igorts-git, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

github-actions

## 📋 Review Summary

This pull request adds Grain input pipeline support for DPO (Direct Preference Optimization). The implementation is consistent with the project's existing Grain-based SFT patterns and includes comprehensive unit tests for different DPO data formats.

🔍 General Feedback

Good Coverage: The addition of TestGrainDPOPipelineProcessing in dpo_data_processing_test.py covers key edge cases like 2-column (common prefix) and 3-column datasets.
Defensive Design: The validation check in src/maxtext/configs/types.py prevents unsupported configurations early.
Defaults: The updated defaults in dpo.yml provide a smoother "out-of-the-box" experience for Tunix-based DPO.
Batching Consistency: It is recommended to use the get_local_batch_size utility to ensure all global configuration flags (like real data expansion) are respected during batching.

igorts-git · 2026-06-02T17:20:29Z

  max_prompt_length: int | None = None

+  def __post_init__(self):
+    if self.max_prompt_length is None:


The new logic that I added in types.py guarantees that this value is not None. However, for cases like unit tests it is still useful to have this default computed. Let' me know if you want it removed.

igorts-git force-pushed the igorts/dpo-grain-pipeline branch 4 times, most recently from d63a9cc to 3b27bbf Compare June 2, 2026 04:09

igorts-git marked this pull request as ready for review June 2, 2026 04:11

igorts-git added the gemini-review label Jun 2, 2026

github-actions Bot reviewed Jun 2, 2026

View reviewed changes

Comment thread src/maxtext/input_pipeline/grain_data_processing.py

igorts-git force-pushed the igorts/dpo-grain-pipeline branch from 3b27bbf to 86f715f Compare June 2, 2026 16:39

Enable Tunix-based DPO input processing for Grain

89d9e3c

igorts-git force-pushed the igorts/dpo-grain-pipeline branch from 86f715f to 89d9e3c Compare June 2, 2026 17:08

igorts-git commented Jun 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support of Grain input pipeline for DPO.#4009

Add support of Grain input pipeline for DPO.#4009
igorts-git wants to merge 1 commit into
mainfrom
igorts/dpo-grain-pipeline

igorts-git commented May 28, 2026 •

edited

Loading

Uh oh!

codecov Bot commented May 28, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

Uh oh!

igorts-git Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

igorts-git commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

codecov Bot commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions Bot commented Jun 2, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

🔍 General Feedback

Uh oh!

Uh oh!

igorts-git Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

igorts-git commented May 28, 2026 •

edited

Loading

codecov Bot commented May 28, 2026 •

edited

Loading