[SPARK-57103][SQL] Wire ordering for nanosecond timestamp types by stevomitric · Pull Request #56207 · apache/spark

stevomitric · 2026-05-29T16:25:28Z

What changes were proposed in this pull request?

Implement Ordering for TimestampNTZNanosType(p) and TimestampLTZNanosType(p), both in the interpreted path and the codegen path.

Why are the changes needed?

Without ordering, SQL operators that need a total order on the type (ORDER BY, sort-merge join, sort-based GROUP BY, DISTINCT) cannot execute against nanos-precision columns.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

New UT in this PR.

Was this patch authored or co-authored using generative AI tooling?

Generated-by: Claude Opus 4.7

### What changes were proposed in this pull request? Implement `Ordering` for `TimestampNTZNanosType(p)` and `TimestampLTZNanosType(p)`, both in the interpreted path (`PhysicalDataType.ordering`) and the codegen path (`CodeGenerator.genComp`). This is the second of three PRs for SPARK-57103. The first (`[SPARK-57103][SQL] Add Comparable to TimestampNanosVal`, commit 0b0ffb7) made the value class `Comparable`. The remaining PR will extend `hash`, `xxhash64`, and `murmur3` for the two nanos types. Changes: - `PhysicalDataType.scala`: replace the two `orderedOperationUnsupportedByDataTypeError` throws with `implicitly[Ordering[InternalType]]`, following the `PhysicalGeographyType` / `PhysicalGeometryType` precedent. Resolves via `scala.math.Ordering.ordered[T <: Comparable[T]]` now that `TimestampNanosVal` is `Comparable`. - `CodeGenerator.scala`: add an explicit `genComp` arm that calls `compareTo`. Required because the existing AtomicType fallback would emit `c1.compare(c2)`, which fails to compile on `TimestampNanosVal` (it has `compareTo`, not `compare`). - Updated the scaladoc on both physical types to note that only hash remains as future work. ### Why are the changes needed? Without ordering, SQL operators that need a total order on the type (`ORDER BY`, sort-merge join, sort-based `GROUP BY`, `DISTINCT`) cannot execute against nanos-precision columns. The two physical types previously threw at runtime. ### Does this PR introduce _any_ user-facing change? No. The nanos types remain gated behind `spark.sql.timestampNanosTypes.enabled`; this PR only fills in the ordering hole their `PhysicalDataType` had. ### How was this patch tested? Added 10 unit tests in `OrderingSuite` (5 cases × 2 types): equal values, `epochMicros` primary key, `nanosWithinMicro` tie-breaker, `Long.MinValue` / `Long.MaxValue` boundary, and pre-epoch (negative `epochMicros`). Each case verifies both `InterpretedOrdering` and `LazilyGeneratedOrdering` agree on ASC and DESC, and that `compare(a, a) == 0`. ``` build/mvn test -pl sql/catalyst \ -DwildcardSuites=org.apache.spark.sql.catalyst.expressions.OrderingSuite ``` Tests: 66/66 passing (10 new). Also ran `DataTypeSuite` and the catalyst-side `TimestampNanos*Suite` set: 344/344 passing. Not adding nanos types to `DataTypeTestUtils.atomicTypes` yet because the generic `GenerateOrdering with $dataType` test there uses `RandomDataGenerator`, which does not yet support nanos types (tracked in SPARK-57034). ### Was this patch authored or co-authored using generative AI tooling? Generated-by: Claude Code (Claude Opus 4.7)

stevomitric · 2026-05-29T16:26:06Z

@MaxGekk please take a look at this PR.

[SPARK-57103][SQL] Drop comment on genComp nanos arm

66cba81

MaxGekk reviewed May 29, 2026

View reviewed changes

Comment thread ...atalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/CodeGenerator.scala

Comment thread sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/OrderingSuite.scala

resolve comments

fcb00de

stevomitric requested a review from MaxGekk May 31, 2026 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-57103][SQL] Wire ordering for nanosecond timestamp types#56207

[SPARK-57103][SQL] Wire ordering for nanosecond timestamp types#56207
stevomitric wants to merge 3 commits into
apache:masterfrom
stevomitric:stevomitric/SPARK-57103-ordering

stevomitric commented May 29, 2026

Uh oh!

stevomitric commented May 29, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

stevomitric commented May 29, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

stevomitric commented May 29, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants