BUG: Fix Series.combine_first silently ignoring duplicate indices (#66009) by krishgarg344 · Pull Request #66025 · pandas-dev/pandas

krishgarg344 · 2026-06-25T19:00:40Z

The fastpath in Series.combine_first (triggered when dtypes and indices match) was skipping the duplicate label check that normally occurs during .reindex(). This caused the method to silently return positionally misaligned results instead of correctly raising a ValueError.

Modifications:

pandas/core/series.py: Added an is_unique check to the fastpath to route duplicate indices to the standard error raising path.
pandas/tests/series/methods/test_combine_first.py: Added a test to ensure duplicate indices raise a ValueError.
doc/source/whatsnew/v3.1.0.rst: Added a release note under the Indexing section.

Closes #66009

(Note: As a first time contributor, I used an LLM as a pair programming tutor to help me navigate the codebase and structure this PR. All logic, testing, and sandbox verifications were executed and validated manually locally before submission.)

krishgarg344 · 2026-06-25T19:50:13Z

pre-commit.ci autofix

for more information, see https://pre-commit.ci

rhshadrach

I'm not convinced this behavior should change; left a comment in the issue.

krishgarg344 · 2026-06-25T20:38:26Z

Thanks for taking a look and providing the architectural context, @rhshadrach!

That logic makes complete sense, if the indices are strictly identical, positional alignment via .mask() avoids the ambiguity that standard .reindex() faces, making the output perfectly predictable.

The original catalyst for this PR was the inconsistency in the "slow lane": if you run the exact same identically indexed Series, but force a dtype mismatch (e.g., int64 vs float64), it falls through to .reindex() and raises the ValueError.

If the fastpath behavior is the intended standard for identical indices, should the slow lane be updated to match it (perhaps bypassing the raise if self.index.equals(other.index) regardless of dtype)?

Happy to either pivot this PR to address that inconsistency, or simply close this out if the current split behavior is accepted as is!

krishgarg344 added 3 commits June 26, 2026 00:17

BUG: Fix Series.combine_first silently ignoring duplicate indices

450e415

TST: Add test for Series.combine_first with duplicate indices

b9692bb

DOC: Add whatsnew entry for Series.combine_first duplicate index fix

fc35f41

[pre-commit.ci] auto fixes from pre-commit.com hooks

37bd85e

for more information, see https://pre-commit.ci

rhshadrach requested changes Jun 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Fix Series.combine_first silently ignoring duplicate indices (#66009)#66025

BUG: Fix Series.combine_first silently ignoring duplicate indices (#66009)#66025
krishgarg344 wants to merge 4 commits into
pandas-dev:mainfrom
krishgarg344:fix-combine-first-duplicate-index

krishgarg344 commented Jun 25, 2026 •

edited

Loading

Uh oh!

krishgarg344 commented Jun 25, 2026

Uh oh!

rhshadrach left a comment

Uh oh!

krishgarg344 commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Uh oh!

Conversation

krishgarg344 commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krishgarg344 commented Jun 25, 2026

Uh oh!

rhshadrach left a comment

Choose a reason for hiding this comment

Uh oh!

krishgarg344 commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

krishgarg344 commented Jun 25, 2026 •

edited

Loading