Skip to content

Add DatasetSchema instances to all dataset classes#249

Open
GaganNarula wants to merge 1 commit into
gagan/schema-testsfrom
gagan/schema-datasets
Open

Add DatasetSchema instances to all dataset classes#249
GaganNarula wants to merge 1 commit into
gagan/schema-testsfrom
gagan/schema-datasets

Conversation

@GaganNarula
Copy link
Copy Markdown
Collaborator

Summary

  • Adds a schema = DatasetSchema(...) class attribute to all 34 dataset classes
  • Each schema lists the expected columns, dtypes, and required status
  • Calls self._validate_schema() after loading data in each dataset

Part of

Stacked PR 3/4 for issue #227. Depends on PR #248 (schema tests).

Test plan

  • uv run pytest — existing dataset tests should still pass
  • Schema validation runs automatically on _load() for each dataset

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant