Fix reconcile schema fetch failure on Foreign Catalogs (Lakehouse Federation) by moomindani · Pull Request #2367 · databrickslabs/lakebridge

moomindani · 2026-04-09T06:19:28Z

Summary

Fixes [BUG]: Reconciliation fails with UNRESOLVED_COLUMN error on Foreign Catalogs (Lakehouse Federation) #2366
Foreign Catalogs (created via Lakehouse Federation) lack the Databricks-specific full_data_type column in information_schema.columns, causing UNRESOLVED_COLUMN errors for all reconcile report types (schema, data, row, all)
Split DatabricksDataSource into two subclasses per review feedback:
- DatabricksSourceDataSource: uses DESCRIBE TABLE (works for hive_metastore, global_temp, and Foreign Catalogs)
- DatabricksTargetDataSource: uses information_schema with full_data_type (for native Unity Catalog tables)
The source_adapter and utils now create the appropriate subclass based on source vs target role

Test plan

Existing unit tests pass
New tests: test_get_schema_target, test_get_schema_source, test_get_schema_source_foreign_catalog, test_get_schema_target_exception_handling, test_get_schema_source_exception_handling
Source adapter tests updated: test_create_adapter_for_databricks_dialect_source, test_create_adapter_for_databricks_dialect_target
Integration test: Reconcile with Foreign Catalog (Lakebase PostgreSQL via Lakehouse Federation) as direct source

…eration) Foreign Catalogs created via Lakehouse Federation do not have the Databricks-specific `full_data_type` column in `information_schema.columns`, causing an UNRESOLVED_COLUMN error for all report types. This adds a DESCRIBE TABLE fallback when the `full_data_type` column is not found. Co-authored-by: Isaac

sundarshankar89 · 2026-04-09T07:21:10Z

@BesikiML and @bishwajit-db can you take a look at this when you get a chance

bishwajit-db

Overall LGTM. Please have a look at the comment https://github.com/databrickslabs/lakebridge/pull/2367/changes#r3056652489

bishwajit-db · 2026-04-09T09:01:46Z

+            if "full_data_type" not in str(e):
+                return self.log_and_throw_exception(e, "schema", schema_query)
+
+        # Fallback to DESCRIBE TABLE for catalogs that lack the full_data_type column


Can we add an optional parameter to _get_schema_query?

def _get_schema_query(catalog: str, schema: str, table: str, force_describe: bool = False): if catalog == "hive_metastore" or force_describe: return f"describe table {catalog}.{schema}.{table}" ...

Then the fallback becomes:

describe_query = _get_schema_query(catalog_str, schema, table, force_describe=True)

Thanks for the review! Updated to reuse _get_schema_query with a force_describe parameter instead of a separate function. Please take another look.

@bishwajit-db

Address review feedback from @bishwajit-db — reuse _get_schema_query with a force_describe parameter instead of a separate _get_describe_table_query function. Co-authored-by: Isaac

m-abulazm

Thanks for raising this. I documented a different approach as scaling this might require dedicated logic for hive vs foreign catalog vs native catalog. to be open for future changes, I would prefer a subclass instead conditional control

m-abulazm · 2026-04-16T10:47:07Z



-def _get_schema_query(catalog: str, schema: str, table: str):
+def _get_schema_query(catalog: str, schema: str, table: str, force_describe: bool = False):


I would prefer to split this into _get_schema_query_for_source that uses describe table exclusively since we know databricks sources can only be Hive, global views or foreign catalog that only supporst DESCRIBE TABLE.

and _get_schema_query_for_target that uses the SELECT query only as this is the needed behavior always.

This requires distinguishing source vs target databricks data source. so I would implement a subclass instead of nested conditionals and control through exception handling.

Also to note this rubs a bit with PR #2362. should not block us on this PR but keep in mind that there might be conflicts

Thanks for the detailed feedback! Refactored to split DatabricksDataSource into two subclasses:

DatabricksSourceDataSource: always uses DESCRIBE TABLE (works for hive_metastore, global_temp, and Foreign Catalogs)

DatabricksTargetDataSource: uses information_schema with full_data_type (for native UC tables)

The source_adapter and utils now create the appropriate subclass based on source vs target role. This eliminates the need for exception-based fallback.

Also verified with an integration test — reconcile against a Foreign Catalog (Lakebase PostgreSQL via Lakehouse Federation) succeeded as direct source.

Noted the potential conflict with #2362 — will keep an eye on it.

@m-abulazm

Address review feedback from @m-abulazm — instead of exception-based fallback, split DatabricksDataSource into: - DatabricksSourceDataSource: uses DESCRIBE TABLE (works for hive, global_temp, and Foreign Catalogs via Lakehouse Federation) - DatabricksTargetDataSource: uses information_schema with full_data_type (optimal for native Unity Catalog tables) The source_adapter and utils are updated to create the appropriate subclass based on source vs target role. Co-authored-by: Isaac

moomindani requested a review from a team as a code owner April 9, 2026 06:19

sundarshankar89 requested a review from bishwajit-db April 9, 2026 07:20

asnare added bug Something isn't working feat/recon making sure that remorphed query produces the same results as original labels Apr 9, 2026

bishwajit-db approved these changes Apr 9, 2026

View reviewed changes

Refactor: use force_describe param instead of separate function

94855d5

Address review feedback from @bishwajit-db — reuse _get_schema_query with a force_describe parameter instead of a separate _get_describe_table_query function. Co-authored-by: Isaac

m-abulazm requested changes Apr 16, 2026

View reviewed changes

moomindani had a problem deploying to tool April 18, 2026 04:36 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix reconcile schema fetch failure on Foreign Catalogs (Lakehouse Federation)#2367

Fix reconcile schema fetch failure on Foreign Catalogs (Lakehouse Federation)#2367
moomindani wants to merge 3 commits intodatabrickslabs:mainfrom
moomindani:fix/foreign-catalog-schema-fallback

moomindani commented Apr 9, 2026 •

edited

Loading

Uh oh!

sundarshankar89 commented Apr 9, 2026

Uh oh!

bishwajit-db left a comment

Uh oh!

bishwajit-db Apr 9, 2026

Uh oh!

moomindani Apr 10, 2026

Uh oh!

m-abulazm left a comment

Uh oh!

m-abulazm Apr 16, 2026

Uh oh!

moomindani Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants



		def _get_schema_query(catalog: str, schema: str, table: str):
		def _get_schema_query(catalog: str, schema: str, table: str, force_describe: bool = False):

Conversation

moomindani commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

sundarshankar89 commented Apr 9, 2026

Uh oh!

bishwajit-db left a comment

Choose a reason for hiding this comment

Uh oh!

bishwajit-db Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

moomindani Apr 10, 2026

Choose a reason for hiding this comment

Uh oh!

m-abulazm left a comment

Choose a reason for hiding this comment

Uh oh!

m-abulazm Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

moomindani Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

moomindani commented Apr 9, 2026 •

edited

Loading