Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Column lineage: Empty field dataset after dbt-ol run #2998

Open
amzar96 opened this issue Dec 30, 2024 · 1 comment
Open

Column lineage: Empty field dataset after dbt-ol run #2998

amzar96 opened this issue Dec 30, 2024 · 1 comment

Comments

@amzar96
Copy link

amzar96 commented Dec 30, 2024

Hi,

I don't know if this is an issue or a limitation.

At first, I am facing an issue generating column lineage in which no field appears in the facets.

ERROR [2024-12-26 09:41:14,327] marquez.db.OpenLineageDao: Cannot produce column lineage for missing output field in output dataset: xxxx

here is the output facets

{
  "fields": {
    "entity_code": {
      "inputFields": [
        {
          "name": "db.schema.table01",
          "field": "entity_code",
          "namespace": "redshift",
          "transformations": []
        }
      ],
      "transformationType": "",
      "transformationDescription": ""
    }
  }
}

When I check the dataset, no field is attached. Somehow, OL is unable to detect the fields.
image


After few rounds of testing, I noticed I needed to ensure all the fields must be specified in source.yml and schema.yml. Also, I cannot use SELECT * FROM xxxxx in the model sql. I need to specify SELECT entity_code, xxx, xxx FROM xxx to ensure the column lineage is generated.

Copy link

boring-cyborg bot commented Dec 30, 2024

Thanks for opening your first issue in the Marquez project! Please be sure to follow the issue template!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant