Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync fork #21

Merged
merged 128 commits into from
Sep 19, 2023
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
128 commits
Select commit Hold shift + click to select a range
b0cb990
tests(search): more golden tests (#8683)
eboneil Aug 22, 2023
439cf4d
test(ingest/vertica): Skip integration test failing CI; support arm M…
asikowitz Aug 22, 2023
d6e36f1
ci: add `needs_artifact_download` output for ingestion image (#8695)
hsheth2 Aug 23, 2023
4116716
logs(ingestion/unity): Hide stack trace on sql parse failure logs (#8…
asikowitz Aug 23, 2023
8ee58af
feat(ingestion/powerbi): support multiple tables as upstream in nativ…
siddiquebagwan-gslab Aug 23, 2023
68abf9c
build(ingest): Bump pydantic pin (#8660)
asikowitz Aug 23, 2023
8141e2d
remove(ingest/snowflake): Remove legacy snowflake lineage (#8653)
asikowitz Aug 23, 2023
01ae5d9
fix(ingest/ldap): Handle case when 'objectClass' not in attrs (#8658)
asikowitz Aug 23, 2023
52f1e32
fix(ui) Remove new Role entity from searchable entity types (#8655)
chriscollins3456 Aug 23, 2023
a4cb81c
fix(java) Use alias for name search sorting and fix missing mappings …
chriscollins3456 Aug 23, 2023
e12d910
feat(ui) Create page for managing home page posts (#8707)
chriscollins3456 Aug 24, 2023
a97548c
fix(ingest/powerbi): add sqlglot python dep (#8704)
hsheth2 Aug 24, 2023
090f8af
ci(ingest): make ingestion caching rules correct (#8685)
hsheth2 Aug 24, 2023
9472636
fix(cleanup): cleanup of 1 sub-module (#8678)
anshbansal Aug 24, 2023
aab5b6a
fix(policies): fix concurrent modification exception (#8681)
RyanHolstien Aug 24, 2023
22c35f1
fix(ingest/bigquery): Add config option to create DataPlatformInstanc…
asikowitz Aug 24, 2023
bcef25a
feat(ingest/looker): Record observed lineage timestamps for Looker an…
ANich Aug 24, 2023
43d48dd
feat(ingest/mssql): load jobs and stored procedures (#5363)
RChygir Aug 24, 2023
d6a935e
fix(ingestion/kafka-connect): update retrieval of database name in De…
Starkie Aug 24, 2023
e285da3
feat(ingest/snowflake): tables from snowflake shares as siblings (#8531)
mayurinehate Aug 24, 2023
6659ff2
feat(ingest/sql-queries): Add sql queries source, SqlParsingBuilder, …
asikowitz Aug 24, 2023
a78e72c
highlight matched fields in search results (#8651)
Aug 24, 2023
d15f080
Add links to glossary term cards without counts (#8705)
Aug 24, 2023
58dfb88
fix non sibling document links (#8724)
Aug 24, 2023
8648126
refactor(policies): Rename edit all privilege to edit entity (#8722)
jjoyce0510 Aug 24, 2023
b6141f5
feat(java/ui) Add search suggestions to our search experience (#8710)
chriscollins3456 Aug 24, 2023
bca914d
fix(cypress) Fix login.js cypress test (#8719)
chriscollins3456 Aug 24, 2023
bf5499e
Fixes for faling login.js and managing_groups.js Cypress tests (#8725)
kkorchak Aug 25, 2023
15f7a12
fix(kafka-setup): remove dependency confluent docker utils (#8715)
lix-mms Aug 25, 2023
04ecf4f
docs(docs): add native versioning (#8714)
yoonhyejin Aug 25, 2023
d169ee8
config(ingest/rest): Update rest sink defaults to retry more often (#…
asikowitz Aug 26, 2023
6bfd94c
chore(jackson): update to released version of jackson (#8674)
david-leifker Aug 26, 2023
47571e6
fix(examples): fix typo in business glossary bootstrap yml (#8703)
mayurinehate Aug 28, 2023
9c5ab1b
fix(schemaRegistry): change api servlet check to only apply to intern…
RyanHolstien Aug 28, 2023
cc94ffb
fix(ingest): stateful redundant run skip handler (#8467)
mayurinehate Aug 28, 2023
ada40e6
fix(superset): get alternate platform value if sqlalchemy_uri param i…
akhil7philip Aug 28, 2023
7b66c32
feat(ingest): support writing configs to files (#8696)
hsheth2 Aug 28, 2023
2f11f24
feat(search): De-duplicate scale factors across entities (#8718)
iprentic Aug 28, 2023
97019d8
test(lineage): Add test for scroll across lineage (#8728)
iprentic Aug 28, 2023
3acd25b
feat(ingest/metabase): detect source table for cards sourced from oth…
k-popov Aug 28, 2023
437b787
(ingestion) bug fix: emit platform instance aspect for dataset in Dat…
jinlintt Aug 28, 2023
7b0ebe6
feat(config): Turn on new search & browse experience by default (#8737)
iprentic Aug 29, 2023
d86b336
chore(ingest/s3) Bump Deequ and Pyspark version (#8638)
treff7es Aug 29, 2023
04bf886
docs(ingest/openapi): Downgrade status from CERTIFIED to INCUBATING (…
asikowitz Aug 29, 2023
4539a1c
feat(health): Adding Entity Health Status to the Lineage Graph View …
jjoyce0510 Aug 29, 2023
19ce003
build(ingest): Pin mypy-boto3-sagemaker directly (#8746)
asikowitz Aug 29, 2023
40d17f0
feat(ingest/datahub): Improvements, bug fixes, and docs (#8735)
asikowitz Aug 29, 2023
df0113c
docs(obseve): Adding Volume Assertion Guide (#8706)
jjoyce0510 Aug 29, 2023
2776903
fix(ingest/okta): Removed code closing okta's event_loop (#8675)
skrydal Aug 29, 2023
44c9ba4
fix(highlight): disable full name highlight (#8750)
Aug 29, 2023
2eb93b9
fix(ui): hide pages from web crawlers (#8738)
hsheth2 Aug 29, 2023
97a77d8
docs: add index pages for feature/deployment guides (#8723)
hsheth2 Aug 29, 2023
bebee88
feat(docs): move versioned_sidebars to static-assets (#8743)
yoonhyejin Aug 30, 2023
dee1bc8
docs(observe): DataHub Operation freshness assertion guide (#8749)
zmcnellis Aug 30, 2023
5032af9
feat(cli): support recursive deletes (#8709)
hsheth2 Aug 30, 2023
fa0c43c
fix(ingest/bigquery): Handle null view_definition; remove view defini…
asikowitz Aug 30, 2023
026f7ab
feat(ingest/usage): Make cumulative query character limit configurabl…
asikowitz Aug 30, 2023
c193b1d
fix(ingest/athena): Fixing db container id (#8689)
treff7es Aug 30, 2023
1282e5b
feat(systemMetadata): add pipeline names to system metadata (#8684)
hsheth2 Aug 30, 2023
e867dbc
ci: separate airflow build and test (#8688)
mayurinehate Aug 30, 2023
4c69f9a
fix(ingest/athena): fix container linting (#8761)
hsheth2 Aug 30, 2023
e7d140f
fix(datahub-frontend) Give permission for start.sh so it can run (#8594)
rtekal Aug 31, 2023
21b2851
feat(sql-parser): schema-aware output column casing (#8760)
hsheth2 Aug 31, 2023
a4e7268
fix(ingest/bigquery): Filter out fine grained lineage with no upstrea…
asikowitz Aug 31, 2023
6fe60a2
feat(iceberg): Upgrade Iceberg ingestion source to pyiceberg 0.4.0 (#…
cccs-eric Aug 31, 2023
d78a790
Allow frontend to use http proxy (#8691)
githendrik Aug 31, 2023
ee06a65
docs(observe): Dataset Profile volume assertion guide (#8764)
zmcnellis Aug 31, 2023
5458c6b
docs:fix broken img links under managed-datahub (#8769)
yoonhyejin Sep 1, 2023
31fc687
fix:small typo on graphql tutorial (#8741)
yoonhyejin Sep 1, 2023
7dd6e09
refactor(build): upgrade to gradle 7 & guava update (#8745)
david-leifker Sep 1, 2023
23f24f4
fix(siblings): space icons out (#8767)
Sep 1, 2023
59b59c2
chore(build): upgrade gradle wrapper (#8776)
hsheth2 Sep 2, 2023
1b79142
feat(EntityService): batched transactions and ebean updates (#8456)
david-leifker Sep 3, 2023
d308986
fix(frontend): Fix"Logout with OIDC not working" (#8773)
FirKys Sep 3, 2023
801208e
docs:upgrade docusaurus version (#8770)
yoonhyejin Sep 4, 2023
065a290
fix:change global graph url to static-assets (#8742)
yoonhyejin Sep 4, 2023
9f223c0
doc(tests): fix endpoint param to push results (#8783)
anshbansal Sep 5, 2023
c38bb91
fix(elastic): improve error handling for profiling (#8785)
anshbansal Sep 5, 2023
7b443e5
chore(analytics): bump version (#8786)
Sep 5, 2023
5b7487d
docs(session): add documentation for session token duration and fix d…
RyanHolstien Sep 6, 2023
ac025e5
fix(ingest/datahub): Support postgres; build(postgres): Modernize pos…
asikowitz Sep 6, 2023
f649d6f
feat(airflow-plugin): add package type information (#8795)
mayurinehate Sep 6, 2023
94fa514
feat(systemMetadata): Adding a lastRunId field system metadata (#8672)
jjoyce0510 Sep 6, 2023
25148f4
refactor(ingest): Add support for group-owners in dataflow entities (…
dnks23 Sep 6, 2023
8bf28bf
fix(ingest/tableau): fix tableau native CLL for snowflake, add type a…
mayurinehate Sep 6, 2023
e680a97
fix(ingest/bigquery): fix partition and median queries for profiling …
mayurinehate Sep 6, 2023
f353bc7
docs: add datahub source to integrations page (#8787)
hsheth2 Sep 6, 2023
4ffad4d
chore(ingest): upgrade sqlglot fork (#8775)
hsheth2 Sep 6, 2023
6a1a1dd
docs: minor fix on versioning navbar and dropdown (#8790)
jeffmerrick Sep 7, 2023
0e8000c
feat(ingest): drop sql_metadata parser (#8765)
hsheth2 Sep 7, 2023
eb4107a
fix(ingest): drop `wrap_aspect_as_workunit` method (#8766)
hsheth2 Sep 7, 2023
cf16684
feat(search): Also de-duplicate the field queries based on field name…
iprentic Sep 7, 2023
56c00f5
feat(openapi): entity endpoints & analytics raw (#8537)
david-leifker Sep 7, 2023
60f1fa9
docs(db-retention): update with default setting (#8797)
david-leifker Sep 7, 2023
519b36d
fix(custom-search): fix custom search to be able to use unquoted quer…
david-leifker Sep 7, 2023
90bceb7
feat: add feedback widget (#8732)
yoonhyejin Sep 8, 2023
794eb03
fix(gms): Fixed Recently Viewed section for users with '@' in the URN…
skrydal Sep 8, 2023
68ae3bf
fix(spark-test): upgrade gradle and fix spark smoke test (#8777)
david-leifker Sep 8, 2023
75252a3
feat(UI): AccessManagement UI to access the role metadata for a datas…
Ramendra761 Sep 8, 2023
aef0f27
Glossary Navigation Cypress test (#8804)
kkorchak Sep 8, 2023
6c55eb8
ci: upgrade python to 3.10 for builds (#8808)
hsheth2 Sep 8, 2023
95b2d43
feat(ingestion/looker): Add view file-path as option in view_naming_p…
siddiquebagwan-gslab Sep 11, 2023
0f3819b
feat(upgrade): add ability to provide a startingOffset for RestoreInd…
ukayani Sep 11, 2023
1efc5d9
fix(index): Do not override the search analyzer for ngram fields (#8818)
iprentic Sep 11, 2023
486bd86
test(managed_ingestion): fix managed ingestion test by fixing actions…
david-leifker Sep 11, 2023
5a0ce38
docs: add 0.11 docs to docs site (#8813)
hsheth2 Sep 11, 2023
aee1e68
docs(release): Update updating-datahub.md for 0.11.0 release (#8821)
iprentic Sep 11, 2023
a021053
fix(ingest/mssql): Add UNIQUEIDENTIFIER data type as String (#8642)
cjm98332 Sep 12, 2023
303a2d0
build(ingest): upgrade to sqlalchemy 1.4, drop 1.3 support (#8810)
mayurinehate Sep 12, 2023
f7fee74
fix(ingest): use epoch 1 for dev build versions (#8824)
hsheth2 Sep 12, 2023
449cc9b
ci: make wheel builds more robust (#8815)
hsheth2 Sep 12, 2023
138f6c0
feat(cli): fix upload ingest cli endpoint (#8826)
pedro93 Sep 12, 2023
3cc0f76
docs(transformer): fix names in sample code of 'pattern_add_dataset_d…
Starkie Sep 12, 2023
785ab77
fix(siblingsHook): check number of dbtUpstreams instead of all upStre…
ethan-cartwright Sep 13, 2023
e9b4727
fix(java) Update DataProductMapper to always return a name (#8832)
chriscollins3456 Sep 13, 2023
1474ac0
build(ingest): Bump jsonschema for Python >= 3.8 (#8836)
asikowitz Sep 13, 2023
493d315
feat(ingest/rest-emitter): Do not raise error on retry failure to get…
asikowitz Sep 13, 2023
31abf38
ci: add markdown-link-check (#8771)
yoonhyejin Sep 14, 2023
654b502
docs(managed datahub): release notes 0.2.11 (#8830)
anshbansal Sep 14, 2023
e75900b
build(ingest): Remove constraint on jsonschema for Python >= 3.8 (#8842)
asikowitz Sep 14, 2023
3193421
fix(build): clean task cleanup generated src (#8844)
anshbansal Sep 15, 2023
ec714fc
feat(ci): disable ingestion smoke build (#8845)
anshbansal Sep 15, 2023
0f77447
fix: fix quickstart page (#8784)
yoonhyejin Sep 15, 2023
cdb9f5b
feat(bigquery): add better timers around every API call (#8626)
mayurinehate Sep 15, 2023
f4da939
feat(ingestion/dynamodb): Add DynamoDB as new metadata ingestion sour…
TonyOuyangGit Sep 15, 2023
99d7eb7
feat(ingest/bigquery): support bigquery profiling with sampling (#8794)
mayurinehate Sep 15, 2023
5882fe4
Fix for edit_documentation and glossary_navigation cypress tests (#8838)
kkorchak Sep 18, 2023
85fa5a1
feat(ui/java) Update domains to be nested (#8841)
chriscollins3456 Sep 18, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
85 changes: 85 additions & 0 deletions .github/workflows/airflow-plugin.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,85 @@
name: Airflow Plugin
on:
push:
branches:
- master
paths:
- ".github/workflows/airflow-plugin.yml"
- "metadata-ingestion-modules/airflow-plugin/**"
- "metadata-ingestion/**"
- "metadata-models/**"
pull_request:
branches:
- master
paths:
- ".github/**"
- "metadata-ingestion-modules/airflow-plugin/**"
- "metadata-ingestion/**"
- "metadata-models/**"
release:
types: [published]

concurrency:
group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
cancel-in-progress: true

jobs:
airflow-plugin:
runs-on: ubuntu-latest
env:
SPARK_VERSION: 3.0.3
DATAHUB_TELEMETRY_ENABLED: false
strategy:
matrix:
include:
- python-version: "3.7"
extraPythonRequirement: "apache-airflow~=2.1.0"
- python-version: "3.7"
extraPythonRequirement: "apache-airflow~=2.2.0"
- python-version: "3.10"
extraPythonRequirement: "apache-airflow~=2.4.0"
- python-version: "3.10"
extraPythonRequirement: "apache-airflow~=2.6.0"
- python-version: "3.10"
extraPythonRequirement: "apache-airflow>2.6.0"
fail-fast: false
steps:
- uses: actions/checkout@v3
- uses: actions/setup-python@v4
with:
python-version: ${{ matrix.python-version }}
cache: "pip"
- name: Install dependencies
run: ./metadata-ingestion/scripts/install_deps.sh
- name: Install airflow package and test (extras ${{ matrix.extraPythonRequirement }})
run: ./gradlew -Pextra_pip_requirements='${{ matrix.extraPythonRequirement }}' :metadata-ingestion-modules:airflow-plugin:lint :metadata-ingestion-modules:airflow-plugin:testQuick
- name: pip freeze show list installed
if: always()
run: source metadata-ingestion-modules/airflow-plugin/venv/bin/activate && pip freeze
- uses: actions/upload-artifact@v3
if: ${{ always() && matrix.python-version == '3.10' && matrix.extraPythonRequirement == 'apache-airflow>2.6.0' }}
with:
name: Test Results (Airflow Plugin ${{ matrix.python-version}})
path: |
**/build/reports/tests/test/**
**/build/test-results/test/**
**/junit.*.xml
- name: Upload coverage to Codecov
if: always()
uses: codecov/codecov-action@v3
with:
token: ${{ secrets.CODECOV_TOKEN }}
directory: .
fail_ci_if_error: false
flags: airflow-${{ matrix.python-version }}-${{ matrix.extraPythonRequirement }}
name: pytest-airflow
verbose: true

event-file:
runs-on: ubuntu-latest
steps:
- name: Upload
uses: actions/upload-artifact@v3
with:
name: Event File
path: ${{ github.event_path }}
2 changes: 1 addition & 1 deletion .github/workflows/check-datahub-jars.yml
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ jobs:
java-version: 11
- uses: actions/setup-python@v4
with:
python-version: "3.7"
python-version: "3.10"
- name: check ${{ matrix.command }} jar
run: |
./gradlew :metadata-integration:java:${{ matrix.command }}:build --info
Expand Down
2 changes: 0 additions & 2 deletions .github/workflows/docker-ingestion-smoke.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,6 @@ on:
release:
types: [published]
push:
branches:
- master
paths:
- "docker/datahub-ingestion-base/**"
- "smoke-test/**"
Expand Down
22 changes: 14 additions & 8 deletions .github/workflows/docker-unified.yml
Original file line number Diff line number Diff line change
Expand Up @@ -45,6 +45,7 @@ jobs:
unique_slim_tag: ${{ steps.tag.outputs.unique_slim_tag }}
unique_full_tag: ${{ steps.tag.outputs.unique_full_tag }}
publish: ${{ steps.publish.outputs.publish }}
python_release_version: ${{ steps.tag.outputs.python_release_version }}
steps:
- name: Checkout
uses: actions/checkout@v3
Expand All @@ -58,6 +59,7 @@ jobs:
echo "unique_tag=$(get_unique_tag)" >> $GITHUB_OUTPUT
echo "unique_slim_tag=$(get_unique_tag)-slim" >> $GITHUB_OUTPUT
echo "unique_full_tag=$(get_unique_tag)-full" >> $GITHUB_OUTPUT
echo "python_release_version=$(get_python_docker_release_v)" >> $GITHUB_OUTPUT
- name: Check whether publishing enabled
id: publish
env:
Expand Down Expand Up @@ -549,6 +551,7 @@ jobs:
runs-on: ubuntu-latest
outputs:
tag: ${{ steps.tag.outputs.tag }}
needs_artifact_download: ${{ (steps.filter.outputs.datahub-ingestion-base == 'true' || steps.filter.outputs.datahub-ingestion == 'true') && needs.setup.outputs.publish != 'true' }}
needs: [setup, datahub_ingestion_base_slim_build]
steps:
- name: Check out the repo
Expand All @@ -572,7 +575,7 @@ jobs:
with:
image: ${{ env.DATAHUB_INGESTION_BASE_IMAGE }}:${{ steps.filter.outputs.datahub-ingestion-base == 'true' && needs.setup.outputs.unique_slim_tag || 'head' }}
- name: Build and push Slim Image
if: ${{ steps.filter.outputs.datahub-ingestion-base == 'true' || steps.filter.outputs.datahub-ingestion == 'true' }}
if: ${{ steps.filter.outputs.datahub-ingestion-base == 'true' || steps.filter.outputs.datahub-ingestion == 'true' || needs.setup.outputs.publish }}
uses: ./.github/actions/docker-custom-build-and-push
with:
target: final
Expand All @@ -581,6 +584,7 @@ jobs:
build-args: |
BASE_IMAGE=${{ env.DATAHUB_INGESTION_BASE_IMAGE }}
DOCKER_VERSION=${{ steps.filter.outputs.datahub-ingestion-base == 'true' && needs.setup.outputs.unique_slim_tag || 'head' }}
RELEASE_VERSION=${{ needs.setup.outputs.python_release_version }}
APP_ENV=slim
tags: ${{ needs.setup.outputs.slim_tag }}
username: ${{ secrets.ACRYL_DOCKER_USERNAME }}
Expand All @@ -605,7 +609,7 @@ jobs:
uses: actions/checkout@v3
- name: Download image Slim Image
uses: ishworkh/docker-image-artifact-download@v1
if: ${{ needs.setup.outputs.publish != 'true' }}
if: ${{ needs.datahub_ingestion_slim_build.outputs.needs_artifact_download == 'true' }}
with:
image: ${{ env.DATAHUB_INGESTION_IMAGE }}:${{ needs.datahub_ingestion_slim_build.outputs.tag }}
- name: Run Trivy vulnerability scanner Slim Image
Expand All @@ -630,6 +634,7 @@ jobs:
runs-on: ubuntu-latest
outputs:
tag: ${{ steps.tag.outputs.tag }}
needs_artifact_download: ${{ (steps.filter.outputs.datahub-ingestion-base == 'true' || steps.filter.outputs.datahub-ingestion == 'true') && needs.setup.outputs.publish != 'true' }}
needs: [setup, datahub_ingestion_base_full_build]
steps:
- name: Check out the repo
Expand All @@ -653,7 +658,7 @@ jobs:
with:
image: ${{ env.DATAHUB_INGESTION_BASE_IMAGE }}:${{ steps.filter.outputs.datahub-ingestion-base == 'true' && needs.setup.outputs.unique_full_tag || 'head' }}
- name: Build and push Full Image
if: ${{ steps.filter.outputs.datahub-ingestion-base == 'true' || steps.filter.outputs.datahub-ingestion == 'true' }}
if: ${{ steps.filter.outputs.datahub-ingestion-base == 'true' || steps.filter.outputs.datahub-ingestion == 'true' || needs.setup.outputs.publish }}
uses: ./.github/actions/docker-custom-build-and-push
with:
target: final
Expand All @@ -662,6 +667,7 @@ jobs:
build-args: |
BASE_IMAGE=${{ env.DATAHUB_INGESTION_BASE_IMAGE }}
DOCKER_VERSION=${{ steps.filter.outputs.datahub-ingestion-base == 'true' && needs.setup.outputs.unique_full_tag || 'head' }}
RELEASE_VERSION=${{ needs.setup.outputs.python_release_version }}
tags: ${{ needs.setup.outputs.unique_full_tag }}
username: ${{ secrets.ACRYL_DOCKER_USERNAME }}
password: ${{ secrets.ACRYL_DOCKER_PASSWORD }}
Expand All @@ -671,7 +677,7 @@ jobs:
platforms: linux/amd64,linux/arm64/v8
- name: Compute Tag (Full)
id: tag
run: echo "tag=${{ steps.filter.outputs.datahub-ingestion-base == 'true' && needs.setup.outputs.unique_full_tag || 'head' }}" >> $GITHUB_OUTPUT
run: echo "tag=${{ (steps.filter.outputs.datahub-ingestion-base == 'true' || steps.filter.outputs.datahub-ingestion == 'true') && needs.setup.outputs.unique_full_tag || 'head' }}" >> $GITHUB_OUTPUT
datahub_ingestion_full_scan:
permissions:
contents: read # for actions/checkout to fetch code
Expand All @@ -685,7 +691,7 @@ jobs:
uses: actions/checkout@v3
- name: Download image Full Image
uses: ishworkh/docker-image-artifact-download@v1
if: ${{ needs.setup.outputs.publish != 'true' }}
if: ${{ needs.datahub_ingestion_full_build.outputs.needs_artifact_download == 'true' }}
with:
image: ${{ env.DATAHUB_INGESTION_IMAGE }}:${{ needs.datahub_ingestion_full_build.outputs.tag }}
- name: Run Trivy vulnerability scanner Full Image
Expand Down Expand Up @@ -737,7 +743,7 @@ jobs:
java-version: 11
- uses: actions/setup-python@v4
with:
python-version: "3.7"
python-version: "3.10"
cache: "pip"
- name: Install dependencies
run: ./metadata-ingestion/scripts/install_deps.sh
Expand Down Expand Up @@ -792,7 +798,7 @@ jobs:
image: ${{ env.DATAHUB_UPGRADE_IMAGE }}:${{ needs.setup.outputs.unique_tag }}
- name: Download datahub-ingestion-slim image
uses: ishworkh/docker-image-artifact-download@v1
if: ${{ needs.setup.outputs.publish != 'true' }}
if: ${{ needs.datahub_ingestion_slim_build.outputs.needs_artifact_download == 'true' }}
with:
image: ${{ env.DATAHUB_INGESTION_IMAGE }}:${{ needs.datahub_ingestion_slim_build.outputs.tag }}
- name: Disk Check
Expand All @@ -803,7 +809,7 @@ jobs:
DATAHUB_VERSION: ${{ needs.setup.outputs.unique_tag }}
DATAHUB_ACTIONS_IMAGE: ${{ env.DATAHUB_INGESTION_IMAGE }}
ACTIONS_VERSION: ${{ needs.datahub_ingestion_slim_build.outputs.tag }}
ACTIONS_EXTRA_PACKAGES: 'acryl-datahub-actions[executor] acryl-datahub-actions'
ACTIONS_EXTRA_PACKAGES: 'acryl-datahub-actions[executor]==0.0.13 acryl-datahub-actions==0.0.13 acryl-datahub==0.10.5'
ACTIONS_CONFIG: 'https://raw.githubusercontent.com/acryldata/datahub-actions/main/docker/config/executor.yaml'
run: |
./smoke-test/run-quickstart.sh
Expand Down
1 change: 1 addition & 0 deletions .github/workflows/documentation.yml
Original file line number Diff line number Diff line change
Expand Up @@ -27,6 +27,7 @@ jobs:
- uses: actions/setup-python@v4
with:
python-version: "3.10"
cache: pip
- name: Install Python dependencies
run: ./metadata-ingestion/scripts/install_deps.sh
- name: Build Docs
Expand Down
9 changes: 3 additions & 6 deletions .github/workflows/metadata-ingestion.yml
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ jobs:
metadata-ingestion:
runs-on: ubuntu-latest
env:
SPARK_VERSION: 3.0.3
SPARK_VERSION: 3.3.2
DATAHUB_TELEMETRY_ENABLED: false
# TODO: Enable this once the test is fixed.
# DATAHUB_LOOKML_GIT_TEST_SSH_KEY: ${{ secrets.DATAHUB_LOOKML_GIT_TEST_SSH_KEY }}
Expand All @@ -42,9 +42,7 @@ jobs:
]
include:
- python-version: "3.7"
extraPythonRequirement: "sqlalchemy==1.3.24 apache-airflow~=2.2.0"
- python-version: "3.10"
extraPythonRequirement: "sqlalchemy~=1.4.0 apache-airflow>=2.4.0"
fail-fast: false
steps:
- uses: actions/checkout@v3
Expand All @@ -56,8 +54,8 @@ jobs:
run: ./metadata-ingestion/scripts/install_deps.sh
- name: Install package
run: ./gradlew :metadata-ingestion:installPackageOnly
- name: Run metadata-ingestion tests (extras ${{ matrix.extraPythonRequirement }})
run: ./gradlew -Pextra_pip_requirements='${{ matrix.extraPythonRequirement }}' :metadata-ingestion:${{ matrix.command }}
- name: Run metadata-ingestion tests
run: ./gradlew :metadata-ingestion:${{ matrix.command }}
- name: pip freeze show list installed
if: always()
run: source metadata-ingestion/venv/bin/activate && pip freeze
Expand All @@ -80,7 +78,6 @@ jobs:
name: pytest-${{ matrix.command }}
verbose: true


event-file:
runs-on: ubuntu-latest
steps:
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/metadata-io.yml
Original file line number Diff line number Diff line change
Expand Up @@ -36,7 +36,7 @@ jobs:
java-version: 11
- uses: actions/setup-python@v4
with:
python-version: "3.7"
python-version: "3.10"
- name: Gradle build (and test)
# there is some race condition in gradle build, which makes gradle never terminate in ~30% of the runs
# running build first without datahub-web-react:yarnBuild and then with it is 100% stable
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/publish-datahub-jars.yml
Original file line number Diff line number Diff line change
Expand Up @@ -58,7 +58,7 @@ jobs:
java-version: 11
- uses: actions/setup-python@v4
with:
python-version: "3.7"
python-version: "3.10"
- name: checkout upstream repo
run: |
git remote add upstream https://github.com/datahub-project/datahub.git
Expand Down
7 changes: 6 additions & 1 deletion .github/workflows/spark-smoke-test.yml
Original file line number Diff line number Diff line change
Expand Up @@ -30,16 +30,21 @@ jobs:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
with:
fetch-depth: 800
fetch-tags: true
- name: Set up JDK 11
uses: actions/setup-java@v3
with:
distribution: "zulu"
java-version: 11
- uses: actions/setup-python@v4
with:
python-version: "3.7"
python-version: "3.10"
- name: Install dependencies
run: ./metadata-ingestion/scripts/install_deps.sh
- name: Remove images
run: docker image prune -a -f || true
- name: Smoke test
run: |
./gradlew :metadata-integration:java:spark-lineage:integrationTest \
Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/test-results.yml
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@ name: Test Results

on:
workflow_run:
workflows: ["build & test", "metadata ingestion"]
workflows: ["build & test", "metadata ingestion", "Airflow Plugin"]
types:
- completed

Expand Down
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -69,6 +69,8 @@ metadata-ingestion/generated/**

# docs
docs/generated/
docs-website/versioned_docs/
docs-website/versioned_sidebars/
tmp*
temp/**

Expand Down
12 changes: 8 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ export const Logo = (props) => {
<div style={{ display: "flex", justifyContent: "center", padding: "20px", height: "190px" }}>
<img
alt="DataHub Logo"
src={useBaseUrl("/img/datahub-logo-color-mark.svg")}
src="https://raw.githubusercontent.com/datahub-project/static-assets/main/imgs/datahub-logo-color-mark.svg"
{...props}
/>
</div>
Expand All @@ -18,7 +18,7 @@ export const Logo = (props) => {
<!--
HOSTED_DOCS_ONLY-->
<p align="center">
<img alt="DataHub" src="docs/imgs/datahub-logo-color-mark.svg" height="150" />
<img alt="DataHub" src="https://raw.githubusercontent.com/datahub-project/static-assets/main/imgs/datahub-logo-color-mark.svg" height="150" />
</p>
<!-- -->

Expand Down Expand Up @@ -80,7 +80,11 @@ Please follow the [DataHub Quickstart Guide](https://datahubproject.io/docs/quic

If you're looking to build & modify datahub please take a look at our [Development Guide](https://datahubproject.io/docs/developers).

[![DataHub Demo GIF](docs/imgs/entity.png)](https://demo.datahubproject.io/)
<p align="center">
<a href="https://demo.datahubproject.io/">
<img width="70%" src="https://raw.githubusercontent.com/datahub-project/static-assets/main/imgs/entity.png"/>
</a>
</p>

## Source Code and Repositories

Expand Down Expand Up @@ -152,7 +156,7 @@ Here are the companies that have officially adopted DataHub. Please feel free to

- [DataHub Blog](https://blog.datahubproject.io/)
- [DataHub YouTube Channel](https://www.youtube.com/channel/UC3qFQC5IiwR5fvWEqi_tJ5w)
- [Optum: Data Mesh via DataHub](https://optum.github.io/blog/2022/03/23/data-mesh-via-datahub/)
- [Optum: Data Mesh via DataHub](https://opensource.optum.com/blog/2022/03/23/data-mesh-via-datahub)
- [Saxo Bank: Enabling Data Discovery in Data Mesh](https://medium.com/datahub-project/enabling-data-discovery-in-a-data-mesh-the-saxo-journey-451b06969c8f)
- [Bringing The Power Of The DataHub Real-Time Metadata Graph To Everyone At Acryl Data](https://www.dataengineeringpodcast.com/acryl-data-datahub-metadata-graph-episode-230/)
- [DataHub: Popular Metadata Architectures Explained](https://engineering.linkedin.com/blog/2020/datahub-popular-metadata-architectures-explained)
Expand Down
Loading